Update files from the datasets library (from 1.4.0)
Release notes: https://github.com/huggingface/datasets/releases/tag/1.4.0
This commit is contained in:
parent
09c7283c8b
commit
12483a7f39
42
README.md
42
README.md
@ -27,7 +27,7 @@
|
|||||||
- [Citation Information](#citation-information)
|
- [Citation Information](#citation-information)
|
||||||
- [Contributions](#contributions)
|
- [Contributions](#contributions)
|
||||||
|
|
||||||
## [Dataset Description](#dataset-description)
|
## Dataset Description
|
||||||
|
|
||||||
- **Homepage:** [http://ai.stanford.edu/~amaas/data/sentiment/](http://ai.stanford.edu/~amaas/data/sentiment/)
|
- **Homepage:** [http://ai.stanford.edu/~amaas/data/sentiment/](http://ai.stanford.edu/~amaas/data/sentiment/)
|
||||||
- **Repository:** [More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
- **Repository:** [More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
@ -37,24 +37,24 @@
|
|||||||
- **Size of the generated dataset:** 127.06 MB
|
- **Size of the generated dataset:** 127.06 MB
|
||||||
- **Total amount of disk used:** 207.28 MB
|
- **Total amount of disk used:** 207.28 MB
|
||||||
|
|
||||||
### [Dataset Summary](#dataset-summary)
|
### Dataset Summary
|
||||||
|
|
||||||
Large Movie Review Dataset.
|
Large Movie Review Dataset.
|
||||||
This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.
|
This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well.
|
||||||
|
|
||||||
### [Supported Tasks](#supported-tasks)
|
### Supported Tasks
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Languages](#languages)
|
### Languages
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
## [Dataset Structure](#dataset-structure)
|
## Dataset Structure
|
||||||
|
|
||||||
We show detailed information for up to 5 configurations of the dataset.
|
We show detailed information for up to 5 configurations of the dataset.
|
||||||
|
|
||||||
### [Data Instances](#data-instances)
|
### Data Instances
|
||||||
|
|
||||||
#### plain_text
|
#### plain_text
|
||||||
|
|
||||||
@ -70,7 +70,7 @@ An example of 'train' looks as follows.
|
|||||||
}
|
}
|
||||||
```
|
```
|
||||||
|
|
||||||
### [Data Fields](#data-fields)
|
### Data Fields
|
||||||
|
|
||||||
The data fields are the same among all splits.
|
The data fields are the same among all splits.
|
||||||
|
|
||||||
@ -78,55 +78,55 @@ The data fields are the same among all splits.
|
|||||||
- `text`: a `string` feature.
|
- `text`: a `string` feature.
|
||||||
- `label`: a classification label, with possible values including `neg` (0), `pos` (1).
|
- `label`: a classification label, with possible values including `neg` (0), `pos` (1).
|
||||||
|
|
||||||
### [Data Splits Sample Size](#data-splits-sample-size)
|
### Data Splits Sample Size
|
||||||
|
|
||||||
| name |train|unsupervised|test |
|
| name |train|unsupervised|test |
|
||||||
|----------|----:|-----------:|----:|
|
|----------|----:|-----------:|----:|
|
||||||
|plain_text|25000| 50000|25000|
|
|plain_text|25000| 50000|25000|
|
||||||
|
|
||||||
## [Dataset Creation](#dataset-creation)
|
## Dataset Creation
|
||||||
|
|
||||||
### [Curation Rationale](#curation-rationale)
|
### Curation Rationale
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Source Data](#source-data)
|
### Source Data
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Annotations](#annotations)
|
### Annotations
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Personal and Sensitive Information](#personal-and-sensitive-information)
|
### Personal and Sensitive Information
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
## [Considerations for Using the Data](#considerations-for-using-the-data)
|
## Considerations for Using the Data
|
||||||
|
|
||||||
### [Social Impact of Dataset](#social-impact-of-dataset)
|
### Social Impact of Dataset
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Discussion of Biases](#discussion-of-biases)
|
### Discussion of Biases
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Other Known Limitations](#other-known-limitations)
|
### Other Known Limitations
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
## [Additional Information](#additional-information)
|
## Additional Information
|
||||||
|
|
||||||
### [Dataset Curators](#dataset-curators)
|
### Dataset Curators
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Licensing Information](#licensing-information)
|
### Licensing Information
|
||||||
|
|
||||||
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
|
||||||
|
|
||||||
### [Citation Information](#citation-information)
|
### Citation Information
|
||||||
|
|
||||||
```
|
```
|
||||||
@InProceedings{maas-EtAl:2011:ACL-HLT2011,
|
@InProceedings{maas-EtAl:2011:ACL-HLT2011,
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user