Hugging Face - Datasets
Imports datasets from Hugging Face (https://huggingface.co/datasets)
Only datasets with Parquet exports can be imported with this connector.
Configuration
| Input | Type | Description | Default Value |
|---|---|---|---|
dataset_name | string | Dataset Name. | |
dataset_subsets | array | Dataset Subsets. Dataset Subsets to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details) | |
dataset_splits | array | Dataset Splits. Splits to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details) |
Streams
| Stream Name | Primary Key | Pagination | Supports Full Sync | Supports Incremental |
|---|---|---|---|---|
| rows | DefaultPaginator | ✅ | ❌ | |
| splits | No pagination | ✅ | ❌ |
Changelog
Expand to review
| Version | Date | Pull Request | Subject |
|---|---|---|---|
| 0.0.1 | 2024-11-28 | Initial release by @michel-tricot via Connector Builder |