1
0
mirror of synced 2025-12-26 14:02:10 -05:00
Files
airbyte/docs/integrations/sources/hugging-face-datasets.md

2.0 KiB

Hugging Face - Datasets

Imports datasets from Hugging Face (https://huggingface.co/datasets)

Only datasets with Parquet exports can be imported with this connector.

Configuration

Input Type Description Default Value
dataset_name string Dataset Name.
dataset_subsets array Dataset Subsets. Dataset Subsets to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details)
dataset_splits array Dataset Splits. Splits to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details)

Streams

Stream Name Primary Key Pagination Supports Full Sync Supports Incremental
rows DefaultPaginator
splits No pagination

Changelog

Expand to review
Version Date Pull Request Subject
0.0.5 2024-12-21 50079 Update dependencies
0.0.4 2024-12-14 49609 Update dependencies
0.0.3 2024-12-12 49233 Update dependencies
0.0.2 2024-12-11 48911 Starting with this version, the Docker image is now rootless. Please note that this and future versions will not be compatible with Airbyte versions earlier than 0.64
0.0.1 2024-11-28 Initial release by @michel-tricot via Connector Builder