Export and ImportDatasets with Ease
Datapluck simplifies dataset management with a powerful CLI and Python library designed to seamlessly work with the Hugging Face Hub and multiple data formats.
$ datapluck export imdb \
--format csv \
--output_file imdb_reviews.csv \
--split train
✓ Successfully exported 25,000 records to imdb_reviews.csv
Why Choose Datapluck?
Simplify your dataset management workflow with powerful features designed for researchers and data scientists.
Simplified Workflow
Extract, transform, and load datasets with minimal code through an intuitive CLI interface.
Multiple Formats
Support for CSV, JSON, JSONL, Excel, Parquet, SQLite, and Google Sheets formats.
Hugging Face Integration
Seamless integration with Hugging Face Hub for easy dataset portability.
Python API
Use as a Python library in your data processing pipelines and notebooks.
Cloud Scheduling
Set up recurring jobs to automate your dataset management in the cloud.
Data Exploration
Built-in tools to explore and understand your datasets before export.
Datapluck CLI in Action
Generate commands for your specific use case with our interactive tool.
The name of the dataset on Hugging Face Hub
Path to save the exported dataset
Name of the dataset subset, if applicable
Dataset split to export
Generated Command
datapluck export --format csv
Schedule Jobs in the Cloud
Automate your dataset synchronization with our premium cloud scheduling service. Set up recurring jobs to export or import datasets on your preferred schedule.
- Automated dataset synchronization
- Customizable schedules and triggers
- Email notifications and logs
- Version control for datasets
Free
$0
- CLI & Python Library
- Local Execution
Premium
$59/mo
- Everything in Free
- Cloud Scheduling
- 100GB of Data Transfer/Month
- Email Notifications
How It Works
Get started with Datapluck in three simple steps
Install
Install Datapluck using pip
pip install datapluck
Configure
Login to Hugging Face
huggingface-cli login
Use
Export or import datasets easily
datapluck export dataset_name
Ready to Get Started?
Join thousands of researchers and data scientists who simplify their dataset management with Datapluck.