Dataset Portability Tool

Export and ImportDatasets with Ease

Datapluck simplifies dataset management with a powerful CLI and Python library designed to seamlessly work with the Hugging Face Hub and multiple data formats.

Terminal

$ datapluck export imdb \

    --format csv \

    --output_file imdb_reviews.csv \

    --split train

✓ Successfully exported 25,000 records to imdb_reviews.csv

Why Choose Datapluck?

Simplify your dataset management workflow with powerful features designed for researchers and data scientists.

Simplified Workflow

Extract, transform, and load datasets with minimal code through an intuitive CLI interface.

Multiple Formats

Support for CSV, JSON, JSONL, Excel, Parquet, SQLite, and Google Sheets formats.

Hugging Face Integration

Seamless integration with Hugging Face Hub for easy dataset portability.

Python API

Use as a Python library in your data processing pipelines and notebooks.

Cloud Scheduling

Set up recurring jobs to automate your dataset management in the cloud.

Data Exploration

Built-in tools to explore and understand your datasets before export.

Datapluck CLI in Action

Generate commands for your specific use case with our interactive tool.

The name of the dataset on Hugging Face Hub

Path to save the exported dataset

Name of the dataset subset, if applicable

Dataset split to export

Generated Command

Shell
datapluck export  --format csv

Schedule Jobs in the Cloud

Automate your dataset synchronization with our premium cloud scheduling service. Set up recurring jobs to export or import datasets on your preferred schedule.

  • Automated dataset synchronization
  • Customizable schedules and triggers
  • Email notifications and logs
  • Version control for datasets
Learn More

Free

$0

  • CLI & Python Library
  • Local Execution

Premium

Most Popular

$59/mo

  • Everything in Free
  • Cloud Scheduling
  • 100GB of Data Transfer/Month
  • Email Notifications

How It Works

Get started with Datapluck in three simple steps

1

Install

Install Datapluck using pip

pip install datapluck
2

Configure

Login to Hugging Face

huggingface-cli login
3

Use

Export or import datasets easily

datapluck export dataset_name

Ready to Get Started?

Join thousands of researchers and data scientists who simplify their dataset management with Datapluck.