Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
Modalities
3D
Audio
Document
Geospatial
Image
Tabular
Text
Time-series
Video
Size (rows)
Reset Size
< 1K
> 1T
Format
Reset Format
json
csv
parquet
imagefolder
soundfolder
webdataset
text
arrow
Apply filters
Datasets
226,401
Full-text search
Edit filters
Sort: Trending
Active filters:
parquet
Clear all
institutional/institutional-books-1.0
Viewer
•
Updated
3 days ago
•
983k
•
9.35k
•
139
nvidia/Nemotron-Personas
Viewer
•
Updated
10 days ago
•
100k
•
14.4k
•
130
openbmb/Ultra-FineWeb
Viewer
•
Updated
4 days ago
•
1.29B
•
45.6k
•
185
open-r1/Mixture-of-Thoughts
Viewer
•
Updated
24 days ago
•
699k
•
38.2k
•
230
miriad/miriad-5.8M
Viewer
•
Updated
8 days ago
•
5.82M
•
3.19k
•
40
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
10 days ago
•
1.2M
•
18.8k
•
110
cais/hle
Viewer
•
Updated
about 1 month ago
•
2.5k
•
6.55k
•
359
openai/gsm8k
Viewer
•
Updated
Jan 4, 2024
•
17.6k
•
503k
•
771
PleIAs/common_corpus
Viewer
•
Updated
9 days ago
•
470M
•
241k
•
293
nvidia/OpenCodeGeneticInstruct
Viewer
•
Updated
28 days ago
•
15.1M
•
109
•
8
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
160k
•
486
HuggingFaceFW/fineweb
Viewer
•
Updated
Jan 31
•
25B
•
278k
•
2.2k
zou-lab/MedCaseReasoning
Viewer
•
Updated
17 days ago
•
14.5k
•
954
•
20
ByteDance-Seed/Code-Contests-Plus
Viewer
•
Updated
10 days ago
•
49.2k
•
6.92k
•
15
PrismaX/SFE
Viewer
•
Updated
7 days ago
•
1.66k
•
857
•
7
BAAI/Infinity-Instruct
Viewer
•
Updated
3 days ago
•
21.9M
•
3.2k
•
637
QAQAQAQAQ/LiveCodeBench-Pro
Viewer
•
Updated
Apr 1
•
594
•
119
•
6
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
about 2 hours ago
•
81.2M
•
31
•
6
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
81.5k
•
847
virattt/financial-qa-10K
Viewer
•
Updated
May 31, 2024
•
7k
•
861
•
90
HuggingFaceTB/smoltalk
Viewer
•
Updated
Feb 10
•
2.2M
•
6.14k
•
345
Dataseeds/DataSeeds.AI-Sample-Dataset-DSD
Viewer
•
Updated
4 days ago
•
7.77k
•
2.11k
•
21
yandex/yambda
Viewer
•
Updated
13 days ago
•
5.31B
•
48.8k
•
162
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
about 2 hours ago
•
274M
•
536
•
5
AlicanKiraz0/Cybersecurity-Dataset-v1
Viewer
•
Updated
2 days ago
•
2.41k
•
35
•
5
Salesforce/wikitext
Viewer
•
Updated
Jan 4, 2024
•
3.71M
•
713k
•
466
tatsu-lab/alpaca
Viewer
•
Updated
May 22, 2023
•
52k
•
40.1k
•
773
roneneldan/TinyStories
Viewer
•
Updated
Aug 12, 2024
•
2.14M
•
24.2k
•
682
uonlp/CulturaX
Viewer
•
Updated
Dec 16, 2024
•
7.18B
•
9.07k
•
521
TIGER-Lab/OmniEdit-Filtered-1.2M
Viewer
•
Updated
Dec 6, 2024
•
1.2M
•
13.8k
•
100
Previous
1
2
3
...
100
Next