Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
JonathanMiddleton
's Collections
data
VQA
ipex-candidates
code
audio
embedding and ranking
evaluation models
data
updated
11 days ago
Upvote
-
karpathy/fineweb-edu-100B-gpt2-token-shards
Updated
Jul 1, 2024
•
333
•
6
bigcode/the-stack-v2-train-full-ids
Viewer
•
Updated
Jun 6, 2024
•
60.5M
•
460
•
59
HuggingFaceTB/finemath
Viewer
•
Updated
Feb 6, 2025
•
48.3M
•
9.32k
•
351
nvidia/Nemotron-CC-v2
Viewer
•
Updated
Dec 23, 2025
•
8.79B
•
23.9k
•
105
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
May 8, 2025
•
3.91M
•
2.14k
•
643
HuggingFaceTB/smoltalk2
Viewer
•
Updated
Oct 31, 2025
•
8.61M
•
7.63k
•
143
Upvote
-
Share collection
View history
Collection guide
Browse collections