Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
36.4
TFLOPS
19
130
480
alkinun
AtAndDev
Follow
Csplk's profile picture
danielhanchen's profile picture
aiworldjournal's profile picture
68 followers
Β·
93 following
alkinun
alkinun
AI & ML interests
LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..
Recent Activity
reacted
to
ajibawa-2023
's
post
with π₯
about 2 hours ago
Python-Code-Large Dataset: https://huggingface.co/datasets/ajibawa-2023/Python-Code-Large Python-Code-Large is a large-scale corpus of Python source code comprising more than 2 million rows of Python code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the Python ecosystem. By providing a high-volume, language-specific corpus, Python-Code-Large enables systematic experimentation in Python-focused model training, domain adaptation, and downstream code understanding tasks. Python-Code-Large addresses the need for a dedicated Python-only dataset at substantial scale, enabling focused research across data science, backend systems, automation, scientific computing, and AI-driven Python environments.
updated
a dataset
about 3 hours ago
MedCall/turkish-45k
published
a dataset
about 3 hours ago
MedCall/turkish-45k
View all activity
Organizations
AtAndDev
's datasets
16
Sort:Β Recently updated
AtAndDev/OpenOrca-tr-2k
Viewer
β’
Updated
Sep 2, 2025
β’
2k
β’
6
AtAndDev/SPRL-v0.1
Viewer
β’
Updated
May 28, 2025
β’
936
β’
14
AtAndDev/SelfCoder-Test
Viewer
β’
Updated
May 28, 2025
β’
936
β’
15
AtAndDev/ranky-dataset
Viewer
β’
Updated
Mar 19, 2025
β’
2.86k
β’
16
AtAndDev/symbolm
Viewer
β’
Updated
Jan 23, 2025
β’
20k
β’
5
AtAndDev/symlm
Viewer
β’
Updated
Jan 16, 2025
β’
10.1k
β’
6
AtAndDev/chain-of-diffusion
Viewer
β’
Updated
Jan 7, 2025
β’
6.45k
β’
5
AtAndDev/clip-bicycle-e-bike
Viewer
β’
Updated
Jan 2, 2025
β’
6k
β’
15
AtAndDev/QwQ-LongCoT-59k-cleaned
Viewer
β’
Updated
Dec 6, 2024
β’
59.2k
β’
30
β’
1
AtAndDev/sedir-clean
Viewer
β’
Updated
Dec 5, 2024
β’
11.8k
β’
7
AtAndDev/sedir-unclean
Viewer
β’
Updated
Dec 5, 2024
β’
19.9k
β’
5
AtAndDev/ultrachat_200k_formatted
Viewer
β’
Updated
Oct 10, 2024
β’
208k
β’
9
AtAndDev/MedInstruct
Viewer
β’
Updated
Jul 20, 2024
β’
216
β’
4
AtAndDev/MedRag-textbooks-stella_en_400M_v5
Viewer
β’
Updated
Jul 14, 2024
β’
126k
β’
8
AtAndDev/MedRag-textbooks-gte-large-en-v1.5
Viewer
β’
Updated
Jul 14, 2024
β’
126k
β’
16
AtAndDev/MedRag-textbooks-mxbai-embed-large-v1
Viewer
β’
Updated
Jul 14, 2024
β’
126k
β’
5