Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
MiniLLM
community
https://github.com/microsoft/LMOps/tree/main/minillm
t1101675
Activity Feed
Follow
42
AI & ML interests
Training efficient language models (MiniLLM, MiniPLM)
Team members
1
MiniLLM
's models
50
Sort: Recently updated
MiniLLM/MiniLLM-gpt2-340M
Text Generation
•
Updated
Apr 11, 2025
•
895
•
6
MiniLLM/SFT-gpt2-120M
Text Generation
•
0.1B
•
Updated
Mar 25, 2025
•
950
MiniLLM/SFT-gpt2-760M
Text Generation
•
0.8B
•
Updated
Mar 25, 2025
•
13
MiniLLM/MiniPLM-Qwen-500M
Text Generation
•
0.5B
•
Updated
Mar 25, 2025
•
67
•
•
7
MiniLLM/MiniPLM-llama3.1-212M
Text Generation
•
0.2B
•
Updated
Mar 25, 2025
•
12
•
6
MiniLLM/MiniPLM-Mamba-130M
Text Generation
•
0.1B
•
Updated
Mar 25, 2025
•
13
•
3
MiniLLM/MiniPLM-Qwen-1.2B
Text Generation
•
1B
•
Updated
Mar 25, 2025
•
62
•
4
MiniLLM/Ref-Pretrain-Qwen-104M
Text Generation
•
0.1B
•
Updated
Mar 25, 2025
•
10
•
2
MiniLLM/Pretrain-Qwen-1.2B
Text Generation
•
1B
•
Updated
Mar 25, 2025
•
5
MiniLLM/Pretrain-Qwen-500M
Text Generation
•
0.5B
•
Updated
Mar 25, 2025
•
13
MiniLLM/Pretrain-Qwen-200M
Text Generation
•
0.2B
•
Updated
Mar 25, 2025
•
32
MiniLLM/VanillaKD-Pretrain-Qwen-200M
Text Generation
•
0.2B
•
Updated
Mar 25, 2025
•
7
MiniLLM/VanillaKD-Pretrain-Qwen-500M
Text Generation
•
0.5B
•
Updated
Mar 25, 2025
•
13
•
MiniLLM/VanillaKD-Pretrain-Qwen-1.2B
Text Generation
•
1B
•
Updated
Mar 25, 2025
•
3
MiniLLM/init-gpt2-120M
Text Generation
•
0.1B
•
Updated
Nov 13, 2024
•
880
•
1
MiniLLM/teacher-Llama-13B
Text Generation
•
Updated
Oct 30, 2024
•
1
MiniLLM/MiniLLM-Llama-7B
Text Generation
•
Updated
Oct 30, 2024
•
9
•
3
MiniLLM/MiniPLM-Qwen-200M
Text Generation
•
0.2B
•
Updated
Oct 27, 2024
•
476
•
9
MiniLLM/init-Llama-7B
Text Generation
•
Updated
Sep 26, 2024
•
6
MiniLLM/teacher-OPT-13B
Text Generation
•
Updated
Sep 26, 2024
•
7
MiniLLM/SeqKD-Llama-7B
Text Generation
•
Updated
Sep 26, 2024
•
1
MiniLLM/KD-Llama-7B
Text Generation
•
Updated
Sep 26, 2024
•
1
MiniLLM/SFT-Llama-7B
Text Generation
•
Updated
Sep 26, 2024
•
6
MiniLLM/init-OPT-6.7B
Text Generation
•
Updated
Sep 26, 2024
•
9
MiniLLM/init-OPT-2.7B
Text Generation
•
Updated
Sep 26, 2024
•
4
MiniLLM/init-OPT-1.3B
Text Generation
•
Updated
Sep 26, 2024
•
7
MiniLLM/SeqKD-OPT-6.7B
Text Generation
•
Updated
Sep 26, 2024
•
5
MiniLLM/SeqKD-OPT-2.7B
Text Generation
•
Updated
Sep 26, 2024
•
1
MiniLLM/SeqKD-OPT-1.3B
Text Generation
•
Updated
Sep 26, 2024
•
2
MiniLLM/KD-OPT-6.7B
Text Generation
•
Updated
Sep 26, 2024
•
6
Previous
1
2
Next