nightmedia/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-qx64-hi-mlx Text Generation • 27B • Updated 4 days ago • 4.28k • 6
stellarator/mxbai-embed-large-v1-Q5_K_M-GGUF Feature Extraction • 0.3B • Updated Oct 29, 2024 • 16
Running on A10G 1.88k GGUF My Repo 🦙 1.88k Quantize a Hugging Face model to GGUF and create a repo
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4, 2024 • 41
Orca-Math: Unlocking the potential of SLMs in Grade School Math Paper • 2402.14830 • Published Feb 16, 2024 • 24