lordx64/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled Text Generation • 36B • Updated 7 days ago • 91.1k • 89
huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated Text Generation • 36B • Updated 8 days ago • 4.17k • 53
TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill Text Generation • Updated Feb 9 • 458 • 53
mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-NVFP4 Text Generation • 22B • Updated Mar 26 • 10.5k • 15
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 24 days ago • 470k • 2.81k
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper • 2505.07291 • Published May 12, 2025 • 15
MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models Paper • 2601.11969 • Published Jan 17 • 27
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published Jan 20 • 57