4 1 62

Paulo Faria

Azoresman

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

prithivMLmods/Qwen3.6-35B-A3B-Uncensored-Aggressive-GGUF

liked a model 13 days ago

prithivMLmods/Qwen3.6-35B-A3B-Uncensored-Aggressive

liked a model 14 days ago

OpenYourMind/OpenYourMind-Qwen3.6-35B-A3B-abliterated-uncensored-APEX-GGUF

View all activity

Organizations

None yet

liked 2 models 13 days ago

prithivMLmods/Qwen3.6-35B-A3B-Uncensored-Aggressive-GGUF

Image-Text-to-Text • 35B • Updated 18 days ago • 714 • 2

prithivMLmods/Qwen3.6-35B-A3B-Uncensored-Aggressive

Image-Text-to-Text • 35B • Updated 18 days ago • 596 • 7

liked a model 14 days ago

OpenYourMind/OpenYourMind-Qwen3.6-35B-A3B-abliterated-uncensored-APEX-GGUF

Image-Text-to-Text • 35B • Updated 15 days ago • 5.19k • 7

reacted to SeaWolf-AI's post with 🔥 22 days ago

Post

8733

🧬 Introducing Darwin-9B-NEG — the first model with Native Entropy Gating (NEG)

🔗 Try it now: FINAL-Bench/Darwin-9B-NEG
🔗 Q4 bit : FINAL-Bench/Darwin-9B-MFP4

We're thrilled to release Darwin-9B-NEG, a 9B-parameter reasoning model
that embeds an architecturally-internalised sense of self-confidence directly
into the transformer — our proprietary Native Entropy Gating (NEG) technology.

📊 GPQA Diamond (198 PhD-level questions):

▸ Baseline Darwin-9B (no NEG) → 51.01 %
▸ Pure NEG (greedy · 1× cost) → 63.64 % 🔥 +12.63 %p
▸ + Permutation (4× cost) → 76.26 %
▸ + Ensemble Refinement (~20×) → 84.34 % 🏆

With only 9 billion parameters and 1× inference cost, Pure NEG jumps
+12.63 %p over the same model without NEG. Going all-in with ensemble
refinement pushes it to 84.34 % — surpassing the published Qwen3.5-9B
leaderboard score (81.7 %) by +2.64 %p.

🔬 What makes NEG different from Multi-Turn Iteration (MTI)?

Classical MTI needs 3-8× extra inference passes. NEG instead lives
INSIDE the single decoding loop. Two tiny modules ride with the
transformer: NEG-Head predicts per-token entropy from the last hidden
state, and NEG-Gate conditionally restricts the top-k choice when
confidence is low. The gate activates in only 4.36 % of tokens —
essentially free at inference time.

✨ Key differentiators
• Architecturally internalised — model file *is* the feature
• 1× inference cost (vs. 3-8× for MTI)
• Drop-in with vLLM / SGLang / TGI / transformers — no extra engine
• +12.63 %p reasoning at zero latency overhead
• Single-file deployment, Apache 2.0 licensed

🧬 Lineage
Qwen/Qwen3.5-9B → Darwin-9B-Opus (V7 evolutionary merge) → Darwin-9B-NEG (V8 + NEG training)

#Darwin #NEG #NativeEntropyGating #GPQA #Reasoning #LLM #OpenSource #Apache2

liked a model 26 days ago

Ex0bit/Gemma4-26B-A4B-PRISM-PRO-DQ-GGUF

Image-Text-to-Text • 25B • Updated Apr 11 • 10k • 71

reacted to danielhanchen's post with 🔥 about 2 months ago

Post

3420

Introducing Unsloth Studio ✨
A new open-source web UI to train and run LLMs.

• Run models locally on Mac, Windows, Linux
• Train 500+ models 2x faster with 70% less VRAM
• Supports GGUF, vision, audio, embedding models
• Auto-create datasets from PDF, CSV, DOCX
• Self-healing tool calling and code execution
• Compare models side by side + export to GGUF

GitHub: https://github.com/unslothai/unsloth
Blog and Guide: https://unsloth.ai/docs/new/studio

Available now on Hugging Face, NVIDIA, Docker and Colab.

New activity in HauhauCS/Qwen3.5-9B-Uncensored-HauhauCS-Aggressive 2 months ago

thinking mode help

#13 opened 2 months ago by

wicesxs

liked a model 2 months ago

HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive

Image-Text-to-Text • 35B • Updated Apr 5 • 824k • 1.39k

reacted to prithivMLmods's post with ❤️ 2 months ago

Post

5055

The Qwen3.5 Multimodal Understanding Demo, powered by Qwen3.5-2B, is now available on HF Spaces! It is a lightweight model designed for fast image and video reasoning. Built with Gradio, the demo showcases Image QA, Video QA, object detection, and 2D point tracking, along with real-time token streaming.

🤗 Demo: prithivMLmods/Qwen-3.5-HF-Demo
✅ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
🔗 Qwen3.5-2B: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.