Falcon-H1-Tiny-90M-Instruct-reasoning-aggressive

REASONING-optimized | Aggressive pruning | 35% weights pruned

This model is a aggressively pruned version of tiiuae/Falcon-H1-Tiny-90M-Instruct.

Note: Minimal quality drop detected. The Wanda pruning algorithm effectively identifies and removes less important weights while preserving model capability.

Performance Comparison

Category Original Pruned Change
Python 0.0% 0.0% β†’
Html 0.0% 0.0% β†’
Trivia 15.0% 15.0% β†’
Math 10.0% 10.0% β†’
Reasoning 15.0% 10.0% ⭐ ↓ 5.0%
Medical 5.0% 5.0% β†’
Linux 30.0% 30.0% β†’
Writing 0.0% 0.0% β†’

Average: 9.4% -> 8.8% (-0.6%)

Reasoning Retention: 66.7%

Comparison Graph

Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("CompactAI/Falcon-H1-Tiny-90M-Instruct-reasoning-aggressive")
tokenizer = AutoTokenizer.from_pretrained("CompactAI/Falcon-H1-Tiny-90M-Instruct-reasoning-aggressive")

inputs = tokenizer("Your prompt here", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Technical Details

Property Value
Base Model tiiuae/Falcon-H1-Tiny-90M-Instruct
Specialization Reasoning
Prune Mode Aggressive
Weight Reduction 35% weights pruned

License

This model inherits the license from the base model.

Downloads last month
7
Safetensors
Model size
91.1M params
Tensor type
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for CompactAI/Falcon-H1-Tiny-90M-Instruct-reasoning-aggressive

Finetuned
(17)
this model

Collection including CompactAI/Falcon-H1-Tiny-90M-Instruct-reasoning-aggressive