Maziyar Panahi's picture

Building on HF

Maziyar Panahi PRO

MaziyarPanahi

OpenMed

·

https://maziyarpanahi.com/

AI & ML interests

AI in Health & Biology, Post-Training, RLHF, RL, model merging, quantization, synthetic datasets

Recent Activity

upvoted an article about 22 hours ago

Liberate your OpenClaw

liked a model 3 days ago

google/gemma-4-26B-A4B-it

liked a model 3 days ago

arcee-ai/Trinity-Large-Thinking-GGUF

View all activity

Organizations

Posts 8

Post

399

Training mRNA Language Models Across 25 Species for $165

We built an end-to-end protein AI pipeline covering structure prediction, sequence design, and codon optimization. After comparing multiple transformer architectures for codon-level language modeling, CodonRoBERTa-large-v2 emerged as the clear winner with a perplexity of 4.10 and a Spearman CAI correlation of 0.40, significantly outperforming ModernBERT. We then scaled to 25 species, trained 4 production models in 55 GPU-hours, and built a species-conditioned system that no other open-source project offers. Complete results, architectural decisions, and runnable code below.

https://huggingface.co/blog/OpenMed/training-mrna-models-25-species

Articles 6

Article

14

Training mRNA Language Models Across 25 Species for $165

View all Articles

Collections 20

View 20 collections

Papers 3

arxiv:2602.17004

arxiv:2508.01630

arxiv:2412.01152

spaces 7

Microsoft Phi-3 Vision 128k

Microsoft Phi-3 Vision 128k with Multimodal capabilities

Qwen2-VL-2B

Generate text from images or videos

FACTS Grounding Leaderboard

This is FACTS Grounding Leaderboard, but for Open LLMs!

Llava Llama-3 8B

Meta Llama3 8b with Llava Multimodal capabilities

Phi 3.5 Vision

Generate answers to questions about any image

Chat With Phi 2

Generate chat responses based on user input

models 2,816

MaziyarPanahi/LocoOperator-4B-GGUF

Text Generation • 4B • Updated Mar 3 • 94 • 1

MaziyarPanahi/LFM2-24B-A2B-GGUF

Text Generation • 24B • Updated Feb 24 • 184 • 2

MaziyarPanahi/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated Feb 10 • 62.8k • 1

MaziyarPanahi/EXAONE-4.0-1.2B-GGUF

Text Generation • 1B • Updated Feb 10 • 85

MaziyarPanahi/VulnLLM-R-7B-GGUF

Text Generation • 8B • Updated Feb 10 • 29

MaziyarPanahi/Schematron-3B-GGUF

Text Generation • 3B • Updated Feb 10 • 24

MaziyarPanahi/finetuned-mistral-7b-Mistral-7B-Instruct-v0.2-slerp

Text Generation • 7B • Updated Jan 7 • 8 • 2

MaziyarPanahi/Trinity-Mini-GGUF

Text Generation • 26B • Updated Dec 16, 2025 • 61.3k • 1

MaziyarPanahi/GLM-4.6V-Flash-GGUF

Text Generation • 9B • Updated Dec 8, 2025 • 61.8k • 4

MaziyarPanahi/Hermes-4.3-36B-GGUF

Text Generation • 36B • Updated Dec 6, 2025 • 162 • 3

View 2,816 models

datasets 51

MaziyarPanahi/Nemotron-Cascade-2-SFT-Data-Small

Viewer • Updated 14 days ago • 4.9M • 112 • 3

MaziyarPanahi/MedExQA-clean

Viewer • Updated Feb 23 • 965 • 34 • 1

MaziyarPanahi/smoltalk2-think

Viewer • Updated Aug 29, 2025 • 1.48M • 251 • 4

MaziyarPanahi/smoltalk2-sft-no-think

Viewer • Updated Jul 11, 2025 • 1.9M • 37 • 6

MaziyarPanahi/AM-DeepSeek-R1-0528-Distilled-with-System

Viewer • Updated Jun 11, 2025 • 2.59M • 29 • 4

MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-ShareGPT

Viewer • Updated Jun 2, 2025 • 30.2M • 130 • 41

MaziyarPanahi/OpenMathReasoning_ShareGPT

Viewer • Updated Apr 24, 2025 • 2.4M • 149 • 4

MaziyarPanahi/OpenCodeReasoning_ShareGPT

Viewer • Updated Apr 7, 2025 • 735k • 1.62k • 9

MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-Smoler-ShareGPT

Viewer • Updated Mar 22, 2025 • 2.25M • 36 • 3

MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-Smol-ShareGPT

Viewer • Updated Mar 19, 2025 • 6.67M • 9 • 3

View 51 datasets