Open to Work

14 10

JP2

MJPT2

AI & ML interests

NLP Generative Multimodal Models

Recent Activity

upvoted an article 13 days ago

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

liked a dataset 24 days ago

OX-PIXL/STVQA-7K

liked a model 24 days ago

nyu-visionx/cambrian-8b

View all activity

Organizations

None yet

upvoted an article 13 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

26 days ago

• 70

liked a dataset 24 days ago

OX-PIXL/STVQA-7K

Viewer • Updated Nov 12, 2025 • 7.59k • 136 • 2

liked a model 24 days ago

nyu-visionx/cambrian-8b

Text Generation • Updated Jun 28, 2024 • 1.13k • 64

liked a dataset 25 days ago

ccvl/3DSRBench

Viewer • Updated Feb 3, 2025 • 5.16k • 1.63k • 9

updated a model about 1 month ago

MJPT2/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 2 • 76

published a model about 1 month ago

MJPT2/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 2 • 76

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 147

upvoted an article 2 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 159

liked a Space 2 months ago

The Smol Training Playbook

📚

3.16k

The secrets to building world-class LLMs

upvoted a collection 3 months ago

L1

Collection

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 9

upvoted 2 articles 3 months ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 120

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 325

liked a Space 3 months ago

Scaling test-time compute

📈

596

Run advanced search strategies to boost LLM problem solving

liked a Space 4 months ago

Model Family Tree

🌳

Generate a family tree of a given model

upvoted a collection 4 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 30

published a model 4 months ago

MJPT2/Qwen2.5-VL-3B-Instruct-Thinking

Updated Jan 6

upvoted a paper 4 months ago

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published Feb 4, 2025 • 23

liked a dataset 4 months ago

vbdai/Ego3D-Bench

Viewer • Updated Jan 26 • 8.68k • 614 • 11

upvoted 2 articles 4 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 118

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

JP2

AI & ML interests

Recent Activity

Organizations

MJPT2's activity

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Mixture of Experts (MoEs) in Transformers

The Smol Training Playbook

What is test-time compute and how to scale it?

KV Caching Explained: Optimizing Transformer Inference Efficiency

Scaling test-time compute

Model Family Tree

From GRPO to DAPO and GSPO: What, Why, and How

Preference Optimization for Vision Language Models