10 4

黄炜锴

tsrigo

tsrigo

AI & ML interests

Trustworthy AI

Recent Activity

upvoted a paper 3 months ago

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

liked a model 5 months ago

facebook/sam-3d-body-dinov3

upvoted a paper 5 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

upvoted a paper 3 months ago

CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era

Paper • 2602.23452 • Published Feb 26 • 17

liked a model 5 months ago

facebook/sam-3d-body-dinov3

Updated Dec 10, 2025 • 4.66k • 209

upvoted a paper 5 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 268

upvoted 2 papers 6 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published Nov 25, 2025 • 49

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Paper • 2511.13288 • Published Nov 17, 2025 • 19

liked a dataset 6 months ago

meta-agents-research-environments/gaia2

Viewer • Updated Sep 25, 2025 • 963 • 55.9k • 41

liked a Space 6 months ago

Gaia2 Agents Evaluation Leaderboard

🐠

Explore AI model performance on the Gaia2 benchmark

upvoted an article 6 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter

•

Sep 22, 2025

• 134

upvoted a collection 8 months ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 544

upvoted a paper about 1 year ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99

liked a model over 1 year ago

tsrigo/coconut

Updated Feb 15, 2025 • 1

updated a model over 1 year ago

tsrigo/coconut

Updated Feb 15, 2025 • 1

published a model over 1 year ago

tsrigo/coconut

Updated Feb 15, 2025 • 1

updated a model over 1 year ago

tsrigo/unsloth_model

Text Generation • 3B • Updated Feb 13, 2025 • 1

published a model over 1 year ago

tsrigo/unsloth_model

Text Generation • 3B • Updated Feb 13, 2025 • 1

updated a model over 1 year ago

tsrigo/Qwen2.5-1.5B-Instruct-DPO-bad-boy

2B • Updated Jan 20, 2025 • 3

published a model over 1 year ago

tsrigo/Qwen2.5-1.5B-Instruct-DPO-bad-boy

2B • Updated Jan 20, 2025 • 3

updated a dataset over 1 year ago

tsrigo/btfChinese-DPO-small

Viewer • Updated Jan 20, 2025 • 5k • 119

published a dataset over 1 year ago

tsrigo/btfChinese-DPO-small

Viewer • Updated Jan 20, 2025 • 5k • 119

updated a model over 1 year ago

tsrigo/Qwen2.5-0.5B-Instruct-DPO-bad-boy

Updated Jan 20, 2025

黄炜锴

AI & ML interests

Recent Activity

Organizations

tsrigo's activity

Gaia2 Agents Evaluation Leaderboard

Gaia2 and ARE: Empowering the community to study agents