In a Training Loop 🔄

45 116 53

Urro

urroxyz

https://urro.xyz/

urroxyz

AI & ML interests

i like research on empowering small LMs to do better 😮 i DISLIKE video & image generation (esp. ai "art") 🤢

Recent Activity

upvoted a paper about 1 hour ago

DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling

updated a collection about 1 hour ago

WTF GENIUS PAPERS

upvoted a paper about 1 hour ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

View all activity

Organizations

upvoted 2 papers about 1 hour ago

DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling

Paper • 2602.11968 • Published 5 days ago • 1

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published 4 days ago • 6

upvoted a paper 3 days ago

Inference-Time Hyper-Scaling with KV Cache Compression

Paper • 2506.05345 • Published Jun 5, 2025 • 30

upvoted 6 papers 7 days ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 8 days ago • 65

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Paper • 2602.06600 • Published 11 days ago • 2

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 11 days ago • 70

Aster: Autonomous Scientific Discovery over 20x Faster Than Existing Methods

Paper • 2602.07040 • Published 14 days ago • 2

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published 12 days ago • 23

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Paper • 2602.07796 • Published 9 days ago • 7

upvoted 6 papers 8 days ago

compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data

Paper • 2602.06669 • Published 11 days ago • 7

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published 13 days ago • 18

upvoted 5 papers 10 days ago

Beyond Fixed Frames: Dynamic Character-Aligned Speech Tokenization

Paper • 2601.23174 • Published 18 days ago • 3

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

Paper • 2602.04998 • Published 13 days ago • 6

Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better

Paper • 2602.05393 • Published 12 days ago • 7

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published 15 days ago • 11

Privileged Information Distillation for Language Models

Paper • 2602.04942 • Published 13 days ago • 25

Urro

AI & ML interests

Recent Activity

Organizations

urroxyz's activity