26 2

Katherine Tieu

kthrn22

https://kthrn22.github.io

AI & ML interests

LLMs, Agents, RL, Multimodal Learning, GNNs

Recent Activity

upvoted a paper 10 days ago

Code as Agent Harness

upvoted a paper 17 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

upvoted a paper 29 days ago

Heterogeneous Scientific Foundation Model Collaboration

View all activity

Organizations

upvoted a paper 10 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 12 days ago • 210

upvoted a paper 17 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published 19 days ago • 76

upvoted a paper 29 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published about 1 month ago • 217

upvoted 2 papers about 1 month ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 273

Near-Future Policy Optimization

Paper • 2604.20733 • Published Apr 22 • 77

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 24 days ago • 5.84M • • 4.44k

upvoted a collection 2 months ago

SDAR

Collection

The models without suffixes use the default block size = 4. • 21 items • Updated Jan 2 • 9

upvoted 4 papers 3 months ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 51

upvoted 2 papers 4 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

upvoted 3 papers 5 months ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

Paper • 2512.16229 • Published Dec 18, 2025 • 16

INTELLECT-3: Technical Report

Paper • 2512.16144 • Published Dec 18, 2025 • 20

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published Dec 22, 2025 • 66

upvoted 4 papers 6 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Paper • 2510.09541 • Published Oct 10, 2025 • 17

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 48

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 141

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 128

Katherine Tieu

AI & ML interests

Recent Activity

Organizations

kthrn22's activity