Shuyu Wu

wonderwind271
·

AI & ML interests

LLM (pre)training dynamics; Mechanistic Interpretability

Recent Activity

updated a model 21 minutes ago
wonderwind271/gpt2_scrub1
published a model 21 minutes ago
wonderwind271/gpt2_scrub1
liked a model 1 day ago
meta-llama/Llama-3.2-1B
View all activity

Organizations

University of Michigan's profile picture Forty-Two AI Lab's profile picture The Computation, Language, Intelligence, and Grounding Laboratory at the University of Waterloo's profile picture HappyEval's profile picture