JackMa
JacckMa
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
upvoted
a
paper
about 1 month ago
Your Group-Relative Advantage Is Biased
upvoted
an
article
12 months ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Organizations
None yet