AI & ML interests
Natural Language Processing
Papers
CCTU: A Benchmark for Tool Use under Complex Constraints
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
FudanNLP 's datasets
None public yet