AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

alignmentforever  updated a dataset 1 day ago
PKU-Alignment/InterMT
muchvo  published a dataset 25 days ago
PKU-Alignment/VLA-Arena-Scenes
XuehaiPan  authored a paper about 1 month ago
AI Alignment: A Comprehensive Survey
View all activity

PKU-Alignment 's collections 6

PKU-SafeRLHF
A safety alignment preference dataset for llama family models