Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

PKU-Alignment

university

https://github.com/PKU-Alignment

AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

PKU-Alignment 's collections 6

Alignment with Multi-turn Multimodal Understanding and Generation

PKU-Alignment/InterMT

Preview • Updated Jan 8 • 117
PKU-Alignment/InterMT-Bench-Images

Viewer • Updated May 23, 2025 • 1.51k • 11
PKU-Alignment/InterMT-Judge

Updated May 23, 2025 • 7

Towards Safety Alignment of Text2Video Generation

PKU-Alignment/SafeSora

Viewer • Updated Jun 20, 2024 • 51.7k • 371 • 7
PKU-Alignment/SafeSora-Eval

Viewer • Updated Jun 20, 2024 • 600 • 116 • 2
PKU-Alignment/SafeSora-Label

Viewer • Updated Jun 20, 2024 • 57.3k • 199 • 2
PKU-Alignment/SafeSora-Prompt

Viewer • Updated Aug 12, 2024 • 36.6k • 11

Alignment with a millennium of moral progress

ProgressGym: Alignment with a Millennium of Moral Progress

Paper • 2406.20087 • Published Jun 28, 2024 • 4
Runtime error

Agents

4

ProgressGym LeaderBoard

🥇

4
PKU-Alignment/ProgressGym-HistText

Preview • Updated Aug 10, 2024 • 1.42k • 1
PKU-Alignment/ProgressGym-TimelessQA

Preview • Updated Aug 10, 2024 • 152 • 1

Language Model Resist Alignment

This repository hosts open-sourced models of "Language Model Resist Alignment" (ACL 2025 Main).

PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000

Updated May 31, 2025 • 1
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-100

Updated May 31, 2025 • 3
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-1000

Updated May 31, 2025 • 3
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-200

Updated May 31, 2025 • 2

A safety alignment preference dataset for llama family models

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 14.5k • 183
PKU-Alignment/PKU-SafeRLHF-single-dimension

Viewer • Updated Jun 14, 2024 • 81.1k • 296 • 3
PKU-Alignment/PKU-SafeRLHF-QA

Viewer • Updated Jun 14, 2024 • 265k • 534 • 8
PKU-Alignment/PKU-SafeRLHF-prompt

Viewer • Updated Jun 14, 2024 • 44.6k • 314 • 5

PKU-Alignment/align-anything

Viewer • Updated Apr 5, 2025 • 69.4k • 5.04k • 48
PKU-Alignment/Align-Anything-Instruction-100K-zh

Viewer • Updated Oct 10, 2024 • 105k • 165 • 10
PKU-Alignment/Align-Anything-Instruction-100K

Viewer • Updated Oct 10, 2024 • 105k • 272 • 9
PKU-Alignment/Align-Anything-TI2T-Instruction-100K

Viewer • Updated Nov 20, 2024 • 103k • 356 • 1

Alignment with Multi-turn Multimodal Understanding and Generation

PKU-Alignment/InterMT

Preview • Updated Jan 8 • 117
PKU-Alignment/InterMT-Bench-Images

Viewer • Updated May 23, 2025 • 1.51k • 11
PKU-Alignment/InterMT-Judge

Updated May 23, 2025 • 7

Language Model Resist Alignment

This repository hosts open-sourced models of "Language Model Resist Alignment" (ACL 2025 Main).

PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000

Updated May 31, 2025 • 1
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-100

Updated May 31, 2025 • 3
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-1000

Updated May 31, 2025 • 3
PKU-Alignment/Qwen1.5-0.5B-IMDB-Q1-10000-Q2-200

Updated May 31, 2025 • 2

Towards Safety Alignment of Text2Video Generation

PKU-Alignment/SafeSora

Viewer • Updated Jun 20, 2024 • 51.7k • 371 • 7
PKU-Alignment/SafeSora-Eval

Viewer • Updated Jun 20, 2024 • 600 • 116 • 2
PKU-Alignment/SafeSora-Label

Viewer • Updated Jun 20, 2024 • 57.3k • 199 • 2
PKU-Alignment/SafeSora-Prompt

Viewer • Updated Aug 12, 2024 • 36.6k • 11

A safety alignment preference dataset for llama family models

PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 14.5k • 183
PKU-Alignment/PKU-SafeRLHF-single-dimension

Viewer • Updated Jun 14, 2024 • 81.1k • 296 • 3
PKU-Alignment/PKU-SafeRLHF-QA

Viewer • Updated Jun 14, 2024 • 265k • 534 • 8
PKU-Alignment/PKU-SafeRLHF-prompt

Viewer • Updated Jun 14, 2024 • 44.6k • 314 • 5

Alignment with a millennium of moral progress

ProgressGym: Alignment with a Millennium of Moral Progress

Paper • 2406.20087 • Published Jun 28, 2024 • 4
Runtime error

Agents

4

ProgressGym LeaderBoard

🥇

4
PKU-Alignment/ProgressGym-HistText

Preview • Updated Aug 10, 2024 • 1.42k • 1
PKU-Alignment/ProgressGym-TimelessQA

Preview • Updated Aug 10, 2024 • 152 • 1

PKU-Alignment/align-anything

Viewer • Updated Apr 5, 2025 • 69.4k • 5.04k • 48
PKU-Alignment/Align-Anything-Instruction-100K-zh

Viewer • Updated Oct 10, 2024 • 105k • 165 • 10
PKU-Alignment/Align-Anything-Instruction-100K

Viewer • Updated Oct 10, 2024 • 105k • 272 • 9
PKU-Alignment/Align-Anything-TI2T-Instruction-100K

Viewer • Updated Nov 20, 2024 • 103k • 356 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs