Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Benjamin's picture
Building on HF
10 24 4

Benjamin

Benjamin-eecs
hngl's profile picture Kumagrimbel's profile picture shuyuej's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper 29 days ago
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation
upvoted a paper about 1 month ago
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation
upvoted a paper 3 months ago
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
View all activity

Organizations

CAMEL-AI.org's profile picture National University of Singapore's profile picture HuangLab Test's profile picture Meta Research's profile picture Computer Intelligence Project's profile picture AcornAI's profile picture Spiral RL's profile picture

Benjamin-eecs 's models 2

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction • 8B • Updated Nov 24, 2024 • 5

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

Feature Extraction • 8B • Updated Nov 24, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs