Itay Itzhak's picture

Itay Itzhak

itay1itzhak

·

https://itay1itzhak.github.io/

itay1itzhak

AI & ML interests

NLP & Deep learning

Recent Activity

authored a paper 2 days ago

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

authored a paper 2 days ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

authored a paper 2 days ago

Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens

View all activity

Organizations

authored 3 papers 2 days ago

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

Paper • 2510.00857 • Published Oct 1, 2025 • 1

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 22

Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens

Paper • 2108.11193 • Published Jun 8, 2022

authored a paper 9 months ago

Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs

Paper • 2507.07186 • Published Jul 9, 2025 • 3

authored 2 papers about 1 year ago

DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

Paper • 2503.01622 • Published Mar 3, 2025

Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs

Paper • 2502.12964 • Published Feb 18, 2025 • 3

authored a paper over 2 years ago

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

Paper • 2308.00225 • Published Aug 1, 2023