Jonas Geiping

JonasGeiping

https://jonasgeiping.github.io/

AI & ML interests

Machine Learning Safety, Security and Privacy; Optimization in Deep Learning; Mathematical Optimization: Federated Learning

Recent Activity

authored a paper about 18 hours ago

Answer Matching Outperforms Multiple Choice for Language Model Evaluation

authored a paper about 18 hours ago

MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

authored a paper about 18 hours ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

View all activity

Organizations

authored 8 papers about 18 hours ago

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

Paper • 2510.14961 • Published Oct 16, 2025 • 8

Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models

Paper • 2510.14853 • Published Oct 16, 2025 • 5

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19

authored 6 papers about 19 hours ago

Training AI Co-Scientists Using Rubric Rewards

Paper • 2512.23707 • Published Dec 29, 2025 • 21

Scaling Open-Ended Reasoning to Predict the Future

Paper • 2512.25070 • Published Dec 31, 2025 • 20

NESSiE: The Necessary Safety Benchmark -- Identifying Errors that should not Exist

Paper • 2602.16756 • Published Feb 18 • 4

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Paper • 2605.12460 • Published 7 days ago • 17

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Paper • 2603.24511 • Published Mar 25

FutureSim: Replaying World Events to Evaluate Adaptive Agents

Paper • 2605.15188 • Published 5 days ago • 6

submitted a paper to Daily Papers 6 days ago

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Paper • 2605.12460 • Published 7 days ago • 17

submitted a paper to Daily Papers 3 months ago

NESSiE: The Necessary Safety Benchmark -- Identifying Errors that should not Exist

Paper • 2602.16756 • Published Feb 18 • 4

authored a paper 8 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22, 2025 • 12

authored 3 papers 11 months ago

Baseline Defenses for Adversarial Attacks Against Aligned Language Models

Paper • 2309.00614 • Published Sep 1, 2023 • 2

What do we learn from inverting CLIP models?

Paper • 2403.02580 • Published Mar 5, 2024 • 4

Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

Paper • 2403.16365 • Published Mar 25, 2024 • 1

Jonas Geiping

AI & ML interests

Recent Activity

Organizations

JonasGeiping's activity