Felix Tuma's picture

117 59

Felix Tuma

floom

·

AI & ML interests

NLP

Recent Activity

upvoted a paper 5 days ago

Adaptation of Agentic AI

upvoted a paper 6 days ago

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

upvoted a paper 6 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Adaptation of Agentic AI

Paper • 2512.16301 • Published 11 days ago • 95

upvoted 2 papers 6 days ago

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Paper • 2512.19682 • Published 6 days ago • 15

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published 7 days ago • 23

upvoted 4 papers about 2 months ago

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Paper • 2510.18940 • Published Oct 21 • 8

Redefining Retrieval Evaluation in the Era of LLMs

Paper • 2510.21440 • Published Oct 24 • 8

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 221

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30 • 29

upvoted 2 papers 2 months ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24 • 99

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published Oct 27 • 11

upvoted 2 papers 3 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 20

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published Sep 25 • 29

upvoted 6 papers 4 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8 • 63

Language Self-Play For Data-Free Training

Paper • 2509.07414 • Published Sep 9 • 29

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Paper • 2508.19229 • Published Aug 26 • 20

Hermes 4 Technical Report

Paper • 2508.18255 • Published Aug 25 • 43

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 129

upvoted 3 papers 5 months ago

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Paper • 2508.09101 • Published Aug 12 • 8

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

Paper • 2508.09776 • Published Aug 13 • 3

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Paper • 2508.09968 • Published Aug 13 • 15