view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131 • Apr 16, 2025 • 42
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques jmamou • Mar 24, 2025 • 20
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model +6 danielkorat, orenpereg, mber, jmamou, joaogante, lewtun, Nadav-Timor, moshew • Oct 29, 2024 • 61
view article Article Faster Assisted Generation with Dynamic Speculation +5 jmamou, orenpereg, joaogante, lewtun, danielkorat, Nadav-Timor, moshew • Oct 8, 2024 • 51
view article Article Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon +4 danielkorat, tomaarsen, orenpereg, moshew, echarlaix, aprabh2 • Apr 3, 2024 • 11
view article Article A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake +4 juliensimon, echarlaix, ofirzaf, imargulis, guybd, moshew • Mar 20, 2024 • 7
view article Article CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG +4 peterizsak, mber, danf, echarlaix, mfuntowicz, moshew • Mar 15, 2024 • 14
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding +9 ofirzaf, echarlaix, imargulis, danielkorat, jmamou, guybd, orenpereg, moshew, Haihao, aayasin, FanZhao • Jan 30, 2024 • 9
view article Article SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit +4 ronenlap, tomaarsen, lewtun, danielkorat, orenpereg, moshew • Dec 6, 2023 • 15
view article Article SetFit: Efficient Few-Shot Learning Without Prompts +4 Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew • Sep 26, 2022 • 40
view article Article SetFit: Efficient Few-Shot Learning Without Prompts +4 Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew • Sep 26, 2022 • 40