Lekr0's picture
Add files using upload-large-folder tool
0b9402c verified

๐Ÿ“ˆ Performance Dashboard

Overview

To better visualize the performance of the SpecBundle draft models, we have built a dashboard to offer interactive experiences to users to explore the evaluation results. We evaluate the performance of SpecBundle draft models under different speculative decoding configurations (i.e. steps, topk, num_draft_tokens) on various benchmarks, the benchmarks include:

  • Conversation
    • MTBench
  • General Knowledge
    • GPQA
    • FinanceQA
  • Math
    • GSM8K
    • Math500
  • Coding
    • HumanEval
    • LiveCodeBench

Check out the Performance Dashboard for more details.