R's picture

In a Training Loop 🔄

R PRO

juiceb0xc0de

·

JuiceB0xC0de

AI & ML interests

destroying heuristic determination in 4 dimensions to flood the engines with diversity and a lot of swear words

Recent Activity

updated a Space about 5 hours ago

juiceb0xc0de/lr-scheduler-benchmark

updated a model about 14 hours ago

juiceb0xc0de/bella-bartender-gemma-4-e2b-gguf

published a model about 14 hours ago

juiceb0xc0de/bella-bartender-gemma-4-e2b-gguf

View all activity

Organizations

buckets 1

juiceb0xc0de/juicebucket

Posts 7

Post

40

I'm not obsessed with LR schedulers you are.

juiceb0xc0de/lr-scheduler-benchmark

Okay maybe I'm a little obsessed with LR schedulers ATM. I ran a SST-2 Sentiment Classification eval using the nyu-mll/glue dataset on distilbert/distilbert-base-uncased-67M to see how different schedulers perform.

I think I've graduated from ML enthusiast to full blown data hoarder and I don't know if I can turn back now.

Anyways I evaluated the 2 schedulers that i designed as well and was pretty happy with the performance of both over all so hell ya to that guess I'll go and grab some more graphs.

https://github.com/JuiceB0xC0de/aecs-scheduler.git
https://github.com/JuiceB0xC0de/lucky-pick-scheduler.git

nyu-mll/glue
distilbert/distilbert-base-uncased

Articles 4

Article

1

lucky_pick_scheduler

View all Articles

Collections 1

spaces 4

LR Scheduler Benchmark

Standardized benchmark comparing learning rate schedulers

Test Dev Agent

Run and manage code in a Hugging Face sandbox

Test Dev Agent

Run code in a secure Hugging Face sandbox from AI commands

Trackio

Track and visualize data sequences with interactive displays

models 12

juiceb0xc0de/bella-bartender-gemma-4-e2b-gguf

Text Generation • 5B • Updated about 14 hours ago • 82

juiceb0xc0de/bella-bartender-gemma-e2b

Text Generation • 5B • Updated about 14 hours ago • 14

juiceb0xc0de/bella-bartender-8b-llama3.1

Text Generation • 8B • Updated Mar 28 • 298 • • 4

juiceb0xc0de/bella-bartender-3b

Text Generation • 3B • Updated Mar 25 • 197 • 2

juiceb0xc0de/bella-bartender-v2-8b

Text Generation • 8B • Updated Mar 25 • 159 • 3

juiceb0xc0de/bella-bartender-9b-yi

Text Generation • 9B • Updated Mar 24 • 590 • 1

juiceb0xc0de/bella-bartender-heretic-1b

Text Generation • 1B • Updated Mar 21 • 990 • 1

juiceb0xc0de/bella-bartender-1b

Text Generation • 1B • Updated Mar 21 • 677 • 1

juiceb0xc0de/bella-bartender-heretic-3b

Text Generation • 3B • Updated Mar 20 • 693 • 2

juiceb0xc0de/bella-tao-merged-qwen2_5-coder-7b

Text Generation • 8B • Updated Mar 19 • 131

datasets 2

juiceb0xc0de/chaotic-absurdity

Viewer • Updated Mar 12 • 100 • 23

juiceb0xc0de/bella-tao

Viewer • Updated Feb 26 • 4.37k • 53