t.d.a.g.'s picture

t.d.a.g. PRO

sequelbox

·

sequelbox.bsky.social

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

replied to their post 1 day ago

IMPORTANT corrective note/apology: the initial upload of ValiantLabs/Qwen3.5-27B-Esper3.1 contained improperly merged weights, meaning it was effectively just Qwen 3.5. We've merged properly now and re-uploaded the correct weights to the existing repository. The model link is here: https://huggingface.co/ValiantLabs/Qwen3.5-27B-Esper3.1 This is not at all the fault of Qwen 3.5, transformers, or anything other than our own flawed upload pipeline and insufficient post-upload validation. We have to do better. We'll immediately improve our validation procedures to be more rigorous. This will never happen again. We are proud to build quickly: by providing a quick finetune of a new high-performance model like Qwen 3.5, we seek to provide value not only in the model's direct use but especially to our fellow open-source creators, who can learn from our initial training attempt. We are proud that building fast helps you build fast. In this case, our poor work has produced the opposite result - we've wasted your time. We are really sorry to everyone, but especially our fellow builders. We feel about one inch tall right now, but we're going to get back to work and do better. Our crew deserves better and so do our users. Humbly, your captain, t.d.a.g.

posted an update 1 day ago

IMPORTANT corrective note/apology: the initial upload of ValiantLabs/Qwen3.5-27B-Esper3.1 contained improperly merged weights, meaning it was effectively just Qwen 3.5. We've merged properly now and re-uploaded the correct weights to the existing repository. The model link is here: https://huggingface.co/ValiantLabs/Qwen3.5-27B-Esper3.1 This is not at all the fault of Qwen 3.5, transformers, or anything other than our own flawed upload pipeline and insufficient post-upload validation. We have to do better. We'll immediately improve our validation procedures to be more rigorous. This will never happen again. We are proud to build quickly: by providing a quick finetune of a new high-performance model like Qwen 3.5, we seek to provide value not only in the model's direct use but especially to our fellow open-source creators, who can learn from our initial training attempt. We are proud that building fast helps you build fast. In this case, our poor work has produced the opposite result - we've wasted your time. We are really sorry to everyone, but especially our fellow builders. We feel about one inch tall right now, but we're going to get back to work and do better. Our crew deserves better and so do our users. Humbly, your captain, t.d.a.g.

liked a model 1 day ago

ValiantLabs/Qwen3.5-27B-Esper3.1

View all activity

Organizations

sequelbox 's models 30

sequelbox/Llama2-70B-SunsetBoulevard

Text Generation • 69B • Updated Jan 5 • 171 • 12

sequelbox/Llama2-70B-SharpBalance

Text Generation • 69B • Updated Jan 4 • 1 • 6

sequelbox/Ministral-3-14B-Reasoning-2512-PlumEsper1.1

Image-Text-to-Text • 14B • Updated Dec 9, 2025 • 22 • 3

sequelbox/Qwen3-14B-UML-Generator

Text Generation • 15B • Updated Nov 12, 2025 • 6 • 4

sequelbox/Qwen3-4B-Thinking-2507-UML-Generator

Text Generation • 4B • Updated Nov 12, 2025 • 8 • 4

sequelbox/gpt-oss-120b-UML-Generator

Text Generation • 117B • Updated Nov 12, 2025 • 2 • 4

sequelbox/gpt-oss-20b-UML-Generator

Text Generation • 21B • Updated Nov 12, 2025 • 6 • 3

sequelbox/Qwen3-4B-Thinking-2507-DES-Reasoning

Text Generation • 4B • Updated Sep 4, 2025 • 2 • 3

sequelbox/gpt-oss-20b-DES-Reasoning

Text Generation • 21B • Updated Sep 4, 2025 • 4 • 4

sequelbox/gpt-oss-20b-DAG-Reasoning

Text Generation • 21B • Updated Aug 11, 2025 • 3

sequelbox/Qwen3-14B-DAG-Reasoning

Text Generation • 15B • Updated Aug 11, 2025 • 1 • 6

sequelbox/Qwen3-8B-DAG-Reasoning

Text Generation • 8B • Updated Aug 11, 2025 • 6 • 2

sequelbox/Qwen3-4B-Thinking-2507-DAG-Reasoning

Text Generation • 4B • Updated Aug 11, 2025 • 2 • 7

sequelbox/Qwen3-4B-PlumEsper

Text Generation • 4B • Updated Jul 15, 2025 • 5

sequelbox/Qwen3-8B-PlumEsper

Text Generation • 8B • Updated Jul 14, 2025 • 5

sequelbox/Qwen3-14B-Esper3Mix

Text Generation • 15B • Updated May 28, 2025 • 3

sequelbox/Qwen3-14B-Esper3Grayline

Text Generation • 15B • Updated May 28, 2025 • 2

sequelbox/Qwen3-14B-Esper3Web3

Text Generation • 15B • Updated May 28, 2025 • 3

sequelbox/Qwen3-14B-Esper3Math

Text Generation • 15B • Updated May 28, 2025 • 3

sequelbox/Qwen3-8B-Esper3-PREVIEW

Text Generation • 8B • Updated May 7, 2025 • 4 • 2

sequelbox/Llama3.1-70B-PlumChat

Text Generation • 71B • Updated Dec 4, 2024 • 10

sequelbox/Llama3.1-8B-PlumCode

Text Generation • 8B • Updated Nov 17, 2024 • 14

sequelbox/Llama3.1-8B-PlumChat

Text Generation • 8B • Updated Nov 17, 2024 • 5

sequelbox/Llama3.1-8B-PlumMath

Text Generation • 8B • Updated Oct 4, 2024 • 10 • 1

sequelbox/Llama3.1-8B-MOTH

Text Generation • 8B • Updated Sep 19, 2024 • 12

sequelbox/gemma-2-9B-MOTH

Text Generation • 9B • Updated Sep 13, 2024 • 9

sequelbox/Llama2-70B-SpellBlade

Text Generation • 69B • Updated May 15, 2024 • 3 • 2

sequelbox/Llama2-13B-DiamondForce

Text Generation • 13B • Updated May 15, 2024 • 4

sequelbox/Llama2-70B-StellarBright

Text Generation • 69B • Updated May 15, 2024 • 144 • 45

sequelbox/Llama2-13B-DaringFortitude

Text Generation • 13B • Updated May 15, 2024 • 12 • 13