·
AI & ML interests
open source, infinite games. (they/them)
Recent Activity
replied to their post 1 day ago IMPORTANT corrective note/apology: the initial upload of ValiantLabs/Qwen3.5-27B-Esper3.1 contained improperly merged weights, meaning it was effectively just Qwen 3.5. We've merged properly now and re-uploaded the correct weights to the existing repository.
The model link is here: https://huggingface.co/ValiantLabs/Qwen3.5-27B-Esper3.1
This is not at all the fault of Qwen 3.5, transformers, or anything other than our own flawed upload pipeline and insufficient post-upload validation. We have to do better. We'll immediately improve our validation procedures to be more rigorous. This will never happen again.
We are proud to build quickly: by providing a quick finetune of a new high-performance model like Qwen 3.5, we seek to provide value not only in the model's direct use but especially to our fellow open-source creators, who can learn from our initial training attempt. We are proud that building fast helps you build fast. In this case, our poor work has produced the opposite result - we've wasted your time. We are really sorry to everyone, but especially our fellow builders.
We feel about one inch tall right now, but we're going to get back to work and do better. Our crew deserves better and so do our users.
Humbly, your captain,
t.d.a.g. posted an
update
1 day ago IMPORTANT corrective note/apology: the initial upload of ValiantLabs/Qwen3.5-27B-Esper3.1 contained improperly merged weights, meaning it was effectively just Qwen 3.5. We've merged properly now and re-uploaded the correct weights to the existing repository.
The model link is here: https://huggingface.co/ValiantLabs/Qwen3.5-27B-Esper3.1
This is not at all the fault of Qwen 3.5, transformers, or anything other than our own flawed upload pipeline and insufficient post-upload validation. We have to do better. We'll immediately improve our validation procedures to be more rigorous. This will never happen again.
We are proud to build quickly: by providing a quick finetune of a new high-performance model like Qwen 3.5, we seek to provide value not only in the model's direct use but especially to our fellow open-source creators, who can learn from our initial training attempt. We are proud that building fast helps you build fast. In this case, our poor work has produced the opposite result - we've wasted your time. We are really sorry to everyone, but especially our fellow builders.
We feel about one inch tall right now, but we're going to get back to work and do better. Our crew deserves better and so do our users.
Humbly, your captain,
t.d.a.g. View all activity
Organizations
sequelbox/Llama2-70B-SunsetBoulevard
Text Generation
• 69B • Updated
• 171
• 12
sequelbox/Llama2-70B-SharpBalance
Text Generation
• 69B • Updated
• 1
• 6
sequelbox/Ministral-3-14B-Reasoning-2512-PlumEsper1.1
Image-Text-to-Text
• 14B • Updated
• 22
• 3
sequelbox/Qwen3-14B-UML-Generator
Text Generation
• 15B • Updated
• 6
• 4
sequelbox/Qwen3-4B-Thinking-2507-UML-Generator
Text Generation
• 4B • Updated
• 8
• 4
sequelbox/gpt-oss-120b-UML-Generator
Text Generation
• 117B • Updated
• 2
• 4
sequelbox/gpt-oss-20b-UML-Generator
Text Generation
• 21B • Updated
• 6
• 3
sequelbox/Qwen3-4B-Thinking-2507-DES-Reasoning
Text Generation
• 4B • Updated
• 2
• 3
sequelbox/gpt-oss-20b-DES-Reasoning
Text Generation
• 21B • Updated
• 4
• 4
sequelbox/gpt-oss-20b-DAG-Reasoning
Text Generation
• 21B • Updated
• 3
sequelbox/Qwen3-14B-DAG-Reasoning
Text Generation
• 15B • Updated
• 1
• 6
sequelbox/Qwen3-8B-DAG-Reasoning
Text Generation
• 8B • Updated
• 6
• 2
sequelbox/Qwen3-4B-Thinking-2507-DAG-Reasoning
Text Generation
• 4B • Updated
• 2
• 7
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated
• 5
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated
• 5
sequelbox/Qwen3-14B-Esper3Mix
Text Generation
• 15B • Updated
• 3
sequelbox/Qwen3-14B-Esper3Grayline
Text Generation
• 15B • Updated
• 2
sequelbox/Qwen3-14B-Esper3Web3
Text Generation
• 15B • Updated
• 3
sequelbox/Qwen3-14B-Esper3Math
Text Generation
• 15B • Updated
• 3
sequelbox/Qwen3-8B-Esper3-PREVIEW
Text Generation
• 8B • Updated
• 4
• 2
sequelbox/Llama3.1-70B-PlumChat
Text Generation
• 71B • Updated
• 10
sequelbox/Llama3.1-8B-PlumCode
Text Generation
• 8B • Updated
• 14
sequelbox/Llama3.1-8B-PlumChat
Text Generation
• 8B • Updated
• 5
sequelbox/Llama3.1-8B-PlumMath
Text Generation
• 8B • Updated
• 10
• 1
sequelbox/Llama3.1-8B-MOTH
Text Generation
• 8B • Updated
• 12
sequelbox/gemma-2-9B-MOTH
Text Generation
• 9B • Updated
• 9
sequelbox/Llama2-70B-SpellBlade
Text Generation
• 69B • Updated
• 3
• 2
sequelbox/Llama2-13B-DiamondForce
Text Generation
• 13B • Updated
• 4
sequelbox/Llama2-70B-StellarBright
Text Generation
• 69B • Updated
• 144
• 45
sequelbox/Llama2-13B-DaringFortitude
Text Generation
• 13B • Updated
• 12
• 13