Oleg Kibirev
catplusplus
AI & ML interests
None yet
Recent Activity
new activity 1 day ago
catplusplus/Qwen3.6-27B-uncensored-heretic-v2-NVFP4-MTP:Update README.md updated a model 1 day ago
catplusplus/Qwen3.6-27B-uncensored-heretic-v2-NVFP4-MTP published a model 1 day ago
catplusplus/Qwen3.6-27B-uncensored-heretic-v2-NVFP4-MTPOrganizations
None yet
Update README.md
#1 opened 1 day ago
by
catplusplus
Pathname confusion
#1 opened 6 days ago
by
catplusplus
Made a version with full precision attention and kv cache scales
3
#1 opened 10 days ago
by
catplusplus
Would you consider model adoption to solve your storage issues?
1
#2 opened 16 days ago
by
catplusplus
quantizing to MLX 4bit removed abliteration
9
#2 opened about 2 months ago
by
giediprime
do the same thing you did with 27b because it does a lot better.
1
#3 opened about 1 month ago
by
drmcbride
Uploaded an NVFP4 quant for Thor/Spark
#5 opened 20 days ago
by
catplusplus
Only generates nulls for me
2
#2 opened 30 days ago
by
catplusplus
Generates incoherent outputs for me with VLLM 0.18
1
#5 opened 30 days ago
by
catplusplus
VLLM + MTP + NVFP4 doesn't work
👀 1
2
#16 opened about 1 month ago
by
catplusplus
Run on DGX Spark
16
#14 opened about 1 month ago
by
LimeemiL
Doesn't work with latest vllm, even tried to recompile vLLM and transformers from git
➕ 1
4
#8 opened about 2 months ago
by
catplusplus
All this talk about NVFP4 - why is it dog slow?
14
#13 opened about 2 months ago
by
josephbreda
NVFP4 cannot be loaded in SGLang
4
#12 opened about 2 months ago
by
mratsim
vLLM MTP unusable on RTX 6000 Pro, as spec decoding consumes 20GB+ VRAM at start-up, causing OOM
5
#9 opened about 2 months ago
by
lsmc
Made NVFP4 version
#3 opened about 2 months ago
by
catplusplus
Got better even though the higher KLD!
👍 4
3
#2 opened about 2 months ago
by
FORNAX20
Any revision plan?
6
#2 opened about 2 months ago
by
Lck0427
Made catplusplus/Qwen3.5-35B-A3B-heretic-NVFP4 for Blackwell users
1
#3 opened about 2 months ago
by
catplusplus
Made catplusplus/Qwen3.5-35B-A3B-heretic-NVFP4 for Blackwell users
1
#3 opened about 2 months ago
by
catplusplus