Man Cub

mancub

·

AI & ML interests

None yet

Recent Activity

new activity 10 days ago

Hikari07jp/gemma4-repe-uncensor:Why not just patch the model?

new activity 30 days ago

llmfan46/gemma-4-31B-it-qat-q4_0-uncensored-heretic-GPTQ-Int4:Does not load into vllm- layer 5 fusion issue

new activity 30 days ago

cyankiwi/gemma-4-12B-it-qat-AWQ-INT4:Vllm and SgLang command please

View all activity

Organizations

None yet

New activity in Hikari07jp/gemma4-repe-uncensor 10 days ago

Why not just patch the model?

#1 opened 10 days ago by

New activity in llmfan46/gemma-4-31B-it-qat-q4_0-uncensored-heretic-GPTQ-Int4 30 days ago

Does not load into vllm- layer 5 fusion issue

#1 opened about 1 month ago by

New activity in cyankiwi/gemma-4-12B-it-qat-AWQ-INT4 30 days ago

Vllm and SgLang command please

#1 opened about 1 month ago by

New activity in OBLITERATUS/Gemma-4-12B-OBLITERATED about 1 month ago

Does this jaibreak version suppor mtp?

#1 opened about 1 month ago by

New activity in z-lab/Qwen3.6-27B-DFlash about 2 months ago

Are we going to see an update to this model?

#15 opened about 2 months ago by

New activity in Intel/gemma-4-31B-it-int4-AutoRound about 2 months ago

INT8 version for TP=2 / dual Ampere GPUs?

#6 opened 2 months ago by

New activity in z-lab/Qwen3.6-27B-DFlash about 2 months ago

RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::Half

#10 opened 2 months ago by

New activity in AesSedai/Qwen3.6-35B-A3B-GGUF about 2 months ago

Q6_K?

#1 opened 3 months ago by

New activity in froggeric/Qwen-Fixed-Chat-Templates 2 months ago

final v16 does not appear to work correctly, it stops after the first prompt.

#19 opened 2 months ago by

v13 stops dead after the first response

#14 opened 2 months ago by

New activity in Minachist/Qwen3.6-35B-A3B-INT8-AutoRound 2 months ago

Crashes with newest vllm version (v0.20.1)

#1 opened 2 months ago by

New activity in froggeric/Qwen-Fixed-Chat-Templates 2 months ago

v11/v12 performance considerations with Claude Code?

#11 opened 2 months ago by

When using Claude Code, tool calls end up broken with this chat template in Qwen3.6-27B

#6 opened 2 months ago by

New activity in Minachist/Qwen3.6-27B-INT8-AutoRound 2 months ago

Good quant!

#1 opened 2 months ago by

New activity in QuantTrio/gemma-4-31B-it-AWQ 2 months ago

Does not appear to work with the new google drafter MTP model

#2 opened 2 months ago by

New activity in google/gemma-4-31B-it-assistant 2 months ago

Is it supposed to work in vllm?

#2 opened 2 months ago by

New activity in z-lab/Qwen3.6-27B-DFlash 3 months ago

Avg Draft acceptance rate is low.

#2 opened 3 months ago by

New activity in rdtand/Qwen3.6-27B-PrismaQuant-5.5bit-vllm 3 months ago

Unable to run on 3090

#1 opened 3 months ago by

New activity in ubergarm/Qwen3.5-122B-A10B-GGUF 3 months ago

How to split this model between 2 (3) GPUs and CPU/RAM ?

#12 opened 4 months ago by

New activity in QuantTrio/Qwen3.5-27B-AWQ 3 months ago

My personal vLLM launch cmd on my old personal 2x3090 workstation

#1 opened 5 months ago by