DeepSeek-MoE-16B-Chat GPTQ Quantized Collection DeepSeek-MoE-16B-Chat quantized with GPTQ via llm-compressor: W8A8, W4A16, FP8, NVFP4. • 4 items • Updated 25 days ago
DeepSeek-MoE-16B-Chat GPTQ Quantized Collection DeepSeek-MoE-16B-Chat quantized with GPTQ via llm-compressor: W8A8, W4A16, FP8, NVFP4. • 4 items • Updated 25 days ago
Llama-3.2-1B-Instruct GPTQ Quantized Collection GPTQ quantized across W4A16, W8A8, FP8, NVFP4 using llm-compressor. • 4 items • Updated 25 days ago