ISTA-DASLab 's Collections
Extreme Compression of Large Language Models via Additive Quantization
Paper
• 2401.06118
• Published • 14
ISTA-DASLab/Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x16
Text Generation
• 11B • Updated • 19
• 20
ISTA-DASLab/Meta-Llama-3-70B-AQLM-2Bit-1x16
Text Generation
• Updated • 9
• 14
ISTA-DASLab/Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x16
Text Generation
• 2B • Updated • 64
• 12
ISTA-DASLab/Meta-Llama-3-8B-AQLM-2Bit-1x16
Text Generation
• 2B • Updated • 100
• 8
ISTA-DASLab/c4ai-command-r-v01-AQLM-2Bit-1x16
Text Generation
• 6B • Updated • 2
• 10
ISTA-DASLab/c4ai-command-r-plus-AQLM-2Bit-1x16
Text Generation
• 16B • Updated • 7
• 10
ISTA-DASLab/Mixtral-8x7B-Instruct-v0_1-AQLM-2Bit-1x16-hf
Text Generation
• 7B • Updated • 23
• 19
ISTA-DASLab/Mixtral-8x7b-AQLM-2Bit-1x16-hf
Text Generation
• 7B • Updated • 51
• 23
ISTA-DASLab/Mistral-7B-Instruct-v0.2-AQLM-2Bit-2x8
Text Generation
• 2B • Updated • 37
• 3
ISTA-DASLab/Mistral-7B-v0.1-AQLM-2Bit-1x16-hf
Text Generation
• 1B • Updated • 21
• 2
ISTA-DASLab/gemma-2b-AQLM-2Bit-1x16-hf
Text Generation
• 0.8B • Updated • 9
• 6
ISTA-DASLab/gemma-2b-AQLM-2Bit-2x8-hf
Text Generation
• 1B • Updated • 60
• 4
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-1x16-hf
Text Generation
• 1B • Updated • 94
• 5
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-2x8-hf
Text Generation
• 2B • Updated • 109
• 2
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-8x8-hf
Text Generation
• 2B • Updated • 9
ISTA-DASLab/Llama-2-13b-AQLM-2Bit-1x16-hf
Text Generation
• 2B • Updated • 7
ISTA-DASLab/Llama-2-13b-AQLM-4Bit-2x16-hf
Text Generation
• Updated • 8
ISTA-DASLab/Llama-2-70b-AQLM-2Bit-1x16-hf
Text Generation
• 9B • Updated • 11
• 6
ISTA-DASLab/Llama-2-70b-AQLM-2Bit-2x8-hf
Text Generation
• 18B • Updated • 43
• 1
ISTA-DASLab/Llama-2-70b-AQLM-4Bit-2x16-hf
Text Generation
• 18B • Updated • 9