QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Jan 1, 2024·
Jing Liu
Ruihao Gong
Ruihao Gong
,
Xiuying Wei
,
Zhiwei Dong
,
Jianfei Cai
,
Bohan Zhuang
· 0 min read
Type
Publication
The Twelfth International Conference on Learning Representations