QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Jan 1, 2024ยท
Jing Liu
Ruihao Gong
Ruihao Gong
,
Xiuying Wei
,
Zhiwei Dong
,
Jianfei Cai
,
Bohan Zhuang
ยท 0 min read
Type
Publication
The Twelfth International Conference on Learning Representations