Compressing Large Language Models by Joint Sparsification and Quantization

Jan 1, 2024ยท
Jinyang Guo
,
Jianyu Wu
,
Zining Wang
,
Jiaheng Liu
,
Ge Yang
,
Yifu Ding
Ruihao Gong
Ruihao Gong
,
Haotong Qin
,
Xianglong Liu
ยท 0 min read
Type
Publication
Forty-first International Conference on Machine Learning