DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models

Jan 1, 2025ยท
Changyi He
,
Yifu Ding
,
Jinyang Guo
Ruihao Gong
Ruihao Gong
,
Haotong Qin
,
Xianglong Liu
ยท 0 min read
Type
Publication
Proceedings of the 42nd International Conference on Machine Learning (ICML)