Publications

(2024). TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite

(2024). Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection. Proceedings of the AAAI Conference on Artificial Intelligence.

PDF Cite

(2024). Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes. Proceedings of the AAAI Conference on Artificial Intelligence.

PDF Cite

(2024). Rectify representation bias in vision-language models for long-tailed recognition. Neural Networks.

Cite DOI URL

(2024). QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models. The Twelfth International Conference on Learning Representations.

PDF Cite Code URL

(2023). Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing.

PDF Cite Code DOI URL

(2023). Lossy and Lossless (L2) Post-training Model Size Compression. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite Code

(2023). Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Cite

(2023). Annealing-Based Label-Transfer Learning for Open World Object Detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Cite

(2023). SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency. Proceedings of Machine Learning and Systems.

PDF Cite Code Dataset Project

(2023). Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs. Proceedings of the 52nd International Conference on Parallel Processing.

Cite DOI URL

(2023). Discrepant Semantic Diffusion Boosts Transfer Learning Robustness. Electronics.

Cite DOI URL

(2022). Distribution-Sensitive Information Retention for Accurate Binary Neural Network. International Journal of Computer Vision.

Cite DOI URL

(2022). Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models. Thirty-Sixth Conference on Neural Information Processing Systems.

PDF Cite URL

(2022). QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization. International Conference on Learning Representations.

PDF Cite Code Project URL

(2022). NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database. 51 International Conference on Parallel Processing - ICPP.

PDF Cite Code DOI URL

(2022). Generating Transferable Adversarial Examples against Vision Transformers. Proceedings of the 30th ACM International Conference on Multimedia.

Cite DOI URL

(2021). Once Quantization-Aware Training: High Performance Extremely Low-Bit Architecture Search. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite

(2021). MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).

PDF Cite

(2021). MQBench: Towards Reproducible and Deployable Model Quantization Benchmark. Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks.

PDF Cite Code Project

(2021). Diversifying Sample Generation for Accurate Data-Free Quantization. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite

(2021). RobustART: Benchmarking Robustness on Architecture Design and Training Techniques. Arxiv.

PDF Cite Dataset Project

(2021). BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction. International Conference on Learning Representations.

PDF Cite Code Project URL

(2020). Towards Unified INT8 Training for Convolutional Neural Network. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Video

(2020). Rotation Consistent Margin Loss for Efficient Low-bit Face Recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Video

(2020). Forward and Backward Information Retention for Accurate Binary Neural Networks. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

PDF Cite Code Video

(2020). Balanced Binary Neural Networks with Gated Residual. International Conference on Acoustics, Speech, and Signal Processing (ICASSP).

PDF Cite Video

(2020). DMS: Differentiable Dimension Search for Binary Neural Networks. ICLR 2020 NAS Workshop.

PDF Cite Video

(2020). Extremely Low-Bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures. 49th International Conference on Parallel Processing - ICPP.

PDF Cite Video DOI URL

(2020). Efficient Bitwidth Search for Practical Mixed Precision Neural Network.

PDF Cite arXiv

(2020). Binary neural networks: A survey. Pattern Recognition.

PDF Cite DOI URL

(2019). Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks. The IEEE International Conference on Computer Vision (ICCV).

PDF Cite Poster

(2013). An example conference paper. Source Themes Conference.

Cite