Ruihao Gong
Open Menu
Close Menu
Bio
Papers
Projects
Award
Paper-Conference
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency
Jan 1, 2023
Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs
Jan 1, 2023
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Sep 27, 2022
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
Jan 1, 2022
NNLQP: A Multi-Platform Neural Network Latency Query and Prediction System with An Evolving Database
Jan 1, 2022
Generating Transferable Adversarial Examples against Vision Transformers
Jan 1, 2022
Once Quantization-Aware Training: High Performance Extremely Low-Bit Architecture Search
Oct 1, 2021
MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing
Oct 1, 2021
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Jul 1, 2021
A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration
Jul 1, 2021
« Previous
Next »