Ruihao Gong
Open Menu
Close Menu
Bio
Papers
Projects
Award
Paper-Conference
AtomNet: Designing Tiny Models from Operators Under Extreme MCU Constraints
Apr 1, 2025
Tool Playgrounds: A Comprehensive and Analyzable Benchmark for LLM Tool Invocation
Jan 1, 2025
Robust long-tailed recognition with distribution-aware adversarial example generation
Jan 1, 2025
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Jan 1, 2025
Past-Future Scheduler for LLM Serving under SLA Guarantees
Jan 1, 2025
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
Nov 1, 2024
TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models
Jun 1, 2024
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection
Jun 1, 2024
Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes
Jun 1, 2024
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jan 1, 2024
Next »