Light Video Generation Inference Framework.
Feb 28, 2025
An off-the-shell tool designed for compressing LLM, leveraging state-of-the-art compression algorithms to enhance efficiency and reduce model size without compromising performance
Nov 26, 2024
A Python-based LLM inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Jan 27, 2024
The implementaion of winner solution for LPCV 2023 Challenge
Aug 27, 2023
The implementaion of winner solution for LPCV 2021 FPGA track
Aug 27, 2021
An open-source model quantization toolkit based on PyTorch fx.
Jul 27, 2021