NVIDIA GPU

Extremely Low-Bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures