Publications

Filter by type:
(2019). Additive Powers-of-Two Quantization: A Non-uniform Discretization for Neural Networks. To appear at International Conference on Learning Representations (ICLR 2020).

Preprint

(2019). RTN: Reparameterized Ternary Network. AAAI Conference on Artificial Intelligence (AAAI 2020).

Preprint

(2019). Maestro: A Memory-on-Logic Architecture for Coordinated Parallel Use of Many Systolic Arrays. The 30th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2019).

Preprint

(2019). Full-stack Optimization for Accelerating CNNs with FPGA Validation. the 33rd ACM International Conference on Supercomputing (ICS 2019).

Preprint