Experience

 
 
 
 
 
June 2019 – September 2019
Seattle

Research Intern

Tencent AI Lab

Work on techniques of NLP
 
 
 
 
 
May 2018 – August 2019

Research Intern

Megvii Inc.

Service

Oct 2019

Reviewer

Conference on Computer Vision and Pattern Recognition (CVPR 2019 and 2020)

May 2019

Reviewer

Neurocomputing

May 2019

Reviewer

Conference on Neural Information Processing Systems (NeurIPS 2019)

Jan 2019

Reviewer

International Conference on Machine Learning (ICML 2019)

Recent Publications

*: Equal Contribution

(2019). RTN: Reparameterized Ternary Network. AAAI Conference on Artificial Intelligence (AAAI 2020).

Preprint

(2019). Full-stack Optimization for Accelerating CNNs with FPGA Validation. the 33rd ACM International Conference on Supercomputing (ICS 2019).

Preprint

(2019). Maestro: A Memory-on-Logic Architecture for Coordinated Parallel Use of Many Systolic Arrays. The 30th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2019).

Preprint

Ongoing Projects

An Efficient Partitioning Scheme of DNNs for IoT

In this work, we study the DNN partitioning problem for CNNs, an efficient partitioning scheme of the large-scale CNN over the edge devices with limited computing power. Evaluation over numerous CNN models anddatasets demonstrates CININ can greatly reduce the inferencelatency while achieving almost no loss on the performance.