Xuechao Wei
Title
Cited by
Cited by
Year
Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs
X Wei, CH Yu, P Zhang, Y Chen, Y Wang, H Hu, Y Liang, J Cong
Proceedings of the 54th Annual Design Automation Conference 2017, 1-6, 2017
1262017
Throughput optimization for streaming applications on CPU-FPGA heterogeneous systems
X Wei, Y Liang, T Wang, S Lu, J Cong
2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC), 488-493, 2017
142017
TGPA: tile-grained pipeline architecture for low latency CNN inference
X Wei, Y Liang, X Li, CH Yu, P Zhang, J Cong
Proceedings of the International Conference on Computer-Aided Design, 1-8, 2018
132018
FlexBFS: a parallelism-aware implementation of breadth-first search on GPU
G Liu, H An, W Han, X Li, T Sun, W Zhou, X Wei, X Tang
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012
122012
Overcoming data transfer bottlenecks in fpga-based DNN accelerators via layer conscious memory management
X Wei, Y Liang, J Cong
2019 56th ACM/IEEE Design Automation Conference (DAC), 1-6, 2019
62019
Frequency Improvement of Systolic Array-Based CNNs on FPGAs
J Zhang, W Zhang, G Luo, X Wei, Y Liang, J Cong
2019 IEEE International Symposium on Circuits and Systems (ISCAS), 1-4, 2019
32019
Overcoming data transfer bottlenecks in dnn accelerators via layer-conscious memory managment
X Wei, Y Liang, P Zhang, CH Yu, J Cong
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
12019
Distributed Control Independence for Composable Multi-processors
M Mao, H An, T Sun, Q Li, B Deng, X Wei, J Zhou
2012 IEEE/ACIS 11th International Conference on Computer and Information …, 2012
12012
FTDL: An FPGA-tailored Architecture for Deep Learning Systems
R Shi, Y Ding, X Wei, H Liu, H So, C Ding
The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays …, 2020
2020
Systems And Methods For Systolic Array Design From A High-Level Program
P Zhang, CH Yu, X Wei, P Pan
US Patent App. 15/962,916, 2018
2018
Framework to Accelerate Single-threaded Applications by Hyperblock Reformation on EDGE Architectures
XC Wei, H An, MJ Mao
Journal of Chinese Computer Systems 33 (10), 2249-2254, 2012
2012
Distributed replay protocol for distributed uniprocessors
M Mao, H An, B Deng, T Sun, X Wei, W Zhou, W Han
Proceedings of the 26th ACM international conference on Supercomputing, 3-14, 2012
2012
Automated Systolic Array Architecture Synthesis for High Throughput CNN Inference on AWS F1 FPGA
X Wei, P Zhang, CH Yu, J Wu
The system can't perform the operation now. Try again later.
Articles 1–13