Seuraa
Jiajun Huang
Jiajun Huang
Vahvistettu sähköpostiosoite verkkotunnuksessa ucr.edu
Nimike
Viittaukset
Viittaukset
Vuosi
C-coll: Introducing error-bounded lossy compression into mpi collectives
J Huang, S Di, X Yu, Y Zhai, J Liu, K Raffenetti, H Zhou, K Zhao, Z Chen, ...
arXiv preprint arXiv:2304.03890, 2023
72023
Anatomy of high-performance gemm with online fault tolerance on gpus
S Wu, Y Zhai, J Liu, J Huang, Z Jian, B Wong, Z Chen
Proceedings of the 37th International Conference on Supercomputing, 360-372, 2023
52023
High-performance effective scientific error-bounded lossy compression with auto-tuned multi-component interpolation
J Liu, S Di, K Zhao, X Liang, S Jin, Z Jian, J Huang, S Wu, Z Chen, ...
Proceedings of the ACM on Management of Data 2 (1), 1-27, 2024
32024
gzccl: Compression-accelerated collective communication framework for gpu clusters
J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ...
arXiv preprint arXiv:2308.05199, 2023
22023
Ft-gemm: A fault tolerant high performance gemm implementation on x86 cpus
S Wu, Y Zhai, J Huang, Z Jian, Z Chen
Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023
22023
POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters
J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ...
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
12024
Accelerating mpi collectives with process-in-process-based multi-object techniques
J Huang, K Ouyang, Y Zhai, J Liu, M Si, K Raffenetti, H Zhou, A Hori, ...
Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023
12023
Accelerating fault-tolerant blas on x86 cpus
Y Zhai, E Giem, K Zhao, J Liu, J Huang, B Wong, C Shelton, Z Chen
July, 2022
12022
TurboFFT: A High-Performance Fast Fourier Transform with Fault Tolerance on GPU
S Wu, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, Z Chen, F Cappello
arXiv preprint arXiv:2405.02520, 2024
2024
A Survey on Error-Bounded Lossy Compression for Scientific Datasets
S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ...
arXiv preprint arXiv:2404.02840, 2024
2024
Exploring Wavelet Transform Usages for Error-bounded Scientific Data Compression
J Huang, J Liu, S Di, Y Zhai, Z Jian, S Wu, K Zhao, Z Chen, Y Guo, ...
2023 IEEE International Conference on Big Data (BigData), 4233-4239, 2023
2023
PiP-MColl: Process-in-Process-based Multi-object MPI Collectives
J Huang, K Ouyang, Y Zhai, J Liu, M Si, K Raffenetti, H Zhou, A Hori, ...
2023 IEEE International Conference on Cluster Computing (CLUSTER), 354-364, 2023
2023
FT-BLAS: A Fault Tolerant High Performance BLAS Implementation on x86 CPUs
Y Zhai, E Giem, K Zhao, J Liu, J Huang, BM Wong, CR Shelton, Z Chen
IEEE Transactions on Parallel and Distributed Systems, 2023
2023
Accelerating Collective Communications with Lossy Compression on GPU
J Huang, S Di, X Yu, Y Guo
Dimensions 449, 849X849X235, 0
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–14