Follow
Ahmad Lashgar
Ahmad Lashgar
Senior SE at Zoox
Verified email at uvic.ca - Homepage
Title
Cited by
Cited by
Year
Performance in GPU architectures: Potentials and distances
A Lashgar, A Baniasadi
9th Annual Workshop on Duplicating, Deconstructing, and Debunking (WDDD 2011 …, 2011
292011
Dynamic Warp Resizing: Analysis and Benefits in High-Performance SIMT
A Lashgar, A Baniasadi, A Khonsari
30th International IEEE Conference on Computer Design, ICCD 2012, 502-503, 2012
182012
IPMACC: open source openacc to cuda/opencl translator
A Lashgar, A Majidi, A Baniasadi
arXiv preprint arXiv:1412.1127, 2014
162014
Employing software-managed caches in OpenACC: Opportunities and benefits
A Lashgar, A Baniasadi
ACM Transactions on Modeling and Performance Evaluation of Computing Systems …, 2016
152016
Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs
A Lashgar, A Baniasadi, A Khonsari
26th International Conference on Architecture of Computing Systems, ARCS …, 2013
122013
Openacc cache directive: Opportunities and optimizations
A Lashgar, A Baniasadi
2016 Third Workshop on Accelerator Programming Using Directives (WACCPD), 46-56, 2016
112016
HARP: Harnessing Inactive Threads in Many-Core Processors
A Lashgar, A Khonsari, A Baniasadi
ACM Transactions on Embedded Computing Systems 13 (3s), Article 114, 2014
92014
Warp size impact in GPUs: large or small?
A Lashgar, A Baniasadi, A Khonsari
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013
92013
Understanding outstanding memory request handling resources in gpgpus
A Lashgar, E Salehi, A Baniasadi
proceedings of The Sixth International Symposium on Highly Efficient …, 2015
72015
Investigating Warp Size Impact in GPUs
A Lashgar, A Baniasadi, A Khonsari
arXiv preprint arXiv:1205.4967, 2012
72012
Loop perforation in OpenACC
A Lashgar, E Atoofian, A Baniasadi
2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2018
62018
A case against small data types in gpgpus
A Lashgar, A Baniasadi
2014 IEEE 25th International Conference on Application-Specific Systems …, 2014
52014
A case study in reverse engineering gpgpus: Outstanding memory handling resources
A Lashgar, E Salehi, A Baniasadi
ACM SIGARCH Computer Architecture News 43 (4), 15-21, 2016
42016
IPMACC: Translating OpenACC API to OpenCL
A Lashgar, A Majidi, A Baniasadi
In poster session of The 3rd International Workshop on OpenCL (IWOCL), IWOCL, 2015
42015
Towards green GPUs: Warp size impact analysis
A Lashgar, A Baniasadi, A Khonsari
2013 International Green Computing Conference Proceedings, 1-6, 2013
32013
Efficient implementation of OpenACC cache directive on NVIDIA GPUs
A Lashgar, A Baniasadi
International Journal of High Performance Computing and Networking 13 (1), 35-53, 2019
22019
Dynamic Warp Resizing in High-Performance SIMT
A Lashgar, A Baniasadi, A Khonsari
arXiv preprint arXiv:1208.2374, 2012
22012
TELEPORT: Hardware/software alternative to CUDA shared memory programming
A Lashgar, E Atoofian, A Baniasadi
Microprocessors and Microsystems 63, 169-181, 2018
12018
Rethinking prefetching in gpgpus: Exploiting unique opportunities
A Lashgar, A Baniasadi
2015 IEEE 17th International Conference on High Performance Computing and …, 2015
12015
Addressing software-managed cache development effort in GPGPUs
A Lashgar
2017
The system can't perform the operation now. Try again later.
Articles 1–20