An overview of the BlueGene/L supercomputer NR Adiga, G Almási, GS Almasi, Y Aridor, R Barik, D Beece, R Bellofatto, ... SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 60-60, 2002 | 667 | 2002 |
Software transactional memory: Why is it only a research toy? C Cascaval, C Blundell, M Michael, HW Cain, P Wu, S Chiras, ... Communications of the ACM 51 (11), 40-46, 2008 | 410 | 2008 |
Implementation of a portable nested data-parallel language GE Blelloch, JC Hardwick, J Sipelstein, M Zagha, S Chatterjee Journal of parallel and distributed computing 21 (1), 4-14, 1994 | 330 | 1994 |
Recursive array layouts and fast parallel matrix multiplication S Chatterjee, AR Lebeck, PK Patnala, M Thottethodi Proceedings of the eleventh annual ACM symposium on Parallel algorithms and …, 1999 | 240 | 1999 |
Exact analysis of the cache behavior of nested loops S Chatterjee, E Parker, PJ Hanlon, AR Lebeck ACM SIGPLAN Notices 36 (5), 286-297, 2001 | 238 | 2001 |
Implementation of a portable nested data-parallel language GE Blelloch, JC Hardwick, S Chatterjee, J Sipelstein, M Zagha ACM Sigplan Notices 28 (7), 102-111, 1993 | 236 | 1993 |
Nonlinear array layouts for hierarchical memory systems S Chatterjee, VV Jain, AR Lebeck, S Mundhra, M Thottethodi Proceedings of the 13th international conference on Supercomputing, 444-453, 1999 | 213 | 1999 |
Automatic array alignment in data-parallel programs S Chatterjee, JR Gilbert, R Schreiber, SH Teng Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of …, 1993 | 196 | 1993 |
Scan primitives for vector computers S Chatterjee, GE Blelloch, M Zagha Carnegie Mellon University, 1990 | 157 | 1990 |
Generating local addresses and communication sets for data-parallel programs S Chatterjee, JR Gilbert, FJE Long, R Schreiber, SH Teng ACM SIGPLAN Notices 28 (7), 149-158, 1993 | 150 | 1993 |
Towards a theory of cache-efficient algorithms S Sen, S Chatterjee, N Dumir Journal of the ACM (JACM) 49 (6), 828-858, 2002 | 133 | 2002 |
Generating local addresses and communication sets for data-parallel programs S Chatterjee, JR Gilbert, FJE Long, R Schreiber, SH Teng Journal of Parallel and Distributed Computing 26 (1), 72-84, 1995 | 126 | 1995 |
Tuning Strassen's matrix multiplication for memory efficiency M Thottethodi, S Chatterjee, AR Lebeck SC'98: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing, 36-36, 1998 | 112* | 1998 |
Tuning Strassen's matrix multiplication for memory efficiency M Thottethodi, S Chatterjee, AR Lebeck SC'98: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing, 36-36, 1998 | 100 | 1998 |
VCODE: A data-parallel intermediate language GE Blelloch, S Chatterjee Proceedings Frontiers of Massively Parallel Computation, 471-480, 1990 | 96 | 1990 |
Cache-efficient matrix transposition S Chatterjee, S Sen Proceedings Sixth International Symposium on High-Performance Computer …, 2000 | 92 | 2000 |
Shared memory programming for large scale machines C Barton, CĆ Casçaval, G Almási, Y Zheng, M Farreras, S Chatterje, ... ACM SIGPLAN Notices 41 (6), 108-117, 2006 | 82 | 2006 |
Method for improving performance of executable code G Cascaval, S Chatterjee, E Duesterwald, A Kielstra, K Stoodley US Patent 7,954,094, 2011 | 77 | 2011 |
Computer architecture: Challenges and opportunities for the next decade T Agerwala, S Chatterjee IEEE Micro 25 (3), 58-69, 2005 | 76 | 2005 |
Cache-efficient multigrid algorithms S Sellappa, S Chatterjee The International Journal of High Performance Computing Applications 18 (1 …, 2004 | 67 | 2004 |