Presentation Schedule and Readings
Mar 20, 22:  Cache-oblivious algorithms, HTA etc.: Jeevitha + Manoj
  - New abstractions for data-parallel programming, Brodman et al, HotPar 2009
- An experimental comparison of cache-oblivious and cache-conscious programs, Yotov et al, SPAA 2007
Mar 27, 29:  Auto-tuning (ATLAS, FFTW) - Hemant + Kumar + Siddharth
  - Is search really necessary to generate high-performance BLAS? - http://www.cs.utexas.edu/~pingali/CS395T/2009fa/papers/ieee05.pdf
- FFTW: An adaptive software architecture for the FFT - http://www.fftw.org/fftw-paper-icassp.pdf
 
Apr 3, 5: Large-scale data analysis - Nikhil + Hector
  - MapReduce - http://research.google.com/archive/mapreduce.html
- HDFS - http://www.cs.utexas.edu/~nikhil/The_Hadoop_Distributed_File_System.pdf
 
Apr 10, 12: Synthesis - Tyler Smith + Martin Schatz
  - Tensor Contraction Engine - http://www.csc.lsu.edu/%7Egb/TCE/
- Stencil compiler -http://www.cs.utexas.edu/~pingali/CS395T/2012sp/papers/stencilCompiler.pdf
 
Apr 17, 19: Approximate computing - Diego + Milad + Wei-Lun
  - Green:
        A Framework for Supporting Energy-conscious Programming using
        Controlled Approximation, Baek and Chilimbi, PLDI 2010.
- Exploiting
        the  Forgiving Nature of Applications for Scalable Parallel
        Execution. Meng, Raghunathan, Chakradhar, and Byna, IPDPS 2010.
- Dynamic
        knobs for power-aware computing. Hoffmann et al ASPLOS, 2011
      
Apr 24, 26: Parallel data structures - Jeffrey + Yilin
  - Transactional Memory Architectural Support for Lock-Free Data Structures, Maurice Herlihy, J. Eliot B. Moss ISCA 1993.
- An efficient heuristic procedure for partitioning graphs, Kernighan and Lin, Bell System Technical Journal, 1970.