ENEE 698B, Fall 2000

List of papers selected by students

Main papers are marked with a (*)
 
Sept 19 Himanshu Khurana (*) P. Ranganathan, S Adve and N.P. Jouppi, "Performance of image and
video processing with general-purpose processors and media ISA
extensions", Proceedings of the 26th annual international symposium on
Computer Architecture, May 1999, Pages 124 - 135.

Thomas M. Conte, Pradeep K. Dubey, Matthew D. Jennings, Ruby B. Lee,
Alex Peleg, Salliah Rathnam, Mike Schlansker, Peter Song, and Andrew
Wolfe, "Challenges to Combining General-Purpose and Multimedia
Processors", IEEE Computer, Vol. 30. No. 12, December 1997.
 

  Radostina Koleva (*) Rainer Leupers, "Code Selection for Media Processors with SIMD
Instructions", Proceedings of the Design, Automation and Test in Europe,
27 - 30 March, 2000, Paris, France. (alternate link).

Ruby B. Lee and Michael D. Smith, "Media Processing: A New Design Target",
IEEE Micro, Vol. 16, No. 4, August 1996, pp. 6-9

 

Oct 3 Steve Haga (*) Jay Bharadwaj, Kishore Menezes and Chris McKinsey, "Wavefront scheduling: path based data representation and scheduling of subgraphs", Proceedings of the 32nd Annual ACM/IEEE international symposium on microarchitecture on MICRO-32, 1999, Pages 262 - 271.

Wen-mei W. Hwu, Scott A. Mahlke, William Y. Chen, Pohua P. Chang, Nancy J. Warter, Roger A. Bringmann, Roland G. Ouellette, Richard E. Hank, Tokuzo Kiyohara, Grant E. Haab, John G. Holm, and Daniel M. Lavery, "The Superblock: An Effective Technique for VLIW and Superscalar Compilation", Journal of Supercomputing, Kluwer Academic Publishers, 1993, pp. 229-248.

  Qingmin Shi (*) A.Y. Zomaya, C. Ward, and B. Macey, "Genetic scheduling for parallel processor systems: comparative studies and performance issues", IEEE Trans. on Parallel and Distributed Systems, 10(8), Aug. 1999, pp. 795-812.

Yu-Kwong Kwok and Ishfaq Ahmad, "Static scheduling algorithms for allocating directed task graphs to multiprocessors", ACM Computing Surverys, 31(4), Dec. 1999, pp. 406-471.

Oct 10 Mohamed Zahran (*) Pedro Marcuello and Antonio Gonzalez, " Clustered Speculative
Multithreaded Processors", Proceedings  of the 1999 International
Conference on SuperComputing, June 20-25,1999, Greece,pp.365-372.

David W.Wall, " Speculative Execution and Instruction Level Parallelism", WRL Technical Note TN-42, March 1994.

Gurindar S. Sohi, Scott E. Breach and T. N. Vijaykumar, "The Multiscalar Processors", Proceedings of the 22nd annual international symposium on Computer architecture, 1995, pp 414 - 425.

Oct 17 Neil Bambha (*) C. Lee, J. Lee, T. Hwang, and S. Tsai, "Compiler Optimization on Instruction Scheduling for Low Power", Proceedings of the 13th International Symposium on Systems Synthesis,  September 2000, pp. 55 - 60.   (paper also available outside my door)

T. Achteren, R. Lauwereins, and F. Cathoor, "Systematic Data Reuse Exploration Methodology for Irregular Access Patterns", Proceedings of the 13th International Symposium on Systems Synthesis, September 2000, pp. 115 - 121.  (paper outside my door)

  Ming-Yung Ko (*) K. Chung, T. Kim, and C. I. Liu, "Behavioral-level partitioning for low 
power design in control-dominated application", Proceedings of the tenth
Great Lakes Symposium on VLSI, 2000, Pages 156-161.

S. Devadas, and S. Malik, "A survey of optimization techniques
targeting low power VLSI circuits", Proceedings of the 32nd ACM/IEEE
conference on Design automation conference, 1995, Pages 242-247.

Oct 24 Tom Carley (*) Jeffrey Dean, Craig Chambers and David Grove, "Selective specialization for object-oriented languages", Proceedings of the conference on Programming language design and implementation June 18 - 21, 1995, La Jolla, CA USA, Pages 93-102.

Ole Agesen and Urs Holzle, "Type Feedback vs. Concrete Type inference: A comparison of optimization techniques for object oriented languages"  Proceedings of the tenth annual conference on Object-oriented programming systems, languages, and applications. October 15 - 19, 1995, Austin, TX USA.

J. Palsberg, "Compiler technology for object-oriented languages", ACM Computing Surveys Volume 28 , Issue 4es (1996) Article No. 161.

  Aneesh Aggarwal (*) Sanjay Ghemawat, Keith Randall and Daniel Scales, "Field Analysis: Getting
Useful and Low-cost Interprocedural Information", Proceedings of the ACM
SIGPLAN '00 conference on Programming language design and implementation. June 18 - 21, 2000, Vancouver, BC Canada. pages 334-344.

Ole Agesen and Urs Holzle, "Type Feedback vs. Concrete Type inference: A
comparison of optimization techniques for object oriented languages"  Proceedings of the tenth annual conference on Object-oriented programming systems, languages, and applications. October 15 - 19, 1995, Austin, TX USA. 

Oct 31 Jesus Molina (*) F. Balarin and M. Chiodo "Software Synthesis for Complex Reactive Embedded Systems", Proceedings of the 1999 IEEE International Conference on Computer Design.

M. Chiodo, P. Giusto, H. Hsieh, A. Jurecska, L. Lavagno, and A. Sangiovanni-Vincentelli. "A formal methodology for Hardware/software co-design of embedded systems" IEEE Micro, 14(4):26--36, August 1994. 

  Kapil Dakhane (*) Bharat P. Dave, Ganesh Lakshminarayana, and Niraj K. Jha, "COSYN:  Hardware-Software Co-Synthesis of Embedded Systems", Proceedings of the 34th annual conference on Design automation conference, 1997, Pages 703 - 708.

Edward A. Lee, "Embedded Software - An Agenda for Research," ERL Technical Report UCB/ERL No. M99/63, Dept. EECS, University of California, Berkeley, CA 94720, December 15, 1999.

Nov 21 Vida Kianzad (*) Matthew C. Merten, Andrew R. Trick, Christopher N. George, John C. Gyllenhaal and Wen-mei W. Hwu, "A hardware-driven profiling scheme for identifying program hot spots to support runtime optimization", Proceedings of the 26th annual international symposium on Computer architecture, 1999.

Jennifer M. Anderson, Lance M. Berc, Jeffrey Dean, Sanjay Ghemawat, Monika R. Henzinger, Shun-Tak A. Leung, Richard L. Sites, Mark T. Vandevoorde, Carl A. Waldspurger and William E. Weihl,'Continuous profiling where have all the cycles gone? ', ACM Trans. Computer Systems 15, 4 (Nov. 1997), Pages 357 - 39.

Thomas Ball and James R. Larus, 'Optimally profiling and tracing programs',  ACM Trans. on Programming Languages and Systems, 16, 4 (Jul. 1994), Pages 1319 - 1360.

  Abdel-Hameed Badawy (*) "Efficacy and Performance Impact of Value Prediction", Bohuslav Rychlik, John Faistl, Bryon Krug, and John P.Shen. Parallel Architectures and Compilation Techniques, Paris,October 1998.

(Seminal paper) Exceeding the dataflow limit via value prediction; Mikko H. Lipasti
and John Paul Shen; Proceedings of the 29th annual IEEE/ACM international symposium on Microarchitecture, 1996, Pages 226.

(Long Journal paper) Using value prediction to increase the power of speculative execution hardware; Freddy Gabbay and Avi Mendelson; ACM Trans. Comput. Syst. 16, 3 (Aug. 1998), Pages 234 - 270.

Nov 28 Zhang Yi (*)Exploiting superword level parallelism with multimedia instruction sets, Samuel Larsen and Saman Amarasinghe ,   Proceedings of the ACM SIGPLAN '00 conference on Programming language design and implementation June 18 - 21, 2000, Vancouver, BC Canada.

Performance of image and video processing with general-purpose processors and media ISA extensions, P. Ranganathan, S Adve and N.P. Jouppi, Proceedings of the 26th annual international symposium on Computer Architecture, May 1999, Pages 124 - 135.

  Kun Luo  (*) "A Vectorizing Compiler for Multimedia Extensions", by N. Sreraman , R. Govindarajan. To appear in the International Journal of Parallel Programming, special issue on Instruction and Loop-Level Parallelism, 2000.

Second paper is Larsen & Amarasinghe2000 (See Zhang Yi's papers above).