Reading List | CMPE293, Winter 19, Section 01

Date	Paper
1/7	Intro Lecture
1/9	Foundation Lecture
1/11	Balfour et al., “An Energy-Efficient Processor Architecture for Embedded Systems,” CAL 2008 Hameed et al., “Understanding Sources of Inefficiency in General-Purpose Chips,” ISCA 2010
1/14	Shaw et al., “Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer,” SC 2014
1/16	Ozdal et al., “Energy Efficient Architecture for Graph Analytics Accelerators,” ISCA 2016
1/18	Eyerman et al., “Many-Core Graph Workload Analysis,” SC 2018
1/21	Holiday
1/23	Ahn et al., “A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing,” ISCA 2015
1/25	Dysart et al., “Highly Scalable Near Memory Processing with Migrating Threads on the Emu System Architecture,” IA^3 2016 Rolinger et al., “Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture,” IA^3 2018
1/28	Shao et al., "Aladdin: a Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures," ISCA 2014 Guest: Sophia Shao
1/30	Magaki et al., "ASIC Clouds: Specializing the Datacenter," ISCA 2016 Guest: Heiner Litz
2/1	Chen et al., “DaDianNao: A Machine-Learning Supercomputer,” MICRO 2014
2/4	Han et al., “EIE: Efficient Inference Engine on Compressed Deep Neural Network,” ISCA 2016
2/6	Chen et al., “Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks,” ISCA 2016
2/8	Jouppi et al., “In-Datacenter Performance Analysis of a Tensor Processing Unit,” ISCA 2017 Guest: David Patterson
2/11	Goulding-Hotta et al., “The GreenDroid Mobile Application Processor: An Architecture for Silicon's Dark Future,” IEEE Micro 2011
2/13	Huang et al., “Hardware Acceleration of the Pair-HMM Algorithm for DNA Variant Calling,” FPGA 2017
2/15	Turakhia et al., “Darwin: A Genomics Co-processor Provides up to 15,000X Acceleration on Long Read Assembly,” ASPLOS 2018
2/18	Holiday
2/20	Putnam et al., “A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services,” ISCA 2014
2/22	Fowers et al., “A Configurable Cloud-Scale DNN Processor for Real-Time AI,” ISCA 2018
2/25	Prabhakar et al., “Plasticine: A Reconfigurable Architecture For Parallel Paterns,” ISCA 2017
2/27	Nowatzki et al., “Stream-Dataflow Acceleration,” ISCA 2017
3/1	Fuchs et al., "The Accelerator Wall: Limits of Chip Specialization," HPCA 2019
3/4	Khailany et al., “Imagine: media processing with streams,” IEEE Micro 2001
3/6	Ahn et al., “Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs,” CAL 2009 McSherry et al., “Scalability! But at what COST?,” HotOS 2015
3/8	Wu et al., “Q100: The Architecture and Design of a Database Processing Unit,” ASPLOS 2014
3/11	Project Presentations
3/13	Project Presentations
3/15	Project Presentations

UCSC Library Instructions for off-campus access

Additional Reading

Gordon Moore, "Cramming More Components onto Integrated Circuits," Electronics 1965

Esmaeilzadeh et al., "Dark Silicon and the End of Multicore Scaling," ISCA 2011

Ho et al., "The Future of Wires," IEEE 2001

Asanovic et al., "The Landscape of Parallel Computing Research: A View from Berkeley", TR 2006

Shao et al., "Research Infrastructures for Hardware Accelerators," M&C 2015

Shao et al., "ISA-Independent Workload Characterization and Its Implications for Specialized Architectures," ISPASS 2013

Dally et al., "Efficient Embedded Computing," IEEE Computer 2008

Shaw et al., "Anton, A Special-Purpose Machine for Molecular Dynamics Simulation," ISCA 2007

Ham et al., "Graphicionado: A High-Performance and Energy-Efficient Accelerator for Graph Analytics," MICRO 2016

Beamer et al., "Locality Exists in Graph Processing: Workload Characterization on an Ivy Bridge Server," IISWC 2015

Taylor, "Bitcoin and The Age of Bespoke Silicon," CASES 2013

Taylor, "The Evolution of Bitcoin Hardware," Computer 2017

Khazraee et al., "Moonwalk: NRE Optimization in ASIC Clouds or, Accelerators Will Use Old Silicon," ASPLOS 2017

Govindaraju et al., "DySER: Unifying Functionality and Parallelism Specialization for Energy-Efficient Computing," IEEE Micro 2012

Caulfield et al., "A Cloud-Scale Acceleration Architecture," MICRO 2016