
Stream Processing
•[abadi03] ``Aurora: A New Model and Architecture for Data Stream Management’’, Daniel J. Abadi, Don Carney, Ugur Cetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik, The International Journal on Very Large Databases, 12(2):120-139, 2003.
•[hwang08] ``Fast and Highly-Available Stream Processing over Wide Area Networks’’, Jeong-Hyon Hwang, Ugur Cetintemel, and Stan Zdonik, In Proceedings of the 24th International Conference on Data Engineering (ICDE), 804-813, 2008.
•[mcconnell10] ``Detouring and Replication for Fast and Reliable Internet-Scale Stream Processing’’, Christopher McConnell, Fan Ping, and Jeong-Hyon Hwang, In Proceedings of the 3rd International Workshop on Data Intensive Distributed Computing (DIDC), held in conjunction with the 19th International Symposium on High Performance Distributed Computing (HPDC), 2010.
•[aggarwal03]* ``A Framework for Clustering Evolving Data Streams’’, Charu C. Aggarwal, Jiawei Han, Jianyong Wang, and Philip S. Yu, In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB), 2003.
Network Positioning
•[dabek04] ``Vivaldi: A Decentralized Network Coordinate System’’, Frank Dabek, Russ Cox, M. Frans Kaashoek, and Robert Morris, In Proceedings of the ACM SIGCOMM 2004 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, 15-26, 2004.
•[ping10] ``A Retrospective Approach for Accurate Network Latency Prediction’’, Fan Ping, Christopher McConnell, and Jeong-Hyon Hwang, In Proceedings of the 2nd Workshop on Grid and P2P Systems and Applications (GridPeer), held in conjunction with 19th International Conference on Computer Communications and Networks (ICCCN), 2010.
Cloud Computing
•[armbrust10] ``A View of Cloud Computing’’, Michael Armbrust, Armando Fox, Rean Griffith, Anthony D. Joseph, Randy Katz, Andy Konwinski, Gunho Lee, David Patterson, Ariel Rabkin, Ion Stoica, and Matei Zaharia, Communications of the ACM, 53(4): 50-58, 2010.
•[barroso09] ``The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines’’, Luiz André Barroso and Urs Hölzle, Synthesis Lectures on Computer Architecture, 4 (1): 1-108, 2009,
Distributed Storage
•[chang06]* ``Bigtable: A Distributed Storage System for Structured Data’’, Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Michael Burrows, Tushar Chandra, Andrew Fikes, and Robert Gruber, In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (OSDI), 205-218, 2006 (Awarded Best Paper).
•[ghemawat03] ``The Google File System’’, In Proceedings of the 19th ACM Symposium on Operating Systems Principle (SOSP), 29-43, 2003.
•[hdfs] ``The HDFS Architecture’’.
•[decandia07]* ``Dynamo: Amazon’s Highly Available Key-value Store”, Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels, In Proceedings of the 21st ACM Symposium on Operating Systems Principles (SOSP), 205-220, 2007 (Awarded Best Paper).
• [lakshman09] ``Cassandra - A Decentralized Structured Storage System’’, Avinash Lakshman and Prashant Mali, In Proceedings of the 3rd ACM SIGOPS Workshop on Large-Scale Distributed Systems and Middleware (LADIS), 2009.
•[cassandra] ``Cassandra’’.
Indexing
•[beckmann90]* ``The R*-tree: An Efficient and Robust Access Method for Points and Rectangles’’, Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider, and Bernhard Seeger, In Proceedings of the 1990 ACM SIGMOD International Conference on Management of Data, 321-331, 1990.
Parallel Database Systems
•[dewitt90]* ``The Gamma Database Machine Project’’, David J. DeWitt, Shahram Ghandeharizadeh, Donovan A. Schneider, Allan Bricker, Hui-I Hsiao, and Rick Rasmussen, IEEE Transactions on Knowledge and Data Engineering, 2(1): 44-62, 1990.
•[dewitt92] ``Parallel Database Systems: the Future of High Performance Database Systems’’, David DeWitt and Jim Gray, Communications of the ACM, 35(6): 85 - 98, 1992.
•[boral90]* ``Prototyping Bubba, A Highly Parallel Database System’’, Haran Boral, William Alexander, Larry Clay, George P. Copeland, Scott Danforth, Michael J. Franklin, Brian E. Hart, Marc G. Smith, Patrick Valduriez, IEEE Transactions on Knowledge and Data Engineering 2(1): 4-24, 1990.
MapReduce
•[dean04]* ``MapReduce: Simplified Data Processing on Large Clusters’’, Jeffrey Dean and Sanjay Ghemawat, In Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI), 137-150, 2004.
•[hadoop] ``Hadoop’’
•[gates09]* ``Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience’’, In Proceedings of the 35th International Conference on Very Large Data Bases (VLDB), 2009.
•[olston08] ``Pig Latin: A Not-So-Foreign Language for Data Processing’’, In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, 1099-1100, 2008.
•[chaiken08]* ``SCOPE: easy and efficient parallel processing of massive data sets’’, Ronnie Chaiken, Bob Jenkins, Per-Ake Larson, Bill Ramsey, Darren Shakib, Simon Weaver, and Jingren Zhou, The Proceedings of the VLDB Endowment (PVLDB), 1(2):1265-1276, 2008.
•[zhou10] Jingren Zhou, Per-Ake Larson and Ronnie Chaiken, ``Incorporating partitioning and parallel plans into the SCOPE optimizer’’, In Proceedings of the 26th International Conference on Data Engineering (ICDE), 1060-1071, 2010.
•[yu08] “DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language”, Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep Kumar Gunda, and Jon Currey, In Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1-14, 2008 (Awarded Best Paper).
Distributed Databases
•[mackert86] ``R* Optimizer Validation and Performance Evaluation for Distributed Queries'', Lothar Mackert and Guy Lohman, In Proceedings of the 12th International Conference on Very Large Data Bases (VLDB), 149-159, 1986.
•[stonebraker96] ``Mariposa: a wide-area distributed database system'', Michael Stonebraker, Paul M. Aoki, Witold Litwin, Avi Pfeffer, Adam Sah, Jeff Sidell, Carl Staelin, and Andrew Yu, The VLDB Journal, 5(1): 048--063, 1996.