Bibliography
Total Page:16
File Type:pdf, Size:1020Kb
Bibliography App, 2017. (2017). appc: App container specification and tooling. https://github.com/appc/spec. Accetta, M. J., Baron, R. V., Bolosky, W. J., Golub, D. B., Rashid, R. F., Tevanian, A., et al. (1986). Mach: A new kernel foundation for UNIX development. In Proceedings of the USENIX Summer Conference. Ahn, D. H., Garlick, J., Grondona, M., Lipari, D., Springmeyer, B., & Schulz, M. (2014). Flux: A next-generation resource management framework for large HPC centers. In 43rd International Conference on Parallel Processing Workshops (ICCPW), 2014 (pp. 9–17). IEEE. Ajima, Y., Inoue, T., Hiramoto, S., Takagi, Y., & Shimizu, T. (2012). The Tofu interconnect. IEEE Micro, 32(1), 21–31. Akkan, H., Ionkov, L., & Lang, M. (2013). Transparently consistent asynchronous shared mem- ory. In Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, ROSS ’13. New York, NY, USA: ACM. Alam, S., Barrett, R., Bast, M., Fahey, M. R., Kuehn, J., McCurdy, C., et al. (2008). Early evaluation of IBM BlueGene/P. In Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, SC ’08 (pp. 23:1–23:12). Piscataway, NJ, USA: IEEE Press. Ali, N., Carns, P., Iskra, K., Kimpe, D., Lang, S., Latham, R., et al. (2009). Scalable I/O forward- ing framework for high-performance computing systems. In IEEE International Conference on Cluster Computing and Workshops, 2009. CLUSTER ’09 (pp. 1–10). Alverson, B., Froese, E., Kaplan, L., & Roweth, D. (2012). Cray Inc., white paper WP-Aries01- 1112. Technical report, Cray Inc. Alverson, G. A., Kahan, S., Korry, R., McCann, C., & Smith, B. J. (1995). Scheduling on the Tera MTA. In Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, IPPS ’95 (pp. 19–44). London, UK: Springer. Alverson, R., Callahan, D., Cummings, D., Koblenz, B., Porterfield, A., & Smith, B. (1990). The Tera computer system. In Proceedings of the 4th International Conference on Supercomputing, ICS ’90 (pp. 1–6). New York, NY, USA: ACM. Andersen, E. (2010). µClibc. https://uclibc.org. Anderson, T. E., Culler, D. E., & Patterson, D. A. (1995). The berkeley networks of workstations (NOW) project. In Proceedings of the 40th IEEE Computer Society International Conference, COMPCON ’95 (p. 322). Washington, DC, USA: IEEE Computer Society. Arcangeli, A. (2010). Transparent hugepage support. In KVM forum. https://www.linux-kvm.org/ images/9/9e/2010-forum-thp.pdf. Hori, A. (2009). PMX Specification –DRAFT–. Allinea Software. Bailey, D., Barszcz, E., Barton, J., Browning, D., Carter, R., Dagum, L., et al. (1991). The nas parallel benchmarks. International Journal of High Performance Computing Applications, 5(3), 63–73. © Springer Nature Singapore Pte Ltd. 2019 375 B. Gerofi et al. (eds.), Operating Systems for Supercomputers and High Performance Computing, High-Performance Computing Series, https://doi.org/10.1007/978-981-13-6624-6 376 Bibliography Balan, R., & Gollhardt, K. (1992). A scalable implementation of virtual memory HAT layer for shared memory multiprocessor machines. In Proceedings of USENIX Summer 1992 Technical Conference. Barach, D. R., Wells, R., Uban, T., & Gibson, J. (1990). Highly parallel virtual memory management on the TC2000. In Proceedings of the 1990 International Conference on Parallel Processin, ICPP ’90 (pp. 549–550). Barak, A., Drezner, Z., Levy, E., Lieber, M., & Shiloh, A. (2015). Resilient gossip algorithms for collecting online management information in exascale clusters. Concurrency and Computation: Practice and Experience, 27(17), 4797–4818. Baskett, F., Howard, J. H., & Montague, J. T. (1977). Task communication in DEMOS. In Proceed- ings of the Sixth ACM Symposium on Operating Systems Principles, SOSP ’77 (pp. 23–31). New York, NY, USA: ACM. Bautista-Gomez, L., Gainaru, A., Perarnau, S., Tiwari, D., Gupta, S., Cappello, F., et al. (2016). Reducing waste in large scale systems through introspective analysis. In IEEE International Parallel and Distributed Processing Symposium (IPDPS). BDEC Committee, (2017). The BDEC “Pathways to convergence” report. http://www.exascale. org/bdec/. Beckman, P. et al. (2015). Argo: An exascale operating system. http://www.argo-osr.org/. Retrieved November 20, 2015. Beckman, P., Iskra, K., Yoshii, K., & Coghlan, S. (2006a). The influence of operating systems on the performance of collective operations at extreme scale. In IEEE International Conference on Cluster Computing. Cluster. Beckman, P., Iskra, K., Yoshii, K., & Coghlan, S. (2006b). Operating system issues for petascale systems. ACM SIGOPS Operating Systems Review, 40(2), 29–33. Beckman, P., Iskra, K., Yoshii, K., Coghlan, S., & Nataraj, A. (2008). Benchmarking the effects of operating system interference on extreme-scale parallel machines. Cluster Computing, 11(1), 3–16. Beeler, M. (1990). Inside the TC2000 computer. Beserra, D., Moreno, E. D., Endo, P. T., Barreto, J., Sadok, D., & Fernandes, S. (2015). Performance analysis of LXC for HPC environments. In International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS). Black, D. L., Tevanian, A., Jr., Golub, D. B., & Young,M. W.(1991). Locking and reference counting in the Mach kernel. In In Proceedings of the 1991 ICPP, Volume II, Software (pp. 167–173). CRC Press. Blumofe, R. D., Joerg, C. F., Kuszmaul, B. C., Leiserson, C. E., Randall, K. H., & Zhou, Y. (1995). Cilk: An efficient multithreaded runtime system. In Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP ’95 (pp. 207–216). New York, NY, USA: ACM. Boden, N. J., Cohen, D., Felderman, R. E., Kulawik, A. E., Seitz, C. L., Seizovic, J. N., et al. (1995). Myrinet: A gigabit-per-second local area network. IEEE Micro, 15(1), 29–36. Boehme, D., Gamblin, T., Beckingsale, D., Bremer, P.-T., Gimenez, A., LeGendre, M., et al. (2016). Caliper: Performance introspection for HPC software stacks. In Proceedings of the 29th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, (SC). Boku, T., Itakura, K., Nakamura, H., & Nakazawa, K. (1997). CP-PACS: A massively parallel processor for large scale scientific calculations. In Proceedings of ACM 11th International Con- ference on Supercomputing (pp 108–115). Vienna, Austria. Bolen, J., Davis, A., Dazey, B., Gupta, S., Henry, G., Robboy, D., et al. (1995). Massively parallel distributed computing. In Proceedings of the Intel Supercomputer Users’ Group. 1995 Annual North America Users’ Conference. Bratterud, A., Walla, A., Haugerud, H., Engelstad, P.E., & Begnum, K. (2015). IncludeOS: A resource efficient unikernel for cloud services. In Proceedings of the 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom). Bibliography 377 Breitbart, J., Pickartz, S., Weidendorfer, J., Lankes, S., & Monti, A. (2017). Dynamic co-scheduling driven by main memory bandwidth utilization. In 2017 IEEE International Conference on Cluster Computing (CLUSTER 2017). Accepted for Publication. Brightwell, R., Fisk, L. A., Greenberg, D. S., Hudson, T., Levenhagen, M., Maccabe, A. B., et al. (2000). Massively parallel computing using commodity components. Parallel Computing, 26(2– 3), 243–266. Brightwell, R., Hudson, T., & Pedretti, K. (2008). SMARTMAP: Operating system support for effi- cient data sharing among processes on a multi-core processor. In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’08). Brightwell, R., Hudson, T., Riesen, R., & Maccabe, A. B. (1999). The Portals 3.0 message passing interface. Technical report SAND99-2959, Sandia National Laboratories. Brightwell, R., Maccabe, A. B., & Riesen, R. (2002). Design and implementation of MPI on Portals 3.0. In D. Kranzlmüller, P. Kacsuk, J. Dongarra & J. Volkert (Eds.), Recent Advances in Parallel Virtual Machine and Message Passing Interface: 9th European PVM/MPI Users’ Group Meeting, Linz, Austria, September 29–October 2, 2002. Proceedings. Lecture notes in computer science (Vol. 2474, pp. 331–340). Springer. Brightwell, R., Maccabe, A. B., & Riesen, R. (2003a). Design, implementation, and performance of MPI on Portals 3.0. The International Journal of High Performance Computing Applications, 17(1), 7–20. Brightwell, R., Oldfield, R., Maccabe, A. B., & Bernholdt, D. E. (2013). Hobbes: Composition and virtualization as the foundations of an extreme-scale OS/R. In Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, ROSS ’13 (pp. 2:1–2:8). Brightwell, R., Riesen, R., Underwood, K., Bridges, P. G., Maccabe, A. B., & Hudson, T. (2003b). A performance comparison of Linux and a lightweight kernel. In IEEE International Conference on Cluster Computing (pp. 251–258). Cluster. Brooks, E. (1990). Attack of the killer micros. In Talk at. Supercomputing’91. Brooks, E. D., Gorda, B. C., Warren, K. H., & Welcome, T. S. (1991). BBN TC2000 architecture and programming models. In Compcon Spring ’91. Digest of papers (pp. 46–50). Brown, N. (2018). Overlay filesystem documentation. https://www.kernel.org/doc/Documentation/ filesystems/overlayfs.txt. Brugger, G., & Streletz. (2001). Network livermore time sharing system (NLTSS). http://www. computer-history.info/Page4.dir/pages/LTSS.NLTSS.dir/pages/NLTSS.pdf. Bull, J. M., Reid, F., & McDonnell, N. (2012). A microbenchmark suite for OpenMP tasks. In Pro- ceedings of the 8th International Conference on OpenMP in a Heterogeneous World, IWOMP’12 (pp. 271–274). Berlin, Heidelberg: Springer. Buntinas,