The heterogeneous adaptable reconfigurable networked systems (Harness) research project focuses on the design and development of a pluggable lightweight heterogeneous Distributed Virtual Machine (DVM) environment, where clusters of PCs, workstations, and “big iron” supercomputers can be aggregated to form one giant DVM (in the spirit of its widely-used predecessor, Parallel Virtual Machine (PVM)).
As part of the Harness project, a variety of experiments and system prototypes are developed to explore lightweight pluggable frameworks, adaptive reconfigurable runtime environments, assembly of scientific applications from software modules, parallel plug-in paradigms, highly available DVMs, fault-tolerant message passing (FT-MPI), fine-grain security mechanisms, and heterogeneous reconfigurable communication frameworks. Three different Harness system prototypes are developed, two C variants and one Java-based alternative, each concentrating on different research issues.
Funding Sources
- Office of Advanced Scientific Computing Research, Office of Science, U.S. Department of Energy
Participating Institutions
Peer-reviewed Conference Publications
- Alessandro Valentini, Christian Di Biagio, Fabrizio Batino, Guido Pennella, Fabrizio Palma, and Christian Engelmann. High Performance Computing with Harness over InfiniBand. In Proceedings of the 17th Euromicro International Conference on Parallel, Distributed, and network-based Processing (PDP) 2009, pages 151-154, Weimar, Germany, February 18-20, 2009. IEEE Computer Society, Los Alamitos, CA, USA. ISBN 978-0-7695-3544-9. ISSN 1066-6192. DOI 10.1109/PDP.2009.64. Acceptance rate 42.0% (58/138).
- Emanuele Di Saverio, Marco Cesati, Christian Di Biagio, Guido Pennella, and Christian Engelmann. Distributed Real-Time Computing with Harness. In Lecture Notes in Computer Science: Proceedings of the 14th European PVM/MPI Users` Group Meeting (EuroPVM/MPI) 2007, pages 281-288, Paris, France, September 30 – October 3, 2007. Springer Verlag, Berlin, Germany. ISBN 978-3-540-75415-2. ISSN 0302-9743. DOI 10.1007/978-3-540-75416-9_39.
- Ronald Baumann, Christian Engelmann, and George A. (Al) Geist. A Parallel Plug-in Programming Paradigm. In Lecture Notes in Computer Science: Proceedings of the 7th International Conference on High Performance Computing and Communications (HPCC) 2006, pages 823-832, Munich, Germany, September 13-15, 2006. Springer Verlag, Berlin, Germany. ISBN 978-3-540-39368-9. ISSN 0302-9743. DOI 10.1007/11847366_85.
Peer-reviewed Workshop Publications
- Christian Engelmann and George A. (Al) Geist. RMIX: A Dynamic, Heterogeneous, Reconfigurable Communication Framework. In Lecture Notes in Computer Science: Proceedings of the 6th International Conference on Computational Science (ICCS) 2006, Part II: 3rd Special Session on Collaborative and Cooperative Environments (CCE) 2006, pages 573-580, Reading, UK, May 28-31, 2006. Springer Verlag, Berlin, Germany. ISBN 3-540-34381-4. ISSN 0302-9743. DOI 10.1007/11758525_77.
- Christian Engelmann and George A. (Al) Geist. A Lightweight Kernel for the Harness Metacomputing Framework. In Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2005: 14th Heterogeneous Computing Workshop (HCW) 2005, Denver, CO, USA, April 4, 2005. IEEE Computer Society, Los Alamitos, CA, USA. ISBN 0-7695-2312-9. ISSN 1530-2075. DOI 10.1109/IPDPS.2005.34.
- Christian Engelmann, Stephen L. Scott, and George A. (Al) Geist. High Availability through Distributed Control. In Proceedings of the 2nd High Availability and Performance Workshop (HAPCW) 2004, in conjunction with the 5th Los Alamos Computer Science Institute (LACSI) Symposium 2004, Santa Fe, NM, USA, October 12, 2004.
- Christian Engelmann, Stephen L. Scott, and George A. (Al) Geist. Distributed Peer-to-Peer Control in Harness. In Lecture Notes in Computer Science: Proceedings of the 2nd International Conference on Computational Science (ICCS) 2002, Part II: Workshop on Global and Collaborative Computing, pages 720-727, Amsterdam, The Netherlands, April 21-24, 2002. Springer Verlag, Berlin, Germany. ISBN 3-540-43593-X. ISSN 0302-9743. DOI content/l537ujfwt8yta2dp.
Talks and Lectures
- Christian Engelmann. Distributed Peer-to-Peer Control for Harness. Seminar at the Department of Computer Science, North Carolina State University, Raleigh, NC, USA, February 11, 2004.
Theses
- Christian Engelmann. Distributed Peer-to-Peer Control for Harness. Master’s thesis, Department of Computer Science, University of Reading, UK, July 7, 2001. Thesis research performed at Oak Ridge National Laboratory. Double diploma in conjunction with the Department of Engineering I, Technical College for Engineering and Economics (FHTW) Berlin, Germany. Advisors: Prof. Vassil N. Alexandrov (University of Reading); George A. (Al) Geist (Oak Ridge National Laboratory).
- Christian Engelmann. Distributed Peer-to-Peer Control for Harness. Master’s thesis, Department of Engineering I, Technical College for Engineering and Economics (FHTW) Berlin, Germany, February 23, 2001. Thesis research performed at Oak Ridge National Laboratory. Double diploma in conjunction with the Department of Computer Science, University of Reading, UK. Advisors: Prof. Uwe Metzler (Technical College for Engineering and Economics (FHTW) Berlin); George A. (Al) Geist (Oak Ridge National Laboratory).
Symbols: Abstract, Publication, Presentation, BibTeX Citation