GARDMON: A Java-based Monitoring Tool for Gardens
Non-dedicated Cluster Computing System
Abstract
The QUT's Gardens project aims to create a vir-tual parallel machine out of
a network of non-dedicated computers (workstations/PCs). These systems are
interconnected through low latency and high bandwidth communication links
such as Myrinet. Gardens is an integrated programming language and system
designed to utilize the idle workstation's CPU cycles to support adaptive
par-allel computing.
A Gardens computation consists of a network of communicating tasks,
dynamically mapped onto a network of processors. Tasks are created
dynami-cally and each task consists of a stack and a col-lection of heap
segments in which dynamic data structures are stored.
We designed and developed a Gardens cluster monitoring system called
Gardmon. It is a port-able, flexible, interactive, scalable,
location-transparent, and comprehensive environment for monitoring of
Gardens runtime activities. It follows client-server methodology and
provides transpar-ent access to all nodes to be monitored from a monitoring
machine. The features of Gardmon in monitoring Gardens adaptive parallel
computing system seem satisfactory.
Click here to download the GARDMON Paper
(which is in Postscript format).
Appeared in:
Proceedings of the Sixth International Conference on Parallel and
Distributed Processing
Techniques and Applications (PDPTA'98), Las Vegas, Nevada, USA,
CSREA Press, 1999.