IMPLEMENTATION AND EVALUATION OF RUNTIME DATA DECLUSTERING METHOD OVER SAN-CONNECTED PC CLUSTER
DOI:
https://doi.org/10.47839/ijc.1.2.123Keywords:
Cluster Computing, Data Mining, Storage Area Network, Runtime Data DeclusteringAbstract
In this paper, a PC cluster connected with Storage Area Network (SAN) is built and evaluated. In the case of SANconnected cluster, each node can access all shared disks directly without LAN; thus, SANconnected clusters achieve better performance than LANconnected clusters for disk access operations. However, if a lot of nodes access the sameshared disk simultaneously, application performance degrades due to I/Obottleneck. A runtime data declustering method, in which data is declustered to several other disks dynamically during the execution of application, is proposed to resolve this problem. Parallel data mining is implemented and evaluated on the SANconnected PC cluster. This application requires iterative scans of a shared disk, which degrade execution performance severely due to I/Obottleneck. The runtime data declustering method is applied to this case. According to the results of experiments, the proposed method prevents performance degradation caused by shared disk bottleneck in SANconnected clusters.References
T. Tamura, M. Oguchi, and M. Kitsuregawa: “Parallel Database Processing on a 100 Node PC Cluster: Cases for Decision Support Query Processing and Data Mining”, Proceedings of SC97: High Performance Networking and Computing (SuperComputing ’97), November 1997.
B. Phillips: “Have Storage Area Networks Come of Age?”, IEEE Computer, Vol. 31, No. 7, pp. 10-12, July 1998.
M. J. Zaki: “Parallel and Distributed Association Mining: A Sur vey”, IEEE Concurrency, Vol. 7, No. 4, pp. 14-25, 1999.
R. Agrawal and R. Srikant: “Fast Algorithms for Mining Association Rules”, Proceedings of the Twentieth International Conference on Very Large Data Bases, pp. 487-499, September 1994.
T. Shintani and M. Kitsuregawa: “Hash Based Parallel Algorithms for Mining Association Rules”, Proceedings of the Fourth IEEE International Conference on Parallel and Distributed Information Systems, pp. 19-30, December 1996.
M. Blumrich, K. Li, R. Alpert, C. Dubnicki, E. Felten, and J. Sandberg: “Virtual Memory Mapped Network Interface for the SHRIMP Multicomputer”, Proceedings of the Twenty First International Symposium on Computer Architecture, pp. 142-153, April 1994.
D. E. Culler, A. A. Dusseau, R. A. Dusseau, B. Chun, S. Lumetta, A. Mainwaring, R. Martin, C. Yoshikawa, and F. Wong: “Parallel Computing on the Berkeley NOW”, Proceedings of the 1997 Joint Symposium on Parallel Processing (JSPP ’97), pp. 237-247, May 1997.
T. Sterling, D. Saverese, D. J. Becker, B. Fryxell, and K. Olson: “Communication Overhead for Space Science Applications on the Beowulf Parallel Workstation”, Proceedings of the Fourth IEEE International Symposium on High Performance Dis tributed Computing, pp. 23-30, August 1995.
M. Oguchi, T. Shintani, T. Tamura, and Masaru Kitsuregawa: “Characteristics of a Parallel Data Mining Application Implemented on an ATM Connected PC Cluster’’, Proceedings of the HPCN Europe 1997, pp. 303-317, April 1997.
Y. Ishikawa, A. Hori, H. Tezuka, S. Sumimoto, T. Takahashi, F. O’Carroll, and H. Harada: “RWC PC Cluster II and SCore Cluster System Software – High Performance Linux Cluster”, Proceedings of the Fifth Annual Linux Expo, pp. 55-62, 1999.
M. Oguchi and M. Kitsuregawa: “Dynamic Remote Memory Acquisition for Parallel Data Mining on ATMConnected PC Cluster”, Proceedings of the Thirteenth ACM International Conference on Supercomputing, pp. 246-252, June 1999.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.