A Novel Parallel Transmission Strategy for Data Grid
Keywords:
Data grid, distributed storage model, parallel transmissionAbstract
Creation of multi-copies accelerates data transmission and reduces network traffic, but it causes overhead storage and additional network traffic. A variety of parallel transmission algorithms based on GridFTP and multi-copy can be used to accelerate data transmission further, but they can not adapt to a wide range of network, and they can not be used to solve the problems of storage space and network traffic waste. GridTorrent combined with BitTorrent and GridFTP has compatibility with grid and has flexible scalability, but the speed is very slow when there are few peers, to solve this problem multicopy is needed also. To achieve multiple optimization objectives of storage space saving, suitable for two kinds of application modes(i.e. parallel transfer based on GridFTP and BitTorrent), adaptability for wide range of network and higher performance when there are fewer peers, based on the idea of GridTorrent, a distributed storage model, parallel transfer algorithm and virtual peer strategy are proposed. In experiments the performance is compared among the verification system VPG-Torrent and original parallel transfer algorithm
(DCDA) only based on GridfTP & multi-copy and GridTorrent. When the same amount of data is deployed VPG-Torrent has better performance than DCDA, and when there are fewer peers VPG-Torrent also exceed GridTorrent, which prove the effectiveness of VPG-Torrent.
References
CHEN Lei, LI San-li. A Calking Dynamic Replication Distribution Algorithm in Data Grid. ACTA ELECTRONICA SINICA, 34(11):1-4, 2006
XIE Xiao-lan, LIU Yu, ZHOU De-jian. Research on Manufacturing Grid Data Access and Integration Key Technology. JOURNAL OF WUHAN UNIVERSITY OF TECHNOLOGY, 31(6):1-4, 2009
ZHANG Guangzhi, HE Jieyue. Application Research on Biological Data Grid. Computer Engineering,(2):1-4, 2004
QIN Xin, LUO Ze, NAN Kai etal. Design and Implementation of Problem Solving Environment for Astronomy Application Based on Science Data Grid. Application Research of Computers,(4):1-4, 2009
H.A. James, K.A. Hawick. Scientific Data Management in a Grid Environment. Journal of Grid Computing,3: 39-51, 2005 http://dx.doi.org/10.1007/s10723-005-5464-y
Mingwei Wang, Shusheng Zhang, Jingtao Zhou etal. An Architecture of Semantic Desktop Data Grid. Proceedings of the 10th International Conference on Computer Supported Cooperative Work in Design,IEEE Computer Society, 1-6, 2006 http://dx.doi.org/10.1007/11686699_1
S. Fiore, M. Mirto, Cafaro. A GRelC based Data Grid Management Environment. 21st IEEE International Symposium on Computer-Based Medical Systems, IEEE Computer Society, 355- 360,2008
Richard McClatchey, Ashiq Anjum etal. Data Intensive and Network Aware (DIANA) Grid Scheduling. Journal of Grid Computing,5:43-64, 2007 http://dx.doi.org/10.1007/s10723-006-9059-z
H. Liu, et al., Scheduling jobs on computational grids using a fuzzy particle swarm optimization algorithm, Future Generation Computer Systems:1-8,2009
Xiangang Zhao, BaiWang, Nan Du. Qos-based Algorithm for Job Allocation and Scheduling in Data Grid. Proceedings of the Fifth International Conference on Grid and Cooperative Computing Workshops (GCCW'06), IEEE Computer Society:1-7,2006
Nhan Nguyen Dang, Soonwook Hwang, Sang Boem Lim. Improvement of Data Grid's Performance by Combining Job Scheduling with Dynamic Replication Strategy. The Sixth International Conference on Grid and Cooperative Computing(GCC 2007), IEEE Computer Society:1-8,2007 http://dx.doi.org/10.1109/GCC.2007.79
Esther Pacitti. Patrick Valduriez. Marta Mattoso. Grid Data Management: Open Problems and New Issues. Journal of Grid Computing,5:273-281, 2007 http://dx.doi.org/10.1007/s10723-007-9081-9
Jiang Jianjin, Yang Guangwen. Replication Strategies in Data Grid Systems with Clustered Demands. JOURNAL OF COMPUTER RESEARCH AND DEVELOPMENT,46(2):1-8,2009
W u Chang-ze, Chen Shu-yu, Ti an Dong. The strategy of creating replica based on cost shared in data grid. Huazhong Univ. of Sci. & Tech. (Nature Science Edition),35(2):1-4, 2007
Pangfeng Liu. Jan-Jan Wu, Optimal Replica Placement Strategy for Hierarchical Data Grid Systems. Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid:IEEE Computer Society: 1-4, 2006
Tim Ho, David Abramson. A Unified Data Grid Replication Framework. Proceedings of the Second IEEE International Conference on e-Science and Grid Computing: IEEE Computer Society: 1-8, 2006
Ingmar Baumgart, Bernhard Heep, Stephan Krause, OverSim: A scalable and flexible overlay framework for simulation and real network applications, Proceedings of the 9th International Conference on Peer-to-Peer Computing (IEEE P2P'09 ), pp. 87-88, Seattle, WA, USA, Sep 2009 http://dx.doi.org/10.1109/p2p.2009.5284505
Ingmar Baumgart, Bernhard Heep, Stephan Krause, OverSim: A Flexible Overlay Network Simulation Framework, Proceedings of 10th IEEE Global Internet Symposium (GI '07) in conjunction with IEEE INFOCOM 2007, p. 79-84, Anchorage, AK, USA, May 2007 http://dx.doi.org/10.1109/gi.2007.4301435
R.S.Bhuvaneswaran, Yoshiaki Katayama, Naohisa Takahashi. Dynamic Co-allocation Scheme for Parallel Data Transmission in Grid Environment. Proceedings of the First International Conference on Semantics, Knowledge, and Grid, IEEE Computer Society: 1-6, 2006
Sudharshan, Vazhkudai. Distributed Downloads of Bulk, Replicated Grid Data. Journal of Grid Computing,2:31-42, 2005
Gaurav Khanna, Umit Catalyurek, Tahsin Kurc, et al. A Dynamic Scheduling Approach for Coordinated Wide-Area Data Transfers using GridFTP. The 22nd International Parallel and Distributed Processing Symposium (IPDPS '08). IEEE Computer Society, 2008,1-12
Liu Dongmei, Liu Dongmei. Multi-path parallel transmission scheme for optical grid systems. Chinese High Technology Letters,5:1-4,2008 http://dx.doi.org/10.1016/j.cclet.2007.11.012
A. Zissimos, K. Doka, A. Chazapis and N. Koziris. GridTorrent: Optimizing data transfers in the Grid with collaborative sharing. in Proceedings of the 11th Panhellenic Conference on Informatics (PCI2007), Patras, Greece, May 2007:1-12
Athanasia Asiki, Katerina Doka, Ioannis Konstantinou, et al. A Distributed Architecture for Multi-Dimensional Indexing and Data Retrieval in Grid Environments. In Proceedings of the Cracow 2007 Grid Workshop (CGW'07), Krakow, Polland, October 16-17, 2007:1-8
A. Kaplan, G.C. Fox and G. von Laszewski, GridTorrent Framework: A High-performance Data Transfer and Data Sharing Framework for Scientific Computing. Proc Grid Computing Environments, Supercomputing Workshops, Reno, NV, USA, November 2007:1-10
Published
Issue
Section
License
ONLINE OPEN ACCES: Acces to full text of each article and each issue are allowed for free in respect of Attribution-NonCommercial 4.0 International (CC BY-NC 4.0.
You are free to:
-Share: copy and redistribute the material in any medium or format;
-Adapt: remix, transform, and build upon the material.
The licensor cannot revoke these freedoms as long as you follow the license terms.
DISCLAIMER: The author(s) of each article appearing in International Journal of Computers Communications & Control is/are solely responsible for the content thereof; the publication of an article shall not constitute or be deemed to constitute any representation by the Editors or Agora University Press that the data presented therein are original, correct or sufficient to support the conclusions reached or that the experiment design or methodology is adequate.