Automatic Parallelism Tuning Mechanism for Heterogeneous IP-SAN Protocols in Long-Fat Networks

Automatic Parallelism Tuning Mechanism for Heterogeneous IP-SAN Protocols in Long-Fat Networks

Information and Media Technologies 8(3): 739-748 (2013) reprinted from: Journal of Information Processing 21(3): 423-432 (2013) © Information Processing Society of Japan Regular Paper Automatic Parallelism Tuning Mechanism for Heterogeneous IP-SAN Protocols in Long-fat Networks Takamichi Nishijima1,a) Hiroyuki Ohsaki1,b) Makoto Imase1,c) Received: October 22, 2012, Accepted: March 1, 2013 Abstract: In this paper we propose Block Device Layer with Automatic Parallelism Tuning (BDL-APT), a mecha- nism that maximizes the goodput of heterogeneous IP-based Storage Area Network (IP-SAN) protocols in long-fat networks. BDL-APT parallelizes data transfer using multiple IP-SAN sessions at a block device layer on an IP-SAN client, automatically optimizing the number of active IP-SAN sessions according to network status. A block device layer is a layer that receives read/write requests from an application or a file system, and relays those requests to a stor- age device. BDL-APT parallelizes data transfer by dividing aggregated read/write requests into multiple chunks, then transferring a chunk of requests on every IP-SAN session in parallel. BDL-APT automatically optimizes the number of active IP-SAN sessions based on the monitored network status using our parallelism tuning mechanism. We evaluate the performance of BDL-APT with heterogeneous IP-SAN protocols (NBD, GNBD and iSCSI) in a long-fat network. We implement BDL-APT as a layer of the Multiple Device driver, one of major software RAID implementations in- cluded in the Linux kernel. Through experiments, we demonstrate the effectiveness of BDL-APT with heterogeneous IP-SAN protocols in long-fat networks regardless of protocol specifics. Keywords: Automatic Parallelism Tuning (APT), IP-based Storage Area Network (IP-SAN), Block Device Layer, long-fat networks the goodput of heterogeneous IP-SAN protocols in long-fat net- 1. Introduction works and that requires no modification to IP-SAN storage de- In recent years, IP-based Storage Area Networks (IP-SANs) vices. BDL-APT parallelizes data transfer using multiple IP-SAN have attracted attention for building SANs on IP networks, owing sessions at a block device layer on an IP-SAN client, automati- to the low cost and high compatibility of IP-SANs with existing cally optimizing the number of active IP-SAN sessions according network infrastructures [1], [2], [3], [4]. to network status. A block device layer is a layer that receives Several IP-SAN protocols such as NBD (Network Block De- read/write requests from an application or a file system, and re- vice) [5], GNBD (Global Network Block Device) [6], iSCSI (In- lays those requests to a storage device. BDL-APT parallelizes ternet Small Computer System Interface) [7], FCIP (Fibre Chan- data transfer by dividing aggregated read/write requests into mul- nel over TCP/IP) [8], and iFCP (Internet Fibre Channel Pro- tiple chunks, then transferring a chunk of requests on every IP- tocol) [9] have been widely utilized and deployed for building SAN session in parallel. BDL-APT automatically optimizes the SANs on IP networks. IP-SAN protocols allow interconnection number of active IP-SAN sessions based on the monitored net- between clients and remote storage via a TCP/IP network. work status by means of our APT mechanism [13], [14]. IP-SAN protocols realize connectivity to remote storage de- We evaluate the performance of BDL-APT with heterogeneous vices over conventional TCP/IP networks, but still have unre- IP-SAN protocols (NBD, GNBD and iSCSI) in a long-fat net- solved issues, performance in particular Refs. [3], [10], [11]. Sev- work. We implement BDL-APT as a layer of the Multiple De- eral factors affect the performance of IP-SAN protocols in a long- vice (MD) driver [15], one of the major software RAID imple- fat network. One significant factor is TCP performance degrada- mentations included in the Linux kernel. Through experiments, tion in long-fat networks [12]; IP-SAN protocols generally uti- we demonstrate the effectiveness of BDL-APT with heteroge- lize TCP for data delivery, which performs poorly in long-fat net- neous IP-SAN protocols in long-fat networks regardless of proto- works. col specifics. In this paper, we propose Block Device Layer with Automatic This paper is organized as follows. Section 2 summarizes re- Parallelism Tuning (BDL-APT), a mechanism that maximizes lated works. Section 3 introduces the IP-SAN protocols used in 1 The Graduate School of Information Science and Technology, Osaka our BDL-APT experiments. Section 4 describes the overview and University, Suita, Osaka 565–0871, Japan the main features of our BDL-APT. Section 5 explains our BDL- a) [email protected] APT implementation. Section 6 gives a performance evaluation b) [email protected] c) [email protected] of our BDL-APT implementation in heterogeneous IP-SAN pro- 739 Information and Media Technologies 8(3): 739-748 (2013) reprinted from: Journal of Information Processing 21(3): 423-432 (2013) © Information Processing Society of Japan tocols. Finally, Section 7 summarizes this paper and discusses and NBD is that GNBD allows simultaneous connections be- areas for future work. tween multiple clients and a single server (a block device). • iSCSI (Internet Small Computer System Interface) 2. Related Work The Internet Engineering Task Force standardized the iSCSI Several solutions for preventing IP-SAN performance degra- protocol in 2004 [7]. iSCSI protocol encapsulates a stream dation in long-fat networks have been proposed [12], [13], [16], of SCSI command descriptor blocks (CDBs) in IP packets, [17]. For instance, solutions utilizing multiple links [16] or paral- allowing communication between a SCSI initiator (a client) lel TCP connections [13] have been proposed to prevent through- and its target (a storage device) via a TCP/IP network. When put degradation of the iSCSI protocol. a SCSI initiator receives read/write requests from an appli- Yang [16] improves iSCSI throughput by using multiple con- cation or a file system, it generates SCSI CDBs and transfers nections, each of which traverses a different path using multi- those CDBs to the SCSI storage through a TCP connection. ple LAN ports and dedicated routers. However, LAN ports and It is known that iSCSI performance is significantly degraded dedicated routers are not always available, forcing significant re- when the end-to-end delay (the delay between the iSCSI ini- strictions on the network environment. Inoue et al. [13] improve tiator and its target) is large [11], [13], [19], [20]. iSCSI throughput by adjusting the number of parallel TCP con- 4. Block Device Layer with Automatic Paral- nections using the iSCSI protocol’s parallel data transfer feature. However, this solution cannot be used in IP-SAN protocols with- lelism Tuning (BDL-APT) out that feature. Changes to the TCP congestion control algorithm 4.1 Overview for improving fairness and throughput of the iSCSI protocol have We propose BDL-APT, a mechanism that maximizes the good- also been proposed [12]. Oguchi et al. analyze iSCSI with the put of heterogeneous IP-SAN protocols in long-fat networks. proposed monitoring tool, and improve the throughput of iSCSI BDL-APT realizes the data delivery over multiple TCP connec- by adjustment of TCP socket buffer, or the correction inside a tions by using multiple IP-SAN sessions, and optimizes the num- Linux kernel [17]. However, solutions that replace or modify the ber of parallel TCP connections automatically based on the IP- transport protocol are unrealistic because they require changes in SAN goodput. BDL-APT is a mechanism that operates as a block all IP-SAN targets (storage devices) and initiators (clients). device layer in an IP-SAN initiator (the client) (see Fig. 1). BDL- While these and other solutions for specific IP-SAN protocols APT parallelizes data transfer by dividing aggregated read/write have been proposed, many heterogeneous IP-SAN devices have requests into multiple chunks, then transferring a chunk of re- already been deployed, so it is desirable to improve IP-SAN per- quests on every IP-SAN session in parallel. BDL-APT automat- formance independent of protocol and without need for device ically optimizes the number of active IP-SAN sessions based on modifications. the monitored network status using our parallelism tuning mech- In this paper, we propose an initiator-side solution, which re- anism APT [13], which is based on a numerical computation al- quires no modification to IP-SAN storage devices, to the perfor- gorithm called the Golden Section Search method. mance degradation of heterogeneous IP-SAN protocols in long- The main advantage of BDL-APT is its independence from fat networks. the underlying IP-SAN protocol. Namely, BDL-APT can operate with any IP-SAN protocol since it works as a block device layer 3. IP-SAN Protocols without reliance on features specific to the underlying block de- In this section, we briefly introduces three IP-SAN protocols vice or protocol. BDL-APT realizes both parallel data transfer used in our BDL-APT experiments. and network status monitoring independently from the underly- • NBD (Network Block Device) ing block device or protocol. The NBD protocol, initially developed by Pavel Machek in Another advantage of BDL-APT is its initiator-side implemen- 1997 [5], is a lightweight IP-SAN protocol for accessing tation. Namely, BDL-APT works within an IP-SAN initiator (a remote block devices over a TCP/IP network. The NBD protocol allows transparent access of remote block devices via a TCP/IP network, and supports primitive block-level operations such as read/write/disconnect and several other I/O controls. All communication between NBD clients and servers occurs using the TCP protocol. • GNBD (Global Network Block Device) The GNBD protocol is another lightweight IP-SAN protocol for accessing remote block devices over a TCP/IP network.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    10 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us