BRKDCN-1902.Pdf

BRKDCN-1902.Pdf

Ethernet Storage Network Designs and Fibre Channel Integration Greg McClellan – Network Consulting Engineer BRKDCN-1902 Agenda • Introduction • SCSI and Fibre Channel • Data Centre Bridging • FCoE, iSCSI and RDMA • SAN Designs – moving toward unified infrastructure • Conclusion Block storage protocols: built on SCSI • SCSI is a block-transfer protocol that enables data transfer between various independent peripheral devices and computers. • SCSI connects disks in a storage array and the tape drives of a tape library to the servers. • Fibre Channel, Fibre Channel over Ethernet (FCoE), and iSCSI, are the most widely deployed block-storage protocols. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 4 SCSI basics BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 5 SCSI-3 Stack BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 6 Fibre Channel Stack BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 7 Fibre Channel Frame BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 8 Block Protocol Comparisons BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 9 Data Center Bridging • EEE 802.1 Data Center Bridging (DCB) is a collection of standards-based extensions to classical Ethernet. It provides a lossless data center transport layer that helps enable the convergence of LANs and SANs onto a single unified fabric. • In addition to supporting Fibre Channel over Ethernet (FCoE), DCB can enhance the operation of iSCSI, network-attached storage (NAS), and other business-critical storage traffic. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 10 Why do we need DBC? • Block storage networks have different requirements than IP networks • Low latency requirements for I/O • No packet loss requirement for SCSI transactions • Different flow control mechanisms (buffer credits vs. pause) • In-order delivery for SCSI transactions (load balancing) BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 11 DCB Framework The DCB framework defines the capabilities for switches and endpoints to be part of a data center fabric. It includes: •Priority-based flow control (PFC, IEEE 802.1Qbb) •Enhanced transmission selection (ETS, IEEE 802.1Qaz) •Congestion notification (IEEE 802.1 Qau) •Extensions to the Link Layer Discovery Protocol standard (IEEE 802.1AB) that support Data Center Bridging Capability Exchange Protocol (DCBX) BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 12 Priority Based Flow Control • Provides capability to manage bursty, single traffic source on a multiprotocol link • Large bursts from one traffic type will not affect other traffic types, large queues of traffic from one traffic type will not starve other traffic types' resources, and optimization for one traffic type will not create high latency for small messages of other traffic types. • PFC is an enhancement to the pause mechanism. PFC enables pause based on user priorities or classes of service instead of link layer pause BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 13 PFC creates eight separate virtual links on the physical link and allows any of these links to be paused and restarted independently. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 14 Enhanced Transmission Selection (ETS) • Enables bandwidth management between traffic types for multiprotocol links • Used to assign traffic to a particular virtual lane using IEEE 802.1p class of service (CoS) values to identify which virtual lane traffic belongs to BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 15 ETS Bandwidth Consumption Example BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 16 Delayed Drop • PFC works very well for protocols like FCoE that require a reliable medium, but it makes short-lived congestion and persistent congestion undistinguishable • Delayed drop mediates between traditional Ethernet behavior and PFC behavior. With delayed drop, a CoS can be flow controlled and the duration of the congestion monitored, so that the traditional drop behavior follows if the congestion is not resolved. • Delayed drop offers the capability to tune the definition of "short-lived congestion" with PCF, hence removing the need to increase physical buffers on the interfaces. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 17 Congestion Notification • Pushes congestion to the edge switches with a proxy buffer • Instead of dropping packets, they are marked as congested in the DSCP header to the receiving host • Transmission rate is decreased from the edge switch BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 18 Explicit Congestion Notification BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 19 Data Center Bridging Exchange (DCBX) Protocol • DCBX is used to exchange all the DCB features across the devices and maintain consistency. DCBX helps ensure consistent quality-of-service (QoS) parameters across the network and servers. The features are advertised to the servers in type-length-value (TLV) format using the Link Layer Discovery Protocol (LLDP). • IEEE DCB builds on classical Ethernet's strengths, adds several crucial extensions to provide a unified fabric. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 20 Logical Link Down • There are many features and error conditions in both Ethernet and particularly FC that counts on flapping the link between the initiator and the first hop switch to cause the initiator to resend FLOGI. This is a common recovery mechanism in FC. • Now that both Ethernet and FC are on the same wire, if the link flaps due to an FC error condition, the Ethernet side is also brought down, and vice versa. • As a result, we need to introduce a protocol message that the NIC and switch can use to just flap the “logical” FC link or the “logical” ethernet link. Using this new message, the FC side can recover and resend FLOGI and the Ethernet side is never distributed; the same is true in the opposite direction. BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 21 FCoE encapsulation BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 22 FCoE frame payload BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 23 FCoE benefit • Integrate with existing FC network • Existing staff skill • Single wire • Easy to implement with default settings once “feature fcoe” is enabled BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 24 FCoE considerations • Specialized host adapters • Careful consideration application requirements and existing network resources • Limited distance capability • Difficult cost justification in the core layer for conversion BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 25 iSCSI payload BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 26 iSCSI packets BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 27 iSCSI benefit • Existing infrastructure • Existing staff skill • Single wire • No special interface* • Easy to implement BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 28 iSCSI considerations • Software based initiators consume a considerable amount of CPU resource • TCP and IP encapsulation additional overhead • Can use TOE or iSCSI HBA but that limits the cost benefit of standard hardware • Limited scale for iSCSI gateways to FC • Pdu segmentation • Jumbo frames • TCP Windowing BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 29 RDMA vs. Sockets BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 30 RoCE Packet format BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 31 RoCE benefit • Very low latency due to kernel bypass • Allows for Host compute cycles to be used for application processing instead of data transfer BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 32 RoCE considerations • Specialized host adapters • Limited application awareness BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 33 Ethernet Storage Network Considerations • Know your application workload requirements • Know your network capabilities • CoS • DCB – PFC, ETS, Congestion Notification • Link aggregation • Network design to limit oversubscription • Ongoing verification of configuration settings (workloads change) BRKDCN-1902 © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Public 34 Network vs. Fabric Classical Ethernet • Ethernet/IP ? ? ? ? •Goal: Provide any-to-any connectivity ? •Unaware of packet loss (“lossy”) – relies on ULPs for ? retransmission and windowing ? Switch Switch •Provides the transport without worrying about the services - services provided by upper layers Switch ? •East-west vs north-south traffic ratios are undefined ? ? • Network design has been optimized for: ? ? ? •High Availability from a transport perspective by connecting nodes in mesh architectures Fabric Topology and Traffic Flows •Service HA is implemented separately

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    51 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us