SAN Fabric Resiliency and Administration Best Practices User Guide
Total Page:16
File Type:pdf, Size:1020Kb
SAN Fabric Resiliency and Administration Best Practices User Guide Broadcom 53-1004609-04 July 8, 2021 User Guide SAN Fabric Resiliency and Administration Best Practices Copyright © 2016–2021 Broadcom. All Rights Reserved. Broadcom, the pulse logo, Brocade, the stylized B logo, ClearLink, Fabric OS, and SANnav are among the trademarks of Broadcom in the United States, the EU, and/or other countries. The term “Broadcom” refers to Broadcom Inc. and/or its subsidiaries. Broadcom reserves the right to make changes without further notice to any products or data herein to improve reliability, function, or design. Information furnished by Broadcom is believed to be accurate and reliable. However, Broadcom does not assume any liability arising out of the application or use of this information, nor the application or use of any product or circuit described herein, neither does it convey any license under its patent rights nor the rights of others. The product described by this document may contain open source software covered by the GNU General Public License or other open source license agreements. To find out which open source software is included in Brocade products, to view the licensing terms applicable to the open source software, and to obtain a copy of the programming source code, please download the open source disclosure documents in the Broadcom Customer Support Portal (CSP). If you do not have a CSP account or are unable to log in, please contact your support provider for this information. Broadcom 53-1004609-04 2 User Guide SAN Fabric Resiliency and Administration Best Practices Table of Contents Chapter 1: Introduction ...................................................................................................................... 7 Chapter 2: Trends in Data Center Storage Networking ................................................................... 8 Chapter 3: Brocade Fabric OS and Fabric Vision ............................................................................ 9 Chapter 4: Feature Availability ........................................................................................................ 10 Chapter 5: Fabric Resiliency ........................................................................................................... 11 Chapter 6: Faulty Media ................................................................................................................... 12 6.1 Description ..............................................................................................................................................................12 6.2 Detection..................................................................................................................................................................13 6.3 Mitigation .................................................................................................................................................................13 Chapter 7: Congestion ..................................................................................................................... 15 7.1 Oversubscription ....................................................................................................................................................15 7.1.1 Description ......................................................................................................................................................15 7.1.2 Detection.........................................................................................................................................................15 7.1.3 Mitigation.........................................................................................................................................................15 7.2 Credit-Stalled Devices ............................................................................................................................................16 7.2.1 Description ......................................................................................................................................................16 7.2.1.1 Latency Caused by a Credit-Stalled Device .........................................................................................16 7.2.1.2 Moderate Device Latency .....................................................................................................................18 7.2.1.3 Severe Device Latency.........................................................................................................................18 7.2.1.4 Latency on ISLs ....................................................................................................................................18 7.2.2 Detection.........................................................................................................................................................18 7.2.3 Mitigation.........................................................................................................................................................19 7.2.3.1 Initiators Compared to Targets .............................................................................................................20 7.3 Loss of Buffer Credits ............................................................................................................................................20 7.3.1 Description ......................................................................................................................................................20 7.3.1.1 Gen 5 and Later ASIC Enhancements .................................................................................................21 7.3.2 Detection.........................................................................................................................................................21 7.3.3 Mitigation.........................................................................................................................................................22 7.3.3.1 Credit Recovery on Back-End Ports .....................................................................................................22 Chapter 8: Tools ............................................................................................................................... 23 8.1 ClearLink Diagnostics ............................................................................................................................................23 8.2 Brocade MAPS ........................................................................................................................................................23 8.3 Fabric Performance Impact Monitoring ................................................................................................................23 8.4 Flow Vision and IO Insight .....................................................................................................................................23 8.5 Edge Hold Time .......................................................................................................................................................23 Broadcom 53-1004609-04 3 User Guide SAN Fabric Resiliency and Administration Best Practices 8.6 Frame Viewer...........................................................................................................................................................24 8.7 Fabric Notification...................................................................................................................................................24 Chapter 9: Designing Resiliency into the Fabric ........................................................................... 25 9.1 Factors Affecting Congestion................................................................................................................................25 9.2 Resiliency ................................................................................................................................................................25 9.3 Redundancy.............................................................................................................................................................26 Chapter 10: Fabric Configuration .................................................................................................... 27 10.1 Fabric-Wide Parameters .......................................................................................................................................27 10.2 Event and Change Log Level Settings................................................................................................................27 10.3 Zoning ....................................................................................................................................................................27 10.4 Advanced Zoning Considerations.......................................................................................................................28 10.5 Zoning Recommendations ...................................................................................................................................29 10.6 Firmware Management .........................................................................................................................................29 10.7 Firmware Recommendations ...............................................................................................................................29 Chapter 11: Routing Policies ..........................................................................................................