Got it

Reliability Optimization in Dorado 6000 V6 iSCSI Scenarios

Latest reply: Sep 21, 2020 14:04:36 423 12 6 0 0

Hello all,

Today I want to share with you a case about how to optimize the reliability in Dorado 6000 V6 iSCSI Scenarios.

Environment configuration

Dorado 6000 V6 (with 50 km remote replication), two Linux hosts, Veritas DMP multipathing, two dual-port 10GE HBAs, and four physical paths between each host and storage. Hosts and storage devices are connected through 10GE switches.


Symptom

In the Linux + Veritas DMP and iSCSI scenario, the default configuration is used. After a controller is removed, I/Os are restored to zero for 10 seconds. When multiple physical links are faulty, I/Os are restored to zero for more than 7 seconds, which does not meet user requirements. With optimization, I/Os are not zeroed when a single controller is faulty, and I/Os are zeroed for 1 second when multiple physical links are faulty.


Optimization Solution

1. Modify the following parameters in the etc/iscsi/iscsid.cof file:


node.session.timeo.replacement_timeout = 1

#How long does the upper layer be notified of network problems?


node.conn[0].timeo.noop_out_interval = 1

#Interval for sending ping packets


node.conn[0].timeo.noop_out_timeout = 1

# Timeout interval for receiving heartbeat packets.


Note: After modifying the file, restart the host, re-establish the iSCSI connection, and make the configuration take effect permanently.

iscsi dorado v6-1


After the preceding parameters are modified, the I/O of one controller is removed and stops for 6 seconds.

iscsi dorado v6-2


Shut down one port of the storage device and then the three ports of the storage device on the switch. I/Os are stopped for 7 seconds.

iscsi dorado v6-3


2. Optimized the iSCSI network mounting solution to fully interconnect host HBAs and storage SmartIO cards.


Solution before optimization: One-to-one connection is used, and each host is connected to each SmartIO card of the storage device. However, each HBA card of each host cannot be connected to each SmartIO card of the storage device.

iscsi dorado v6-4


iSCSI mounting script:

iscsi dorado v6-5

iscsi dorado v6-5


After one controller is removed, only one HBA on each host has traffic and more traffic is forwarded.

iscsi dorado v6-6


After the four ports on the storage side are shut down, each host is still connected to only half of the SmartIO cards on the storage side, and more data is forwarded.

iscsi dorado v6-7


Optimized solution: Each HBA on the host is fully interconnected with each SmartIO on the storage device.

iscsi dorado v6-8


iSCSI mounting script:

iscsi dorado v6-9

iscsi dorado v6-10


After one controller is removed, each HBA on the host is still connected to each SmartIO card on the storage device, and I/O forwarding is reduced.

iscsi dorado v6-11


After the four ports on the storage side are shut down, each host is still connected to each SmartIO card on the storage side, reducing I/O forwarding.

iscsi dorado v6-12


After the parameters in the iSCSI configuration file are modified and the host HBA and storage SmartIO cards are fully interconnected, the fault of a single controller does not return to zero.

iscsi dorado v6-13


Shut down one port of the storage device and then the three ports of the storage device on the switch. I/Os are stopped for 1 second.

iscsi dorado v6-14


Summary

When the controller fault pretest in the iSCSI scenario lasts for 6 seconds, it takes 949 ms for the storage BSP to detect the controller removal, the BSP reports the interrupt, the system controller receives the interrupt, and the system control switches from the dual-host mode to the single mode. Other time is spent on the host. The multipathing software on the host does not take much time from receiving the error code returned by the I/O to complete the path switchover. Therefore, the main time consumption is mainly caused by fault detection and I/O forwarding related to the host HBA and iSCSI driver. The solution is to modify iSCSI-related timeout parameters and use full interconnection to reduce I/O forwarding in case of faults.


Thank you.


Good to read
View more
  • x
  • convention:

little_fish
little_fish Created Sep 16, 2020 00:33:43 (0) (0)
Is this case too long?  
Unicef
Unicef Reply little_fish  Created Sep 16, 2020 07:38:46 (0) (0)
Yes, a bit long  
little_fish
little_fish Reply Unicef  Created Sep 16, 2020 08:28:18 (0) (0)
yes, many pictures, I put all the steps in this post.  
Good!
View more
  • x
  • convention:

little_fish
little_fish Created Sep 16, 2020 01:21:58 (0) (0)
 
Thank you for sharing!
View more
  • x
  • convention:

little_fish
little_fish Created Sep 16, 2020 01:22:06 (0) (0)
 
Thank you for sharing!
View more
  • x
  • convention:

little_fish
little_fish Created Sep 18, 2020 09:18:58 (0) (0)
welcome  
Reliability Optimization in Dorado 6000 V6 iSCSI Scenarios-3440157-1
View more
  • x
  • convention:

little_fish
little_fish Created Sep 22, 2020 00:55:06 (0) (0)
 

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.