ms_sql_server:microsoft_clustering_fail-over_have_downtime

Microsoft Clustering Fail-over Have Downtime

According to https://www.mssqltips.com/sqlservertip/1882/understanding-why-there-is-still-downtime-for-sql-server-with-microsoft-clustering/, Microsoft clustering fail-over solution involve the following steps that cause the downtime.

  1. Stops the SQL Server Agent service and any services dependent on the SQL Server service.
  2. Stops the SQL Server service (meaning SQL Server isn't available at this point).
  3. Releases NodeA's hold on the IP address and network name for VirtualNode1.
  4. Releases NodeA's hold on the shared storage.
  5. Tells NodeB to take control of the shared storage (which it does).
  6. Tells NodeB to take control of the IP address and network name for VirtualNode1.
  7. Starts the SQL Server service on NodeB (meaning SQL Server is coming back on-line for its clients).
  8. Starts the SQL Server Agent service and any services dependent on the SQL Server service.

I test the down time, it would be a couple of seconds. Sound little, but it would be bad enough to cause some application server (WebLogic for exmaple) 20s to 30s revcover.

  • ms_sql_server/microsoft_clustering_fail-over_have_downtime.txt
  • Last modified: 2020/11/13 09:02
  • by chongtin