跳到主要内容

验证修复和手动切回

可以通过在协商切换后将集群切回原始数据中心来测试修复和手动切回操作,从而验证数据可用性是否受到影响(SMB 配置除外)。

关于本任务

此测试应该需要大约 30 分钟。

此过程的预期结果是应将服务切回其主节点。

在运行 ONTAP 9.5 或更高版本的系统上不需要执行修复步骤,这些系统会在协商切换后自动执行修复。在运行 ONTAP 9.6 及更高版本的系统上,还会在发生计划外切换后自动执行修复。

  1. 如果系统运行的是 ONTAP 9.4 或更低版本,请修复数据聚合:metrocluster heal aggregates

    示例

    以下示例显示已成功完成命令:
    cluster_A::> metrocluster heal aggregates
    [Job 936] Job succeeded: Heal Aggregates is successful.
  2. 如果系统运行的是 ONTAP 9.4 或更低版本,请修复根聚合:metrocluster heal root-aggregates

    以下配置需要执行此步骤:
    • MetroCluster FC 配置。

    • 运行 ONTAP 9.4 或更低版本的 MetroCluster IP 配置。

    示例

    以下示例显示已成功完成命令:
    cluster_A::> metrocluster heal root-aggregates
    [Job 937] Job succeeded: Heal Root Aggregates is successful.
  3. 验证是否已完成修复:metrocluster node show

    示例

    以下示例显示已成功完成命令:
    cluster_A::> metrocluster node show
    DR Configuration DR
    Group Cluster Node State Mirroring Mode
    ----- ------- ------------------ -------------- --------- --------------------
    1 cluster_A
    node_A_1 configured enabled heal roots completed
    cluster_B
    node_B_2 unreachable - switched over
    42 entries were displayed.metrocluster operation show

    示例

    如果自动修复操作因任何原因失败,则必须手动发出 metrocluster heal 命令,这与在 ONTAP 9.5 之前的 ONTAP 版本中执行的操作一样。可以使用 metrocluster operation showmetrocluster operation history show -instance 命令监控修复状态并确定失败原因。

  4. 验证是否已镜像所有聚合:storage aggregate show

    示例

    以下示例显示所有聚合的 RAID 状态均为 mirrored
    cluster_A::> storage aggregate show
    cluster Aggregates:
    Aggregate Size Available Used% State #Vols Nodes RAID Status
    --------- -------- --------- ----- ------- ------ ----------- ------------
    data_cluster
    4.19TB 4.13TB 2% online 8 node_A_1 raid_dp,
    mirrored,
    normal
    root_cluster
    715.5GB 212.7GB 70% online 1 node_A_1 raid4,
    mirrored,
    normal
    cluster_B Switched Over Aggregates:
    Aggregate Size Available Used% State #Vols Nodes RAID Status
    --------- -------- --------- ----- ------- ------ ----------- ------------
    data_cluster_B
    4.19TB 4.11TB 2% online 5 node_A_1 raid_dp,
    mirrored,
    normal
    root_cluster_B - - - unknown - node_A_1 -

  5. 引导灾难站点的节点。
  6. 检查切回恢复的状态:metrocluster node show

    示例

    cluster_A::> metrocluster node show
    DR Configuration DR
    Group Cluster Node State Mirroring Mode
    ----- ------- ------------------ -------------- --------- --------------------
    1 cluster_A
    node_A_1 configured enabled heal roots completed
    cluster_B
    node_B_2 configured enabled waiting for switchback
    recovery
    2 entries were displayed.

  7. 执行切回:metrocluster switchback

    示例

    cluster_A::> metrocluster switchback 
    [Job 938] Job succeeded: Switchback is successful.Verify switchback
  8. 确认节点的状态:metrocluster node show

    示例

    cluster_A::> metrocluster node show
    DR Configuration DR
    Group Cluster Node State Mirroring Mode
    ----- ------- ------------------ -------------- --------- --------------------
    1 cluster_A
    node_A_1 configured enabled normal
    cluster_B
    node_B_2 configured enabled normal

    2 entries were displayed.

  9. 确认 metrocluster 操作的状态:metrocluster operation show

    示例

    输出中应显示成功状态。
    cluster_A::> metrocluster operation show
    Operation: switchback
    State: successful
    Start Time: 2/6/2016 13:54:25
    End Time: 2/6/2016 13:56:15
    Errors: -