跳到主要内容

执行聚合修复以及恢复镜像(MetroCluster IP 配置)

在运行 ONTAP 9.5 或更低版本的系统中更换硬件并分配磁盘后,可以执行 MetroCluster 修复操作。在所有的 ONTAP 版本中,都必须随后确认聚合已镜像,并在必要时重新启动镜像。

关于本任务

从 ONTAP 9.6 开始,当灾难站点节点引导时,将自动执行修复操作。不需要修复命令。

在幸存集群上执行这些步骤。

  1. 如果使用的是 ONTAP 9.6 或更高版本,则必须验证自动修复是否已成功完成:
    1. 确认 heal-aggr-auto 和 heal-root-aggr-auto 操作已完成:metrocluster operation history show

      示例

      以下输出显示已在 cluster_A 上成功完成这些操作。

      cluster_B::*> metrocluster operation history show
      Operation State Start Time End Time
      ----------------------------- -------------- ---------------- ----------------
      heal-root-aggr-auto successful 2/25/2019 06:45:58
      2/25/2019 06:46:02
      heal-aggr-auto successful 2/25/2019 06:45:48
      2/25/2019 06:45:52
      .
      .
      .

    2. 确认灾难站点已准备好进行切回:metrocluster node show

      示例

      以下输出显示已在 cluster_A 上成功完成这些操作。

      cluster_B::*> metrocluster node show
      DR Configuration DR
      Group Cluster Node State Mirroring Mode
      ----- ------- ------------- -------------- --------- --------------------
      1 cluster_A
      node_A_1 configured enabled heal roots completed
      node_A_2 configured enabled heal roots completed
      cluster_B
      node_B_1 configured enabled waiting for switchback recovery
      node_B_2 configured enabled waiting for switchback recovery
      4 entries were displayed.
  2. 如果使用的是 ONTAP 9.5 或更低版本,则必须执行聚合修复:
    1. 验证节点的状态:metrocluster node show

      示例

      以下输出显示切换已完成,因此可以执行修复。

      cluster_B::> metrocluster node show
      DR Configuration DR
      Group Cluster Node State Mirroring Mode
      ----- ------- ------------------ -------------- --------- --------------------
      1 cluster_B
      node_B_1 configured enabled switchover completed
      node_B_2 configured enabled switchover completed
      cluster_A
      node_A_1 configured enabled waiting for switchback recovery
      node_A_2 configured enabled waiting for switchback recovery
      4 entries were displayed.

      cluster_B::>
    2. 执行聚合修复阶段:metrocluster heal -phase aggregates

      示例

      以下输出显示了典型的聚合修复操作。

      cluster_B::*> metrocluster heal -phase aggregates
      [Job 647] Job succeeded: Heal Aggregates is successful.

      cluster_B::*> metrocluster operation show
      Operation: heal-aggregates
      State: successful
      Start Time: 10/26/2017 12:01:15
      End Time: 10/26/2017 12:01:17
      Errors: -

      cluster_B::*>
    3. 验证是否已完成聚合修复并且灾难站点已准备好进行切回:metrocluster node show

      示例

      以下输出显示 cluster_A 上已完成聚合修复阶段。

      cluster_B::> metrocluster node show
      DR Configuration DR
      Group Cluster Node State Mirroring Mode
      ----- ------- ------------------ -------------- --------- --------------------
      1 cluster_A
      node_A_1 configured enabled heal aggregates completed
      node_A_2 configured enabled heal aggregates completed
      cluster_B
      node_B_1 configured enabled waiting for switchback recovery
      node_B_2 configured enabled waiting for switchback recovery
      4 entries were displayed.

      cluster_B::>

  3. 如果已更换磁盘,则必须镜像本地聚合和已切换的聚合:
    1. 显示聚合:storage aggregate show

      示例

      cluster_B::> storage aggregate show
      cluster_B Aggregates:
      Aggregate Size Available Used% State #Vols Nodes RAID Status
      --------- -------- --------- ----- ------- ------ ---------------- ------------
      node_B_1_aggr0 1.49TB 74.12GB 95% online 1 node_B_1 raid4,
      normal
      node_B_2_aggr0 1.49TB 74.12GB 95% online 1 node_B_2 raid4,
      normal
      node_B_1_aggr1 3.14TB 3.04TB 3% online 15 node_B_1 raid_dp,
      normal
      node_B_1_aggr2 3.14TB 3.06TB 3% online 14 node_B_1 raid_tec,
      normal
      node_B_1_aggr1 3.14TB 2.99TB 5% online 37 node_B_2 raid_dp,
      normal
      node_B_1_aggr2 3.14TB 3.02TB 4% online 35 node_B_2 raid_tec,
      normal

      cluster_A Switched Over Aggregates:
      Aggregate Size Available Used% State #Vols Nodes RAID Status
      --------- -------- --------- ----- ------- ------ ---------------- ------------
      node_A_1_aggr1 2.36TB 2.12TB 10% online 91 node_B_1 raid_dp,
      normal
      node_A_1_aggr2 3.14TB 2.90TB 8% online 90 node_B_1 raid_tec,
      normal
      node_A_2_aggr1 2.36TB 2.10TB 11% online 91 node_B_2 raid_dp,
      normal
      node_A_2_aggr2 3.14TB 2.89TB 8% online 90 node_B_2 raid_tec,
      normal
      12 entries were displayed.

      cluster_B::>
    2. 镜像聚合:storage aggregate mirror -aggregate aggregate-name

      示例

      以下输出显示了典型的镜像操作。

      cluster_B::> storage aggregate mirror -aggregate node_B_1_aggr1

      Info: Disks would be added to aggregate "node_B_1_aggr1" on node "node_B_1" in
      the following manner:

      Second Plex

      RAID Group rg0, 6 disks (block checksum, raid_dp)
      Position Disk Type Size
      ---------- ------------------------- ---------- ---------------
      dparity 5.20.6 SSD -
      parity 5.20.14 SSD -
      data 5.21.1 SSD 894.0GB
      data 5.21.3 SSD 894.0GB
      data 5.22.3 SSD 894.0GB
      data 5.21.13 SSD 894.0GB

      Aggregate capacity available for volume use would be 2.99TB.

      Do you want to continue? {y|n}: y
    3. 对幸存站点中的每个聚合重复上一步。
    4. 等待聚合再同步;可以使用 storage aggregate show 命令查看状态。

      示例

      以下输出显示许多聚合正在进行再同步。

      cluster_B::> storage aggregate show

      cluster_B Aggregates:
      Aggregate Size Available Used% State #Vols Nodes RAID Status
      --------- -------- --------- ----- ------- ------ ---------------- ------------
      node_B_1_aggr0 1.49TB 74.12GB 95% online 1 node_B_1 raid4,
      mirrored,
      normal
      node_B_2_aggr0 1.49TB 74.12GB 95% online 1 node_B_2 raid4,
      mirrored,
      normal
      node_B_1_aggr1 2.86TB 2.76TB 4% online 15 node_B_1 raid_dp,
      resyncing
      node_B_1_aggr2 2.89TB 2.81TB 3% online 14 node_B_1 raid_tec,
      resyncing
      node_B_2_aggr1 2.73TB 2.58TB 6% online 37 node_B_2 raid_dp,
      resyncing
      node_B-2_aggr2 2.83TB 2.71TB 4% online 35 node_B_2 raid_tec,
      resyncing

      cluster_A Switched Over Aggregates:
      Aggregate Size Available Used% State #Vols Nodes RAID Status
      --------- -------- --------- ----- ------- ------ ---------------- ------------
      node_A_1_aggr1 1.86TB 1.62TB 13% online 91 node_B_1 raid_dp,
      resyncing
      node_A_1_aggr2 2.58TB 2.33TB 10% online 90 node_B_1 raid_tec,
      resyncing
      node_A_2_aggr1 1.79TB 1.53TB 14% online 91 node_B_2 raid_dp,
      resyncing
      node_A_2_aggr2 2.64TB 2.39TB 9% online 90 node_B_2 raid_tec,
      resyncing
      12 entries were displayed.
    5. 确认所有聚合都已联机并已再同步:storage aggregate plex show

      示例

      以下输出显示所有聚合都完成了再同步。

      cluster_A::> storage aggregate plex show
      ()
      Is Is Resyncing
      Aggregate Plex Online Resyncing Percent Status
      --------- --------- ------- ---------- --------- ---------------
      node_B_1_aggr0 plex0 true false - normal,active
      node_B_1_aggr0 plex8 true false - normal,active
      node_B_2_aggr0 plex0 true false - normal,active
      node_B_2_aggr0 plex8 true false - normal,active
      node_B_1_aggr1 plex0 true false - normal,active
      node_B_1_aggr1 plex9 true false - normal,active
      node_B_1_aggr2 plex0 true false - normal,active
      node_B_1_aggr2 plex5 true false - normal,active
      node_B_2_aggr1 plex0 true false - normal,active
      node_B_2_aggr1 plex9 true false - normal,active
      node_B_2_aggr2 plex0 true false - normal,active
      node_B_2_aggr2 plex5 true false - normal,active
      node_A_1_aggr1 plex4 true false - normal,active
      node_A_1_aggr1 plex8 true false - normal,active
      node_A_1_aggr2 plex1 true false - normal,active
      node_A_1_aggr2 plex5 true false - normal,active
      node_A_2_aggr1 plex4 true false - normal,active
      node_A_2_aggr1 plex8 true false - normal,active
      node_A_2_aggr2 plex1 true false - normal,active
      node_A_2_aggr2 plex5 true false - normal,active
      20 entries were displayed.
  4. 在运行 ONTAP 9.5 及更低版本的系统上,执行根聚合修复阶段:metrocluster heal -phase root-aggregates

    示例

    cluster_B::> metrocluster heal -phase root-aggregates
    [Job 651] Job is queued: MetroCluster Heal Root Aggregates Job.Oct 26 13:05:00
    [Job 651] Job succeeded: Heal Root Aggregates is successful.
  5. 验证是否已完成根聚合修复并且灾难站点已准备好进行切回:

    示例

    以下输出显示 cluster_A 上已完成根修复阶段。

    cluster_B::> metrocluster node show
    DR Configuration DR
    Group Cluster Node State Mirroring Mode
    ----- ------- ------------------ -------------- --------- --------------------
    1 cluster_A
    node_A_1 configured enabled heal roots completed
    node_A_2 configured enabled heal roots completed
    cluster_B
    node_B_1 configured enabled waiting for switchback recovery
    node_B_2 configured enabled waiting for switchback recovery
    4 entries were displayed.

    cluster_B::>

完成之后

继续验证替换节点上的许可证。

验证替换节点上的许可证