7-Mode Data ONTAP upgrade HA pair

What should we do before upgrade?

1) Download system files for current and desired version of Data ONTAP from MySupport pages

http://mysupport.netapp.com/NOW/cgi-bin/software

and upload it to /etc/software directory located on your filer.

2) Before upgrade unmount your exports with configuration files from servers and do the backup of /etc/hosts and /etc/rc

3) Choose the way of upgrade depending on your environment configuration:

Type of SM Upgrade method
Volume SnapMirror Upgrade the storage systems containing the destination volumes, beginning with the furthest system in the topology (that is, the last system in a series of cascading destinations), and then upgrade the storage systems containing the source volumes.
Synchronous SnapMirroror semi-synchronous SnapMirror Upgrade the storage systems containing the source volumes and the storage systems containing the destination volumes simultaneously.
Qtree SnapMirror Upgrade the storage systems containing the source volumes before or after youupgrade the storage systems containing the destination volumes.

Data ONTAP 7-Mode in High Availability pair – non-disruptive upgrade step by step

Just before you start with below plan ensure that everything on both filers are online (disks, volumes, aggregates, interfaces) and there are no maintenance jobs running like RAID scrubbing, reconstructions or disk wiping.

1) On both filers monitor  the performance

filer1(2)> sysstat -c 30 -x 3

It’s good for a while to monitor the performance of our both nodes to ensure that CPU and Disk utilization is below 50% on each node. It’s because during takeover one node need to handle with both nodes. And low level of disk utilization is recommended to let ONTAP perform disk upgrade.

2) Trigger AutoSupport on both filers. Notification is send immediately and subject makes clear view on our actions

filer1(2)> options autosupport.doit "starting_NDU 8.X.XPX"

3) Verify status of HA pair from both filers

filer1(2)> cf status

4) Ensure that automatic giveback is off

filer1(2)> options cf.giveback.auto.enable off

5) Install the system files using the software command on both filers

filer1(2)> software update 8XXPX_q_image.tgz –r

6) Take care of snapmirror (optional step). Decide whether you want to wait until current transfers are finished. Below steps should be performed on both filers.

6.1) Look for any ongoing transfers, nice to have some management server to use grep

unix-mgmt> ssh filer01(02) snapmirror status |grep Transferring

If you don’t care whether current update finish job then go with

filer1(2)> snapmirror abort dest_vol

In other case use quiesce  and wait for the whole update

filer1(2)> snapmirror quiesce dest_vol

If you know that more relationships have scheduled for next hour then do quiesce to them as well.

6.2) Turn off snapmirror

filer1(2)> snapmirror off

7) Terminate CIFS on the node to be taken over

filer2> cifs terminate

8) Time for takeover and giveback sequence on our HA pair. Be careful, statistics say the most failures happen in cluster configuration, because administrator become too far confident.

filer1> cf takeover

During takeover second filer goes down. Wait around 10 minutes to ensure that environment become stabilized, which means that clients multipathing and I/O request can be served after short pause caused because of takeover.

9) Ensure that filer2 is ready for giveback and give back the data service from filer1

filer1> cf giveback

Again wait some time before next step.

10) From the newly upgraded filer2, take over the data service from filer1

filer2> cf takeover

Once again wait some time before next step.

11) After the first node reboots and displays “waiting for giveback”, give back the data service from filer2

filer2> cf giveback

12) From filer1 and filer2 check the current version of ONTAP

filer1(2)> version

13) Turn on automatic giveback, if that is your preferable setup

filer1(2)> options cf.giveback.auto.enable on

14) Check performance statistics

filer1(2)> sysstat -c 30 -x 3

And now probably you see high disk utilization. It means that your disk firmware is during update or some WAFL blocks reallocation has place. Anyway, it could take few hours before your system go back to fully operational status.

15) On both filers turn on snapmirror

filer1(2)> snapmirror on

and also quiesced SM has to be resumed

filer1(2)> snapmirror resume dest_vol

16) Trigger post-upgrade AutoSupport

filer1(2)> options autosupport.doit “finishing_NDU 8.X.XPX”

17) Verify the HA relationship is operational on both filers

filer1(2)> cf status

18) Confirm that all paths to disk are visible to the system by entering the following command on both filers

filer1(2)> storage show disk -p

19) Confirm that there are no failed disks by entering the following command on both filers

filer1(2)> vol status -f

20) Verify that all aggregates are online by entering the following command on both filers

filer1(2)> aggr status

21) Confirm that network interfaces are online by entering the following command on both filers

filer1(2)> ifconfig -a

What else is worth to know?

  • Before you perform any implementation verify your plan with NetApp tool called Upgrade Advisor. In most cases it will help you determine the required steps. But be aware that you always have to think about your production environment to be not surprised in case of something goes wrong. That lead us to the next point, which is..
  • Reverse plan – if you like your job, always ensure that you have backout plan. In terms of backout plan you have to turn off most of the features and make sure that the downgrade do not require any additional changes in system.
  • Currently with firmware version of Data ONTAP, firmware for disks and disk shelves is bundled with system files. After filer reboot automatically firmware for disks and disk shelves is installed.

Recommended manuals provided by NetApp

Upgrade and Revert/Downgrade Guide For 7-Mode

Related topics:

7-Mode Data ONTAP upgrade single node

Leave a Reply

Your email address will not be published. Required fields are marked *