If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

 
Go Back  dBforums > Database Server Software > DB2 > DB2 takes time to switch over on passive node – DB2HA using db2haicu

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 04-05-10, 15:34
shashankj shashankj is offline
Registered User
 
Join Date: Mar 2010
Posts: 13
DB2 takes time to switch over on passive node – DB2HA using db2haicu

Hi All,

I have configured DB2 HA using db2haicu utility.
Setup Details:
test14 and test15 are using shared filesystem which is always mounted as /DB2.

I see the expected output on both machines for lssam command.

db2inst1@test14:~> lssam
Online IBM.ResourceGroup:db2_db2inst1_0-rg Nominal=Online
|- Online IBM.Application:db2_db2inst1_0-rs
|- Online IBM.Application:db2_db2inst1_0-rs:test14
'- Offline IBM.Application:db2_db2inst1_0-rs:test15
'- Online IBM.ServiceIP:db2ip_9_1_69_84-rs
|- Online IBM.ServiceIP:db2ip_9_1_69_84-rs:test14
'- Offline IBM.ServiceIP:db2ip_9_1_69_84-rs:test15
db2inst1@test14:~>

db2inst1@test15:~> lssam
Online IBM.ResourceGroup:db2_db2inst1_0-rg Nominal=Online
|- Online IBM.Application:db2_db2inst1_0-rs
|- Online IBM.Application:db2_db2inst1_0-rs:test14
'- Offline IBM.Application:db2_db2inst1_0-rs:test15
'- Online IBM.ServiceIP:db2ip_9_1_69_84-rs
|- Online IBM.ServiceIP:db2ip_9_1_69_84-rs:test14
'- Offline IBM.ServiceIP:db2ip_9_1_69_84-rs:test15
db2inst1@test15:~>


I have also verified that the switch over happens as expected after rebooting active node. I have observed that IP gets switch over from active node to passive node immediately after rebooting active node, but db2 goes in pending state and takes around 3 to 4 mins to become online.

Questions:
Is that expected to switch over from active node to passive node? If no, what might have gone wrong? How should I fix this?

Thanks and Regards,
-Shashank
Reply With Quote
  #2 (permalink)  
Old 04-05-10, 18:57
sathyaram_s sathyaram_s is offline
Super Moderator
 
Join Date: Aug 2001
Location: UK
Posts: 4,534
3-4 mins does not sound unreasonable.

are there any messages in db2diag.log. there should be a few about crash recovery. are there any other messages?

monitor resource failover on the TSA side and look for error messages.
__________________
Visit the new-look IDUG Website , register to gain access to the excellent content.
Reply With Quote
  #3 (permalink)  
Old 04-05-10, 19:09
shashankj shashankj is offline
Registered User
 
Join Date: Mar 2010
Posts: 13
Hi,

I am using shared filesystem. So my logs are also shared. db2diag.log file is getting written by active node.

I have shared a partition called /Db2 on both machines. This has db2inst1 user home and also installtion is done in this directory.
# ls /DB2/
drwxr-xr-x 9 dasusr1 dasadm1 65536 2010-03-29 21:11 dasusr1
drwxr-xr-x 8 db2fenc1 db2fadm1 65536 2010-03-29 21:07 db2fenc1
drwxr-xr-x 15 db2inst1 db2iadm1 65536 2010-04-05 17:04 db2inst1
drwxr-xr-x 38 root root 65536 2010-03-29 21:10 v9.7
#


Do you think this can be an issue?

I will try rebooting active node and check for errors.

Thanks and Regards,
-Shashank
Reply With Quote
  #4 (permalink)  
Old 04-07-10, 08:35
muffaddalDBA muffaddalDBA is offline
Registered User
 
Join Date: Mar 2010
Location: uk
Posts: 4
I agree with sathyaram 3-4 minutes is reasonable, the longer recovery time has to do with the loss of the group buffer pools when the disaster event occurs, something that causes database objects to go into group buffer pool recover pending status. It takes a while to get the objects out of pending status when the data sharing group is recovered at the DR site.

one of solution will be : Requires the standby system to always be fully up-to-date with the primary system, which can bring down the online time.

thanks
Reply With Quote
  #5 (permalink)  
Old 04-09-10, 14:58
shashankj shashankj is offline
Registered User
 
Join Date: Mar 2010
Posts: 13
Thanks for response.

In my setup above, the other machines is not being used for anything else.
I would like to understand exactly what should I check to see if it is ‘fully up-to-date with the primary system’.

Regards,
-Shashank
Reply With Quote
  #6 (permalink)  
Old 04-12-10, 05:44
muffaddalDBA muffaddalDBA is offline
Registered User
 
Join Date: Mar 2010
Location: uk
Posts: 4
Hi Shashank,

Heres good link to understand how HADR Sync mode (fully up-to-date) means.

IBM DB2 9.7 for Linux, UNIX and Windows Information Center

Thanks
Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On