Hello,
During the execution of a load test for a WPS based application a lot of archive logs are created on DB2 server. I have several instances of DB2 set on userexit - logs are moved into another partition that becomes full 100% and then I got archiving logs error.
Soon after this it happened that the load test failed. WAS behind WPS (located on some other app server) reported the cause is due to a time out connection and a WAS transaction has to be rolled-back - the WAS logs are not relevant if indeed this is DB2 fault.
This is the error in my USEREXIT.ERR:
HTML Code:
********************************************************************************
Time of Error:
...
Logfile path: ...
Node number: NODE0000
Operating system: AIX64BIT
Release: SQL08029
Request: ARCHIVE
Audit Log File: .../ARCHIVE.LOG
System Call Parms: rm -f ..../NODE0000/S0000xxx.LOG
Media Type: disk
User Exit RC: 28
> Error isolation: Error archiving file. Return code 256 received from the system call
Audit Log File ARCHIVE.LOG reports:
HTML Code:
********************************************************************************
Time Started: Wed Mar 11 ...
Parameter Count: 8
Parameters Passed:
Database name: ...
Logfile name: S0000xxx.LOG
Logfile path: .../NODE0000/SQL00001/SQLOGDIR/
Node number: NODE0000
Operating system: AIX64BIT
Release: SQL08029
Request: ARCHIVE
System Action: ARCHIVE from ..../NODE0000/SQL00001/SQLOGDIR/ file S0000xxx.LOG to ...
Media Type: disk
User Exit RC: 28 |||> ERROR <|||
Time Completed: Wed Mar 11 ...
However I have 2 questions to be clarified:
1. What can I do to see what is happening during the load test?
What do you suggest is better for me to monitor - to take db2 get snapshot from time to time or do you have a better idea - to see if indeed this is cause by archive log failure and what DB2 is doing during the test.
Anyway I need to establish a kind of monitoring and it would be helpfully for me.
2. I inherited this systems without any knowledge transition.
I have several DB2 servers each with about 3 to 10 instances each with one DB SMS, LOGARCHMETH1 is userexit and no partioned is used since db are small sized.
I found the path where db2uext2.c is located and I understand when some conditions are fullfilled the logs are moved from logfile path to another location/AIX partition - and there they are purged by a crontab script if they become old enough.
My problem is to understand in what conditions / when the logs are moved - if someone can help I will attach the db2uext2.c file here - I am not an expert in C.
HTML Code:
$ db2level
DB21085I Instance "za7d1006" uses "64" bits and DB2 code release "SQL08029" with level identifier "030A0106".
Informational tokens are "DB2 v8.1.1.144", "s080111", "U811269", and FixPak
"16".
Thank you,