Results 1 to 4 of 4
  1. #1
    Join Date
    Aug 2003
    Posts
    9

    Unhappy Unanswered: DB2 crashes after 60 DIA3003E Entries

    Hay everyone,

    The system info is on the bottom of this message.
    I get several entries into the db2diag.log (around 60) within 1 second of the following form:
    ----------------
    2003-08-19-15.50.27.459815 Instancedb2ins1 Node:000
    PID:17821(db2tcpcm) Appid:none
    common_communication sqlcctcpconnmgr_child Probe:125
    DIA3003E Error encountered in "TCPIP" protocol support. Return code from
    "sqleGetAgent" was "-6036".
    ----------------

    The next entry is from the db2gds process (see below):

    Then the instance crashes. I had this 2 times this week and its
    getting nasty. I searched all over for the DIA3003E Message but can not really find a good answer somewhere. I get a dump file (which is appendended below as well).

    Does anyone ever had similar problems or even solved this problem already? Any help I would appreciate.

    regards
    roman



    System Info:
    ----------------------------------------------
    - SunOS 5.8 Generic_108528-15 sun4u sparc SUNW,Sun-Fire-480R (4 CPU)
    - IBM DB2 7.2fp8
    - Instance running as DCS Gateway Server to z/OS Host

    Instance Config:
    ----------------------------------------------
    Database manager configuration release level = 0x0900

    CPU speed (millisec/instruction) (CPUSPEED) = 4.873018e-07

    Database monitor heap size (4KB) (MON_HEAP_SZ) = 256
    UDF shared memory set size (4KB) (UDF_MEM_SZ) = 256
    Java Virtual Machine heap size (4KB) (JAVA_HEAP_SZ) = 2048
    Audit buffer size (4KB) (AUDIT_BUF_SZ) = 0

    Backup buffer default size (4KB) (BACKBUFSZ) = 1024
    Restore buffer default size (4KB) (RESTBUFSZ) = 1024

    Sort heap threshold (4KB) (SHEAPTHRES) = 20000

    Directory cache support (DIR_CACHE) = YES

    Application support layer heap size (4KB) (ASLHEAPSZ) = 15
    Max requester I/O block size (bytes) (RQRIOBLK) = 32767
    Query heap size (4KB) (QUERY_HEAP_SZ) = 1000
    DRDA services heap size (4KB) (DRDA_HEAP_SZ) = 1000

    Priority of agents (AGENTPRI) = SYSTEM
    Max number of existing agents (MAXAGENTS) = 2000
    Agent pool size (NUM_POOLAGENTS) = 220
    Initial number of agents in pool (NUM_INITAGENTS) = 0
    Max number of coordinating agents (MAX_COORDAGENTS) = MAXAGENTS
    Max no. of concurrent coordinating agents (MAXCAGENTS) = MAX_COORDAGENTS
    Max number of logical agents (MAX_LOGICAGENTS) = MAX_COORDAGENTS

    Keep DARI process (KEEPDARI) = YES
    Max number of DARI processes (MAXDARI) = MAX_COORDAGENTS
    Initialize DARI process with JVM (INITDARI_JVM) = NO
    Initial number of fenced DARI process (NUM_INITDARIS) = 0

    Index re-creation time (INDEXREC) = RESTART

    Transaction manager database name (TM_DATABASE) = 1ST_CONN
    Transaction resync interval (sec) (RESYNC_INTERVAL) = 180

    SPM name (SPM_NAME) =
    SPM log size (SPM_LOG_FILE_SZ) = 256
    SPM resync agent limit (SPM_MAX_RESYNC) = 20
    SPM log path (SPM_LOG_PATH) =

    TCP/IP Service name (SVCENAME) = pdb2ins1
    APPC Transaction program name (TPNAME) =
    IPX/SPX File server name (FILESERVER) =
    IPX/SPX DB2 server object name (OBJECTNAME) =
    IPX/SPX Socket number (IPX_SOCKET) = 879E

    Discovery mode (DISCOVER) = SEARCH
    Discovery communication protocols (DISCOVER_COMM) = TCPIP
    Discover server instance (DISCOVER_INST) = ENABLE

    Maximum query degree of parallelism (MAX_QUERYDEGREE) = ANY
    Enable intra-partition parallelism (INTRA_PARALLEL) = NO

    No. of int. communication buffers(4KB)(FCM_NUM_BUFFERS) = 1024
    Number of FCM request blocks (FCM_NUM_RQB) = 512
    Number of FCM connection entries (FCM_NUM_CONNECT) = (FCM_NUM_RQB * 0.75)
    Number of FCM message anchors (FCM_NUM_ANCHORS) = (FCM_NUM_RQB * 0.75)
    ------------------------------------------

    error message in diag before crash:
    ------------------
    2003-08-19-15.50.28.510117 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:20

    PID of abnormally terminated child process:
    0000 45a1 ..E.


    2003-08-19-15.50.28.526703 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:30

    waitpid() status of abnormally terminated child process:
    0000 0009 ....


    2003-08-19-15.50.28.529560 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:50

    Signal that terminated the child process:
    0000 0009 ....

    --------------------------------------------------------


    dump file generated:
    -----------------------------
    2003-08-19-15.50.28.532847 : DB2 v7.1.0.72 s021110 SQL07026
    SunOS R:5.8 V:Generic_108528-15 M:sun4u N:chr7ca27
    pdb2ins1.000 : db2gds (0x1)
    Signal #6

    Data seg top [sbrk(0)] = 0x01A55968
    Cur data size (bytes) = 0xFFFFFFFFFFFFFFFD
    Cur stack size (bytes) = 0x800000
    Cur core size (bytes) = 0xFFFFFFFFFFFFFFFD

    siginfo_t (length=128)
    00000006 ffffffff 00000000 0000459b
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000


    ucontext_t information:
    UCONTEXT: addr=ffbee1b0 flags=0000002f
    next=ffbee888 stack=ffbea000
    regset: psr= fe401002 pc= ff09bdc4
    npc= ff09bdc8 gwins=00000000
    g: 00000000 000000a3 01680c00 00000000
    00000000 018d2bfc 00000000 ff1475a8
    o: 00000000 00000001 00000005 ff0ba000
    00000000 ffbee5b8 ffbee4e8 ff04b758
    l: 150f0000 0000002e 00000011 00000031
    00000010 00000004 7efefeff 81010100
    i: 00000006 00000006 ffbee5a8 00000006
    ffbee6cc 00000010 ffbee548 ff035a3c
    SP==o6 FP==i6 SavePC=i7
    PC location: _lwp_kill + 0x8
    Object file: /usr/lib/libc.so.1 (offset 0x9bdc4)
    UCONTEXT: addr=ffbee888 flags=0000002f
    next=00000000 stack=ffbec000
    regset: psr= fe501001 pc= ff09bbdc
    npc= ff09bbe0 gwins=00000000
    g: 00000000 00000000 00be6400 00000000
    00000000 00000068 00000000 ff1475a8
    o: 0000005b 00000002 00008820 ffbeece4
    00000094 00000001 ffbeebc0 ff09362c
    l: 00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    i: 00008820 ffbeece4 00000094 00000001
    00000000 ff0941d8 ffbeec20 ff12e460
    SP==o6 FP==i6 SavePC=i7
    PC location: _syscall + 0x8
    Object file: /usr/lib/libc.so.1 (offset 0x9bbdc)

    Signal Handlers

    SIGABRT : 2d2aec
    SIGBUS : default
    SIGCHLD : 2d21a8
    SIGEMT : default
    SIGILL : default
    SIGINT : 2d2a30
    SIGPRE : 2d311c
    SIGSEGV : bc7514
    SIGSYS : default
    SIGTRAP : default
    SIGALRM : ignored
    SIGURG : 2d311c
    SIGPROF : 2d43fc
    SIGPIPE : ignored
    SIGHUP : ignored
    SIGFPE : 2d2aec
    SIGUSR1 : 2d2d94
    SIGUSR2 : ignored

    ##### Object: /usr/lib/libc.so.1
    _lwp_kill(0x6,0x6,0xffbee5a8,0x6,0xffbee6cc,0x10) + 0x8
    abort(0xff0ba000,0x0,0x0,0xffbee6bc,0x1680ddc,0x14 0823c) + 0xc0
    ##### Object: /usr/pdb2ins1/sqllib/adm/db2sysc
    __0FVsqloEDUSIGCHLDHandleri(0x0,0x4,0x4001,0x194f8 00,0x4,0x1660) + 0x260
    ##### Object: /usr/lib/lwp//libthread.so.1
    setitimer(0x12,0xffbeeb40,0xffbee888,0x2d21a8,0x0, 0x0) + 0xfc
    sema_post(0xff1475a8,0x1,0xff1479c0,0xffbee888,0xf fbeeb40,0x12) + 0x510
    sema_post(0xff1475a8,0xffbeeb40,0xffbee888,0xff146 000,0xffbeeb40,0x12) + 0x6dc
    ##### Object: /usr/lib/libc.so.1
    _msgrcv(0x8820,0xffbeece4,0x94,0x1,0x0,0xff0941d8) + 0x1c
    ##### Object: /usr/lib/lwp//libthread.so.1
    msgrcv(0x8820,0xff1475a8,0x94,0x1,0x0,0x94) + 0x68
    ##### Object: /usr/pdb2ins1/sqllib/adm/db2sysc
    __0FKsqloRunGDSv(0x194fbe0,0x18d2c04,0x0,0xff1475a 8,0xffbeee04,0x0) + 0x12c
    __0FTsqloInitEDUServicesv(0x18d2c00,0x0,0x0,0xffbe f6fc,0xffbef704,0x2) + 0x340
    __0FPsqloRunInstancePFv_iPFi_vPPvPlTE(0x1408dec,0x 3,0x0,0x459a,0x10039394,0x1d080c) + 0x51c
    main(0x1944b50,0x3a,0x18cbe2c,0x1400,0x2000,0x0) + 0xb6c
    _start(0x0,0x0,0x0,0x0,0x0,0x0) + 0xdc

  2. #2
    Join Date
    Aug 2003
    Posts
    9

    Re: DB2 crashes after 60 DIA3003E Entries

    I opened a PMR at IBM for this error. When I get a solution I'll report it. If in between someone has a tip I would appreciate to let me know

    regards
    roman

  3. #3
    Join Date
    Jan 2004
    Posts
    1

    Re: DB2 crashes after 60 DIA3003E Entries

    Do you have the PMR number for this problem and did you get a fix
    for this ? We are having similar issues and wanted to know if you could
    help.

    Thanks


    Originally posted by srzgea
    Hay everyone,

    The system info is on the bottom of this message.
    I get several entries into the db2diag.log (around 60) within 1 second of the following form:
    ----------------
    2003-08-19-15.50.27.459815 Instancedb2ins1 Node:000
    PID:17821(db2tcpcm) Appid:none
    common_communication sqlcctcpconnmgr_child Probe:125
    DIA3003E Error encountered in "TCPIP" protocol support. Return code from
    "sqleGetAgent" was "-6036".
    ----------------

    The next entry is from the db2gds process (see below):

    Then the instance crashes. I had this 2 times this week and its
    getting nasty. I searched all over for the DIA3003E Message but can not really find a good answer somewhere. I get a dump file (which is appendended below as well).

    Does anyone ever had similar problems or even solved this problem already? Any help I would appreciate.

    regards
    roman



    System Info:
    ----------------------------------------------
    - SunOS 5.8 Generic_108528-15 sun4u sparc SUNW,Sun-Fire-480R (4 CPU)
    - IBM DB2 7.2fp8
    - Instance running as DCS Gateway Server to z/OS Host

    Instance Config:
    ----------------------------------------------
    Database manager configuration release level = 0x0900

    CPU speed (millisec/instruction) (CPUSPEED) = 4.873018e-07

    Database monitor heap size (4KB) (MON_HEAP_SZ) = 256
    UDF shared memory set size (4KB) (UDF_MEM_SZ) = 256
    Java Virtual Machine heap size (4KB) (JAVA_HEAP_SZ) = 2048
    Audit buffer size (4KB) (AUDIT_BUF_SZ) = 0

    Backup buffer default size (4KB) (BACKBUFSZ) = 1024
    Restore buffer default size (4KB) (RESTBUFSZ) = 1024

    Sort heap threshold (4KB) (SHEAPTHRES) = 20000

    Directory cache support (DIR_CACHE) = YES

    Application support layer heap size (4KB) (ASLHEAPSZ) = 15
    Max requester I/O block size (bytes) (RQRIOBLK) = 32767
    Query heap size (4KB) (QUERY_HEAP_SZ) = 1000
    DRDA services heap size (4KB) (DRDA_HEAP_SZ) = 1000

    Priority of agents (AGENTPRI) = SYSTEM
    Max number of existing agents (MAXAGENTS) = 2000
    Agent pool size (NUM_POOLAGENTS) = 220
    Initial number of agents in pool (NUM_INITAGENTS) = 0
    Max number of coordinating agents (MAX_COORDAGENTS) = MAXAGENTS
    Max no. of concurrent coordinating agents (MAXCAGENTS) = MAX_COORDAGENTS
    Max number of logical agents (MAX_LOGICAGENTS) = MAX_COORDAGENTS

    Keep DARI process (KEEPDARI) = YES
    Max number of DARI processes (MAXDARI) = MAX_COORDAGENTS
    Initialize DARI process with JVM (INITDARI_JVM) = NO
    Initial number of fenced DARI process (NUM_INITDARIS) = 0

    Index re-creation time (INDEXREC) = RESTART

    Transaction manager database name (TM_DATABASE) = 1ST_CONN
    Transaction resync interval (sec) (RESYNC_INTERVAL) = 180

    SPM name (SPM_NAME) =
    SPM log size (SPM_LOG_FILE_SZ) = 256
    SPM resync agent limit (SPM_MAX_RESYNC) = 20
    SPM log path (SPM_LOG_PATH) =

    TCP/IP Service name (SVCENAME) = pdb2ins1
    APPC Transaction program name (TPNAME) =
    IPX/SPX File server name (FILESERVER) =
    IPX/SPX DB2 server object name (OBJECTNAME) =
    IPX/SPX Socket number (IPX_SOCKET) = 879E

    Discovery mode (DISCOVER) = SEARCH
    Discovery communication protocols (DISCOVER_COMM) = TCPIP
    Discover server instance (DISCOVER_INST) = ENABLE

    Maximum query degree of parallelism (MAX_QUERYDEGREE) = ANY
    Enable intra-partition parallelism (INTRA_PARALLEL) = NO

    No. of int. communication buffers(4KB)(FCM_NUM_BUFFERS) = 1024
    Number of FCM request blocks (FCM_NUM_RQB) = 512
    Number of FCM connection entries (FCM_NUM_CONNECT) = (FCM_NUM_RQB * 0.75)
    Number of FCM message anchors (FCM_NUM_ANCHORS) = (FCM_NUM_RQB * 0.75)
    ------------------------------------------

    error message in diag before crash:
    ------------------
    2003-08-19-15.50.28.510117 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:20

    PID of abnormally terminated child process:
    0000 45a1 ..E.


    2003-08-19-15.50.28.526703 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:30

    waitpid() status of abnormally terminated child process:
    0000 0009 ....


    2003-08-19-15.50.28.529560 Instancedb2ins1 Node:000
    PID:17819(db2gds) Appid:none
    oper_system_services sqloEDUSIGCHLDHandler Probe:50

    Signal that terminated the child process:
    0000 0009 ....

    --------------------------------------------------------


    dump file generated:
    -----------------------------
    2003-08-19-15.50.28.532847 : DB2 v7.1.0.72 s021110 SQL07026
    SunOS R:5.8 V:Generic_108528-15 M:sun4u N:chr7ca27
    pdb2ins1.000 : db2gds (0x1)
    Signal #6

    Data seg top [sbrk(0)] = 0x01A55968
    Cur data size (bytes) = 0xFFFFFFFFFFFFFFFD
    Cur stack size (bytes) = 0x800000
    Cur core size (bytes) = 0xFFFFFFFFFFFFFFFD

    siginfo_t (length=128)
    00000006 ffffffff 00000000 0000459b
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000


    ucontext_t information:
    UCONTEXT: addr=ffbee1b0 flags=0000002f
    next=ffbee888 stack=ffbea000
    regset: psr= fe401002 pc= ff09bdc4
    npc= ff09bdc8 gwins=00000000
    g: 00000000 000000a3 01680c00 00000000
    00000000 018d2bfc 00000000 ff1475a8
    o: 00000000 00000001 00000005 ff0ba000
    00000000 ffbee5b8 ffbee4e8 ff04b758
    l: 150f0000 0000002e 00000011 00000031
    00000010 00000004 7efefeff 81010100
    i: 00000006 00000006 ffbee5a8 00000006
    ffbee6cc 00000010 ffbee548 ff035a3c
    SP==o6 FP==i6 SavePC=i7
    PC location: _lwp_kill + 0x8
    Object file: /usr/lib/libc.so.1 (offset 0x9bdc4)
    UCONTEXT: addr=ffbee888 flags=0000002f
    next=00000000 stack=ffbec000
    regset: psr= fe501001 pc= ff09bbdc
    npc= ff09bbe0 gwins=00000000
    g: 00000000 00000000 00be6400 00000000
    00000000 00000068 00000000 ff1475a8
    o: 0000005b 00000002 00008820 ffbeece4
    00000094 00000001 ffbeebc0 ff09362c
    l: 00000000 00000000 00000000 00000000
    00000000 00000000 00000000 00000000
    i: 00008820 ffbeece4 00000094 00000001
    00000000 ff0941d8 ffbeec20 ff12e460
    SP==o6 FP==i6 SavePC=i7
    PC location: _syscall + 0x8
    Object file: /usr/lib/libc.so.1 (offset 0x9bbdc)

    Signal Handlers

    SIGABRT : 2d2aec
    SIGBUS : default
    SIGCHLD : 2d21a8
    SIGEMT : default
    SIGILL : default
    SIGINT : 2d2a30
    SIGPRE : 2d311c
    SIGSEGV : bc7514
    SIGSYS : default
    SIGTRAP : default
    SIGALRM : ignored
    SIGURG : 2d311c
    SIGPROF : 2d43fc
    SIGPIPE : ignored
    SIGHUP : ignored
    SIGFPE : 2d2aec
    SIGUSR1 : 2d2d94
    SIGUSR2 : ignored

    ##### Object: /usr/lib/libc.so.1
    _lwp_kill(0x6,0x6,0xffbee5a8,0x6,0xffbee6cc,0x10) + 0x8
    abort(0xff0ba000,0x0,0x0,0xffbee6bc,0x1680ddc,0x14 0823c) + 0xc0
    ##### Object: /usr/pdb2ins1/sqllib/adm/db2sysc
    __0FVsqloEDUSIGCHLDHandleri(0x0,0x4,0x4001,0x194f8 00,0x4,0x1660) + 0x260
    ##### Object: /usr/lib/lwp//libthread.so.1
    setitimer(0x12,0xffbeeb40,0xffbee888,0x2d21a8,0x0, 0x0) + 0xfc
    sema_post(0xff1475a8,0x1,0xff1479c0,0xffbee888,0xf fbeeb40,0x12) + 0x510
    sema_post(0xff1475a8,0xffbeeb40,0xffbee888,0xff146 000,0xffbeeb40,0x12) + 0x6dc
    ##### Object: /usr/lib/libc.so.1
    _msgrcv(0x8820,0xffbeece4,0x94,0x1,0x0,0xff0941d8) + 0x1c
    ##### Object: /usr/lib/lwp//libthread.so.1
    msgrcv(0x8820,0xff1475a8,0x94,0x1,0x0,0x94) + 0x68
    ##### Object: /usr/pdb2ins1/sqllib/adm/db2sysc
    __0FKsqloRunGDSv(0x194fbe0,0x18d2c04,0x0,0xff1475a 8,0xffbeee04,0x0) + 0x12c
    __0FTsqloInitEDUServicesv(0x18d2c00,0x0,0x0,0xffbe f6fc,0xffbef704,0x2) + 0x340
    __0FPsqloRunInstancePFv_iPFi_vPPvPlTE(0x1408dec,0x 3,0x0,0x459a,0x10039394,0x1d080c) + 0x51c
    main(0x1944b50,0x3a,0x18cbe2c,0x1400,0x2000,0x0) + 0xb6c
    _start(0x0,0x0,0x0,0x0,0x0,0x0) + 0xdc

  4. #4
    Join Date
    Aug 2003
    Posts
    9

    Unhappy PMR closed

    Hay.

    the problems disappeared when we switched the machine (hardware). There was no special change to the new Solaris box. The problem was never solved though. But the solaris machine was pretty old and did have problems on the memory.

    sorry I can't help you there.
    gr
    roman

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •