Hello Gurus,
Quick question - my application is DB2 'partitioning-aware'. It initiates 2 application partitions to connect to each of the 2 physical database partitions. I can see messages in the application log indicating each of the application connections connected to individual database partitions.
This particular ETL routine is performing a join between 2 tables A1 and A2 - A1 is hash partitioned on AK1 and A2 is hashed on AK2.
When I look at the queries generated by the application partitions, I see -
Application Partition 1:
Select * from A1 , A2 where A1.AK1 = A2.AK1 and Nodenumber(A1.AK1) = 1
Application Partition 2:
Select * from A1 , A2 where A1.AK1 = A2.AK1 and Nodenumber(A1.AK1) = 2
While I can confirm if there will be data lost due to the above partitioning (application) by reviewing the data, the data load is in progress, and is going to take quite a while,
I was wondering if you guys could tell by just reviewing the description above that my application would miss data because of how it is connecting to the data nodes and performing a non-collocated join.
Thanks in advance for your help!
- G