hi all,
I'm not confident to post the topic in this section, but not yet find batter match
Recently we had a mail server failed because of the database broken, I made an investigation that is because the disk used by database (mysql) has broken sectors just at the ibdata and ib_logfiles that ruin the whole database. The disk is a normal seagate sata disk using under 24x7 loads for several months without stop.
I then take an investigations to all hard disks we have from workstations to servers. I then find another 4 disks has broken sectors and all of them has installed with database (mysql or postgresql). All other disks using in application server or other workstations even running 24x7 has no problem. (we use normal SATA or SATAII disks with S.M.A.R.T. enabled)
This gives me an conclusion that database uses will easily give broken sectors to hard disks. So I'm looking solutions to monitor bad sectors in a running linux.
I think the usual solution is periodically reboot the server to force check the disk or use RAID, but in my concern even use RAID we need to have a knowledge of the broken sectors on the disk, for managed repair or change.
I'll do some google of the topic and update here, also appreciate any of your knowledge!
P.S. for modern disks that has extra blank sectors that the firmware can detect broken sectors and relocate it with transfer the data before it is impossible to read. but it seems not 100% reliable as our disks are those bad samples.