• Michael Meierruth (11/8/2012)


    So in those several thousand cases what do you see as the main cause of these corruptions. It's ovbiously something bad at the hardware level. But what I'm looking for is things like 'having bought cheap hardware', 'not changing hardware often enough', 'bad physical environments', 'plain bad luck' i.e. 'philosophical' things like that.

    I've had it happen due to a harddrive controller crapping out intermittently. And then once with a bad memory chip. These were solid machines that weren't cheap or old. Philosophically, you plan for the worst and hope for the best. Or something like that 🙂

    Look at several companies with server hardware in Lower Manhattan - they are blaming their host company for flooded hardware, saying the host company shouldn't have servers in basements. But that wouldn't have been a problem if they'd prepared for the worst and ensured that they had good offsite backups or mirrored co-location facilities outside of the disaster area. DR and BCP should not be overlooked anymore or an afterthought.

    MWise