Introduction When you’re running MongoDB at scale with data distributed across multiple Pure Storage FlashArrays, achieving truly consistent backups becomes a critical and interesting technical challenge. In this post, I’m walking through an automated snapshot and recovery solution for a sharded MongoDB cluster running across two separate FlashArrays.
Important: This demonstration uses a sharded cluster with data distributed across multiple shard servers, not a replica set. Sharded deployments introduce additional complexity for backup operations because data is partitioned across multiple servers and storage arrays.