Is there a best practice for what source data to use during development.
I could use data from the source development environment, but this poses the problem that it may have architectural differences to production, or have data anomalies not in production (or lack data anomalies in production) and not show the same changes to data that occurs in the production environment.
I could use the production environment, but it introduces risk.
I could take a backup of source data, and work from the back up, but this isn't always practical with large databases and wouldn't allow me to see changes.
Currently I have a tendency to use a backup to understand a new data source initially and then work with production data so there are no surprises.
I'm wondering how others approach though