• So the whole business should stop seeing crucial data that supports their daily decision-making, and wait for a data scientist to sanitise the data (however long it takes)?

    The fact is the end users of those data are the domain experts who can tell what is rogue data and what is real trend better than anybody else, including the data scientist who has generic data knowledge but not necessarily the domain knowledge.

    IMHO, we should just give the data to the business, and give them the tools that highlights abnormal trends and help them do the analysis. That way you don't stop them seeing the data, but also help them identify rogue data.