Books on implementing data lineage

  • Hi,

    We are currently working on implementing the data lineage within our data warehouse and I don't seem to find a book that really covers this subject. I have checked the 2012 certification PDF and few other books with no success.

    I know that there are tools that are used for implementing the lineage and audit data, but we would like to create a db to contain the metadata(mainly for completeness and accuracy). Is there any book that covers this subject in detail?

    Thanks a lot!

  • I have also looked and not been able to find anything. And although in general I'm a big fan of Ralph Kimball's books, I find it disappointing that he talks about the importance of lineage without making any practical suggestions to go about doing that.

  • Some ETL tools such as Informatica have data lineage built in. To my knowledge, SSIS does not. You'll have to either build on SSIS's auditing features or maybe find a third-party add on.

  • Hi,

    Thank you for the replies. Our workflows are made in informatica, but for the lineage info they ask for extra money, so we need to do it ourselves :-).

    We will eventually do it step by step and improve it as we go along.

    Basically we created a metadata db that will be populated from the informatica workflows.

    If I find anything useful, I will post it here, but I'm not very optimist:-).

Viewing 4 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply