Sunday, September 1, 2013

How to design a Source to Target Matrix

After working a Source to Target Matrix (STTM) for weeks I came to a rather disappointing conclusion ... we laid out the matrix by sources and a developer can't pick that up and start developing. To say the least I was upset at myself and frustrated that we needed to reorganize this document again.  So I decided to pass on this piece of advice to you so you don't have to go through this like did.
Organize your STTM by targets and not sources

The target is what the developer is using as a starting point.  As a result, the developer needs to have his work, the mappings, organized by targets.  I'm not suggesting deviating from the left to right convention of source to target, but rather building a STTM for each target.  In many cases this means having a Excel tab for each target.

Sounds simple enough, right?  In reality it is simple but the complexity comes in how the information is gathered or relayed from clients.  If you start conversations with the client in the context of a target the approach is straight forward.  But that's not reality.  Clients start by talking about sources because that is what they know.  At the end of that session you end up with one source and multiple targets and a document you need to reorganize.
Have migration conversations with clients in the context of the target and save everyone the time of reorganizing and confirming the requirements

This approach is rooted in the principle of beginning with the end in mind.  In my experience, when everyone understands the goal in detail it is much easier, efficient and accurate to hit the target (no pun intended! Well maybe a little).

2 comments:

  1. I would strongly suggest not using spreadsheets for this because spreadsheets and Data Quality seldom appear in the same conversation.

    The source to target matrix will be multidimensional and needs to be easily viewed and validated through all of its dimensions. Doing this on a spreadsheet will result in duplication of entries, which is flies in the face of DQ practice.

    A simple intersection table sitting between source and target is an ideal way of viewing the the matrix in all dimensions.

    You just need to take care with defining the unique identifiers (these will not be the primary keys) and run a unique index on them, to prevent making duplicate entries.

    When you this approach it does not matter where you start the conversation.

    Kind regards
    John

    ReplyDelete
  2. All valid points, John! I have thought about building an STTM app a dozen times. Unfortunately, I am too busy dedupping spreadsheets! LOL. Thanks for the comment!

    ReplyDelete

What data quality is (and what it is not)

Like the radar system pictured above, data quality is a sentinel; a detection system put in place to warn of threats to valuable assets. ...