Saturday, November 19, 2011

Too much push, not enough pull

As you might have noticed from my long absence in the blogosphere, I have been very busy managing and developing EMC Consulting's Data Quality Practice.  One thing has become very clear to me over the past few months ... data quality is an ambiguous term that means different things to different people.

I have also become aware of the fact that blogging lends itself to publishing a perspective, rather than collecting perspectives.

In an attempt to put some structure around the term data quality by collecting perspectives, I have decided to post some polls and then write about them.

I have chosen three rather basic polls in an attempt to build a baseline.  If you are reading this post, you are obviously interested in data quality (or an insomniac looking for a cure!).  Please participate in the polls and comment on this post if you feel I have missed a baseline measure that will help add clarity to data quality.

When is data quality useful?


[polldaddy poll=5684585]

When using data quality tools, what do you include?


[polldaddy poll=5684590]

Data Quality: underutilized or over-hyped?


[polldaddy poll=5684595]

Of course I have my own opinions, but I want this post, and the series of posts that follow, to be more about what others think than my own thoughts.

In an attempt to gather as many perspectives as possible, please pass this post around your network and feel free to add comments if you feel I have neglected to add something important.

3 comments:

  1. Unfortunately the people that will respond to this poll provide a somewhat singular worldview on data quality. Few outside the relatively small community of data quality participants consider data quality a problem. Professing that organizations need to adopt data quality is not itself sufficient to prove the need for data quality. A justification can be made for improving business process quality that is even more pervasive in organizations than data quality but there des not appear to be the same degree of concern for business process quality. Organizations have not adopted a quality culture as a result attempting to deploy data quality in this environment will prove to be limited.

    ReplyDelete
  2. Richard
    I, sadly, have to agree with you. I think one of the largest obstacles for data quality is that it is primarily a reaction to poor systems design. This reduces the justification of data quality to "we can correct mistakes". There are some exceptions, like Address Validation and Matching, but those are special cases.
    This could change once we enter the "Big Data" world and the deluge of data becomes so degraded that quality wins over quamtity.
    Thanks for the comment, Richard!

    ReplyDelete

What data quality is (and what it is not)

Like the radar system pictured above, data quality is a sentinel; a detection system put in place to warn of threats to valuable assets. ...