Monday, June 14, 2010

Informatica v9 shifts and how to manage them: Source to Target

Introduction


I've recently made the shift from Informatica Data Quality / Data Explorer version 8.x to Informatica v9.  In the process I have discovered quite a few shifts in how certain tasks are performed.  Notice I called them shifts.  I did so because after learning them, I didn't feel as though there were fundamental changes.

A prime example and the focus of this post is the source-to-target process.  This process is so fundamental to ETL and data quality that it is often the starting point for all development.  As such I have chosen it as my starting point in mapping out the shifts in v9.

Shifting from the start ...


In IDQ v8.x there was a source component that you added to the plan and defined either with custom SQL or by selecting the required data attributes.  In v9 you'll have a hard time finding that component.  I know I did.  That's because a shift has been made by Informatica away from source/target components and toward physical / logical data objects.  For simplicity purposes we'll use physical data objects in our example.  However a logical data object is a construct that merits mention here and I'll follow up with in a later posting.

Logically speaking


Briefly, a logical data object is a generic data object that is not customized to one particular data environment but rather is constructed to represent a typical instance of metadata.  An example is a logical data object that models addresses.  This data is fairly consistent in most environments, so it is advantageous to an ETL/DQ developer to build a generic object that represents the typical metadata associated with an address.  Once this object is built it can be defined as reusable and dropped into any mapping that requires address data.

In the physical world


Physical data objects are different than logical data objects in that they more directly represent the data environment as it exists in reality.  Physical data objects can be defined from an existing data source, such as a flat file or data store.  Once the physical data object is created it can be used as a read or write object.  Defining a physical data object as a read object is analogous to using the source component.

To create a read physical data object follow the steps defined below:

  1. Right-click on the Physical Data Object entry in the Data Explorer pane on the left-hand side of the Informatica Developer utility

  2. Select Create Physical Data Object from menu options provided

  3. Select either the flat file or data store source of the source data

  4. Be sure to double check the data types and precision of the attributes created from the source

  5. In the properties section of the data object specify run time properties like out file name and directory


Once the physical data object is created, drag-and-drop the object into a mapping and select the Read option from the dialog that appears.  Once this step is complete, you've made the shift to using data object instead of components to define sources in Informatica Data Quality v9!

I'll cover write objects next month and move on to logical data objects shortly after, so stay tuned!

Please feel free to submit comments and questions and I will respond as quickly as possible ...

10 comments:

  1. Very helpful, as I am just starting to work with version 9, and am starting to test our version 8 address validation routines.

    ReplyDelete
  2. Thanks for reading and commenting on the blog post, John. I have some content on here on v9 changes and plan to add more. V9 should provide a conversion utility to convert your v8 code to the v9 platform, if I am not mistaken. I'll double check on that and get back to you.
    If you'd like to see a v9 topic covered, feel free to shoot me a note and I can do my best to cover it!

    ReplyDelete
  3. Very Helpful William! Thanks for sharing u'r experience.

    ReplyDelete
  4. Pl. keep posting u'r experience with IDQ V-9

    ReplyDelete
  5. Thanks for stopping by and commenting Ranjeeta! Glad I could shed light on the changes in v9 for you. Please come back and comment again soon.

    ReplyDelete
  6. Of course, Ranjeeta. Glad it is helpful!

    ReplyDelete
  7. very informative..Please share some more exp if you have in Informatica like wt things should be taken care when shifting from V8 to V9 and at the same time server is changing to different server..

    Thanks!!

    ReplyDelete
  8. [...] v9 shifts and how to manage them: Source to Target – Introduction I've… http://thedataqualitychronicle.org/… #dataquality [...]

    ReplyDelete
  9. Hi, very useful informn.

    can u pls provide some materials which can demonstrate how to use Informatica's data validation tool bcoz i am beginner for this tool.



    thanks.

    ReplyDelete
  10. Ashish

    Glad you liked the information! I have a section in the blog dedicated to Informatica Data Quality. Please feel free to check that section out or contact me via the contact form on the right hand side of the blog to submit a specific request. I look forward to our exchange in the future!

    ReplyDelete

What data quality is (and what it is not)

Like the radar system pictured above, data quality is a sentinel; a detection system put in place to warn of threats to valuable assets. ...