Results 1 to 3 of 3
  1. #1
    Join Date
    Jul 2009
    Posts
    10

    Unanswered: Data Profiling in SQL Server

    Hi - does anyone have any tips for profiling and cleansing data with T-SQL? I've got a lot of columns of data to clean up and it seems the only way of cleaning it all is to handcraft T-SQL code to:

    1) first profile my data, then,
    2) Clean it.

    For example, I have a column of datetimes with a lot of noise in it - nulls, future dates (there shouldn't be!) and dates that are far too old (i.e. they were incorrectly entered). I need to end up with some script that strips out (or possibly even cleans) the invalid rows. Trouble is, I need to do this for a lot of columns...

    Does anyone know of a quicker way of doing this - is there something like this in SSIS or elsewhere that will make this job easier/quicker? A profiling tool of some sort?

    Thank!

  2. #2
    Join Date
    Nov 2002
    Location
    Jersey
    Posts
    10,322
    k

    do you know what information_schema.columns are?
    Brett
    8-)

    It's a Great Day for America everybody!

    dbforums Yak CorralRadio 'Rita
    dbForums Member List
    I'm Good Once as I ever was

    The physical order of data in a database has no meaning.

  3. #3
    Join Date
    Jul 2009
    Posts
    10
    Hi Brett

    Yes - I can get access to the information in information_schema.columns. And I did think about writing something which would interrogate this information and generate some SQL script for me - is this what you would suggest?

    I guess I was just wondering if there was a quicker way of doing this (e.g. some part of SSMS that I've missed!) or even a tool I could download that could help.. If not, I'm happy to start coding something myself - but I didn't want to do this if somebody had already done it for me!

    Ben

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •