Quote:
|
Originally Posted by pootle flump
Sounds interesting - please could I request that people respond to the thread, rather than email or pm. This way you gain from the exposure on the forum and the forum gains by keeping much of your source material here for others to reference. It would also be jolly nice if you could somehow link to your eventual article, or at least paste in the recommendations, upon completion.
|
Thanks for looking kindly on the thread, especially if you are a mod!
Anyway, I will happily post further details here. After collating the ideas of people like you about these things, I will be able to produce a framework for an automated solution.
Background/Purpose Of The Study
Poor data quality costs enterprises money; by making business processes less efficient, by increasing the cost of maintaining contact with their customer base and through loss of customers due to poor customer service provision. The purpose of this research is to help demonstrate that automation remains worthwhile, bringing many perceptible benefits to an enterprise, particularly in terms of the quality of service it provides.
There has been much investigation into the benefits of automation in other areas of computing, such as automation of testing in software development. As data loading and cleaning involves similarly programmable recurring processes, and automation programmes are being initiated on this basis, I believe it is now worth fully investigating whether the benefits of automation can be replicated in this area. Although there is some research on the benefits of automation in other areas, the research into the benefits of automation in data loading and cleaning is meagre.
Thanks for taking the time to have a look over my research – if you do respond, you are of course free to withdraw at any time, and without giving a reason. The results will be written up and a copy will be made available in the university library. I will also, following pootle flump's request, link to the paper if and when it is published. If you would like a copy please don’t hesitate to contact me. I would like to make clear that all results would be made anonymous unless you would like me to cite you as a source, please say and I would be happy to do so...
The questions I am looking to discuss are as follows.
1) What are the main data quality problems in marketing databases?
2) What are the main factors that cause these issues to arise?
3) What are the main costs of poor data quality?
4) How would you go about measuring data quality? What are the key indicators?
5) Do you believe that automated or manual loading and/or cleaning can best remedy these problems?
6) How possible do you think it is to achieve fully automated processes to improve data quality?
7) Are there any major impediments to being able to improve data quality using automated processes?
8) What change do you think automated data loading and cleaning solutions may have on an organisation?
9) Are there any other side benefits of automated processing in improving data quality?
10) Do you think the impact of these is felt to be beneficial to an organisation's clients/stakeholders?
11) Therefore, are manual or automated solutions preferable?
Many thanks.