Info Request  | Help |  Site Map |  Contact Us | Home   

Search


Advanced Search

RESOURCES

White Papers
Articles
Brochures
Books
 

Complimentary Products

matchIT Contact

Back Download PDF Web
 
 
 

 
 Data cleansing and deduplication software

Your customers are unique — don't send them duplicate mail

Duplicate records within databases are created for a number of reasons — keying errors, misheard names and addresses, merging files. This is a huge problem for anyone responsible for the quality and integrity of a database. As data is collected from a wide variety of sources, the number of duplicate entries will just keep growing and growing. An average database can contain 5% duplicate records; in many cases, significantly more. With the explosion of web data collection, the data is often badly cased and structured too. Quite simply, when you send duplicate and poorly addressed mail to customers and prospects, it is a huge waste of money and will seriously damage your reputation.


 

Eliminate duplicate records easily

Anybody can dedupe records when they match exactly. But what happens when the differences are phonetic variations or typing errors? Phonetic and miskeyed data accounts for a huge proportion of duplicate records. Most of these are left undetected. Fortunately, there is a solution. is the result of more than 10 years of development by helpIT systems. It incorporates the very latest fuzzy matching algorithms and will allow you to find these duplicates. It is easy to use and can carry out a whole host of other data cleansing tasks simultaneously, saving vast amounts of time and money. If you send large mailings, using to dedupe and correct your data can pay for itself immediately.


 
  • Identify duplicates in any data; business or consumer, UK or international
  • Case data intelligently and create salutations from unstructured names
  • Merge files, purge customers from lists, transfer valuable information between matching records
  • Verify matches found interactively and/or delete them automatically.

Matching records shown during the dedupe process

scores each pair of duplicate records that it finds — the higher the score, the more likely the match. You can easily review the lower scoring matches (or all matches) before deletion. You can transfer information from one record to the other, to ensure that none of your valuable data is lost.

is an easy-to-use Windows based deduplication system. It can process both business and personal data from any country, which means that all of your data cleansing and duplication worries can be easily solved. And you don't have to be a "techie" — anyone can use . You just select the processing options that you want and then sit back while does all the work.


 
How does work?

matches records by comparing their data in great detail, without relying on any single item of data being consistent. It uses powerful fuzzy matching techniques to find the similarity between Mrs E Shaw and Lizzie Shore, for example. also matches business names, allowing for optional elements of company names and acronyms e.g. The Dalton Group, Dolton Ltd and TDG.

grades duplicates by "score" — a higher matching score indicates a closer match. You can even set your own rules for scoring, to allow for the differing nature of data files or specific requirements. You can choose to individually inspect the results, or save time and delete records that score above a certain threshold. can even identify which of a pair of records is "best" and should be preserved, or combine data from duplicate records so nothing is lost — and you can change these rules as well. Of course, you can print detailed reports at every stage in the process, or export the results for external database processing if desired.


 
  • Find the common records across multiple databases — even if they are in different formats
  • Set up automatic job scripts for tasks that you will repeat on a regular basis
  • Print address labels or mail-merge letters, or output a file to be printed elsewhere
  • Import all the common data formats e.g. Access, Excel, CSV, DBF, SQL Server, Oracle

 
Output options after deduplication

Once your data processing has been completed, you are able to choose what to do with it. Produce labels, letters or output to any major format.

 

 
 
Back to GoldMine