SSIS Data Quality - SSIS Merge/Purge
A typical merge/purge process involves two important steps - first merging data from different
sources and second, ensuring that the merging process does not create duplicate records by purging
records already in the combined data source. The IST SSIS merge/purge component is a great
tool that offers a solution for both of these challenges. The component utilizes the SSIS framework,
which makes it is easy to chain and merge many data sources and IST’s intelligent fuzzy matching
engine to identify and purge duplicate records.

Fuzzy Grouping and fuzzy lookup is not Enough

IST’s fuzzy matching engine is easier to use and provides better results than the fuzzy grouping
and fuzzy lookup transformations provided with SSIS.

Our SSIS deduplication component provides extremely powerful and accurate data deduplication
functionality. At its core is IST’s powerful searching and matching technology, which allows it to
outperform the competition. How? By using sophisticated techniques, such as “fuzzy” matching,
heuristic algorithms, phonetic analysis, and much more. The sophisticated techniques at the heart
of IST SSIS deduplication are hidden from the end-user through an easy-to-use, point-and-click
graphical user interface. Using only a few mouse clicks, users can send their deduping jobs to
the matching engine and receive processed results in a specified format. Combining advanced
deduping with SSIS built-in extensibility and data integration features provides a powerful toolset
for every data steward or data warehousing specialist.

Our SSIS merge/purge component finds duplicates across two data sources while our SSIS
deduplication component
finds duplicates within a single data sources. If you want to dedupe or
find matches across multiple data sources you can chain multiple SSIS merge/purge components
in the SSIS work pane.

.: To find out more about IST technology click here.
To find out more, call (800) 287-0412
www.intelligentsearch.com, Copyright © 1993-2007 Intelligent Search Technology Ltd.