| |
Intelligent Search Technology Extends
Microsoft SSIS Data Cleansing
Functionality
Intelligent Search Technology now provides SSIS users with
a flexible and efficient way of cleansing their data directly within their
SSIS solutions.
White Plains, NY (January 23, 2007) - Intelligent Search Technology
today announced availability of its Data Quality (DQ) solution for SQL
Server 2005 Integration Services (SSIS). This solution extends the data
cleansing capabilities of SSIS by providing powerful components that utilize
the intelligence of IST’s NameSearch® engine. The DQ components
available at this time are SSIS Deduplication, Merge-Purge, and Address
Correction. These components integrate directly into the SSIS data flow
engine, allowing users to seamlessly utilize IST’s advanced data
matching technology within their SSIS solutions.
The Deduplication transformation component uses the NameSearch
algorithms for advanced searching and matching to filter duplicate records
from a single data stream. The user can customize the deduplication process
by selecting match criteria, weight factors, score thresholds and match
algorithms. The match criteria can include any combination of personal
names, corporate names, addresses, dates, numbers and other alphanumeric
sequences. Duplicate records are filtered into a separate table where
they are grouped together based on their duplicate identifiers. Users
may also choose to remove duplicate records.
The Merge-Purge transformation component allows merging of two
data streams by creating full records from partial duplicates while simultaneously
filtering duplicate records. Similar to deduplication, merge-purge derives
results by searching and matching on different combinations of match criteria.
The output is stored in a master table containing deduplicated records
from both data sources.
The Address Correction component corrects addresses according
to U.S. Postal Service™ standards while flagging or filtering
incorrect addresses. Additionally, this component will provide possible
alternative
addresses when the input address is ambiguous.
The combination of the SSIS flexibility and ease of use with
NameSearch’s data cleansing intelligence offers users an extremely
powerful and efficient solution for solving data quality issues during
the data integration process.
IST is proud to partner with Microsoft on its SSIS Data Quality
initiatives. With this set of components, IST continues to solidify its
position as the leading provider of database integrated searching and
matching solutions.
About Microsoft SSIS
Microsoft SQL Server 2005 Integration Services (SSIS) is a platform
for building high performance data integration solutions, including
extraction, transformation, and load (ETL) packages for data warehousing.
Integration Services includes graphical tools and wizards for building
and debugging packages; tasks for performing workflow functions such
as FTP operations, for executing SQL statements, or for sending e-mail
messages; data sources and destinations for extracting and loading data;
transformations for cleaning, aggregating, merging, and copying data;
a management service, the Integration Services service, for administering
Integration Services; and Application Programming Interfaces (APIs)
for programming the Integration Services object model.
For more information about SSIS Data Quality, please visit:
http://msdn2.microsoft.com/en-us/library/aa964137.aspx
About Intelligent Search Technology
Founded in 1993, Intelligent Search Technology, Ltd.
(IST) has devoted its resources to the development of the fastest
and most accurate data searching and matching software. IST is a privately-held
company and has shown continued growth since its inception. With
the development
of MerlinMerge® SpeedPro (duplicate detection, merge purge software),
CorrectAddress® (address verification and address correction software)
and ISTwatch© (OFAC compliance and terrorist searching software)
IST continues to develop and market core technologies used for
data management and retrieval of information from large systems
running under diverse
hardware and software platforms.
IST News
|