Intelligent Search Technology, Ltd. specializes in search and matching software.  Name Search our flagship product provides intelligence to both online and batch search and matching applications.  Name Search not only enables systems to find and match information based on personal and corporate names but also comes with powerful address searching and e-mail searching services.  Correct Address is address verification, validation and correction software harnesses the intelligence of the Name Search.  Name search also powers ISTwatch.  ISTwatch is terrorist checking software to enabling compliance with US patriots act.   Merlin Merge supplied with the name search is used for duplicate record identification and merge purge operations. The Intelligent Choice 
HOME  |  PRODUCTS  |  SERVICES  |  CUSTOMERS  |  NEWS |  ABOUT IST  |  MY ACCOUNT
NameSearch®
 


NameSearch®
» Overview
» Features
» Intelligence
» Architecture
» Integration
» Applications
» Company Name Search
» FAQ
» White Paper


Product Demo
» Free trial
» Personal demo

Technical Information
» System requirements
» SDKs
» Technical support








 
 

How NameSearch® Works - Sanitization

The sanitization module removes noise characters, extra spaces, control characters and converts lower case letters to uppercase. Examples of noise characters are: @, #. $, %, ^, &, *, (, ), }, {, [, ]. The following characters are handled separately and have special meanings: commas, hyphens and quotes. Commas usually indicate the insertion of a last name. Sanitization places words followed by commas at the end of the string. Quotes are deleted and the space between them is removed. A space replaces the hyphens.

Examples of Sanitization:

Before Sanitization After Sanitization
Scott Lions SCOTT LIONS
Smith, John F. JOHN F SMITH
Rose Stone-Shield ROSE STONE SHIELD
James O'Tool JAMES OTOOL
James O. Tool JAMES OTOOL
Owen, Tool, James JAMES OWEN TOOL
# Williams, $Richard RICHARD WILLIAMS

The sanitization module also contains a small rulebase. The rulebase is applied after all the alpha characters have been converted to upper case letters and extra blanks are removed. This rulebase is used to recognize words that contain noise characters or prefixes that could be effected by the sanitization process. The sanitization rulebase also gives you the ability to convert non-alpha-numeric characters to other symbols or words. The First Word rule type was designed for commercial name searches where a word in the first position of a name would be considered noise. There are times when a word in the middle of a commercial or cooperate name would help contribute to the identification of a record but the same word found in the first position would obscure the search. Classifying noise words based on position could effect NameSearch®’s ability to overcome sequence variations. The application of this rule should be used judiciously and with great thought. The sanitization rulebase can be easily modified using the NameSearch® Graphical User Interface, the "Generation Shell."

Before Sanitization After Sanitization Sanitization (without rulebase expertise)
c\o CARE OF C O
Mc Donald, Old OLD MCDONALD MC OLD DONALD
% CARE OF  


How NameSearch® works




    Home |  Privacy  |  Legal  |  Partners  |  Contact  |  Support

To find out more, call (800) 287-0412
Copyright © 1993-2006 Intelligent Search Technology Ltd.
IBM Business Partner emblem is a registered trademark of IBM Corporation.
Microsoft is a registered trademark of Microsoft Corporation.
'Java and all Java-based marks', Sun and Solaris are trademarks or registered trademarks of
Sun Microsystems, Inc. in the United States and other countries.
Oracle is a registered trademark of Oracle Corporation.