Data Profiling Software for Data Discovery,Data Assessment,Data Analysis
The primary functionality of the DataProfiler is to automatically extract statistical information from a data source using definitions stored in its metadata and to either display the retrieved results on screen, or generate a report which can be printed or saved. The target data store can be Microsoft® SQL Server® 2000/2005/2008, Oracle® 9i or higher
or IBM® DB2 (v9).




The two databases installed together with the DataProfiler are Repository and File Storage. The repository is the main storage for the profiling data, metadata describing target data sources and audit information about profiling activities and changes in the data sources structure.

The File Storage database is used to support profiling of the data stored in plain text format (i.e., CSV files). It is usually deployed into a local instance of SQL Server Express but can be deployed into local instance of any edition of SQL Server 2005/2008. The DataProfiler has built-in functionality allowing the user to load text files into this database and then to profile them just as any other data store.  

Data Sampling

Even the most sophisticated data profiling algorithms cannot replace sampling and visual analysis of the original data records. The DataProfiler provides robust and convenient methods of the data sampling. The features include random data sampling, column and record filtering, presentation of data in printable format and ability to save retrieved data sample into a file. The application automatically recalls the most recent data sampling settings for each table and allows you to save custom data sampling definitions.

>>Page Up<<


HomePrivacyLegalContactSite Map
Follow IST on Linkedin®