Talend Releases Enhanced Version of Its Open Source Data Quality Platform

Cost-Effective and Simple to use Products Enable More Efficient Processes to Provide Better Customer Service and Greater Customer Satisfaction

SAN DIEGO, Calif. - TDWI World Conference - August 4, 2009 - Talend, the recognized market leader in open source data integration software, today announced the availability of enhanced versions of its Talend Open Profiler and Talend Data Quality solutions.

From the largest Fortune 500 organization to the smallest company, the ability to move and combine reliable data across information systems can mean the difference between profit and loss. While data quality has been out of reach for most organizations, the new Talend Open Profiler and Talend Data Quality bring even more control and consistency to corporate data, at a fraction of the cost and without the complexity of other systems. “Data quality is essential to the success of any data-driven project, but has often been an afterthought for many organizations,” said Fabrice Bonan, co-founder and COO of Talend. “Talend's open source approach democratizes data quality, making it available to all companies as an easy-to-use offering. We have already seen strong adoption of our products and these new versions will further reinforce our position within the market.”

Talend Open Profiler - which is freely downloadable - is the first open source data profiler. It provides businesses with a snapshot assessment of the quality of their data, based on indicators they create for each data element. Built with the end user in mind, Talend Open Profiler can be used by anyone regardless of technical acumen, including business analysts and non-technical personnel.

Talend Data Quality is the first open source enterprise data quality platform to combine data profiling and data cleansing in the same environment. It not only enables organizations to get correct data into their information systems, but it identifies and eliminates bad, corrupted or duplicate data.

Talend's data quality products are used across a broad range of industries including retail, digital media, healthcare, government and more. The new version introduces the following capabilities:

  • Custom data quality rules - allows users to define their own business rules applied to data quality and validate their data sets against these business rules. For example, users can define a rule that will check the validity of a postal code, based on the country defined in another data item, or check that the area code of a phone number matches the geographical location of an address.
  • Pattern finder - identifies predominant patterns in data sets. For example, this could be used to detect that a “comments” field has actually been used to store Social Security numbers, or mobile phone numbers.
  • Advanced data profiling - includes redundancy profiling, used to detect relationships between entities (foreign keys candidates) and correlation profiling, used to identify outlying values and possible incorrect data points.
  • New data cleansing components - perform deduplication or data joins based on fuzzy matching technologies. For example, even though Marlborough and Marlboro are spelled differently, the two city names are often interchangeable. Fuzzy matching alleviates the concern that they will be classified separately.

Talend has also added a Web-based Data Quality Portal to Talend Data Quality which allows any user in charge of the quality and compliance of corporate data to understand, at a glance and in real-time, the improvement or degradation of data in order to identify and remedy any points of failure. The portal allows for granular reporting, the ability to better define key alerts and drill down into data issues.

Press contact:
press@talend.com