Data Quality Specifications

Specifications: Data Quality

Talend Data Quality as part of Talend Platforms supports the following third party components, products and operating systems. Support varies across Talend products. For detailed information, please reference the product installation document and release notes.

Address Validation, Standardization and Enrichment 

Through a combination of components and services, Talend supports the following address validation partners: Google, Loqate, Melissa Data and QAS/Experian.

Matching Algorithms

Exact Match, SoundEx, Soundex FR, Metaphone, Double Metaphone, Levenshtein, Jaro, Jaro-Winkler, Q-gram, Custom/User-Defined, Hamming, Swoosh, VSR

Supported Database and Storage Connectivity

Amazon Aurora, Amazon RDS, Amazon Redshift, Amazon S3, AS400,  DB2, Derby DB, EXASOL, eXist-db, Firebird, Google Cloud Storage, Greenplum, H2, HSQLDB, Informix, Ingres, InterBase, JavaDB, JDBC, MariaDB, MaxDB, Microsoft Azure Blog Storage, Azure Queue Storage, Azure SQL Data Warehouse, Azure Table Storage, Microsoft OLE-DB, Microsoft SQL Server, MySQL, Netezza, Oracle, ParAccel, PostgresSQL, PostgresPlus, SAP Business Warehouse, SAS, Snowflake, SQLite, Sybase, Sybase IQ, Teradata, VectorWise, Vertica

Connectors to SaaS, Enterprise, and More

  • SaaS Connectors: Marketo, ServiceNow, Salesforce.com and Salesforce Wave, NetSuite, Microsoft Dynamics CRM 2016, Microsoft Dynamics AX
  • Packaged Application Connectors: SAP (table extract, BAPI, IDOC), Sugar CRM, Microsoft Dynamics CRM/365, Sage X3, CentricCRM, Vtiger CRM, Open Bravo
  • Technical Connectors: Amazon S3, Amazon SQS, Bonita, Box, Dropbox, ElasticSearch, GoogleDrive, JIRA, Email (SMTP), FTP/SFTP, LDAP, REST, Splunk

Supported Big Data Distributions and Technologies

Amazon Redshift, Amazon DynamoDB, Amazon EMR (including Apache Spark), Apache Hadoop (HBase, HDFS, Hive), Apache Spark, Cassandra, Couchbase, CouchDB, Cloudera Altus, Cloudera Enterprise, Google BigQuery, Google Cloud Dataproc, Hortonworks Data Platform, Microsoft HDInsight, Microsoft Azure Cosmos DB, Microsoft Azure Data Lake Store, IBM PureData System for Hadoop, MapR, MapR-DB, MarkLogic, MongoDB, Neo4J, Pivotal HD, Riak, SAP HANA, Teradata, Vertica

Supported Operating Systems

  • CentOS Linux
  • OS X
  • Red Hat Enterprise Linux
  • Solaris
  • SUSE Linux
  • Ubuntu Linux
  • Microsoft Windows

Additional Details

Supported Systems and Databases
Talend Open Studio for Data Quality
Talend Data Management Platform

All Components & Connectors
Component List

Product Documentation
help.talend.com