Big Data Specifications

Specifications: Big Data Integration

Talend Big Data supports the following third party components, products and operating systems. Support varies across products.   

Supported Operating Systems

  • CentOS Linux
  • OS X
  • Red Hat Enterprise Linux
  • Solaris
  • SUSE Linux
  • Ubuntu Linux
  • Microsoft Windows

Supported Big Data Distributions and Technologies

Amazon Redshift, Amazon DynamoDB, Amazon EMR (including Apache Spark), Apache Hadoop (HBase, HDFS, Hive), Apache Spark, Cassandra, Couchbase, CouchDB, Cloudera Altus, Cloudera Enterprise, Google BigQuery, Google Cloud Dataproc, Hortonworks Data Platform, Microsoft HDInsight, Microsoft Azure Cosmos DB, Microsoft Azure Data Lake Store, IBM PureData System for Hadoop, MapR, MapR-DB, MarkLogic, MongoDB, Neo4J, Pivotal HD, Riak, SAP HANA, Teradata, Vertica, WebHDFS (AFS)

Big Data File Format Support


Supported Database and Storage Connectivity

Amazon Aurora, Amazon RDS, Amazon Redshift, Amazon S3, AS400, DB2, Derby DB, EXASOL, eXist-db, Firebird, Google Cloud Storage, Greenplum, H2, HIVE, HSQLDB, Informix, Ingres, InterBase, JavaDB, JDBC, MaxDB, Microsoft Azure Blog Storage, Azure Queue Storage, Azure SQL Data Warehouse, Azure Table Storage, Microsoft OLE-DB, Microsoft SQL Server, MySQL, Netezza, Oracle, ParAccel, PostgresSQL, PostgresPlus, SAP Business Warehouse, SAS, Snowflake, SQLite, Sybase, Sybase IQ, Teradata, VectorWise, Vertica

Connectors to SaaS, Enterprise, and More

  • SaaS Connectors: Marketo, ServiceNow, and Salesforce Wave, NetSuite, Microsoft Dynamics CRM 2016, Microsoft Dynamics
  • Packaged Application Connectors: SAP (table extract, BAPI, IDOC), Sugar CRM, Microsoft Dynamics CRM/365, Sage X3, CentricCRM, Vtiger CRM, Open Bravo
  • Technical Connectors: Amazon S3, Amazon SQS, Bonita, Box, Dropbox, ElasticSearch, GoogleDrive, JIRA, Email (SMTP), FTP/SFTP, LDAP, REST, Splunk

Talend Big Data Platform

Address Validation, Standardization and Enrichment

Through a combination of components and services, Talend supports the following address validation partners: Google, Loqate, QAS, Melissa Data and QAS.

Matching Algorithms

Exact Match, SoundEx, SoundEx FR, Metaphone, Double Metaphone, Levenshtein, Q-gram, Jaro, Jaro-Winkler, Custom/User-Defined, Hamming, Swoosh, VSR

Talend Real-Time Big Data Platform also includes:

Big Data Supported Messaging Services

Apache Spark Streaming , Apache Kafka, Amazon Kinesis, Google Pub/Sub, MapR-Streams

Support for Enterprise Messaging Standards, Transports and other ESB-related Capabilities

Learn more

additional details

Supported Systems and Databases
Talend Open Studio for Big Data
Talend Big Data
Talend Big Data Platform
Talend Real-Time Big Data Platform

All Components & Connectors
Component List

Product Documentation

Product Certifications

SAP Certification