Big Data Specifications

Specifications: Big Data Integration

Talend Big Data supports the following third party components, products and operating systems. Support varies across products.

Supported Operating Systems

  • CentOS Linux
  • OS X
  • Red Hat Enterprise Linux
  • Solaris
  • SUSE Linux
  • Ubuntu Linux
  • Microsoft Windows

Supported Big Data Hadoop Distributions and NoSQL

Amazon Redshift, Amazon DynamoDB, Amazon EMR (including Apache Spark), Apache Hadoop (HBase, HDFS, Hive), Apache Spark, Cassandra, Couchbase, CouchDB, Cloudera Enterprise, Google BigQuery, Hortonworks Data Platform, Microsoft HDInsight, IBM PureData System for Hadoop, MapR, MapR DB, MarkLogic, MongoDB, Neo4J, Pivotal HD, Riak, SAP HANA, Teradata, THD, Vertica

Big Data File Format Support

SEQ, JSON, RC, ORC and AVRO

Supported Database and Storage Connectivity

Amazon Aurora, Amazon RDS, Amazon Redshift, Amazon S3, AS400, DB2, Derby DB, Exasol, eXist-db, Firebird, Google Storage, Greenplum, H2, HIVE, HSQLDB, Informix, Ingres, InterBase, JavaDB, JDBC, MaxDB, Microsoft OLE-DB, Microsoft SQL Server, MySQL, Netezza, Oracle, ParAccel, PostgresSQL, PostgresPlus, SAP Business Warehouse, SAS, SQLite, Sybase, Sybase IQ, Teradata, VectorWise, Vertica, Windows Azure Blob Storage

Connectors to SaaS, Enterprise, and More

  • SaaS Connectors: Marketo, ServiceNow, Salesforce.com and Salesforce Wave, NetSuite, MS CRM & AX
  • Packaged Application Connectors: SAP (table extract, BAPI, IDOC), Sugar CRM, Microsoft, Sage X3, CentricCRM, Vtiger CRM, Open Bravo
  • Technical Connectors: Amazon S3, Amazon SQS, Bonita, Box, Dropbox, ElasticSearch, GoogleDrive, JIRA, Email (SMTP), FTP/SFTP, LDAP, REST, Splunk

Talend Big Data Platform

Address Validation, Standardization and Enrichment

Through a combination of components and services, Talend supports the following address validation partners: Google, Loqate, QAS, Melissa Data and QAS.

Matching Algorithms

Exact Match, SoundEx, SoundEx FR, Metaphone, Double Metaphone, Levenshtein, Q-gram, Jaro, Jaro-Winkler, Custom/User-Defined, Hamming, Swoosh, VSR

Talend Real-Time Big Data Platform also includes:

Big Data Supported Messaging Services

Apache Spark Streaming , Apache Kafka, Amazon Kinesis, MapR-Streams

Support for Enterprise Messaging Standards, Transports and other ESB-related Capabilities

Learn more

additional details

Supported Systems and Databases
Talend Open Studio for Big Data
Talend Big Data
Talend Big Data Platform
Talend Real-Time Big Data Platform

All Components & Connectors
Component List

Product Documentation
help.talend.com

Product Certifications

SAP Certification