Big Data Platform

Turn big data into trusted insights

Talend Big Data Platform simplifies complex integrations to take advantage of Spark, Hadoop, NoSQL and cloud, so your enterprise can turn more data into trusted insights. Leverage the full power and scale of your big data framework with the leading data integration and data quality platform built on Spark for cloud and on-premises.

Big Data Platform Features


  • Subscription license with warranty and indemnification

Design & Productivity Tools

  • Visual mapping for complex XML & EDI on Spark
  • Spark & MapReduce job designer
  • Generates native MapReduce & Spark batch code
  • Hadoop job scheduler with YARN
  • Hadoop security for Kerberos
  • Ingestion, loading, and unloading data into a data lake
  • Eclipse-based developer tooling & job designer
  • Continuous delivery integration & team collaboration with shared repository
  • Audit, job compare, impact analysis, testing, debugging & tuning
  • Metadata bridge for metadata import/export & centralized metadata management
  • Distant run & parallelization
  • Dynamic schema, re-usable joblets & reference projects
  • Repository manager
  • ETL & ELT support
  • Wizards & interactive data viewer
  • Versioning
  • Change data capture (CDC)
  • Drools business rule management system
  • Automatic documentation
+   Show more features

Advanced Data Profiling

  • Fraud pattern detection using Benford Law
  • Column set analysis
  • Advanced matching analysis
  • Time column correlation analysis
+   Show more features

Data Quality & Governance

  • Data profiling & analytics with graphical charts & drilldown data
  • Automate data quality error resolution and enforce rules
  • Data masking
  • Data quality portal with monitoring, reporting & dashboards
  • Semantic discovery with automatic detection of patterns
  • Comprehensive survivorship
  • Data sampling
  • Enrichment, harmonization, fuzzy matching & de-duplication
+   Show more features


  • Cloud: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform, and more
  • Supported big data distributions: Amazon EMR, Azure HDInsight, Cloudera, Google Dataproc, Hortonworks, MapR
  • Spark MLlib (classification, clustering, recommendation, regression)
  • NoSQL: Cassandra, Couchbase, DynamoDB, MongoDB, Neo4j, and more
  • RDBMS: Oracle, Teradata, Microsoft SQL server, and more
  • SaaS: Marketo, Salesforce, NetSuite, and more
  • Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
  • Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more
  • Optional 3rd-party address validation services
+   Show more features


  • Hadoop components: HDFS, Hbase, Hive, Pig, Sqoop
  • File management: open, move, compress, decompress without scripting
  • Control and orchestrate data flows and data integrations with master jobs
  • Map, aggregate, sort, enrich & merge data
+   Show more features

Data Preparation & Stewardship

  • Import, export & combine data from any database, Excel or CSV file
  • Import, export & combine CSV, Parquet & AVRO files from/to Hadoop
  • Export to Tableau
  • Self-service on-demand access to sanctioned datasets
  • Share data preparations & datasets
  • Operationalize preparations into any big data & cloud integration flow
  • Run preparations on Apache Beam
  • Auto-discovery, profiling, smart suggestions, and data visualization
  • Auto-discovery & auto-profiling of custom semantic types
  • Smart & selective sampling & full-runs
  • Data tracking & masking with role-based security
  • Cleansing and enrichment functions
  • Data Stewardship App for data curation and certification
+   Show more features

Management & Monitoring

  • High availability, load balancing, failover for jobs
  • Deployment manager & team collaboration
  • Talend Administration Center
  • Amazon EC2 lifecycle control
  • Execution plan, time & event-based scheduler
  • Check points, error recovery
  • Context management (dev, QA, prod)
  • Activity Monitoring Console
  • Log server with dashboard
+   Show more features

Big Data Quality

  • Data cleansing, profiling, masking, parsing & matching on Spark & Hadoop
  • Machine learning for data matching and deduplication
  • Support for Cloudera Navigator & Apache Atlas
  • HDFS file profiling
+   Show more features

Gartner Recognizes Talend as a Leader

Gartner Magic Quadrant for Data Integration Get the 2017 Gartner Magic Quadrant for Data Integration report today.

Read the Report

Increase productivity by 10x without coding

Talend Studio provides over 900 pre-built connectors and components for broad connectivity with native code generation. The graphical drag-and-drop UI and wizards increase productivity and speed development. Automate routine tasks so your team can focus on high-value ones.


How to Fast Track Your Real-Time Big Data Project

Accelerate insight with trusted data

Talend Big Data Platform builds data quality into the integration process so your team can make trusted data available. With easy onboarding, embedded quality controls, and rules management, data is enriched, protected, and available from a single, unified platform.


Harnessing Data Quality for Better Decisions

Adopt any cloud technology, anywhere

With the broadest support for the most cloud services, you can quickly adopt new cloud technologies, maximize performance, and lower TCO. Take advantage of the capacity and performance of AWS, Microsoft Azure, Google, and more with Talend. Build data pipelines once to integrate Salesforce, Marketo, NetSuite, or any apps and run them anywhere.

Cloud Integration Options

What’s New in Talend

  • New connectors and components for Microsoft Azure, Google Cloud, Amazon Web Services, Snowflake, and Cloudera Altus
  • Smarter data quality with machine learning and natural language processing
  • Agile DevOps to speed up your big data projects

Watch Now: What's New for Big Data in Summer '17

Additional Benefits

Self-Service and Collaborative Governance

Data Quality

Improve the accuracy and integrity of your data

Data Preparation

Accelerate data usage and collaboration

Data Stewardship

Ensure data integrity through curation

Metadata Bridge

Synchronize, track and trace data pipelines

Featured Resources

Getting Started Resources


How It Works

Customer Success Stories

Subscription Pricing

Talend significantly lowers the upfront cost and total cost of ownership by charging per user.
Start where you are and scale as you need to.

Technical Support and Professional Services

An active community, dedicated customer success managers, and a global network of technical support and professional services are ready
to help you on your integration journey.

Self-Service Support

Help Center

Help Center



Direct Support

Talend Customer Portal,
web, email, phone


Support & Services

Contact Sales