`

Big Data Platform

Turn big data into trusted insights

Talend Big Data Platform simplifies complex integrations to take advantage of Spark, Hadoop, NoSQL, and cloud, so your enterprise can turn more data into trusted insights. Leverage the full power and scale of your big data framework with the leading data integration and data quality platform built on Spark for cloud and on-premises.

Big Data Platform Features

License

  • Subscription license with warranty and indemnification

Design & Productivity Tools

  • Visual mapping for complex XML and EDI on Spark
  • Spark and MapReduce job designer
  • Generates native MapReduce and Spark batch code
  • Hadoop job scheduler with YARN
  • Hadoop security for Kerberos
  • Ingestion, loading, and unloading data into a data lake
  • Eclipse-based developer tooling and job designer
  • Continuous delivery integration and team collaboration with shared repository
  • Audit, job compare, impact analysis, testing, debugging, and tuning
  • Metadata bridge for metadata import/export and centralized metadata management
  • Distant run and parallelization
  • Dynamic schema, re-usable joblets, and reference projects
  • Repository manager
  • ETL and ELT support
  • Wizards and interactive data viewer
  • Versioning
  • Change data capture (CDC)
  • Drools business rule management system
  • Automatic documentation
+   Show more features

Advanced Data Profiling

  • Fraud pattern detection using Benford Law
  • Column set analysis
  • Advanced matching analysis
  • Time column correlation analysis
+   Show more features

Data Quality & Governance

  • Data profiling and analytics with graphical charts and drilldown data
  • Automate data quality error resolution and enforce rules
  • Data masking
  • Data quality portal with monitoring, reporting, and dashboards
  • Semantic discovery with automatic detection of patterns
  • Comprehensive survivorship
  • Data sampling
  • Enrichment, harmonization, fuzzy matching, and de-duplication
+   Show more features

Connectors

  • Cloud: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform, and more
  • Supported big data distributions: Amazon EMR, Azure HDInsight, Cloudera, Google Dataproc, Hortonworks, MapR
  • Spark MLlib (classification, clustering, recommendation, regression)
  • NoSQL: Cassandra, Couchbase, DynamoDB, MongoDB, Neo4j, and more
  • RDBMS: Oracle, Teradata, Microsoft SQL server, and more
  • SaaS: Marketo, Salesforce, NetSuite, and more
  • Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
  • Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more
  • Optional 3rd-party address validation services
+   Show more features

Components

  • Hadoop components: HDFS, Hbase, Hive, Pig, Sqoop
  • File management: open, move, compress, decompress without scripting
  • Control and orchestrate data flows and data integrations with master jobs
  • Map, aggregate, sort, enrich, and merge data
+   Show more features

Data Preparation & Stewardship

  • Import, export, and combine data from any database, Excel, or CSV file
  • Import, export, and combine CSV, Parquet, and AVRO files from/to Hadoop
  • Export to Tableau
  • Self-service on-demand access to sanctioned datasets
  • Share data preparations and datasets
  • Operationalize preparations into any big data and cloud integration flow
  • Run preparations on Apache Beam
  • Auto-discovery, profiling, smart suggestions, and data visualization
  • Auto-discovery and auto-profiling of custom semantic types
  • Smart and selective sampling and full-runs
  • Data tracking and masking with role-based security
  • Cleansing and enrichment functions
  • Data Stewardship App for data curation and certification
+   Show more features

Management & Monitoring

  • High availability, load balancing, failover for jobs
  • Deployment manager and team collaboration
  • Talend Administration Center
  • Amazon EC2 lifecycle control
  • Execution plan, time, and event-based scheduler
  • Check points, error recovery
  • Context management (dev, QA, prod)
  • Activity Monitoring Console
  • Log server with dashboard
+   Show more features

Big Data Quality

  • Data cleansing, profiling, masking, parsing, and matching on Spark and Hadoop
  • Machine learning for data matching and deduplication
  • Support for Cloudera Navigator and Apache Atlas
  • HDFS file profiling
+   Show more features

Gartner Recognizes Talend as a Leader

Gartner Magic Quadrant for Data Integration Get the 2017 Gartner Magic Quadrant for Data Integration report today.

Read the Report

Increase productivity by 10x without coding

Talend Studio provides over 900 pre-built connectors and components for broad connectivity with native code generation. The graphical drag-and-drop UI and wizards increase productivity and speed development. Automate routine tasks so your team can focus on high-value ones.

Watch:

How to Fast Track Your Real-Time Big Data Project

Accelerate insight with trusted data

Talend Big Data Platform builds data quality into the integration process so your team can make trusted data available. With easy onboarding, embedded quality controls, and rules management, data is enriched, protected, and available from a single, unified platform.

Watch:

Harnessing Data Quality for Better Decisions

Adopt any cloud technology, anywhere

With the broadest support for the most cloud services, you can quickly adopt new cloud technologies, maximize performance, and lower TCO. Take advantage of the capacity and performance of AWS, Microsoft Azure, Google, and more with Talend. Build data pipelines once to integrate Salesforce, Marketo, NetSuite, or any apps and run them anywhere.

Cloud Integration Options

What’s New in Talend

  • New connectors and components for Microsoft Azure, Google Cloud, Amazon Web Services, Snowflake, and Cloudera Altus
  • Smarter data quality with machine learning and natural language processing
  • Agile DevOps to speed up your big data projects

Watch Now: What's New for Big Data in Summer '17

Additional Benefits

Self-Service and Collaborative Governance

Data Quality

Improve the accuracy and integrity of your data

Data Preparation

Accelerate data usage and collaboration

Data Stewardship

Ensure data integrity through curation

Metadata Bridge

Synchronize, track, and trace data pipelines

Featured Resources

Getting Started Resources

GET STARTED

How It Works

Customer Success Stories

Subscription Pricing

Talend significantly lowers the upfront cost and total cost of ownership by charging per user.
Start where you are and scale as you need to.

Technical Support and Professional Services

An active community, dedicated customer success managers, and a global network of technical support and professional services are ready
to help you on your integration journey.

Self-Service Support

Help Center

Help Center

Community

Community

Direct Support

Talend Customer Portal,
web, email, phone


 


>
Support & Services

Contact Sales