Talend Products

Big Data Integration

The first Hadoop-based data integration platform

Talend simplifies the integration of big data so you can respond to business demands without having to write or maintain complicated Apache Hadoop code. Enable existing developers to start working with Hadoop and NoSQL databases today. Use simple, graphical tools and wizards to generate native code that leverages the full power of Hadoop and accelerates your path to informed decisions.

Develop integration jobs 10 times as fast

Loremsum dolor sit amet consectetur adipiscing elit.

Run at the speed and scale of Hadoop

Cum sociis nato que pena tibus et magnis dis par turient montes nascetur.

Protect your investments with a future-proof architecture

Suspendisse feugiat quam sit amet malesuada tincidunt.

Talend Customers Get to Market Faster

We have to continually increase our velocity in acquiring data, and the ease of use of the Talend platform allows us to deliver on those requests. Marc Gallman, Manager of Data Architecture, Lenovo

 

Go from zero to big data in under 10 minutes.

Get big data going without coding. The Talend Big Data Sandbox is a ready-to-run virtual environment that includes Talend Platform for Big Data, popular Hadoop distributions and big data examples.

 

 

Sandbox Promo

 

Run data integration at the speed and scale of Hadoop.

Only Talend takes advantage of the massively parallel environment of Hadoop distributions through native and optimized code generation. Load, transform, enrich, and cleanse data inside Hadoop without additional storage or computing expense. Running data quality inside of Hadoop increases performance and data accuracy, so you can make more informed decisions with full confidence in data quality.

View White Paper
Get the White Paper:
Hadoop in the Enterprise

Protect big data investments with a future-proof architecture.

Talend was the first to provide support and technical previews for MapReduce and YARN, and now Spark and Storm. As new Hadoop frameworks are released, you can stay ahead of the innovation curve without learning new coding languages. Subscription pricing, based on users not CPUs or connectors, sets a predictable cost basis even as data volumes and systems grow exponentially. Only Talend delivers a unified platform for data, application, and process integration to meet today and tomorrow’s business needs.

Speed up Your Big Data Integration Projects

Design
Faster
Collaborate
Better
Cleanse
Earlier
Manage
More
Scale
Easier
Data Lifecycle
Use Talend Studio to design big data integration jobs with a drag-and-drop user interface. Enable teams of developers to collaborate better using a shared repository Use native Hadoop profiling and data matching to understand and cleanse your data more accurately. Leverage big data consoles to centrally manage and monitor your projects. Achieve infinite scale with native Hadoop code.
Design Faster
Use Talend Studio to design big data integration jobs with a drag-and-drop user interface.
Collaborate Better
Enable teams of developers to collaborate better using a shared repository
Cleanse Earlier
Use native Hadoop profiling and data matching to understand and cleanse your data more accurately.
Manage More
Leverage big data consoles to centrally manage and monitor your projects.
Scale Easier
Achieve infinite scale with native Hadoop code.

Right Size Your Big Data Integration Solution

Choose a Talend Big Data Integration solution with the feature set and licensing options to best fit your project and budget.

 
Open Studio for Big Data
Enterprise Big Data
Platform for Big Data
LicenseApacheSubscriptionSubscription
Big Data Big data components+ Certified for Hadoop, native connectors & support+ Certified for Hadoop, native connectors & support
Big data components: HDFS, Hbase, HCatalog, Hive, Pig, SqoopIncludedIncludedIncluded
Hadoop job schedulerIncludedIncludedIncluded
Hadoop security for KerberosIncludedIncludedIncluded
NoSQL connectivityIncludedIncludedIncluded
YARN supportIncludedIncludedIncluded
Certified on Hadoop distributions (Cloudera, Hortonworks, MapR, Pivotal)UnavailableIncludedIncluded
MapReduce job designer with visual code optimizationUnavailableIncludedIncluded
Hadoop cleansing, profiling, parsing and matchingUnavailableUnavailableIncluded
Design Faster
& Scale Easily
800+ Components & Connectors+ Modeling, Testing,
Sharing, & Debugging
+ Repository Manager& Visual Mapping
On Demand DocumentationIncludedIncludedIncluded
Business ModelerIncludedIncludedIncluded
Eclipse-based developer toolingIncludedIncludedIncluded
ETL & ELT supportIncludedIncludedIncluded
Job designerIncludedIncludedIncluded
VersioningIncludedIncludedIncluded
AuditUnavailableIncludedIncluded
Automatic documentationUnavailableIncludedIncluded
Change data capture (CDC)UnavailableIncludedIncluded
Drools business rule management system (BRMS)UnavailableIncludedIncluded
Distant runUnavailableIncludedIncluded
Dynamic schemaUnavailableIncludedIncluded
Impact analysisUnavailableIncludedIncluded
Interactive data viewerUnavailableIncludedIncluded
Jobs compareUnavailableIncludedIncluded
ParallelizationUnavailableIncludedIncluded
Reference projectsUnavailableIncludedIncluded
Re-usable jobletsUnavailableIncludedIncluded
Team collaboration with shared repositoryUnavailableIncludedIncluded
Testing, debugging and tuningUnavailableIncludedIncluded
Centralized metadata managementUnavailableIncludedIncluded
WizardsUnavailableIncludedIncluded
Repository managerUnavailableUnavailableIncluded
Visual mapping for complex XML and EDIUnavailableUnavailableIncluded
Collaborate Better
& Manage More
UnavailableManage Administration, Deployment, & Automate Tasks+ High availability, load balancing, and failover
Amazon EC2 lifecycle controlUnavailableIncludedIncluded
Check points, error recoveryUnavailableIncludedIncluded
Context management (dev, QA, prod)UnavailableIncludedIncluded
Deployment manager and team collaborationUnavailableIncludedIncluded
Execution plan, time and event-based schedulerUnavailableIncludedIncluded
Log server with dashboardUnavailableIncludedIncluded
Activity Monitoring ConsoleUnavailableIncludedIncluded
Talend Administration CenterUnavailableIncludedIncluded
High availability, load balancing, failoverUnavailableIncludedIncluded
Increase Trust
with Data Quality
UnavailableUnavailableCleansing, Profiling, Stewardship
Batch execution of analysesUnavailableUnavailableIncluded
Big data quality capabilities (parsing & matching)UnavailableUnavailableIncluded
Comprehensive survivorshipUnavailableUnavailableIncluded
Data cleansingUnavailableUnavailableIncluded
Data profilingUnavailableUnavailableIncluded
Data quality monitoring,
reporting & dashboards
UnavailableUnavailableIncluded
Data standardizationUnavailableUnavailableIncluded
Data stewardshipUnavailableUnavailableIncluded
Enrichment, fuzzy matching & de-duplicationUnavailableUnavailableIncluded
Graphical charts with drilldown dataUnavailableUnavailableIncluded
Third-party address validation servicesUnavailableUnavailableOptional
SupportTalendForge Community,
Help Center access
+ Guaranteed Response Times, Web & Email Support+ Phone Support
Indemnification/
Warranty
UnavailableIncludedIncluded
SPECIFICATIONS →Free DownloadFree TrialRequest Info

 

Develop integration jobs 10 times as fast
and do more with data.

Talend Studio gives you access to over 800 connectors and components, including native support for Hadoop, NoSQL, and all your structured and unstructured data sources. Graphical drag-and-drop tools and wizards speed design, deployment and maintenance. No need to learn MapReduce and tweak custom code, just design your integration jobs and Talend does the heavy lifting.

Download
Get the E-Book:
Big Data Management

Why Talend?

The more connected the world becomes, the more quickly a business must adapt. By design, Talend integration software simplifies the development process, reduces the learning curve, and decreases total cost of ownership with a unified, open, and predictable platform.

Executive Header: 
Simplify Big Data Integration
Executive Copy: 
Talend provides a powerful and versatile open source big data product that makes the job of working with big data technologies easy and helps drive and improve business performance, without the need for specialist knowledge or resources.
What it Does: 
Integration at Cluster Scale
Manager Copy: 

Talend redefines the development skills needed for big data and facilitates the organization and orchestration required by these projects so that you can focus on the key question: “What use should we make of data, big and small, and how am I going to be the leader in using data to help my business?”

Talend’s big data product combines big data components for MapReduce 2.0 (YARN), Hadoop, HBase, Hive, HCatalog, Oozie, Sqoop and Pig into a unified open source environment so you can quickly load, extract, transform and process large and diverse data sets from disparate systems.

How it Works: 
Big Data Without The Need To Write / Maintain Code
Implementer Copy: 

Ready to Use Big Data Connectors

Talend provides an easy-to-use graphical environment that allows developers to visually map big data sources and targets without the need to learn and write complicated code. Running 100% natively on Hadoop, Talend Big Data provides massive scalability. Once a big data connection is configured the underlying code is automatically generated and can be deployed remotely as a job that runs natively on your big data cluster - HDFS, Pig, HCatalog, HBase, Sqoop or Hive.

Big Data Distribution and Big Data Appliance Support

Talend's big data components have been tested and certified to work with leading big data Hadoop distributions, including Amazon EMR, Cloudera, IBM PureData, Hortonworks, MapR, Pivotal Greenplum, Pivotal HD, and SAP HANA.  Talend provides out-of-the-box support for big data platforms from the leading appliance vendors including Greenplum/Pivotal, Netezza, Teradata, and Vertica.

Talend big data integration works with Apache, mongodb architecture, sqoop, and more

Open Source

Using the Apache software license means developers can use the Studio without restrictions. As Talend’s big data products rely on standard Hadoop APIs, users can easily migrate their data integration jobs between different Hadoop distributions without any concerns about underlying platform dependencies. Support for Apache Oozie is provided out-of-the-box, allowing operators to schedule their data jobs through open source software.

Pull Source Data from Anywhere Including NoSQL

With 800+ connectors, Talend integrates almost any data source so you can transform and integrate data in real-time or batch. Pre-built connectors for HBase, MongoDB,Cassandra, CouchDB, Couchbase, Neo4J and Riak speed development without requiring specific NoSQL knowledge. Talend big data components can be configured to bulk upload data to Hadoop or other big data appliance, either as a manual process, or an automatic schedule for incremental data updates.

Support for Google BigQuery

Quote: 
The strategy for data quality with Big Data will depend on whether the application is mission-critical, whether regulatory compliance ramifications are involved, and the degree to which bad quality data will materially impact the business.
Quote Author: 
Tony Baer
Quote Author Title: 
Ovum
Product Screenshot: 
Feature Grid: 
FEATURESTalend Open Studio for Big DataTalend Enterprise Big DataTalend Platform for Big Data
Job Designer

x

x

x

Components for HDFS, HBase, HCatalog, Hive, Pig, Sqoop

x

x

x

Hadoop Job Scheduler

x

x

x

NoSQL Support

x

x

x

Versioning and Centralized Metadata Management

x

x

Shared Repository

x

x

Reporting and Dashboards

x

Big Data Profiling, Parsing and Matching

x

Indemnification/Warranty and Talend Support

x

x

LicenseApacheSubscriptionSubscription
Section Landing Page Text: 

Talend Open Studio for Big Data combines big data technologies into a unified open source environment simplifying the loading, extraction, transformation and processing of large and diverse data sets.

Feature Grid Description: 

Talend Open Studio for Big Data is an Apache licensed, open source development tool. Talend Enterprise Big Data adds teamwork and management features. Talend Platform for Big Data adds data quality, clustering features with extended support services.

Site Section:

Product Specifications: 
Specifications: Big Data

Link to Downloads:

Download Page Text: 

The product:

  • Provides graphical development productivity tools for interaction with big data sources and targets.
  • Provides 800+ connectors and components to almost any data source with support for big data Hadoop distributions including Cloudera, Google BigQuery, Greenplum, Hortonworks and MapR.
  • Supports HDFS, Pig, HCatalog, Hbase, Sqoop, Oozie and Hive.

Talend Open Studio for Big Data is provided under the Apache License v2 agreement terms.

Select the appropriate tabs below to download the Current Version, or to download Other Releases, or to download the User Manuals.

Download Landing Page Text: 

Talend Open Studio for Big Data combines big data technologies into a unified open source environment simplifying the loading, extraction, transformation and processing of large and diverse data sets. Users are also presented with a full palette of components for NoSQL connectivity, all under an open source Apache license.

What's new text: 

Talend provides a powerful and versatile open source big data product that makes the job of working with big data technologies easy and helps drive and improve business performance, without the need for specialist knowledge or resources.

Why Upgrade: 

Talend Platform for Big Data is a powerful and versatile big data integration and data quality solution that simplifies the loading, extraction and processing of large and diverse data sets so you can make more informed and timely decisions.

See the different solutions Talend offers for Big Data

Basic Version: 
Introduction: 
Talend makes the task of working with big data technologies easy.
Product Title: 
Talend Open Studio for Big Data
Product Subtitle: 
Free Open Source
Product version: 
Version 5.6.0
Product type version: 
Basic Big Data
Product Info: 
Open Studio Capabilities Includes:
Eclipse-Based Tooling
Hadoop 2.0 and YARN Support
Big Data ETL and ELT
HDFS, HBase, HCatalog, Hive, Pig, Sqoop Components
Job Designer
Apache License 2.0
Broadest NoSQL Support
Fully Open Source

Advanced Version: 
Product Title: 
Talend Enterprise Big Data
Product Subtitle: 
Free 30-day Full Product Trial
Product version: 
Version 5.6.1
Product type version: 
Advanced Big Data Integration
Product Info: 
Open Studio Capabilities, Plus:
Design and Generate 100% MapReduce Code
Visual MapReduce Job Optimizer
Data Viewer for Hadoop
Collaborative Team Development
Compare Changes and Impact Analysis
Versioning
Data Lineage
Testing, Debugging and Tuning Tools
Advanced Management and Monitoring
Integrated Business Rules
Graphical Wizards

 

© 2015 Talend All rights reserved.

X