Talend Products

Real-Time Big Data Integration

The first data integration platform
for Hadoop and Spark

Talend simplifies real-time big data integration for advanced analytics and the real-time use cases that are driving business innovation. The Talend Real-Time Big Data Platform generates native code so you can start working with Apache Spark and Spark Streaming today.

As part of the Data Fabric solution, Talend Real-Time Big Data Platform delivers high-scale, in-memory fast data processing to turn more data into business decisions, in real time, at scale.

  • Blazing fast speed and scale with Spark and Hadoop
  • Gain real-time insight with IoT integration
  • Deliver self-service data preparation to everyone
  • Protect big data investments with a future-proof architecture

Talend Customers Get to Market Faster

We have to continually increase our velocity in acquiring data, and the ease of use of the Talend platform allows us to deliver on those requests. Marc Gallman, Manager of Data Architecture, Lenovo

Fast and First in Real Time

Blazing fast speed and scale with Spark Streaming and Hadoop.

Only Talend takes advantage of the massively parallel environment of Hadoop by generating native MapReduce and Spark code. Load, transform, enrich, and cleanse data inside Hadoop to leverage Hadoop's power and scale. Run 5 times faster than MapReduce using Spark and Spark Streaming in-memory data processing. Talend Studio provides access to over 900 connectors and components, including Spark components for Hadoop/HDFS, cloud SaaS and storage, databases, NoSQL, data masking and transformations plus messaging services, IoT integration, and machine learning.

Gain real-time insights with Internet of Things (IoT) integration.

As billions of sensors and Internet-enabled devices come online, companies have access to more data, in more ways, more quickly. Talend Real-Time Big Data Platform uses Spark in-memory analytics and machine learning components to analyze streaming data in memory so your enterprise can act on big data in real time. The first end-to-end integration platform combines IoT connectivity (AMQP, MQTT); high-speed, reliable messaging (Apache Kafka, Amazon Kinesis, Talend ESB); and high-speed big data processing (Apache Spark). Capture and deliver millions of events per second then instantly ingest, process and deliver insight to real-time applications and fast NoSQL data stores.

View White Paper
Get the White Paper:
Advanced Analytics with Spark
Sandbox Promo

Deliver self-service data preparation to everyone.

Talend combines data preparation with big data integration in a single unified platform to transform how IT and business can turn data into insight. Empower any decision maker with self-service tools to catalog, cleanse, and shape data from any source for use anywhere. Your data experts design the integration rules, while IT governs the use of data and facilitates collaboration across batch, bulk, and master data management scenarios.

View White Paper
Get the White Paper:
Self-Service Analytics Report

Protect big data investments with a future-proof architecture.

Talend released the first integration platform to run MapReduce, Spark and Spark Streaming on YARN. With each new Hadoop framework, Talend makes it possible to convert data integration jobs to the latest frameworks with the push of a button, so you can stay ahead of the innovation curve. Subscription pricing, based on users not CPUs or connectors, sets a predictable cost basis even as data volumes and systems grow exponentially.

Sandbox Promo

Subscription pricing at 1/5th the cost.

Talend Data Integration significantly lowers the cost of ownership for integration solutions with a much lower up-front investment and a predictable spend over time. Talend charges per developer user with no hidden fees per connector so you can start where you are and scale as you need to.

Speed up Your Real-Time Big Data Integration Projects

Design
Faster
Collaborate
Better
Cleanse
Earlier
Ingest
More
Scale
Easier
Data Lifecycle
Use Talend Studio to design batch, real-time and streaming integration jobs with a drag-and-drop user interface. Improve collaboration with a shared repository, continuous delivery methods, and metadata bridge sharing. Use machine learning on Spark and to train and score your data more accurately. Load and ingest more device, log and sensor data into Hadoop. Achieve infinite scale with built-in Lambda architecture and in-memory processing.
Design Faster
Use Talend Studio to design batch, real-time and streaming integration jobs with a drag-and-drop user interface.
Collaborate Better
Improve collaboration with a shared repository, continuous delivery methods, and metadata bridge sharing.
Cleanse Earlier
Use machine learning on Spark, to train and score your data more accurately.
Ingest More
Load and ingest more device, log and sensor data into Hadoop.
Scale Easier
Achieve infinite scale with built-in Lambda architecture and in-memory processing.

Right Size Your Real-Time Big Data Integration Solution

Choose a Talend Big Data Integration solution with the feature set and licensing options to best fit your project and budget.

 
Open Studio for Big Data
Big Data
Big Data Platform
Real-Time Big Data Platform
License Apache Subscription Subscription Subscription
Big Data Hadoop and NoSQL components + Batch Processing (MapReduce, Spark), Native Hadoop Connectors + Batch Processing (MapReduce, Spark), Native Hadoop Connectors + Real-Time Processing (Spark Streaming), and Machine Learning, High-Speed Messaging, and IoT Connectivity
Big data components: HDFS, Hbase, HCatalog, Hive, Pig, Sqoop Included Included Included Included
Hadoop job scheduler Included Included Included Included
Hadoop security for Kerberos Included Included Included Included
NoSQL connectivity Included Included Included Included
YARN support Included Included Included Included
Certified on Hadoop distributions (Amazon EMR, Azure HDInsight, Cloudera, Hortonworks, MapR, Pivotal) Unavailable Included Included Included
Spark and MapReduce job designer Unavailable Included Included Included
MapReduce visual code optimization Unavailable Included Included Included
Hadoop cleansing, profiling, parsing and matching Unavailable Unavailable Included Included
HDFS File Profiling Unavailable Unavailable Included Included
Spark batch Unavailable Included Included Included
Spark Streaming Unavailable Unavailable Unavailable Included
Spark machine learning Unavailable Unavailable Included Included
Complex data mapping on Spark Unavailable Unavailable Included Included
High-Speed Messaging Components (Kafka, Kinesis, Flume) Unavailable Unavailable Unavailable Included
Enterprise Messaging (JMS, ActiveMQ, AMQP) Unavailable Unavailable Unavailable Included
Internet of Things Connectivity (AMQP, MQTT) Unavailable Unavailable Unavailable Included
Accelerate
Data Usage
+ Operationalize Data Preparation + Operationalize Data Preparation + Operationalize Data Preparation
Server-based with role-based security Unavailable Included Included Included
Data discovery and profiling Unavailable Included Included Included
Cleansing, standardization and shaping Unavailable Included Included Included
Data enrichment and combination Unavailable Included Included Included
Share data preparations and datasets Unavailable Included Included Included
Smart sampling and full-runs Unavailable Included Included Included
Operationalize preparations into any data integration flow Unavailable Included Included Included
Design Faster
& Scale Easily
900+ Components & Connectors + Continuous Delivery, testing, sharing, and debugging + Repository Manager + Repository Manager
On Demand Documentation Included Included Included Included
Business Modeler Included Included Included Included
Eclipse-based developer tooling Included Included Included Included
ETL & ELT support Included Included Included Included
Job designer Included Included Included Included
Versioning Included Included Included Included
Audit Unavailable Included Included Included
Automatic documentation Unavailable Included Included Included
Change data capture (CDC) Unavailable Included Included Included
Continuous Delivery Data Integration Unavailable Included Included Included
Drools business rule management system (BRMS) Unavailable Included Included Included
Distant run Unavailable Included Included Included
Dynamic schema Unavailable Included Included Included
Impact analysis Unavailable Included Included Included
Interactive data viewer Unavailable Included Included Included
Jobs compare Unavailable Included Included Included
Metadata Bridge Unavailable Included Included Included
Parallelization Unavailable Included Included Included
Reference projects Unavailable Included Included Included
Re-usable joblets Unavailable Included Included Included
Team collaboration with shared repository Unavailable Included Included Included
Testing, debugging and tuning Unavailable Included Included Included
Centralized metadata management Unavailable Included Included Included
Wizards Unavailable Included Included Included
Repository manager Unavailable Unavailable Included Included
Visual mapping for complex XML and EDI Unavailable Unavailable Included Included
Collaborate Better
& Manage More
Unavailable Manage Administration, Deployment, & Automate Tasks + High Availability, Load Balancing, & Failover + High Availability, Load Balancing, & Failover
Amazon EC2 lifecycle control Unavailable Included Included Included
Check points, error recovery Unavailable Included Included Included
Context management (dev, QA, prod) Unavailable Included Included Included
Deployment manager and team collaboration Unavailable Included Included Included
Execution plan, time and event-based scheduler Unavailable Included Included Included
Log server with dashboard Unavailable Included Included Included
Activity Monitoring Console Unavailable Included Included Included
Talend Administration Center Unavailable Included Included Included
High availability, load balancing, failover for Jobs Unavailable Unavailable Included Included
Increase Trust
with Data Quality
Unavailable Unavailable Data Profiling, Cleansing, Matching, Masking & Stewardship Data Profiling, Cleansing, Matching, Masking & Stewardship
Batch execution of analyses Unavailable Unavailable Included Included
Big data quality capabilities (parsing & matching) Unavailable Unavailable Included Included
Comprehensive survivorship Unavailable Unavailable Included Included
Data cleansing Unavailable Unavailable Included Included
Data masking Unavailable Unavailable Included Included
Data profiling Unavailable Unavailable Included Included
Data quality analytics with graphical charts and drilldown data Unavailable Unavailable Included Included
Data quality monitoring,
reporting & dashboards
Unavailable Unavailable Included Included
Data standardization Unavailable Unavailable Included Included
Data stewardship Unavailable Unavailable Included Included
Enrichment, fuzzy matching & de-duplication Unavailable Unavailable Included Included
Sampling Unavailable Unavailable Included Included
Semantic discovery Unavailable Unavailable Included Included
Cloud or on-premises third-party address validation services Unavailable Unavailable Optional Optional
Support TalendForge Community,
Help Center access
+ Guaranteed Response Times, Web & Email Support, Optional 24/7 + Phone Support, Faster Response, Optional 24/7 + Phone Support, Faster Response, Optional 24/7
Indemnification/
Warranty
Unavailable Included Included Included
SPECIFICATIONS → Free Download Free Trial Request Info Request Info

Why Talend?

The more connected the world becomes, the more quickly a business must adapt. By design, Talend integration software simplifies the development process, reduces the learning curve, and decreases total cost of ownership with a single platform for batch and real-time data integration, in the cloud and on-premises.

© 2016 Talend All rights reserved.

X