Talend Products

Big Data Integration

Get faster time to value from your big data projects

Talend simplifies and automates big data integration with graphical tools and wizards that generate native code. This means your team can start working with Apache Hadoop, Apache Spark, Spark Streaming and NoSQL databases today.

Talend Big Data Integration platform delivers high-scale, in-memory fast data processing, as part of the Talend Data Fabric solution, so your enterprise can turn more data into real-time decisions.

  • Blazing fast speed and scale with Spark and Hadoop
  • Let anyone access and cleanse big data while governing its use
  • Optimize big data performance in the cloud
  • Protect your investments with a future-proof architecture

DOWNLOAD FREE TRIAL

Gartner Recognizes Talend as a Leader

Gartner Magic Quadrant for Data Integration

Get the 2016 Gartner Magic Quadrant for Data Integration report today.

Read the Report

Run 5x Times Faster for 1/5th the Price

Blazing fast speed and scale with Spark and Hadoop.

Only Talend Big Data Integration takes advantage of the massively parallel environment of Hadoop by generating native Spark and MapReduce code. Ingest, process, enrich, and cleanse data inside Hadoop to leverage Hadoop's power and scale. Run 5 times faster than MapReduce using Spark Batch and Spark Streaming in-memory data processing. Talend Studio provides over 900 connectors and components for broad connectivity, plus the graphical drag-and-drop UI and wizards increase productivity and speed development. Automate routine tasks so your team can focus on the high-value jobs.

white-paper-icon
Get the White Paper:
Hadoop in the Enterprise

Let anyone access and cleanse big data while governing its use.

Empower any decision maker with self-service tools to curate, catalog, cleanse, and shape data from the data lake for use anywhere. Talend unifies data preparation, data stewardship and big data integration to transform how IT and business can turn data into insight. Your data experts design the integration rules, while IT governs the use of data and facilitates collaboration across the enterprise.

Optimize big data performance in the cloud.

Run big data processing when and where you need it—on-premises, hybrid or in the cloud—with the best response time, lowest latency, and most cost-effective use of resources. Build end-to-end big data integration workflows that easily integrate with Amazon Redshift, Elastic MapReduce (EMR), Amazon Kinesis, Microsoft Azure HDInsight, or Google BigQuery systems, so all your infrastructure runs in the cloud.

Protect big data investments with a future-proof architecture.

Unlike hand coding, Talend makes it easy to convert your integration jobs to the latest big data technology. Converting between MapReduce, Spark, and Spark Streaming is a breeze, so you can stay current with the latest innovation.

Subscription pricing at 1/5th the cost.

Talend significantly lowers the cost of ownership for integration solutions with a much lower up-front investment and a predictable spend over time. Talend charges per user with no fees per connector so you can start where you are and scale as you need to.

Talend Pricing Model

Speed up Your Big Data Integration Projects

Design
Faster
Collaborate
Better
Cleanse
Earlier
Manage
More
Scale
Easier
lifecycle
Use Talend Studio to design batch, real-time and streaming integration jobs with a drag-and-drop user interface. Improve collaboration with a shared repository, continuous delivery methods, and self-service data preparation. Use native Hadoop data quality, data matching, and machine learning to better reveal your data. Leverage big data consoles to centrally manage and monitor your projects. Achieve infinite scale with built-in batch and streaming architecture and in-memory processing.
Design Faster
Use Talend Studio to design batch, real-time and streaming integration jobs with a drag-and-drop user interface.
Collaborate Better
Improve collaboration with a shared repository, continuous delivery methods, and self-service data preparation.
Cleanse Earlier
Use native Hadoop data profiling, data matching, and machine learning to better reveal your data.
Manage More
Leverage big data consoles to centrally manage and monitor your projects.
Scale Easier
Achieve infinite scale with built-in streaming architecture and in-memory processing.

Right Size Your Real-Time Big Data Integration Solution

Choose a Talend Big Data Integration solution with the feature set and licensing options to best fit your project and budget.

 
Open Studio for Big Data
Big Data
Big Data Platform
Real-Time Big Data Platform
License Apache Subscription Subscription Subscription
Big Data Hadoop and NoSQL components + Batch Processing (MapReduce, Spark), Native Hadoop Connectors + Batch Processing (MapReduce, Spark), Native Hadoop Connectors + Real-Time Processing (Spark Streaming), and Machine Learning, High-Speed Messaging, and IoT Connectivity
Big data components: HDFS, Hbase, HCatalog, Hive, Pig, Sqoop Included Included Included Included
Hadoop job scheduler Included Included Included Included
Hadoop security for Kerberos Included Included Included Included
NoSQL connectivity Included Included Included Included
YARN support Included Included Included Included
Certified on Hadoop distributions (Amazon EMR, Azure HDInsight, Cloudera, Hortonworks, MapR, Pivotal) Unavailable Included Included Included
Spark and MapReduce job designer Unavailable Included Included Included
MapReduce visual code optimization Unavailable Included Included Included
Hadoop cleansing, profiling, parsing and matching Unavailable Unavailable Included Included
HDFS File Profiling Unavailable Unavailable Included Included
Spark batch Unavailable Included Included Included
Spark Streaming Unavailable Unavailable Unavailable Included
Spark machine learning Unavailable Unavailable Included Included
Complex data mapping on Spark Unavailable Unavailable Included Included
High-Speed Messaging Components (Kafka, Kinesis, Flume) Unavailable Unavailable Unavailable Included
Enterprise Messaging (JMS, ActiveMQ, AMQP) Unavailable Unavailable Unavailable Included
Internet of Things Connectivity (AMQP, MQTT) Unavailable Unavailable Unavailable Included
Accelerate Data Usage + Operationalize Data Preparation + Operationalize Data Preparation + Operationalize Data Preparation
Server-based with role-based security Unavailable Included Included Included
Data discovery and profiling Unavailable Included Included Included
Cleansing, standardization and shaping Unavailable Included Included Included
Data enrichment and combination Unavailable Included Included Included
Share data preparations and datasets Unavailable Included Included Included
Self-service data access Unavailable Included Included Included
Smart sampling and full-runs Unavailable Included Included Included
Operationalize preparations into any big data integration flow Unavailable Included Included Included
Design Faster & Scale Easily 900+ Components & Connectors + Continuous Delivery, testing, sharing, and debugging + Repository Manager + Repository Manager
On Demand Documentation Included Included Included Included
Business Modeler Included Included Included Included
Eclipse-based developer tooling Included Included Included Included
ETL & ELT support Included Included Included Included
Job designer Included Included Included Included
Versioning Included Included Included Included
Audit Unavailable Included Included Included
Automatic documentation Unavailable Included Included Included
Change data capture (CDC) Unavailable Included Included Included
Continuous Delivery Data Integration Unavailable Included Included Included
Drools business rule management system (BRMS) Unavailable Included Included Included
Distant run Unavailable Included Included Included
Dynamic schema Unavailable Included Included Included
Impact analysis Unavailable Included Included Included
Interactive data viewer Unavailable Included Included Included
Jobs compare Unavailable Included Included Included
Metadata Bridge Unavailable Included Included Included
Parallelization Unavailable Included Included Included
Reference projects Unavailable Included Included Included
Re-usable joblets Unavailable Included Included Included
Team collaboration with shared repository Unavailable Included Included Included
Testing, debugging and tuning Unavailable Included Included Included
Centralized metadata management Unavailable Included Included Included
Wizards Unavailable Included Included Included
Repository manager Unavailable Unavailable Included Included
Visual mapping for complex XML and EDI Unavailable Unavailable Included Included
Collaborate Better & Manage More Unavailable Manage Administration, Deployment, & Automate Tasks + High Availability, Load Balancing, & Failover + High Availability, Load Balancing, & Failover
Amazon EC2 lifecycle control Unavailable Included Included Included
Check points, error recovery Unavailable Included Included Included
Context management (dev, QA, prod) Unavailable Included Included Included
Deployment manager and team collaboration Unavailable Included Included Included
Execution plan, time and event-based scheduler Unavailable Included Included Included
Log server with dashboard Unavailable Included Included Included
Activity Monitoring Console Unavailable Included Included Included
Talend Administration Center Unavailable Included Included Included
High availability, load balancing, failover for Jobs Unavailable Unavailable Included Included
Increase Trust with Data Quality Unavailable Unavailable Data Profiling, Cleansing, Matching, Masking & Stewardship Data Profiling, Cleansing, Matching, Masking & Stewardship
Batch execution of analyses Unavailable Unavailable Included Included
Big data quality capabilities (parsing & matching) Unavailable Unavailable Included Included
Comprehensive survivorship Unavailable Unavailable Included Included
Data cleansing Unavailable Unavailable Included Included
Data masking Unavailable Unavailable Included Included
Data profiling Unavailable Unavailable Included Included
Data quality analytics with graphical charts and drilldown data Unavailable Unavailable Included Included
Data quality monitoring, reporting & dashboards Unavailable Unavailable Included Included
Data standardization Unavailable Unavailable Included Included
Data stewardship Unavailable Unavailable Included Included
Enrichment, fuzzy matching & de-duplication Unavailable Unavailable Included Included
Sampling Unavailable Unavailable Included Included
Semantic discovery Unavailable Unavailable Included Included
Cloud or on-premises third-party address validation services Unavailable Unavailable Optional Optional
Support TalendForge Community, Help Center access + Guaranteed Response Times, Web & Email Support, Optional 24/7 + Phone Support, Faster Response, Optional 24/7 + Phone Support, Faster Response, Optional 24/7
Indemnification/ Warranty Unavailable Included Included Included
SPECIFICATIONS ? Free Download Free Trial Request Info Request Info

Why Talend

The more connected the world becomes, the more quickly a business must adapt. By design, Talend integration software simplifies the development process, reduces the learning curve, and decreases total cost of ownership with a single platform for batch and real-time data integration, in the cloud and on-premises.

Learn More