What’s New in
Talend Spring '18

Introducing Talend Cloud Data Stewardship

LEARN MORE

Enable Everyone to Work with Trusted Data

New cloud, big data, and governance enhancements will dramatically improve your team’s ability to deliver data-driven results.

Talend Cloud Data Stewardship 
Faster Big Data Integration
More Productive DevOps
Empower those who know the data best with a self-service data curation and validation app.Quickly build cloud data warehouses and data lakes with the latest technology.Make integration builds up to 50% faster with new continuous integration tools.

Introducing Talend Cloud Data Stewardship 

Increase trust in your data with Talend Cloud Data Stewardship, a team-based, self-service data curation and validation app that enables the people who know the data best to quickly identify, manage, and resolve any data integrity issue. Using a simple, web-based UI, you can define user roles, workflows, and priorities for data curation, then delegate tasks. Establish a single version of the truth, no matter which cloud or location your data is on. Nothing to install, just turn it on as a Talend Cloud service.(Data Stewardship is available as a Talend Cloud app or as Talend software you download and install)

data stewardship
fast big data integration

Faster Big Data Integration

Process more data more quickly across cloud data warehouses and data lakes. Gain rapid insight with new ELT push-down capabilities for Snowflake, new Spark and Spark Streaming support in Azure Data Lake Store, and enhanced data extraction capabilities for SAP.Talend now supports dynamic distribution for Cloudera, giving you instant access to the latest Cloudera features with no need to upgrade Talend, saving weeks to even months of admin time. Develop big data jobs once and deploy on-premises, on any cloud, or as a Talend Cloud-managed service.

More Productive DevOps

DevOps is supercharged through continuous integration improvements. Using the latest Maven standards, the integration build process is up to 50% faster! This release provides incremental builds in Studio, broader Git support, standard Maven commands for integration, and the ability to easily extend the build process with Maven Plugins. Talend Cloud testing and debugging goes from minutes to seconds with a free test engine and the ability to remotely debug big data jobs.

devops

Enhancements

This section lists new features in Talend Spring ’18 and Talend Winter ’18. To see what is in each release and product (downloadable software or Talend Cloud), visit help.talend.com

 
Big Data Integration

Improve the performance and productivity of your big data projects:

  • New dynamic distribution support—instantly add Hadoop distribution updates without upgrading Talend (initial support for Cloudera CDH)

  • Run Spark jobs in YARN cluster mode, removing the need for a job server on an edge node at runtime, simplifying and speeding up your deployment with no single point of failure

  • Dramatically increase your ability to extract data from SAP, at the application, database, and data warehouse level. New SAP bulk-extraction capabilities allow you to extract nearly unlimited amounts of data from SAP. Easily extract new or changed pre-packaged SAP data using the business content extractor with delta mode (Technical Preview). ELT push-down support for SAP enables processing natively within SAP, before moving data to the cloud

  • Snowflake component support is enhanced so you can do ELT push-down, where data processing and transformations are done on Snowflake clusters, leveraging the massive performance and scalability of Snowflake for faster analytics

  • Ingest into and query Cloudera Kudu, a Hadoop columnar storage manager used for rapid analytics on fast data scenarios like IoT, GDPR, and fraud detection. Advanced tuning options provide optimal performance

  • MapR-DB OJAI support, so you can perform advanced hierarchical transformations graphically and query MapR-DB OJAI from your job, delivering faster performance and greater flexibility for web, mobile, social, and IoT-based applications

  • Simplify AWS S3 security implementation by using IAM roles and secure token service for your job

  • Run your Talend workloads on Cloudera Altus on Azure (in addition to AWS today)

  • Process more data faster with Spark and Spark Streaming support for Microsoft Azure Data Lake Store

  • Track application IDs in Hive Query to better manage your Talend / Hive jobs

  • Get and set rowkeys in HBase, so you can leverage HBase best practices and work with time-series data

Data Integration

Improve your productivity and project security:

  • Job server security and productivity improvements including:

    • Role-based security: A Studio developer can only execute jobs belonging to a project which they have authorization

    • Enhanced job server data cleansing actions to ignore active running jobs and any linked dependencies or libraries

    • Scheduling and error handling improvements to restart tasks on unavailable job servers, and virtual job servers with weighted round-robin load balancing

    Talend Administration Center (TAC) improvements including:

    • Additional Single Sign-on (SSO) options, including support for Ping Identity PingFederate Server, and Microsoft Active Directory Federation Services

    • Greater visibility of what is happening through auditing and security logging, which traces all user interactions including access, modifications, and configuration changes

    • A new auditor role for configuring and accessing the audit log, providing a greater level of security

     

  • Talend Cloud reduces testing and debugging time from minutes to seconds with a free test engine and the ability to remotely debug big data jobs, and debug jobs on either Talend Cloud Engines or Remote Engines

  • Continuous integration updates including using Maven standards for incremental builds in Studio, broader Git support including Bitbucket Server 5.x, Nexus 3 support for the Talend Artifact Repository, standard Maven commands for data integration and application integration (technical preview), and the ability to easily extend the build process through Maven Plug-ins and custom Project Object Models (POMs)

  • Increase productivity by building custom Talend components. Develop once with the Talend Component Kit, then reuse across all Talend products and integration styles, batch to real-time, data integration to big data, on-premises to cloud

  • Save time by automatically matching similarly named columns with Smart tMap Fuzzy Auto-mapping, which uses data quality algorithms (Levenshtein, Jaccard) to do fuzzy matching

  • Increased flexibility and productivity in job design with the ability to change table names at runtime through ELTMap, and new routines to adapt to changing schemas

Data Quality

Increase the integrity of cloud and on-premises data as it flows through the business:

  • Improved survivorship rules with per column support so you have finer control of the master value you want to keep

  • New component tPatternMasking to define new types of masking patterns for privacy and security control

  • Import and export semantic types from the Dictionary Service UI, making it easier to manage the promotion of semantic types across environments

  • Talend Dictionary Service REST APIs are now publicly available and self-documented via Swagger. You can leverage Talend Dictionary Service in data/application integration scenarios and populate the Talend Dictionary Service programmatically

  • The Dictionary Service UI has been translated to French

Data Preparation

Deliver the best data preparation user experience at extreme scale:

  • With the Cloud Dictionary Service, you can define new business terms for your data to facilitate data understanding and usage by both people and machines

  • Expanded connectivity options with Redshift and Snowflake self-service connectors

  • Dynamic preparation selection in a Talend job, to improve maintenance and productivity

  • Improved flexibility with new data preparation functions: Basic deduplication, Standardization via data dictionaries, Fill from above, Generate a sequence, Percentages management

  • Support for custom enclosure and escape characters for CSV files makes it possible to handle non-standard or complex CSV files without the need to standardize the file outside Talend Data Preparation

  • UI now supports both French and Japanese

Data Stewardship

Quickly identify, manage, and resolve any data integrity issue:

  • Empower those who know the data best with Talend Cloud Data Stewardship, a team-based, self-service data curation and validation app where you quickly identify, manage, and resolve any data integrity issue

  • With the Cloud Dictionary Service, you can define new business terms for your data to facilitate data understanding and usage by others, both people and machines

  • Users can now import and export campaigns and data models directly from the Talend Data Stewardship UI. This makes it easier to comply with IT policies by managing the promotion of configuration across different environments (downloadable software only)

  • UI now supports French and Japanese

MDM

Design, ingest, author, curate, and update your master data faster:

  • License and identity management via Talend Administration Center for improved security

  • Single sign-on with Data Preparation and Data Stewardship saves time

  • REST API improvement (“IN” operator)

  • Survivorship rules per column in MDM integrated matching

  • Audit all user actions, including login/logout and configuration deployment, for security compliance

Talend Data Mapper

Increase the performance of your complex mappings:

  • tHMapRecord, in addition to receiving, can send complex hierarchical structures to queue outputs such as Kafka (tKafkaOutput), and Kinesis (tKinesisOutput)

  • tHMap can create multiple outputs from a single input improving productivity

  • New transformation and expression language functions including upper-case, lower-case, translate, and contains

  • Improved conversion between hierarchical data and flat records

Extend Your Data Integration Reach

supported systems

To see what components are in each Talend product, visit help.talend.com.

 

New and Updated Hadoop Distributions

  • Amazon EMR 5.8

  • Cloudera CDH 5.12, 5.13

  • MapR 6.0

  • Spark 2.2

New and Updated Components

  • Amazon S3

  • Cloudera Kudu

  • Couchbase

  • FTP

  • Hbase

  • Hive

  • MapR-DB OJAI

  • Marketo

  • Marklogic

  • Microsoft Azure Data Lake Store

  • Microsoft Dynamics CRM 2016 (on-premises)

  • MongoDB

  • Neo4J

  • Oracle Cloud

  • SAP Business Suite

  • SAP Hana

  • SAP s/4Hana

  • Snowflake

  • Sybase

  • Vertica