What’s New in Talend Fall '18.

Scale up so you can deliver more insights faster to the right people.

Delivering insight-ready data at scale.

Talend helps companies streamline and scale data collection, processing, and management.

Create a single source
of trusted data.

Talend Data Catalog makes 100x more data accessible.

Reduce data processing
costs up to 80%.

Run Spark workloads serverless in cloud via support for Databricks and Qubole Data Services.

Build APIs in days –
not months.

Easily automate API development with Talend Cloud API Services.

Trusted Sources Infographic

Create a single source
of trusted data.

Talend Data Catalog automatically discovers, profiles, organizes and documents your metadata and makes it easily searchable. Ensure quality and trust in your data with team-based data curation and data lineage tracking tools. Allow data consumers to get to the right data quickly.

Learn More

Build APIs in days – not months.

Talend Cloud API Services provides a faster and easier way to build APIs with full development lifecycle support from design to deployment. Create APIs without the need to learn complex OAS or RAML. Build consumer-friendly APIs with streamlined prototyping and automatic documentation generation.

Learn More

data integration

Reduce data processing costs up to 80%.

Get faster insight at a fraction of the cost with big data and Spark on demand. Run Spark workloads serverless in the cloud through support for Microsoft Azure Databricks and Qubole Data Services on AWS.

Minimize server management tasks and fast-track development and deployment projects with one click deployment from Talend Studio. Automatically scale up/down cloud resources to meet business demands with native cloud elasticity.

Learn More: Disrupting Big Data Economics with Serverless

Enhancements

Read about the many new features across all products in Talend Fall '18. To get more details on specifications, components and connectors for each release and product, visit https://help.talend.com/.

Data Integration
Big Data Integration
Data Quality
Data Catalog
Data Preparation
Data Stewardship
MDM
Cloud API Services
ESB
Talend Data Mapper
Data Integration
  • Scale your cloud data warehouse with native support for Snowflake on Microsoft Azure. Leverage the massive performance and scalability of Snowflake for faster analytics. Seamlessly integrate with other Azure data sources and easily offload on-premises data to Snowflake with Talend.
  • Updated Snowflake components for increased performance and productivity:
    • Create Snowflake tables in Talend jobs for easier data loading
    • Bulk loading performance improvements. Leverage compression and all Snowflake bulk loading options automatically
  • Updated SAP components for increased performance and productivity:
    • SAP HANA ELT “push-down” support enables you to leverage HANA’s parallel processing capabilities without needing to code in SAP
    • Easily bulk load data to SAP HANA for faster analytics
    • Secure file transfer (SFTP) support so you can securely extract large datasets from SAP
    • Easily extract large volumes of SAP business data in full or delta mode leveraging Operational Data Provisioning (ODP)
  • Easily integrate Oracle Autonomous Data Warehouse (ADW) for elastic scaling and fast query performance. Talend supports ELT-pushdown to leverage the parallel processing power of ADW
  • With one click in Talend Studio, build and push your job as a container to a Docker repository
  • Microsoft users can now store their Talend code in Azure DevOps Services (formally VSTS) and Team Foundation Server (TFS) Git repository
  • Support for JFrog Artifactory repository manager to store and manage Talend library and job packages
Big Data Integration
  • Reduce big data processing costs by running Spark workloads serverless in the cloud through support for Microsoft Azure Databricks, and Qubole Data Services on AWS.
  • Instantly add Hadoop distribution updates as they are released without upgrading Talend through dynamic distribution support for Cloudera CDH 6.0.x (tech preview) and Hortonworks 2.6.x and 2.5.x
  • Leverage recent MapR and Spark 2.2 improvements though support for MapR 6.0.1 and MEP 5.0
  • Improved HDFS components with support for WebHDFS & Azure Data Lake Store makes it easy to connect and switch file systems
  • Access latest Google BigQuery features with support for region awareness, standard SQL and cloud security to help comply with regulations like GDPR
  • More robust and resilient Hive support – set high-availability on the Hive metastore in Spark batch and Spark streaming jobs
  • Check big data integrity using the tSchemaComplianceCheck component on Spark. This ensures that the metadata is consistent in the Spark job with the schema. If the data is not right, it will be rejected for curation.
  • Support for MapR-DB OJAI 2.0 and Input components where you can read/write data for high-performance document processing, so you have instant querying and results.
Data Quality
  • Audit all user actions in the Dictionary service for security compliance, including login, logout, configuration update and deployment. (on-premises only)
  • Understand new data patterns, e.g. outliers detection, through improved profiling with support for word-based patterns
  • Import and export multiple semantic types at once improving productivity
  • Retrieve international phone numbers using the tGoogleAddressRow component.
  • Support for Denodo database expands connectivity options for data profiling
  • Japanese version includes new data matching, masking and standardization functions
  • Talend Data Quality now available in Chinese
Data Catalog

Talend Data Catalog (formally Talend Metadata Manager) adds smart profiling and machine learning capabilities. It automatically discovers, profiles, organizes and documents your metadata and makes it easily searchable.

  • Perform guided, real-time search on any facet of a data asset as a more efficient way to organize, find and consume data assets
  • Improve data accessibility by automatically profiling and documenting data sets
  • Smart and secure metadata discovery enriches metadata for better data protection, classification, accessibility, searchability and lineage
  • Better control data through social curation where authorized users and stewards can tag any metadata or relationships with warnings, endorse and certify relationships, and provide impact on search rankings
  • Data relationships in a data lake or across disparate environments are automatically captured and ranked by popularity, improving end-to-end data lineage, governance and compliance
  • Automatic crawling and discovery of data in the data lake including flat files, NoSQL and other common structures: CSV, XLSX, JSON, AVRO, Parquet, etc.
  • Automatic metadata documentation creation for any data asset (datasets, columns, reports, etc.) improves data accessibility and governance with enriched metadata.
  • 30+ new and updated metadata connectors for big data, cloud, analytics and enterprise apps reduces the costs for creating and maintaining the data inventory
Data Preparation
  • Improve security by auditing all user actions, including login, logout, and actions on preparations or datasets (on-premises only)
  • Understand new data patterns, e.g. outliers detection, through improved profiling with support for word-based patterns
  • Advanced data masking function for improved data privacy, allowing you to select the masking routine and perform repeatable, consistent masking
  • Build preparations faster through improved UI performance
  • New data quality functions and profiling capabilities for Japanese characters
  • Talend Data Preparation now available in Chinese
Data Stewardship
  • Improve security by auditing all user actions, including login, logout, and actions on campaigns and data model definitions (on-premises only)
  • Understand new data patterns, e.g. outliers detection, through improved profiling with support for word-based patterns
  • Get started quicker with predefined data models and campaigns for all campaign types
  • Find specific tasks and errors more quickly with global search functionality
  • Smarter survivorship functions to improve data trustworthiness
  • New data stewardship functions and profiling capabilities for Japanese characters
  • Talend Data Stewardship now available in Chinese
MDM
  • Find master data faster with fuzzy search on all attributes of a view and additional search operators on foreign keys
  • Export search results to CSV in addition to XLSX
  • Support adding complex fields in a data model without impact on the existing data.
  • Talend MDM now available in Japanese and Chinese
Cloud API Services

Talend Cloud API Services is a new offering that covers the full API development lifecycle (design, test, documentation, implementation and deployment), providing a significant time savings in building and maintaining APIs.

  • Cloud API Designer
    • Visual, contract-based design tool eliminates the need to learn complex standards (Open API Specification (OAS) / Swagger and RAML)
    • Inline mocking allows iterative prototyping making it easy to get API consumers to validate APIs
    • Automatically generates and hosts API documentation facilitating usage by others
    • Provides easy integration with API Gateways through support for OAS / Swagger and RAML
    • Share API definition with team members for collaborative feedback
  • Cloud API Tester
    • Visual tool to debug and discover APIs. Call any type of HTTP API (REST, SOAP...) and inspect responses
    • Use assertions with wizards to perform any type of check on your API
    • Easily create test and run scenarios composed of many API requests to simulate real-life usage
    • Share and collaborate API tests with your team
    • Provides API testing automation with standard Maven integration and Junit reporting support for DevOps best practices
  • Integrated Implementation and Deployment
    • Easy import into Talend Studio to visually add advanced routing, transforming and mediation steps to your API
    • Embed data quality to ensure the integrity of data sent between endpoints
    • Supports continuous integration / continuous development (CI/CD) best practices to speed development and deployment
    • Easily deploy on-premises on in the Cloud in a few clicks
ESB
  • Continuous integration / continuous delivery (CI/CD) updates using Maven standards for data services and routes
  • Kafka 1.0 and Camel 2.21.2 support so you can leverage the latest Kafka update
  • Logging performance improvements
  • JFrog Artifactory support
Talend Data Mapper
  • Data masking for hierarchical data (e.g. JSON)
  • Multi-input hierarchical mapping for data enrichment and joining

Extend your data integration reach

To get more details on specifications, components, and connectors for each release and product, visit https://help.talend.com/.

New and Updated Hadoop Distributions

  • Amazon EMR 5.15
  • Cloudera CDH 6.0
  • Hortonworks 2.6x
  • MapR 6.0.1 with MEP 5.0
  • Databricks 3.5 LTS
  • Qubole Data Services 1.0
  • Spark 2.3

New and Updated Components

  • Couchbase
  • Google BigQuery
  • Greenplum
  • MapR-DB OJAI
  • MariaDB
  • MarkLogic
  • MySQL
  • Oracle ADW
  • PostgreSQL
  • Salesforce
  • SAP HANA
  • Snowflake