Talend Data Integration Advanced

Discover how to use more advanced features of Talend Data Integration which provides an extensible, highly scalable set of tools to access, transform, and integrate data from any business system to meet both operational and analytical data integration needs.

Duration1 day (7 hours)
Target audienceAnyone who wants to use Talend Data Integration to perform data integration and management tasks: software developers and development managers
PrerequisitesCompletion of Talend Data Integration Basics and knowledge of computing, including familiarity with Java or another programming language, SQL, and general database concepts
Course objectives

After completing this course, you will be able to:

  • Start Talend Studio and connect it to a remote repository
  • Run a Job from Talend Studio on a remote JobServer
  • Use debugging tools that do not require deep Java development skills
  • Implement several methods of parallel execution in a Talend Job
  • Design a Job using the memorization logic
  • Create Joblets
  • Use custom Java codes in Talend Studio Jobs
  • Configure a database table to be monitored for changes in a separate change data capture (CDC) database
Course agenda

Using Git in Talend Studio

  • Using Git in Talend Studio

Remote Job execution

  • Creating and running a Job remotely

Basic debugging

  • Debugging Jobs using Debug Run

Parallel execution

  • Writing large files
  • Writing to databases
  • Automatic parallelization
  • Partitioning

Memorizing data

  • Preparing and memorizing a sequence of rows with the tMemorizeRows component
  • Reusing memorized rows in tMap

Joblets

  • Creating a Joblet from an existing Job
  • Creating a Joblet from scratch
  • Triggering Joblets

Using Custom Java Code

  • Using Java in Talend Studio
  • Executing simple instructions with tJava
  • Processing data with custom Java code using the tJavaRow and tJavaFlex components
  • Creating and using Java routines

Change data capture

  • Examining databases
  • Configuring the CDC database
  • Monitoring changes
  • Updating a warehouse
  • Resetting the database