Talend Data Integration Basics

Talend Studio for Data Integration dramatically improves the efficiency of data integration Job design through an easy-to-use graphical development environment. It enables rapid deployment and reduces maintenance costs with integrated connectors to source and target systems, with support for all types of data integration, data migration, and data synchronization.

This course helps you use the Talend Studio for Data Integration as quickly as possible. It focuses on the basic capabilities of the Studio and how you can be use it to build reliable, maintainable data integration tasks that solve practical problems, including extracting data from common database and file formats, transforming it, and integrating it into targets.

This course serves as a prerequisite for many other Talend courses, and the skills learned apply to most products.

Duration 2 days (14 hours)
Target audience Anyone who wants to use the Talend Studio to perform data integration tasks: software developers and development managers
Prerequisites Basic knowledge of computing, including familiarity with Java or another programming language, SQL, and general database concepts
Course objectives
After completing this course, you will be able to:
  • Create a project
  • Create and run a Job that reads, converts, and writes data
  • Merge data from several sources within a Job
  • Save a schema for repeated use
  • Create and use metadata and context variables within Jobs
  • Connect, read from, and write to a database from a Job
  • Access a public web service from a Job
  • Work with master Jobs and SubJobs
  • Build, export, and test run Jobs outside the Studio
  • Invoke basic error-handling techniques
  • Use best practices for Job and component naming, hints, and documentation
Course agenda

Getting started

  • Starting the Talend Studio
  • Creating a first Job
  • Running a Job

Working with files

  • Reading an input file
  • Transforming data
  • Running a Job
  • Combining columns
  • Duplicating a Job

Joining data sources

  • Creating metadata
  • Joining data sources
  • Capturing rejects
  • Correcting a lookup

Filtering data

  • Filtering output data
  • Using multiple filters

Using context variables

  • Understanding and using context variables
  • Using repository context variables

Error handling

  • Detecting and handling basic errors
  • Raising a warning

Generic schemas

  • Setting up sales data files
  • Creating customer metadata
  • Creating product metadata

Working with databases

  • Creating database metadata
  • Creating a customer table
  • Creating a product table
  • Setting up a sales table
  • Joining data
  • Finalizing a Job

Creating master Jobs

  • Controlling Job execution using a master Job

Working with web services

  • Accessing a web service
  • Capturing and viewing detailed XML output

Running Jobs standalone

  • Building a Job
  • Modifying a Job

Documenting a Job

  • Best practices for documenting a Job