Talend Open Studio

Talend Open Studio provides advanced capabilities that dramatically improve the productivity of data integration job design and proven scalability to ensure optimal execution.

Not sure if you need open source Talend Open Studio or Talend Integration Suite for data integration, data migration or data synchronization? Check out the features comparison matrix.

Want to learn more about Talend Open Studio for data integration, data migration or data synchronization? Then watch an online demo or check out our users' testimonials.

Business modeling

Talend Open Studio: Business Modeler

Talend Open Studio's Business Modeler leverages a top-down approach, allowing line-of-business stakeholders to get involved in the design of the integration processes and to monitor development progress. The Business Models are non-technical and business-oriented views built using the convenient library of shapes and links.

The Business Modeler also regroups all relevant documentation supporting the open source data integration, data migration and data synchronization processes in a business-friendly diagram. This is a very efficient way of monitoring the Jobs and performing impact analysis if a problem arises.

 

 

Graphical development

Talend Open Studio: Mapper

Talend Open Studio’s Job Designer provides both a graphical and a functional view of the actual integration processes using a graphical palette of open source components and connectors. Integration processes are built by simply dragging and dropping these open source components and connectors onto the workspace, drawing connections and relationships between them, and setting their properties.

Components and connectors cover all types of tasks and operations on the data itself or on the sequencing of the workflow. Connectors help access and read/write all data source and target systems for data integration, data migration and data synchronization. Properties are configured centrally in one view when selecting each component involved in the Job or can be inherited from the Metadata Manager. To maintain the readability of a Job design, the diagram can be divided into Subjobs, and then can be set out as child and parent Jobs to sequence their execution. A built-in console view lets users quickly monitor execution and track performance directly in the open source data integration tool.

 

Metadata-driven design

Talend Open Studio: File Wizard

Database Wizard

Talend Open Studio is an open source metadata-driven solution for data integration, data migration and data synchronization. All Metadata is stored and managed in a Metadata Manager- the Repository-shared by all the modules. The Repository centralizes all project information and ensures consistency across all integration processes.

Metadata related to source and target systems of the integration processes is easily loaded in the Metadata Repository through advanced database or file introspection facilitated by a number of wizards. The Metadata Repository is based on an open relational model, through which impact analysis can be performed to facilitate maintenance and identify Job dependencies in the data integration, data migration and data synchronization Jobs.

 

Advanced and versatile connectivity

Talend Open Studio: File Wizard

Talend Open Studio offers native technical and business open source connectors to all IT environments. This wide array of connectors is the key to the successful interoperability of applications and databases; it allows bridging diverse and heterogeneous data structures at unmatched performance rates. It is also continually expanding, enriching the features of the open source data integration, data migration and data synchronization solution.

Refer to http://www.talendforge.org/components for the complete list of supported connectors.

Talend Open Studio leverages industry-standard languages that include Java, Perl and SQL. This allows users to easily enrich existing components or to create their own. A dedicated view in Talend Open Studio - Talend Exchange - helps users plug these newly created open source components natively into the environment. Users can also write routines and other pieces of code for data migration or data synchronization and store this information centrally in the Repository for reuse.

 

Real-time debugging

Talend Open Studio: RealTime Debugging

Talend Open Studio: Debug Mode

Talend Open Studio includes powerful testing, debugging and tuning features that allow the real-time tracking of data flowing through the whole transformation processes, including execution statistics and an advanced trace mode.

When an integration job is executed through the open source Job Designer interface - in graphical mode - statistics are displayed in real-time, showing the number of processed rows and rejected rows, as well as the throughput (rows per second) - allowing you to immediately spot any bottleneck in the data integration, data migration and data synchronization job. It is also possible to activate a trace mode, which displays row-by-row behavior and shows the result of the transformations. Traditional debugging breakpoints and variables are also available

And, of course, all code generated by Talend Open Studio, regardless of the target language, is always visible and accessible from the design environment.

 

Deployment and maintenance

Advanced execution context management (test, staging, production, etc.) facilitates integration processes deployment for data integration, data migration and data synchronization. Implicit loading of context parameters directly in the open source job design helps develop the various execution environments and manage them easily. Deployment of processes across enterprise’s systems can easily be performed as data services or as data integration, data migration or data synchronization services via the convenient export tool.

Automated documentation generation provides complete and up-to-date technical reference documentation (in XML and HTML) that helps various users and stakeholders to maintain and update inherited processes.

The impact analysis feature helps users identify dependencies among integration processes developed within Talend Open Studio and simplifies the global update of the large number of processes stored centrally in the Repository.

 

Robust and scalable execution

Talend Open Studio: Job Designer

Unlike many data integration, data migration or data synchronization solutions which are based on a centralized integration server, or can only use RDBMS engines to process data, Talend Open Studio allows users to export processes into executable files that can be distributed across a grid of systems. These systems do not need to be dedicated to executing integration processes. Instead, Talend Open Studio leverages available resources, regardless of their nature.

Talend Open Studio leverages both the traditional ETL (Extract-Transform-Load) approach as well as the ELT (Extract-Load-Transform) approach. ELT leverages the power of the RDBMS engines to execute the data transformations inside the database, achieving unmatched performance for high volume batches. For each subset of a process, it is possible to choose the most suitable approach, and hence to obtain the highest level of performance and scalability for data integration, data migration and data synchronization.

This architecture design, which is especially suited to leverage grids of inexpensive servers, as well as high-range systems, enables data to be processed at a location closest to its source (thus decreasing data transfers), and maximizes the use rate of computing resources.