Talend Document Library http://www.talend.com/library/reflibrary.php Talend, the first provider of opensource data integration software, leverages the open source model to make data integration available to all types of organizations, regardless of their size, level of expertise or budgetary constraints. Talend solutions connect to all source and target systems and they can be downloaded at no cost. lang <![CDATA[Matching Technology Improves Data Quality]]> Matching technology plays an important role in achieving a single view of customers, parts, transactions or almost any type of data. Often used to identify duplicates and near-duplicates, matching technology is vital to providing data that is fit-for-use in enterprise applications.

This white paper outlines the basic theories and strategies of record matching. It describes the nuances of deterministic and probabilistic matching and the algorithms used to identify relationships within records. It covers the processes you can use in conjunction with matching technology to transform raw data into powerful information that drives success in enterprise applications like CRM, data warehouse and ERP.

]]>
Thu, 29 Jul 2010 22:22:30
<![CDATA[Master Reference Data - Extract Value from your Most Common Data]]> Reference data is the lifeblood of an organization but inconsistencies across business units or systems can lead to inefficient processes and inaccurate analytics. Most organizations understand the importance of consistent reference data but struggle to ensure consistency and to police compliance to a central reference data standard. Master reference data makes operational data more accurate, simplifies synchronization and migration projects and improves effectiveness of MDM.

Creating master reference data can be very simple. Typically, an organization will start small and then grow the master as acceptance spreads throughout the organizations. This white paper presents some key considerations for creating a reference data master.

]]>
Tue, 13 Jul 2010 00:13:35
<![CDATA[Open Source Data Management in the Public Sector]]> Historically, public administrations have been the leading supporters of open source technology, especially - but not only - in Europe. The economic nature of these products has been one of the main factors of adoption, but as they have matured, these products offered other benefits that positioned them as viable alternatives to proprietary technology.

This White Paper presents the logic and reasons behind the adoption of open source by the public sector. It analyzes its benefits and presents concrete examples from public institutions in different countries.

]]>
Thu, 01 Jul 2010 17:39:14
<![CDATA[Testing Talend MDM Enterprise Edition - an independent software review by IAIT]]> Master Data Management has become increasingly important for ensuring the consistency and quality of data across the various parts of the information system. Leveraging its open source solution, Talend is set to democratize the MDM market, which has been dominated by expensive products for large companies.

IAIT performed an in-depth review of Talend MDM Enterprise Edition. They studied what this solution can do in practice and how it performs with regard to daily tasks.

In the Test Lab of IAIT, the Institute for Analysis of IT, Dr. Götz Güttich simulates heterogeneous IT environments of small and medium-sized enterprises, to test hardware and software solutions for the enterprise.

]]>
Tue, 29 Jun 2010 12:05:04
<![CDATA[Unified Data Management - A Collaboration of Data Disciplines and Business Strategies - TDWI Best Practices Report]]> In most organizations today, data and other information are managed in isolated silos by independent teams using various data management tools such as data quality, data integration, master data management, etc. In response to this situation, some organizations are adopting unified data management (UDM), a practice that holistically coordinates teams and integrates tools.

The purpose of this report is to help organizations plan and execute effective UDM efforts. Many need the help, because UDM is a relatively new shift in best practices for data management. Toward that end, the report drills into the business initiatives that need UDM, the data management practices and tools that support it, and the organizational structures that enable the cross-functional collaboration that’s critical to UDM success. The report also provides practical recommendations for the success of unified data management projects.

TDWI Best Practices Reports are designed to educate technical and business professionals about new technologies, concepts, or approaches that address a significant problem or issue. Research for the reports is conducted via interviews and surveys of leading-edge user companies.

]]>
Wed, 12 May 2010 11:27:00
<![CDATA[Telesperience data sheet: driving down costs through better data integration]]> Data integration plays a vital role in the commercial agility and operational efficiency of enterprises. In this data sheet, UK-based analysts Telesperience summarise the findings from a primary research project into data integration in high transaction industries. Among other topics, this paper looks at what is driving data integration today, outlines the cost implications of poor data integration, examines how confident enterprises are that they can deliver data integration projects on time and to budget, and reveals what data integration goals are for 2010-11.

]]>
Wed, 28 Apr 2010 09:36:56
<![CDATA[Overcoming Objections to Data Governance]]> Data governance is an outstanding business strategy that leads your company toward greater efficiency, lower risk and increased revenue. Proper management of data underpins the success many strategic initiatives. However, it is not always easy for others to see its value.

This white paper discusses the techniques that have been successful for data champions as they "sell" the importance of data governance to their company. It examines guidelines for optimal project selection, tracking the value of data governance and techniques for overcoming objections to a data governance program.

]]>
Fri, 26 Mar 2010 16:34:45
<![CDATA[Building a Robust Business Case for High Quality Master Data - An Information Difference White Paper]]> Projects addressing data quality or master data management frequently struggle to get approved by senior management, and only around 60% of such projects proceed with a proper business case - causing high rates of cancellation and failure.

This white paper explains the key elements for building a proper, quantified business case. It presents the measures that are favored by corporate finance departments, and helps develop a strong business case for data quality and MDM projects. A number of real-life examples with quantifiable benefits are also included.

Armed with the materials in this white paper, you will be in a good position to deliver a high quality business case for your data quality or MDM project!

The Information Difference is an analyst firm focusing on Master Data Management (MDM). Its founders are pioneers who helped shape the MDM industry, with in-depth MDM global project experience.

]]>
Wed, 10 Mar 2010 15:43:20
<![CDATA[Loading and Analyzing Web Data: Considerations and Recommendations - A Bloor Research White Paper]]> Founded in 1989, Bloor Research is one of the world's leading IT research, analysis and consultancy organizations.

In this Analyst White Paper, Research Director Philip Howard explains why web-based data, that includes not only clickstream data but also Web 2.0 user behavior information, is essential to analytic applications. He explains the complexity of incorporating this type of data in conventional analytic environments, and the technologies that are involved to optimize the process.

The white paper also describes a number of real-life use cases and their constraints and analyzes which technologies provide required capabilities.

]]>
Tue, 09 Feb 2010 23:34:47
<![CDATA[Using data integration to drive down costs and increase profits in high transaction industries - A Telesperience issues paper, sponsored by Talend]]> Data integration (DI) is often seen as merely a technical discipline, but this ignores the vital role it plays in helping enterprises achieve their business goals. It is essential that business managers, as well as technical staff, understand how better DI can help deliver against key commercial goals, such as helping the organization become more efficient, agile, innovative and customer centric.

Telesperience research indicates a pattern of high and increasing demand for DI from the business, as well as a recognition that DI is a key strategy for lowering costs and improving business performance. However, most enterprises also report that their DI projects over-run on either cost or time (or both) and that DI costs are either too high or more than anticipated.

This paper explains why DI is an important weapon in an enterprise's competitive arsenal by analysing how poor DI inflates costs, by revealing the current DI drivers, and by explaining how DI can contribute to commercial success.

Telesperience's Research Director Teresa Cottam says: "Our research demonstrates that companies that are able to deliver DI projects quickly, reliably and at low cost will out-perform their rivals, because they will be able to exploit new technologies and insights to gain competitive advantage and reduce their costs."

]]>
Tue, 02 Feb 2010 18:58:41
<![CDATA[Open Source Master Data Management - The Time is Right]]> Many organizations consider high quality master data as a key strategy for accomplishing corporate objectives. Proper master data management (MDM) is indeed extremely valuable when available to enterprise business processes and analytics, however, master data projects are often politically challenging, architecturally complex, time intensive and expensive.

MDM is a natural extension to data integration and data quality. Open source MDM introduces a new, more accessible approach. It reduces implementation complexity, time to value and cost. In fact, as in many other markets, open source helps organizations overcomes obstacles and realize their goals.

This White Paper provides an explanation of why the time is right for open source MDM, and predicts the effects of open source on the MDM marketplace.

]]>
Wed, 20 Jan 2010 21:03:35
<![CDATA[The 451 Group - Impact of Economic Conditions on Open Source Adoption]]> The 451 group is an independent technology-industry analyst company focused on the business of enterprise IT innovation. The company's analysts provide critical and timely insight into the market and competitive dynamics of innovation in emerging technology segments.

This report titled Climate Change: User Perspectives on the Impact of Economic Conditions on Open Source Software Adoption, is based on a survey of more than 1,700 open source software users and customers, assessing their current attitudes on the key benefits of open source software, including cost and flexibility.

The report serves as a practical guide for understanding the financial benefits of open source. It also includes an updated version of The 451 Group's guide for calculating the financial benefits of open source in enterprise IT projects. It provides a basic financial analysis approach and calculator to identify and capture the costs and potential benefits of open source software.

Note: This report is made available to you through your relationship with Talend. It was researched and published by The 451 Group, an independent industry analyst company, and is part of The 451 Group's Commercial Adoption of Open Source service. The report is available to the named recipient only and must not be shared externally to your organization

]]>
Mon, 11 Jan 2010 17:37:27
<![CDATA[Practical Open Source Data Integration: Case Studies & Implementation Examples (Vol. 3)]]> Over the past few years, commercial open source vendors have been providing a real alternative to proprietary players. In the data integration space, enterprise grade open source solutions are adopted by many organizations for their data integration and data quality projects. These organizations span all industries, all continents, and all company sizes. More importantly, their projects range from ETL for business intelligence to operational data, data quality, master data management, etc.

This third volume of Practical Open Source Data Integration: Case Studies & Implementation Examples presents selected case studies, illustrating real-life implementations of open source data integration and its associated benefits.

]]>
Tue, 08 Dec 2009 15:20:07
<![CDATA[Master Data Management Projects in Practice - An Information Difference Research Study]]> Master Data Management (MDM) has received growing attention recently as an essential component of information management alongside data governance and data quality. More and more, organizations are turning to Master Data Management as a key enabler in improving the timeliness, quality and reliability of business intelligence with the ultimate goal of improving business performance. Increasing regulatory requirements and the recent financial crisis have ensured that Master Data Management is increasingly finding its way onto the business agenda.

Information Difference conducted a survey of both end-user organizations and systems integrators aimed at gaining deeper insight into MDM implementations and their success factors. This report summarizes and analyzes the results of that survey, and presents practical recommendations on the "do and don't" of MDM. It also contains some enlightening findings on the use of open source and manual coding in MDM projects.

The Information Difference is an analyst firm focusing on Master Data Management (MDM). Its founders are pioneers who helped shape the MDM industry, with in-depth MDM global project experience.

]]>
Tue, 08 Dec 2009 13:50:37
<![CDATA[Leveraging Open Source Data Quality - Practical Examples]]> Poor-quality data affects all data-related projects and refers to the state of completeness, validity, consistency, timeliness and accuracy that makes data appropriate for a specific use.

This Technical White Paper presents the main challenges of data quality and drills into specific uses of data profiling and data cleansing. It highlights, through 32 distinct use cases, how open source data quality technology can be used to alleviate issues related to poor-quality data.

]]>
Wed, 30 Sep 2009 18:06:08
<![CDATA[High Performance for Integrating Massive Data Volumes - A Technical White Paper]]> Processing very large data sets provides unique constraints, especially when time windows available for this processing are shrinking.

This Technical White Paper presents a variety of technologies that are available for accelerating the processing of large data volumes, including massive parallelization, optimized data-set-mode processing based on Map Reduce, grid deployment, and load balancing.

]]>
Fri, 11 Sep 2009 11:07:39
<![CDATA[The Top 10 Reasons for Choosing Open Source Data Integration]]> When choosing a data integration solution that best meets their needs, companies are confronted with many product choices, and multiple sources of information - including software vendors who will assure that their product is the one they need! In reality, the proposed solutions may or may not be appropriate.

To help you evaluate various solutions and technologies available on the market, and to compare open source data integration technology with other alternatives, this White Paper presents the top 10 reasons why organizations are choosing open source for their data integration needs. To support these top reasons, the White Paper includes proof points (31 in total) provided by actual users of open source data integration.

]]>
Fri, 14 Aug 2009 21:27:56
<![CDATA[Open Sesame: Why Open Source BI, Data Integration, and Data Warehousing Solutions are Gaining in Acceptance - by Dr Claudia Imhoff]]> In the beginning, open source solutions were the shiny play things of the techno crowd. After all, they were fun, new, and most importantly, free. No one would have considered using them in any real world corporate IT shops...

Today, open source solutions are not only being considered, they are being implemented by large and small enterprises -- at rates that are mind-boggling. This paper examines the challenges faced by organizations today regarding their BI, data integration, and data warehousing environments, why traditional solutions fall short, and the rise of upstart open source companies.

A thought leader, visionary, and practitioner in the rapidly growing fields of business intelligence and customer focused-strategy - Claudia Imhoff, Ph.D. is a popular and dynamic speaker and internationally recognized expert on analytical CRM, business intelligence, and the infrastructure to support these initiatives.

]]>
Mon, 15 Jun 2009 15:26:19
<![CDATA[Silver Bullet Report: Talend, Open Source Data Integration]]> The Silver Bullet Report provides an unbiased focus on Talend's technology and strategy. Focusing on twelve different angles, CohesionSG analyzes the strengths and weaknesses of the company, its products, and its go-to-market strategy. The report assesses not only the capabilities of the technology but also the market it evolves into, the future of the company and its ability to accelerate its growth and better serve its customers and its community.

CohesionSG is the producer of The Silver Bullet Report and is an independent strategic consulting and research firm that delivers a combination of expert market research, competitive intelligence, and technical product analysis and strategy recommendations for its clients.

]]>
Tue, 02 Jun 2009 14:07:35
<![CDATA[Operational Data Integration: A New Frontier for Data Management - TDWI Best Practices Report]]> The amount and diversity of work done by data integration specialists has exploded since the turn of the twenty-first century. A lot of the growth comes from the emerging practice of operational data integration, defined as "the exchange of data among operational applications, whether in one enterprise or across multiple ones." Operational data integration involves a long list of project types, but it usually manifests itself as projects for the migration, consolidation, collocation, and upgrade of operational databases.

The purpose of this report is to identify the best practices and common pitfalls involved in starting and sustaining a program for operational data integration. The report defines operational data integration in terms of its relationship to other data integration practices, as well as by its most common project types. Along the way, it also looks at staffing and other organizational issues, followed by a list of technical requirements and vendor products that apply to operational data integration projects. The report also provides practical recommendations for the success of operational data integration projects.

TDWI Best Practices Reports are designed to educate technical and business professionals about new technologies, concepts, or approaches that address a significant problem or issue. Research for the reports is conducted via interviews and surveys of leading-edge user companies.

]]>
Tue, 28 Apr 2009 10:51:48
<![CDATA[Bloor Research - Data Discovery Spotlight]]> Founded in 1989, Bloor Research is one of the world's leading IT research, analysis and consultancy organizations.

In this Spotlight Report, Research Director Philip Howard explains why data discovery is of fundamental importance to data integration, data quality, and many other projects ranging from business intelligence through master data management to data governance and data archival. Nevertheless, data discovery has not traditionally been treated as a market or requirement in its own right. As a result, it is time to consider the importance of data discovery, and its requirements.

"We believe that the ability to discover and understand the relationships that exist across your data, wherever it resides, is of fundamental importance to a number of IT disciplines." -- Philip Howard, Bloor Research

]]>
Thu, 19 Mar 2009 11:13:45
<![CDATA[The Role of Open Source Data Integration - An Analyst White Paper, by Mark Madsen]]> Most data integration processes have seen limited automation over the past decade despite many technology advances.

This White Paper by Industry Analyst Mark Madsen will help understand what data integration is, and especially the difference between Operational Data Integration and Analytic Data Integration. It highlights the benefits of open source for data integration, and provides a series of recommendations to make the integration job easier, repeatable and more productive.

Mark Madsen is president of Third Nature, a consulting and technology research firm focused on information management. Mark is an award-winning architect and former CTO whose work has been featured in numerous industry publications. He is an international speaker, a contributing editor at Intelligent Enterprise, and manages the open source channel at the Business Intelligence Network.

]]>
Fri, 06 Feb 2009 18:35:37
<![CDATA[Usage Landscape of Enterprise Open Source Data Integration]]> Enterprise data integration needs are growing exponentially over time, as is the interest in open source technologies and the adoption of open source solutions. And, as information systems are becoming more heterogeneous, the global economy is imposing cost controls on IT Managers, both in terms of staff and software. With this in mind Talend conducted a survey to define the usage landscape of open source data integration and to profile users of this technology.

Consolidating the open source experience of over 1000 respondents, this White Paper discusses the most commonly used data integration technologies, their benefits, and the associated decision criteria. It also highlights the types of projects open source data integration is used for, and how it complements other data integration technologies.

]]>
Thu, 22 Jan 2009 11:17:17
<![CDATA[Practical Open Source Data Integration: Case Studies & Implementation Examples (Vol. 2)]]> Over the past few years, open source has established itself as a key component of the overall data integration market. Many organizations have adopted this model for their data integration projects. These organizations span all industries, all continents, and all company sizes. More importantly, their projects range from ETL for data warehousing or business intelligence to operational data integration, data migration, data synchronization, etc.

This second volume of Practical Open Source Data Integration: Case Studies & Implementation Examples presents selected case studies, illustrating real-life implementations of open source data integration and its associated benefits.

]]>
Tue, 23 Dec 2008 01:27:53
<![CDATA[Integrating SAP data in the information system using open source data integration]]> All companies deploying SAP need to align all other applications and systems - whether internal or external - as a means of collaborating with partners and vendors, streamlining their operations, and maximizing their IT investments.

SAP is highly successful and popular, but it is also a large, unique, and complex system environment and presents many challenges to companies who want to integrate it with third-party systems.

This Technical White Paper explains how to fill a broad range of integration needs without expensive IT development costs.

]]>
Fri, 21 Nov 2008 12:21:55
<![CDATA[IDC White Paper - Talend Uses Open Source to Deliver Low-Cost, Easy-to-Use Enterprise Data Integration]]> IDC is the premier global provider of market intelligence, advisory services, and events for the information technology, telecommunications, and consumer technology markets.

In this IDC White Paper sponsored by Talend, analyst Carl Olofson discusses the data integration market, examines the approach taken by Talend, and shows how the combination of Talend's technical approach and its open source licensing overcomes key barriers to adoption of data integration.

It has been generally accepted that the open source model works well for delivering software support and other services on top of well-understood software technology in commodity markets such as operating systems and general-purpose relational DBMS.

Talend is demonstrating, however, that this model may also be applied as a go-to-market strategy to build a business based on new technology, such as that of the still rapidly evolving data integration market. Talend's product line, which is positioned to offer scalable, incremental enterprise data integration with components that are easy for non-technical staff to use, breaks the mold for open source, and its success could shake up the software business.

]]>
Thu, 11 Sep 2008 10:15:44
<![CDATA[Bloor Research - Data Integration Platforms Market Update]]> Founded in 1989, Bloor Research is one of the world's leading IT research, analysis and consultancy organizations.

In this Market Update, Research Director Philip Howard explains how the enterprise market for data integration is now less focused on traditional ETL and more centered on data integration as a broader concept. He looks at the requirements for enterprise-ready data integration platforms and examines the vendor landscape, including traditional players and disruptive technologies such as open source.

"In any market, open source represents a potentially disruptive approach. However, relatively few vendors are effectively organised to take advantage of such disruption. Talend is an exception." -- Philip Howard, Bloor Research

]]>
Wed, 02 Jul 2008 14:20:21
<![CDATA[The Return on Investment of Open Source Data Integration]]> With cost often viewed as one of the major reasons to adopt open source technologies, the return on investment (ROI) of open source must be determined precisely when looking at deploying data integration solutions. This ROI Study looks in detail at all project costs: license (development, runtime, maintenance...), training and rampup, development and maintenance time, hardware and operating system, IT operations, etc. It also considers many intangible elements such as reliability, predictability, time to market.

More than a theoretical report, the ROI Study provides not only hard numbers but also the tools IT organizations need to assess the return of investment of open source, and to compare this ROI with the one of alternative options: manual coding, and proprietary data integration solutions.

]]>
Wed, 02 Jul 2008 14:02:47
<![CDATA[Practical Open Source Data Integration: Case Studies & Implementation Examples]]> Over the past few years, open source has established itself as a key component of the overall data integration market. Many organizations have adopted this model for their data integration projects. These organizations span all industries, all continents, and all company sizes. More importantly, their projects range from ETL for data warehousing or business intelligence to operational data integration, data migration, data synchronization, etc.

This document presents a few selected case studies, illustrating real-life implementations of open source data integration and its associated benefits.

]]>
Mon, 12 May 2008 00:00:00
<![CDATA[The Evolution of Integration - A White Paper by Bill Inmon]]> As information systems grow in complexity and volume, the need for scalability and versatility of data integration increases. In this White Paper, Bill Inmon explains how data integration grows from simple data movement to complex transformation functions, and how modern, pervasive and scalable data integration technology enables organizations of all sizes to deploy data integration technology, without the usual restrictions imposed by the deployment costs of traditional integration products.

Bill Inmon, world-renowned expert, speaker and author on data warehousing, is widely recognized as the "Father of Data Warehousing". He was also voted as "One of the Ten IT People Who Mattered in the Past Forty Years" by the ComputerWorld Magazine's July 2007 issue.

]]>
Fri, 12 Oct 2007 00:00:00
<![CDATA[Data Migration - A White Paper by Bloor Research]]> Founded in 1989, Bloor Research is one of the world's leading IT research, analysis and consultancy organisations.

Data migration projects frequently overrun their budgets, get delayed or, in extreme circumstances, get cancelled. A major reason behind these failures is because the techniques and disciplines of data migration are not treated seriously enough or are not well enough understood. This white paper highlights the major issues and complexities involved in data migration; it includes practical recommendations by the leading independent analyst firm for the success of migration projects.

"In our view, data migration has historically been under-valued, under-resourced and not treated with the attention it deserves." -- Philip Howard, Bloor Research

]]>
Wed, 12 Sep 2007 00:00:00
<![CDATA[Bloor InDetail: Talend Open Studio - an independent analyst review]]> Founded in 1989, Bloor Research is one of the world's leading IT research, analysis and consultancy organizations.

In this InDetail product review, Research Director Philip Howard takes a close look at the features of Talend Open Studio, its advantages and drawbacks. This 11-page analyst study will help IT staff and decision makers get an independent angle on the leading open source data integration solution.

"In our view, Talend is the most enterprise oriented of the open source data integration vendors." -- Philip Howard, Bloor Research

]]>
Sun, 12 Aug 2007 00:00:00
<![CDATA[Integrating Data in the Information System, an Open Source Approach]]> Today, many organizations are deploying new systems - RDBMS, business applications such as CRM or ERP, data warehouses, etc. Any new technology deployment in an enterprise information system requires the newly-deployed system to interoperate, in one form or another, with a number of other applications or databases. This interoperability, which is essentially based on the exchange of data between systems ("data integration"), guarantees the consistency of data in the overall information system.

This White Paper describes a number of real-life applications and databases interoperability scenarios and explains how an Open Source approach helps solving the interoperability challenge.

]]>
Sat, 12 May 2007 00:00:00