Talend Blog

Using Machine Learning for Data Quality

  In my last blog, I highlighted some of the Data Governance challenges in Big Data and how Data Quality (DQ) is a big part of Data Governance. In this blog, I wanted to focus on how Big Data is changing the DQ methodology. Big Data has made Machine Learning (ML) mainstream and just as […]

How DevOps Can Bring Innovation to IT through Cloud Integration

  One of my favorite new TV shows is APB, a story about a billionaire tech entrepreneur who goes into one of Chicago’s most crime-infested districts and changes things around. He roots out inefficient processes and legacy equipment by bringing in state-of-the-art technology such as drones, enhanced body armor, supercharged police cars, and the ability […]

Data Matching 101: What Tools Does Talend Have?

This blog is the second part of a three-part series looking at Data Matching. In the first part, we looked at the theory behind data matching. In this second part, we will look at the tools Talend provides in its suite to enable you to do Data Matching, and how the theory is put into […]

Unlocking Data Preparation for Business Intelligence (BI)

  We live in a world surrounded by data. From our daily grocery shopping, to our mobile phone usage, fitness regime tracker, bank accounts, social media etc., practically everything we do is either driven by or a contributor to data volumes. In this blog I would like to reiterate the importance of data and data […]

How to Use Click Stream Analysis to Optimize your Company’s Social Outreach

  In this blog, I’ll be discussing how I expanded the recommendation demo provided in Talend’s Big Data Sandbox to influence my promotional Twitter campaign. Enterprises are now taking data-oriented approaches when defining their social strategy as they find new and interesting influencers around their business. It is critical to implement plans that utilize this […]

A First for Apache Beam

  At Talend, we like to be first. Back in 2014, we made a bet on Apache Spark for our Talend Data Fabric platform which paid off beyond our expectations. Since then, most of our competitors tried to catch-up… Last year we announced that we were joining efforts with Google, Paypal, DataTorrent, dataArtisans and Cloudera […]

Using Talend to Gather Data About Data

  This article was developed using the free, open source version of Talend Open Studio for Data Integration which is available here. What’s more exciting than data? Data about data! Recently I had to assess the impact of data model changes within a transactional system feeding our data warehouse.  I directed our SQL scripts and […]

What’s Blockchain and Can It Help You Trust Your Data?

  It first appeared in 2008 with the Bitcoin currency, this year, Blockchain technology achieved the summit of Gartner’s “Hype Cycle.” While many economists or policy actors have expressed their interest to use the technology (i.e. the government of Honduras, Ghana and Georgia wish to secure their land titles in a Blockchain and, in the private […]