In this blog, I want to go over how to set up and deploy a Talend Spark Streaming job into a new Elastic Stack instance. Spark is the engine of choice for near real-time processing, not only for Talend but also for many organizations who have a need for large-scale lightning fast data processing. The Elastic Stack is a highly versatile and widely adopted suite of tools built for monitoring that works perfectly for this scenario....READ ARTICLE
Talend was recently recognized as a certified partner on the MapR Converged Data Platform. This is exciting news not only for Talend and MapR, but also for current and future customers who are looking at Talend and MapR as the solution to their big data challenges. Today we are going to look at how you can implement a real-time recommendation model usi...READ ARTICLE
Earlier this year, I finished an exciting Proof of Concept (POC) with one of the top Energy and Utility organizations using the Talend Big Data Platform. I thought I would write a quick blog on getting started with self-service data in the enterprise as it’s a common theme I have been experiencing with many companies focusing on digital transformation. This company was moving an existing on-premise data warehouse...READ ARTICLE
At Talend, we like to be first. Back in 2014, we made a bet on Apache Spark for our Talend Data Fabric platform which paid off beyond our expectations. Since then, most of our competitors tried to catch-up… Last year we announced that we were joining efforts with Google, Paypal, DataTorrent, dataArtisans and Cloudera to work on Apache Beam which si...READ ARTICLE
A 16 Step Data Governance Plan for GDPR Compliance now.
Well, let’s be specific here. Birds migrate either north or south. Data warehouses are only going in one direction. Up, to the cloud. It’s a common trend we’re seeing across every vertical and across every region. Companies are moving their existing data warehouses to cloud environments like Amazon Redshift. And more often than not –unlike their feather counterpart...READ ARTICLE
With the Euro 2016 tournament now drawing to a close, and having two kids of my own in a competitive soccer league, even a French husband; I now live and breathe football (or soccer depending on your frame of reference) almost every single day. Soccer, much like today’s businesses, has begun to embrace the world of big data. The key method of judging how well a player or team will perform throughout the course of a game or even throughout the season is beginning to sh...READ ARTICLE
As part of a POC of Talend v6.1 Big Data capabilities, I was asked by one of our long-time customers, a major e-commerce company, to present a solution for aggregating huge files of clickstream data on Hadoop. The input data was a giant clickstream file (larger than 100GB, or even terabytes) from a website. Our goal was to aggregate the file and add a session ID column. We needed to create session records for all u...READ ARTICLE