Talend Increases Big Data Integration Performance and Scalability by 45 Percent

Only Talend 5.5 allows developers to generate high performance Hadoop code without needing to be an expert in MapReduce or Pig

San Jose, Calif. (Hadoop Summit)

Talend, the global big data integration software leader, today announced the availability of Talend version 5.5, the latest release of the only integration platform optimized to deliver the highest performance on all leading Hadoop distributions.

Talend 5.5 enhances Talend’s performance and scalability on Hadoop by an average of 45 percent. Adoption of Hadoop is skyrocketing and companies large and small are struggling to find enough knowledgeable Hadoop developers to meet this growing demand. Only Talend 5.5 allows any data integration developer to use a visual development environment to generate native, high performance and highly scalable Hadoop code. This unlocks a large pool of development resources that can now contribute to big data projects. In addition, Talend is staying on the cutting edge of new developments in Hadoop that allow big data analytics projects to power real-time customer interactions.   

Proven Performance and Scalability

It’s Talend’s mission to provide easy to use big data integration tools with the industry’s highest performing, most scalable integration code running natively on Hadoop. As a part of this mission, Talend puts every product release through a rigorous set of performance and scalability tests, including a performance benchmark developed by the Transaction Processing Performance Council, known as TPC-H.  Out of the 22 standard TPC-H tests, Talend ran up to 67 percent faster with an average improvement of 45 percent across generated MapReduce code.

TPC-H testing was just the beginning. Talend also worked with one of the industry’s largest financial services companies to do real-world testing. “At Talend, we are making the fastest and the most predictable integration solutions on the market,” said Fabrice Bonan, chief product officer and co-founder of Talend. “Our ability to show true scalability and performance on a 1000-node Hadoop cluster, with such a large customer, is just one proof point.”

Talend has made numerous other performance improvements throughout the product suite. One example of this is enhancements to the Talend Data Mapper. The Talend Data Mapper is an advanced data mapping tool designed explicitly to handle complex data structures such as XML, EDI and Java objects.  This kind of data mapping is particularly important for managing electronic data interchange (EDI) in the healthcare industry or for conveying financial information between banks using the Financial Product Markup Language (FPML). Talend 5.5 now includes support for very large files with the ability to stream multi-gigabyte documents into Hadoop clusters.

Igniting Real-Time Big Data

Talend 5.5 is also setting the bar for real-time or operational big data. Analytics is often just the first step in a company’s big data journey. The next step is delivering those analytics and recommendations to the right people at the right time. This is where operational big data comes in with its ability to handle thousands of simultaneous transactions in real time. 

Working in conjunction with Talend alliance partner Altic, Talend Labs is supporting the future of Hadoop by working with Apache Spark for fast, large-scale data processing. Spark is ideally suited to handle high volume, high speed data situations, such as fraud detection and sensor data processing.  “Our support of Apache Spark is a great example of how Talend is helping the open source community and customers take advantage of the latest innovations in big data,” said Bonan.  “Instead of becoming experts in every new Hadoop project, customers can use our visual designer and Talend generates optimized code for them. This allows IT organizations to stay focused on delivering business value while we keep them on the cutting edge.”

“Talend is answering the call from customers to support operational big data, providing users with the ability to process any data volume in real-time, and helping to drive and improve business performance,” said Charly Clairmont, chief technical officer of Altic. “We are proud to have been early supporters of Apache Spark for over two years. Now alongside the experts from Talend Labs, we are able to jointly offer Spark support, enabling the future of big data. Innovation in the big data space is clearly driven by open source vendors and Talend has always shown a strong commitment to driving value for its users.” 

Availability

Version 5.5 of all Talend open source products is available for immediate download from Talend’s website, www.talend.com. Experimental support for Spark code generation is also available immediately and can be downloaded from the Talend Exchange on Talendforge.org. Version 5.5 of the commercial subscription products will be available within 3 weeks and will be provided to all existing Talend customers as part of their subscription agreement. Products can be also be procured through the usual Talend representatives and partners.

To learn more about Talend 5.5 with 45 percent faster Big Data integration Performance and Scalability, register here for our June 10 webinar.