Cloudera Distribution

The Cloudera distribution of Hadoop makes managing big data easier.

Cloudera is a Hadoop distribution designed to deliver the capabilities that enterprises need to succeed in the Hadoop architecture. Designed for mission-critical environments, the Cloudera distribution includes CDH, one of the most popular open-source Hadoop-based platforms, as well as advanced system management and data management tools. As a unified platform for big data, the Cloudera distribution gives enterprises a single place to store, process and analyze all their data.

But while the Cloudera distribution can certainly help enterprises use big data to drive business performance, few organizations have the optimal teams and infrastructure in place to implement and manage a Cloudera distribution. Most legacy integration tools can't adequately connect to the hundreds of new and emerging data sources that working with big data requires. Few organizations have the infrastructure to handle and manipulate massive data sets. And there simply aren't very many developers around who are skilled in big data, NoSQL and other technologies needed to effectively manage a Cloudera distribution.

That's why Talend provides a software solution that lets organizations work with big data in a Cloudera distribution, using the infrastructure and team they have in place today.

Talend simplifies management of a Cloudera distribution.

Talend's open source platform that makes it easy for developers to work with big data in a Cloudera distribution without needing to learn new skills or invest in new infrastructure. With a comprehensive set of easy-to-use tools, developers can easily integrate data sources, manipulate massive data sets, and manage a Cloudera distribution to deliver the big data analytics their organization needs.

Talend runs 100% natively on Hadoop and provides components that let developers work with all the components in the Hadoop family – HBase, Hive, Oozie, Sqoop, Pig, HCatalog and others. Talend also provides support for all leading Hadoop distributions, including Hortonworks Data Platform, Amazon EMR, IBM PureData, the MapR distribution and others.

Easy-to-use tools for a Cloudera distribution

With Talend, developers can work with their Cloudera distribution to:

  • Integrate data. More than 800 pre-built connectors simplify the task of connecting to any big data source. And developers can use a Hadoop connector to work with NoSQL databases, without having any previous NoSQL experience.
  • Transform data sets. Redefining the skillset it takes to manipulate big data, Talend lets developers perform complex transformations and analyses using the skills they already have.
  • Achieve massive scalability. By automatically generating the underlying code for data connectors as new clusters are added, Talend lets organizations take advantage of the scalability provided by distributions like Cloudera or Hortonworks for Hadoop.

Learn more about Talend’s big data solutions from the many resources on this web site, or download Talend Open Studio for Big Data today and start benefiting from the leading open source big data tool.

Download the Talend Big Data Jumpstat Sandbox
Watch the Talend v5.4 Big Data Features Webinar