Hadoop Framework

Working with the Hadoop framework requires new big data solutions.

Apache Hadoop is a critical technology for any enterprise wanting to take advantage of big data. The Hadoop framework delivers the processing power big data requires by distributing the processing of massive data sets across multiple computer clusters – scaling from one server to thousands as needed. It provides high availability by handling failure at the application layer, and continuing operations even when a single server or cluster fails. And the Hadoop framework provides the efficiency needed to process big data by not asking applications to move large volumes of data across a network.

But using the Hadoop framework to take advantage of the power of big data isn't an easy task for most organizations. Programming Hadoop, NoSQL and other big data technologies isn't part of the skillset for most developers today, and those few that excel in these new platforms command very high salaries. Additionally, legacy integration tools are almost obsolete when trying to connect to all the data sources that Hadoop can access.

For any organization that really wants to use the Hadoop framework to realize the benefits of big data, Talend provides a software solution that integrates data sources and manipulates massive data sets using the team and the infrastructure already in place.

Talend puts the Hadoop framework to work.

Talend solutions run 100% natively on the Hadoop framework, giving developers the tools to work with big data without need to learn new skills or write complicated code. With components for MapReduce 2.0 (YARN for big data) and other big data technologies like HBase, Oozie, Sqoop and Pig, Talend lets organizations begin immediately using the Hadoop framework to manipulate big data sets to make better business decisions.

Talend makes it easy to connect to big data sources, integrate disparate data and perform complex transformations of massive data sets. Developers can work with Talend's easy-to-use graphical environment to visually map data sources and targets without learning or writing complicated code. And with more than 800 pre-built Hadoop connector options, Talend lets one easily pull data from any data source in real-time or batch.

How Talend makes it easy to work with the Hadoop framework.

Talend leverages all the benefits of the Hadoop framework:

  • Massive scalability. Talend automatically generates the underlying code as new jobs get remotely deployed, natively on the Hadoop cluster.
  • Data from any source. Talend integrates almost any source, including NoSQL databases like Cassandra, CouchDB, HBase, Couchbase and MongoDB for Hadoop framework.
  • Quick manipulation. Using skills they already have, developers can quickly map, compare, filter, evaluate and group data, performing transformations and analyses with great speed.
  • Data quality. Talend takes advantage of the massively parallel environment of the Hadoop framework to provide clearer visibility into the completeness, integrity and accuracy of the data.

Learn more about Talend’s big data solutions from the many resources on this web site, or download Talend Open Studio for Big Data today and start benefiting from the leading open source big data tool.