YARN MapReduce

YARN (MapReduce 2.0) delivers greater scalability for big data.

YARN in Hadoop provides a new processing platform for big data that is not constrained to MapReduce. Also known as MapReduce 2.0, YARN decouples the resource management and scheduling capabilities from the data processing component in Hadoop, limiting the dependency of Hadoop environments on the MapReduce program. By solving this job-execution bottleneck, YARN/MapReduce enables organizations to achieve greater scalability, agility and processing power as they work with big data.

To take advantage of the power and new capabilities of YARN/MapReduce, organizations need developers with strong skills in these technologies. But because YARN, MapReduce, Hadoop and other big data platforms are still developing technologies, the number of developers with skills in these frameworks is relatively small. Most developers who know YARN/MapReduce well are highly paid and working for leading companies in the big data space. For organizations that want to work with big data but lack developers with experience in YARN/MapReduce, Talend's software for big data can help.

Talend redefines the skills it takes to use YARN/MapReduce.

Talend's software provides an open source platform that lets developers use the skills they have today to work with YARN, Hadoop MapReduce, NoSQL databases and other big data technologies. With Talend, developers can quickly integrate data from hundreds of sources into massive data sets, and easily transform them to deliver the business intelligence that organizations need to make better and faster decisions.

Talend runs 100% natively in Hadoop and lets developers work with components for HBase, Hive, Oozie, Sqoop, HCatalog, Pig for Hadoop, and other big data technologies. Talend is also certified to work with leading Hadoop distributions like Amazon EMR, Cloudera, MapR, IBM PureData, and others.

Comprehensive tools for YARN/MapReduce.

Talend provides all the tools developers need to work with YARN, big data and Hadoop – without needing to learn new skills for these technologies. With Talend, developers can:

  • Connect to any data source. Using more than 800 pre-built connectors, developers can load data from any source – including NoSQL databases – without needing specific knowledge about YARN/MapReduce.
  • Integrate data quickly. Developers can visually map big data sources and targets in an easy-to-use graphical environment without needing to write complicated code.
  • Transform data easily. Talend provides tools that let developers load, extract and manipulate data quickly, performing complex transformations and analyses.
  • Scale as needed. Talend lets organizations take advantage of the massive scalability afforded by YARN/MapReduce. As new clusters are added, Talend automatically generates the underlying code for data connections on each new cluster.

Learn more about Talend’s big data solutions from the many resources on this web site, or download Talend Open Studio for Big Data today and start benefiting from the leading open source big data tool.