Yarn Big Data

YARN makes big data faster.

YARN for big data is a cluster management technology designed to improve the speed and efficiency of big data processing. By separating resource management and scheduling functionality from processing components, YARN MapReduce 2.0 eliminates a bottleneck in the original version of Hadoop MapReduce and enables organizations to utilize a broader range of applications and processing approaches. For organizations working in big data, YARN has the potential to deliver greater scalability and agility.

But while YARN's big data functionality is highly promising, many organizations won't immediately be able to take advantage of it for one specific reason: they lack developers with any experience in YARN, big data, MapReduce or Hadoop. These technologies are all quite new and the number of developers with deep experience in them is still small. Most developers with any experience in YARN for big data are already working for the major players in the space. That means organizations wanting to work with big data must either spend a lot of time and money to train developers, or find solutions that let their existing development staff use the skills they already have to work with YARN and big data. And that's exactly what Talend provides.

Talend lets any developer work with YARN for big data.

Talend software for big data provides easy-to-use tools that let developers without any specific experience in big data nevertheless work with leading big data platforms like YARN in Hadoop, the MapReduce program, HBase, HCatalog, Squoop, Oozie and Pig for Hadoop. With Talend, developers can use the skills they have today to load data from a wide variety of sources into a big data platform, and manipulate it with ease to help the organization improve business performance.

Running 100% natively on Hadoop, Talend has been tested and certified to work with leading Hadoop distributions like Cloudera, MapR, Hortonworks, Amazon EMR, IBM PureData, SAP HANA, and others.

Easy-to-use tools for YARN and big data.

Talend gives developers a broad set of tools to manage massive data sets and take advantage of YARN's big data processing power. These include:

  • Data integration tools, including more than 800 pre-built connectors, that let developers quickly pull data from any source in order to integrated and transform it in real-time or batch.
  • An easy-to-use graphical environment where developers can visually map big data sources and targets without requiring them to learn new skills or write complicated code.
  • Data manipulation tools that let developers perform complex transformations and analyses on massive sets of data in very little time, using the skills they have today.
  • Project governance and administration tools that include a simple, intuitive environment for scheduling, monitoring and deploying any big data job in YARN.

Learn more about Talend’s big data solutions from the many resources on this web site, or download Talend Open Studio for Big Data today and start benefiting from the leading open source big data tool.