MapR Distribution

The MapR distribution of Hadoop can help organizations exploit big data.

MapR is an enterprise-grade Hadoop distribution that delivers unprecedented dependability, ease of use, and remarkable speed for Hadoop, NoSQL, databases and streaming applications. Built from the ground up for business-critical production applications, MapR packages more than a dozen projects from the Hadoop ecosystem to deliver a broad set of capabilities for working with enterprise data. Enterprises rely on the MapR distribution to achieve high availability, security, and disaster recovery, and to access Hadoop as traditional network attached storage with read-write capabilities.

But while the MapR distribution offers tremendous advantages for any organization working with big data, implementing and managing a MapR distribution may present some challenges. Most organizations lack the infrastructure to support big data on the MapR distribution – legacy integration tools simply can't keep up with a growing number of data sources, existing architecture can't scale to handle the growing volume of big data sets, and there are very few developers who are skilled in Hadoop and other technologies that are essential for managing big data, NoSQL databases and the MapR distribution.

Talend: easy-to-use tools for a MapR distribution.

Talend provides an open source platform for big data that lets organizations implement a MapR distribution with the infrastructure and team they already have in place.

Talend provides easy-to-use tools that let developers use the skills they have today to integrate data sources and transform massive data sets within a MapR distribution. Running 100% natively on Hadoop, Talend lets developers use their existing skills to work with all the components in the Hadoop ecosystem – YARN, Hive, HBase, Pig, Sqoop, Oozie, HCatalog and more.

In addition to the MapR distribution, Talend is tested and certified to work with other Hadoop distributions including the Hortonworks Data Platform, Amazon EMR, IBM PureData, the Cloudera distribution, and others.

The benefits of using Talend for a MapR distribution.

With Talend, organizations can easily work with Hadoop and a MapR distribution (or other distributions like Hortonworks for Hadoop) to:

  • Integrate big data sources. Talend provides more than 800 connectors out of the box that allow developers to quickly connect to any big data source. An easy-to-use graphical environment simplifies the task of visually mapping sources and targets, and pre-built NoSQL connectors enable developers to work with NoSQL databases like Cassandra for Hadoop, MongoDB or Neo4J without any having specific NoSQL experience.
  • Manipulate big data sets. With Talend, developers can use their existing skills to manage complex transformations and analyses on massive data sets in little time.
  • Scale to meet big data requirements. Talend makes it easy to achieve the massive scalability available through a MapR distribution. Once a big data connection is configured, Talend automatically generates the underlying code and enables it to be deployed remotely as a job that runs natively as new clusters are added.

Learn more about Talend’s big data solutions from the many resources on this web site, or download Talend Open Studio for Big Data today and start benefiting from the leading open source big data tool.

Download the Taking the Headache out of Big Data with MapR Webinar