Compare All Cloud Integration Products

 Cloud Integration - SaaSCloud Integration - HybridCloud Integration - Elastic
Start my free trial Start my free trial Start my free trial

License

User-based subscriptionUser-based subscriptionUser-based subscription
Subscription license with warranty and indemnification

Design & Productivity Tools

Generates native Spark Streaming code
Visual mapping for complex XML & EDI on Spark
Spark & MapReduce job designer
Generates native MapReduce & Spark batch code
Hadoop job scheduler with YARN
Hadoop security for Kerberos
Ingestion, loading and unloading data into a data lake
Eclipse-based developer tooling & job designer
Continuous deliveryÊintegration & team collaboration with shared repository
Audit, job compare, impact analysis, testing, debugging & tuning
Metadata bridge for metadata import/export & centralized metadata management
Dynamic schema, re-usable joblets & reference projects
ETL & ELT support
Wizards & interactive data viewer
Versioning
Change data capture (CDC)
Automatic documentation
Publish to the Cloud

Components

Hadoop components: HDFS, Hbase, Hive, Pig, Sqoop
File management: open, move, compress, decompress without scripting
Control and orchestrate data flows and data integrations with master jobs
Map, aggregate, sort, enrich & merge data

Connectors

Cloud: AWS, Microsoft Azure, Google Cloud Platform, and more
Supported big data distributions: Amazon EMR, Azure HDInsight, Cloudera, Google Dataproc, Hortonworks, MapR
Spark MLlib (classification, clustering, recommendation, regression)
NoSQL: Cassandra, Couchbase, DynamoDB, MongoDB, Neo4j, and more
RDBMS: Oracle, Teradata, Microsoft SQL server, and more
SaaS: Marketo, Salesforce, NetSuite, and more
Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more

Data Quality & Governance

Data profiling & analytics with graphical charts & drilldown data
Automate data quality error resolution and enforce rules
Data quality portal with monitoring, reporting & dashboards
Semantic discovery with automatic detection of patterns
Comprehensive survivorship
Enrichment, harmonization, fuzzy matching & de-duplication

Big Data Quality

Data cleansing, profiling, masking, parsing & matching on Spark & Hadoop
Machine learning for data matching and deduplication
Support for Cloudera Navigator & Apache Atlas
HDFS file profiling

Advanced Data Profiling

Fraud pattern detection using Benford Law
Advanced statistics with indicator thresholds
Column set analysis
Advanced matching analysis
Time column correlation analysis

Cloud Management

Monitor & manage
User administration & access controls
Connection management
Cloud engines
Remote engines (*Hybrid and Elastic only)
Exclusive cloud containers (*Elastic only)

Contact Sales