`

Compare All Cloud Integration Products

 Cloud Integration - SaaSCloud Integration - HybridCloud Integration - Elastic
Free TrialFree TrialFree Trial

License

User-based subscriptionUser-based subscriptionUser-based subscription
Subscription license with warranty and indemnification

Design & Productivity Tools

Generates native Spark Streaming code
Visual mapping for complex XML & EDI on Spark
Spark & MapReduce job designer
Generates native MapReduce & Spark batch code
Hadoop job scheduler with YARN
Hadoop security for Kerberos
Ingestion, loading and unloading data into a data lake
Eclipse-based developer tooling & job designer
Enterprise SDLC for cloud development, test & production
Continuous delivery integration & team collaboration with shared repository
Audit, job compare, impact analysis, testing, debugging & tuning
Metadata bridge for metadata import/export & centralized metadata management
Dynamic schema, re-usable joblets & reference projects
ETL & ELT support
Versioning
Change data capture (CDC)
Automatic documentation
Publish to the Cloud

Components

Hadoop components: HDFS, Hbase, Hive, Pig, Sqoop
File management: open, move, compress, decompress without scripting
Control and orchestrate data flows and data integrations with master jobs
Map, aggregate, sort, enrich & merge data
Internet of Things connectivity: AMQP, MQTT

Connectors

Cloud: AWS, Microsoft Azure, Google Cloud Platform, and more
Supported big data distributions: Amazon EMR, Azure HDInsight, Cloudera, Google Dataproc, Hortonworks, MapR
Spark MLlib (classification, clustering, recommendation, regression)
NoSQL: Cassandra, Couchbase, DynamoDB, MongoDB, Neo4j, and more
RDBMS: Oracle, Teradata, Microsoft SQL server, and more
SaaS: Marketo, Salesforce, NetSuite, and more
Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more
Cleansing, masking & error resolution
Optional 3rd-party address validation services
High-Speed messaging components (Kafka, Kinesis, Flume)

Data Quality & Governance

Data cleansing & masking
Automate data quality error resolution and enforce rules
Semantic discovery with automatic detection of patterns
Comprehensive survivorship
Enrichment, harmonization, fuzzy matching & de-duplication

Big Data Quality

Data cleansing, profiling, masking, parsing & matching on Spark & Hadoop
Machine learning for data matching and deduplication
Support for Cloudera Navigator & Apache Atlas
HDFS file profiling

Advanced Data Profiling

Fraud pattern detection using Benford Law
Advanced statistics with indicator thresholds
Column set analysis
Advanced matching analysis
Time column correlation analysis

Cloud Management

Monitor & manage
User administration & access controls
Connection management
Cloud engines
Remote engines
Exclusive cloud containers

Contact Sales