Compare All Data Integration Products

Free Trial

 Open Studio for Data QualityOpen Studio for Data IntegrationData Integration Entry-LevelData Management Platform
Product Details Product Details Product Details Product Details 

License

Free Open SourceFree Open SourceUser-based subscriptionUser-based subscription
Free open source Apache license
Subscription license with warranty and indemnification
Available as downloadable software
Available as cloud service and downloadable software
2 free Data Preparation and 2 free Data Stewardship licenses with any Talend subscription

Design and Productivity Tools

Graphical design environment
Team collaboration with shared repository
Continuous integration / Continous delivery
Visual mapping for complex JSON, XML and EDI
Audit, job compare, impact analysis, testing, debugging, and tuning
Metadata bridge for metadata import/export, and centralized metadata management
Distant run and parallelization
Dynamic schema, re-usable joblets, and reference projects
Repository manager
ETL and ELT support
Wizards and interactive data viewer
Versioning
Export and execute standalone jobs in runtime environments
Change data capture (CDC)
Automatic documentation
Customizable assessment
Pattern library

Components

File management: open, move, compress, decompress without scripting
Control and orchestrate data flows and data integrations with master jobs
Map, aggregate, sort, enrich, and merge data

Connectors

Cloud: AWS, Microsoft Azure, Google Cloud Platform, and more
RDBMS: Oracle, Teradata, Microsoft SQL server, and more
SaaS: Marketo, Salesforce, NetSuite, and more
Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more
Cleansing, masking and error resolution
Optional 3rd-party address validation services

Management and Monitoring

High availability, load balancing, failover for Jobs
Deployment manager and team collaboration
Manage users, groups, roles, projects, and licenses
Manage execution engines
Execution plan, time, and event-based scheduler for jobs
Check points, error recovery
Context management (dev, QA, prod)
Activity monitoring
Log collection and display
Engine clusters for Jobs
Static IP addresses
Job execution log history (2 months for Entry products, 3 months for Platforms)
Log export to S3 for Jobs
Environments (2 for Entry products, unlimited for Platforms)

Data Quality and Governance

Data profiling and analytics with graphical charts and drilldown data
Automate data quality error resolution and enforce rules
Data cleansing and masking
Data quality portal with monitoring, reporting, and dashboards
Semantic discovery with automatic detection of patterns
Comprehensive survivorship
Data sampling
Enrichment, harmonization, fuzzy matching, and de-duplication

Advanced Data Profiling

Fraud pattern detection using Benford Law
Advanced statistics with indicator thresholds
Column set analysis
Advanced matching analysis
Time column correlation analysis

Data Preparation and Stewardship

Import, export, and combine data from any database, Excel or CSV file
Import, export, and combine CSV, Parquet and AVRO files**
Export to Tableau
Self-service on-demand access to sanctioned datasets
Share data preparations and datasets
Operationalize preparations into any data or big data integration flow
Run preparations on Apache Beam**
Auto-discovery, profiling, smart suggestions, and data visualization
Customization of semantic type for auto-profiling and standardization
Smart and selective sampling and full-runs
Data tracking and masking with role-based security
Cleansing and enrichment functions
Data Stewardship App for data curation and certification
Define data models, data semantics and profile data accordingly. Define and apply rules (survivorship, mass updates)
Merge and match data, resolve data errors, and arbitrate on data (classification and certification)
Orchestrate and collaborate on activities in campaigns
Define user roles, workflows and priorities, assign and delegate tasks, tag and comment
Embed governance and stewardship in data integration flows and manage rejects
Embed human certification and error resolution into MDM processes
Take matching decisions that cannot be processed automatically.
De-duplicate data at scale with machine learning
Audit and track data error resolution actions. Monitor progress of campaigns. Undo/redo based on business need

Support

Self-serviceSelf-serviceWeb, emailWeb, email, phone

Contact Sales

For information about our collection and use of your personal information, our privacy and security practices and your data protection rights, please see our privacy policy.