Big Data Platform

Convierta los big data en información fiable.

Empiece a operar rápidamente con la principal herramienta de big data de código abierto

Talend Big Data Platform simplifica integraciones complejas para aprovechar Apache Spark, Databricks, Qubole, AWS, Microsoft Azure, Snowflake, Google Cloud Platform y NoSQL, y ofrece calidad de datos integrada para que su empresa pueda transformar big data en información fiable. Aproveche toda la potencia y escala de su framework de big data gracias a la plataforma líder de integración de datos basada en Spark para arquitecturas de cloud, híbridas y multicloud.

Big Data Platform - Características

License and Support

  • Subscription license with warranty and indemnification
  • 2 free Data Preparation and 2 free Data Stewardship licenses with any Talend subscription
  • Available as cloud service and downloadable software
+ MOSTRAR MÁS CARACTERÍSTICAS

Design and Productivity Tools

  • Generates native MapReduce and Spark batch code
  • Visual mapping for complex JSON, XML, and EDI on Spark
  • Spark and MapReduce job designer
  • Serverless Spark processing through Databricks and Qubole
  • Dynamic distribution support
  • Hadoop job scheduler with YARN
  • Hadoop security for Kerberos
  • Ingestion, loading, and unloading data into a data lake
  • Graphical design environment
  • Team collaboration with shared repository
  • Continuous integration / Continuous delivery
  • Visual mapping for complex JSON, XML, and EDI
  • Audit, job compare, impact analysis, testing, debugging, and tuning
  • Metadata bridge for metadata import/export and centralized metadata management
  • Distant run and parallelization
  • Dynamic schema, re-usable joblets, and reference projects
  • Repository manager
  • ETL and ELT support
  • Wizards and interactive data viewer
  • Versioning
  • Change data capture (CDC)
  • Automatic documentation
  • Customizable assessment
  • Pattern library
  • Cloud Pipeline Designer
+ MOSTRAR MÁS CARACTERÍSTICAS

Data Quality and Governance

  • Data profiling and analytics with graphical charts and drill-down data
  • Automated data standardization, cleansing, and rules enforcement
  • Data privacy with masking and encryption
  • Data quality portal with monitoring, reporting, and dashboards
  • Semantic discovery with automatic detection of patterns
  • Comprehensive survivorship
  • Data sampling
  • Enrichment, harmonization, fuzzy matching, and de-duplication
+ MOSTRAR MÁS CARACTERÍSTICAS

Connectors

  • Cloud: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform, and more
  • Supported big data distributions: Amazon EMR, Azure HDInsight, Cloudera, Google Dataproc, Hortonworks, MapR
  • Serverless: Cloudera Altus, Databricks, Qubole
  • Spark MLlib (classification, clustering, recommendation, regression)
  • NoSQL: Cassandra, Couchbase, DynamoDB, MongoDB, Neo4j, and more
  • RDBMS: Oracle, Teradata, Microsoft SQL server, and more
  • SaaS: Marketo, Salesforce, NetSuite, and more
  • Packaged Apps: SAP, Microsoft Dynamics, Sugar CRM, and more
  • Technologies: Dropbox, Box, SMTP, FTP/SFTP, LDAP, and more
  • Optional 3rd-party address validation services
+ MOSTRAR MÁS CARACTERÍSTICAS

Components

  • Hadoop components: HDFS, Hbase, Hive, Pig, Sqoop
  • File management: open, move, compress, decompress without scripting
  • Control and orchestrate data flows and data integrations with master jobs
  • Map, aggregate, sort, enrich, and merge data
+ MOSTRAR MÁS CARACTERÍSTICAS

Data Preparation and Stewardship

  • 2 free licenses with subscription
  • Import, export, and combine data from any database, Excel or CSV file
  • Import, export and combine CSV, Parquet and AVRO files**
  • Export to Tableau
  • Self-service on-demand access to sanctioned datasets
  • Share data preparations and datasets
  • Operationalize preparations into any data or big data integration flow
  • Operationalize preparations into any cloud integration flow
  • Run preparations on Apache Beam*
  • Auto-discovery, standardization, auto-profiling, smart suggestions, and data visualization
  • Customization of semantic type for auto-profiling and standardization
  • Smart and selective sampling and full-runs
  • Data tracking and masking with role-based security
  • Cleansing and enrichment functions
  • Data Stewardship App for data curation and certification
  • Define data models, data semantics and profile data accordingly. Define and apply rules
  • Merge and match data, resolve data errors, and arbitrate on data (classification and certification)
  • Orchestrate and collaborate on activities in campaigns
  • Define user roles, workflows and priorities, assign and delegate tasks, tag and comment
  • Embed governance and stewardship in data integration flows and manage rejects
  • Embed human certification and error resolution into MDM processes
  • Take matching decisions that cannot be processed automatically
  • De-duplicate data at scale with machine learning
  • Audit and track data error resolution actions. Monitor progress of campaigns. Undo/redo based on business needs
+ MOSTRAR MÁS CARACTERÍSTICAS

Management and Monitoring

  • High availability, load balancing, failover for jobs
  • Deployment manager and team collaboration
  • Manage users, groups, roles, projects, and licenses
  • Manage execution engines
  • Single Sign-On (SSO) integration with several SSO providers
  • Execution plan, time, and event-based scheduler for jobs
  • Check points, error recovery
  • Context management (dev, QA, prod)
  • Log collection and display
  • Optional Admin user add-on*
  • Engine clusters for jobs*
  • Static IP addresses*
  • Job execution log history (2 months for Entry products, 3 months for Platforms)*
  • Environments (2 for Entry products, unlimited for Platforms)*
  • Cloud Security Information and Event Management (SIEM), Intrusion Detection System (IDS), Intrusion Prevention System (IPS) and Web Application Firewall (WAF)
+ MOSTRAR MÁS CARACTERÍSTICAS

Big Data Quality

  • Data cleansing, profiling, masking, parsing, and matching on Spark and Hadoop
  • Machine learning for data matching and deduplication
  • Support for Cloudera Navigator and Apache Atlas
  • HDFS file profiling
+ MOSTRAR MÁS CARACTERÍSTICAS

Advanced Data Profiling

  • Fraud pattern detection using Benford Law
  • Advanced statistics with indicator thresholds
  • Column set analysis
  • Advanced matching analysis
  • Time column correlation analysis
+ MOSTRAR MÁS CARACTERÍSTICAS

Controle los costes de sus proyectos de integración de datos

Talend keeps it flexible

Flexible

Mantenga recursos flexibles y costes previsibles con una suscripción mensual o anual.

Talend keeps it predictable

Predecible

Talend cobra por usuario, no por volúmenes de datos ni por conectores

Talend keeps it simple

Sencillo

un 50 % menos de coste total de propiedad con una única solución ejecutable desde la cloud

With Talend, we have improved our 48.8 million passenger’s experience and operation’s efficiency. And we have been recognized as Europe ‘s number One airport over 40 million passengers according to ACI World’s globally-established Airport Service Quality programme

Pietro Caminiti - Head of IT Solutions, Aeroporti di Roma

¿Listo para empezar a usar Talend?