+
+

Related Products

  • AnalyticsCreator
    46 Ratings
    Visit Website
  • Teradata VantageCloud
    1,105 Ratings
    Visit Website
  • dbt
    239 Ratings
    Visit Website
  • Docket
    58 Ratings
    Visit Website
  • Kamatera
    152 Ratings
    Visit Website
  • Comet Backup
    219 Ratings
    Visit Website
  • Yodeck
    7,501 Ratings
    Visit Website
  • SureSync
    13 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • DataBuck
    6 Ratings
    Visit Website

About

Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.

About

Open source data governance suite for databases and data lakes. Tokern is a simple to use toolkit to collect, organize and analyze data lake's metadata. Run as a command-line app for quick tasks. Run as a service for continuous collection of metadata. Analyze lineage, access control and PII datasets using reporting dashboards or programmatically in Jupyter notebooks. Tokern is an open source data governance suite for databases and data lakes. Improve ROI of your data, comply with regulations like HIPAA, CCPA and GDPR and protect critical data from insider threats with confidence. Centralized metadata management of users, datasets and jobs. Powers other data governance features. Track Column Level Data Lineage for Snowflake, AWS Redshift and BigQuery. Build lineage from query history or ETL scripts. Explore lineage using interactive graphs or programmatically using APIs or SDKs.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Companies looking for a storage layer software solution for big data workloads

Audience

Anyone who needs an open source data governance suite for databases and data lakes

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Delta Lake
Founded: 2019
United States
delta.io

Company Information

Tokern
Founded: 2019
United States
tokern.io

Alternatives

Apache Hudi

Apache Hudi

Apache Corporation

Alternatives

Apache Iceberg

Apache Iceberg

Apache Software Foundation
Kylo

Kylo

Teradata
Apache Kudu

Apache Kudu

The Apache Software Foundation

Categories

Categories

Integrations

AWS Glue
Acryl Data
Amazon Redshift
Amazon Web Services (AWS)
Amundsen
Apache Spark
Booz Allen MDR
Daft
DataHub
Dell AI-Ready Data Platform
Edmunds Financial Management
Google Cloud Dataproc
Google Cloud Storage
Hackolade
IBM StreamSets
McGraw-Hill Connect
PuppyGraph
Talend Data Fabric
eBay

Integrations

AWS Glue
Acryl Data
Amazon Redshift
Amazon Web Services (AWS)
Amundsen
Apache Spark
Booz Allen MDR
Daft
DataHub
Dell AI-Ready Data Platform
Edmunds Financial Management
Google Cloud Dataproc
Google Cloud Storage
Hackolade
IBM StreamSets
McGraw-Hill Connect
PuppyGraph
Talend Data Fabric
eBay
Claim Delta Lake and update features and information
Claim Delta Lake and update features and information
Claim Tokern and update features and information
Claim Tokern and update features and information