WANdisco LiveData
for Databricks
Continuous. Immediate. Accurate.
Automated Data and Metadata Replication

Under pressure to decrease time to insight?
Automated Migration for Cloud Analytics
Zero downtime. Zero data loss. 100% consistency.
LiveData for Databricks provides immediate analytic data access through continuous automated replication from on-premises Hadoop analytics to Spark based cloud analytics.


On-Premises to Cloud Migration
Built using WANdisco LiveData Platform's plugin architecture, LiveData for Databricks allows users to easily select data sets and begin migration. LiveData for Databricks provides automated migration from all major commercial Hadoop distributions to Delta Lake running on Databricks. Requiring just one pass of the source data, LiveData for Databricks performs the migration of the selected data sets along with any data changes that may have occurred.
Learn MoreActive-Active Replication
Leveraging LiveData Plane, LiveData for Databricks also supports active-active replication across multiple Databricks environments. Any changes made to content in any of the participating Databricks environments are replicated among the others. This ensures users get consistent business insights when using Databricks across multiple cloud regions or providers.
Download Solution Brief
LiveData for Databricks key features
Delta Lake brings key features to cloud storage that have challenged adopters of the cloud:
Hadoop & Object Storage
Works across a variety of big data source and target environments, including all major Hadoop distributions and object storage technologies such as AWS S3 and Azure Data Lake Storage.
Delta Lake Support
Replicates Hive content to cloud storage for Databricks to make Hive data and metadata available as Delta Lake tables.
Petabyte Scale
Migrates big data sets at any scale to cloud storage without needing to halt changes made to the data sets during migration.
Selective Migration
Administrators have full control over what data is migrated from source to target or replicated across multiple environments.
One Pass
With just a single pass through the source storage system, LiveData for Databricks is able to perform the migration of the selected data sets as well as any changes being made to the data.
Guaranteed Consistency
LiveData for Databricks leverages WANdisco’s patented coordination engine to ensure 100% data consistency between source and target. Applications are able to continue to modify the source system’s data during migration and still achieve guaranteed consistency.
Automatic Outage Recovery
LiveData for Databricks' guaranteed consistency eliminates the need for manual response to system failures, including network outages and other disruptions.
Bandwidth Management
Optimizes bandwidth use by eliminating the need for repeated transfer of data, and enforces limits on bandwidth use.
Rapid Availability
Minimizes the time required to bring workloads to the cloud by making each data location available for use as soon as bandwidth allows.
Seeing is Believing. Try WANdisco Now.
Fully-featured, self-service and automated. Start migrating data in minutes, at any scale to any cloud. With zero production system downtime and zero business disruption. Start with 5TB free. Seeing is indeed believing.
FREE TRIAL