WHY YOUR HDP NEEDS WANDISCO

Continuous availability with Maximum Performance and Scalability

100% Use of Compute Resources

Cluster Zoning

Multi-Data Center Ingest

Selective replication with global access

Continuous availability with Maximum Performance and Scalability

WANdisco’s patented technology removes the bottleneck of a single active NameNode in a single data center, balances workload across data centers, and enables LAN-speed read/write at any location without downtime or data loss – even during Hadoop upgrades and maintenance.

More...

WANdisco’s patented replication technology enables workload to be balanced across multiple active peer NameNodes that simultaneously support client requests and continually synchronize across any number of data centers any distance apart. Clients have LAN-speed read/write access to the same data at every location without downtime or data loss. Support for rolling upgrades eliminates downtime when new versions of HDP are deployed. These capabilities meet the most stringent regulatory and business requirements for data availability and disaster recovery.

100% Use of Compute Resources

Active-active architecture eliminates read-only backup servers by making all servers fully readable and writable, so you can take full advantage of the hardware at each location.

More...

With Non-Stop Hadoop, there are no passive standby servers. In WANdisco’s active/active architecture, all servers across all locations are fully readable and writeable. Money isn’t wasted on read-only backup servers and clusters that can’t be fully utilized. For even greater efficiency, features such as selective replication and asymmetric block replication allow different hardware footprints in each location.

Cluster Zoning

Delegate your most resource-intensive data load and in-memory applications to high-spec servers while running less critical batch applications on commodity servers. Maintain quality of service for all users and eliminate the need for costly hardware throughout the cluster.

More...

Cluster zoning enables a ‘virtual cluster’ or zone within a cluster to be deployed that isolates the most resource intensive data load and in-memory applications to the highest spec servers, while less critical batch applications run on low-end commodity servers. This offers significant cost savings by eliminating the need to deploy high-end servers throughout a cluster, while maintaining quality of service for all users.

Multi-Data Center Ingest

Ingest data at any number of locations simultaneously, automatically replicate where you choose, and analyze from anywhere with no single point of failure. No admin overhead, no loss of data.

More...

WANdisco’s active/active architecture enables data ingest at any number of locations simultaneously, automatically replicates it to any number of locations, and allows it to be analyzed anywhere without any single point of failure. This eliminates the administrator overhead and risk of error involved in using DistCp and other tools to copy data from its source to a central location for analysis. This capability is essential in large global organizations where data is generated from widely distributed sources and timeliness and accuracy are critical.

Selective replication with global access

WANdisco makes Hadoop data center-aware so you can perform global, company-wide roll-up analysis from anywhere regardless of how your data is distributed.

More...

In large deployments where selective replication is used, data may not exist at the location where analysis is run. WANdisco overcomes this by enabling Hadoop to be data center aware, making global company-wide roll-up analysis possible regardless of how data is distributed across an organization.


Key features of Non-Stop Hadoop for Hortonworks

Non-Stop Hadoop for Hortonworks offers flexible, automated features that reduce manual processes and eliminate the need for third-party backup and recovery solutions.

Single Hadoop cluster over a WAN

A single Hadoop cluster guarantees complete synchronization, eliminates single points of failure and enables operation across data centers any distance apart, subject to the selective replication policies you implement.

Automatic WAN backup and failover

Automatic hot backup, failover and recovery prevent data loss and downtime whether a single server or an entire data center goes offline.

Optimal hardware utilization

There are no passive standby servers. All available servers are active and fully readable and writable at all times, making maximum use of your compute power. Cluster zoning enables support for the most demanding applications without the need for high-cost, high-spec hardware across the entire deployment.

Unlimited performance and scalability

Enables 100% Hadoop uptime for business continuity with unmatched scalability and performance over a WAN across data centers, or LAN within each data center.

No disruption to applications

Mission-critical applications run flawlessly without any change in behavior. Support for rolling upgrades enables applications to continue running while new versions of Hadoop are being deployed.

Full support for HBase database

Builds on Non-Stop Hadoop's multiple active NameNode architecture to support a single HBase instance with multiple active master servers and multiple active region servers for each region.


Unparalleled availability and performance

Unlike other high availability solutions that only work within a single data center over a LAN, Non-Stop Hadoop is optimized for use between data centers thousands of miles apart to deliver global business results.

Gain greater efficiency

One Hadoop cluster can span geographic locations for better load balancing.

Eliminate manual processes

Clusters are kept continuously in sync without admin intervention.

0% data loss

Failover and recovery are automatic both within and across data centers.


The impact of Non-Stop Hadoop across the enterprise

Non-Stop Hadoop benefits for users

Access the data you need, when you need it

Experience LAN-speed performance with access to the same data at every location.

Analyze data anywhere

With automatic replication and global access, you can perform analysis from anywhere regardless of how the data is distributed.

Use the tools you’re familiar with

No learning curve — work with the same tools and protocols you always have.

Non-Stop Hadoop benefits for administrators

Easy installation

Intuitive browser interface makes configuration simple. Multiple sites can be up and running in no time.

100% hardware utilization

There are no passive standby servers. All available servers are active and fully readable and writable at all times, reducing cost and delivering maximum compute power.

Support for rolling upgrades

Non-Stop Hadoop’s active-active architecture keeps your deployment up and running during planned and unplanned maintenance, including upgrades to Hadoop.

Automated features for easy administration
  • Automatic disaster recovery with built-in self-healing capabilities. No administrator intervention required.
  • Automatic synchronization for new data centers, or existing data centers brought back online after an outage.
Centralized administration

All sites can be administered from a single location.

Cluster zoning

Cluster zoning offers significant cost savings by eliminating the need to deploy high-end servers throughout a cluster, while maintaining quality of service for all users.

Non-Stop Hadoop benefits for managers

No retraining

No change to Hadoop functionality. Teams continue to use the applications and tools they're familiar with.

100% hardware utilization

There are no passive standby servers. All available servers are active and fully readable and writable at all times, reducing cost and delivering maximum compute power.

Cluster zoning

Cluster zoning offers significant cost savings by eliminating the need to deploy high-end servers throughout a cluster, while maintaining quality of service for all users.

Peace of mind

No downtime and data loss with automatic recovery and failover – no human intervention or additional third party backup and recovery solutions required.

Meets stringent requirements for data availability and disaster recovery

Non-Stop Hadoop’s active-active architecture guarantees consistency across peer NameNodes in data centers any distance apart, so when data center outages occur due to hardware or network failure, applications keep running. When servers come back online, they resynchronize automatically.



Contact a Specialist

* Required field