Hadoop without Limits
Hadoop without Limits
WANdisco’s patented technology removes the bottleneck of a single active NameNode in a single data center, balances workload across data centers, and enables LAN-speed read/write at any location without downtime or data loss – even during Hadoop upgrades and maintenance.
WANdisco’s patented replication technology enables workload to be balanced across multiple active peer NameNodes that simultaneously support client requests and continually synchronize across any number of data centers any distance apart. Clients have LAN-speed read/write access to the same data at every location without downtime or data loss. Support for rolling upgrades eliminates downtime when new versions of CDH are deployed. These capabilities meet the most stringent regulatory and business requirements for data availability and disaster recovery.
Active-active Hadoop replication architecture eliminates read-only backup servers by making all servers fully readable and writable, so you can take full advantage of the hardware at each location.
With Non-Stop Hadoop, there are no passive standby servers. In WANdisco’s active/active architecture, all servers across all locations are fully readable and writeable. Money isn’t wasted on read-only backup servers and clusters that can’t be fully utilized. For even greater efficiency, features such as selective replication and asymmetric block replication allow different hardware footprints in each location.
Delegate your most resource-intensive data load and in-memory applications to high-spec servers while running less critical batch applications on commodity servers. Maintain quality of service for all users and eliminate the need for costly hardware throughout the cluster.
Cluster zoning enables a ‘virtual cluster’ or zone within a cluster to be deployed that isolates the most resource intensive data load and in-memory applications to the highest spec servers, while less critical batch applications run on low-end commodity servers. This offers significant cost savings by eliminating the need to deploy high-end servers throughout a cluster, while maintaining quality of service for all users.
Ingest data at any number of locations simultaneously, automatically replicate where you choose, and analyze from anywhere with no single point of failure. No admin overhead, no loss of data.
WANdisco’s active/active architecture enables data ingest at any number of locations simultaneously, automatically replicates it to any number of locations, and allows it to be analyzed anywhere without any single point of failure. This eliminates the administrator overhead and risk of error involved in using DistCp and other tools to copy data from its source to a central location for analysis. This capability is essential in large global organizations where data is generated from widely distributed sources and timeliness and accuracy are critical.
WANdisco makes Hadoop data center-aware so you can perform global, company-wide roll-up analysis from anywhere regardless of how your data is distributed. Control data location and access with granular rules and unique WAN-read on-demand replication.
WANdisco's selective replication gives you full control over data location, including the unique ability to pull data on demand. Selective replication lets you use network bandwidth efficiently while still presenting a unified view of data for roll-up analysis for authorized users and physically isolating sensitive data to secure clusters.
Non-Stop Hadoop is the only solution that guarantees global business continuity for distributed Hadoop and HBase deployments, turning your investment into a non-stop engine.
100% uptime guaranteed - whether a single server or entire site goes down
100% hardware utilization reduces cost and delivers maximum compute power
No change to application performance or behavior
100% sync - every location has LAN-speed read and write access to the same data
Full support for HBase, the NoSQL database for Hadoop
Monitor and administer servers from a single location
"Global organizations require big data solutions that can meet stringent data and availability needs across wide geographies. We believe the integration of WANdisco's active-active replication technology with the Hortonworks enterprise Apache Hadoop distribution makes sense for SAP customers looking to leverage big data."
Irfan Khan, Senior VP and GM, SAP Big Data
The Culture of Big Data describes the cultural challenges that accompany efforts to create and sustain big data initiatives in an evolving world rooted firmly in data warehouse architectures.
This white paper places Hadoop in the context of enterprise IT and helps those managing Hadoop platforms make it responsive to the enterprise's data governance policies and processes.