Bigdataappliance datasheet-1883358

6
ORACLE DATA SHEET ORACLE BIG DATA APPLIANCE X4-2 BIG DATA FOR THE ENTERPRISE OPEN, SECURE AND INTEGRATED KEY FEATURES Massively scalable, open infrastructure to store and manage big data Industry-leading security, performance and the most comprehensive big data tool set on the market all bundled in an easy to deploy appliance. Big Data Connectors delivers load rates of up to 15TB per hour between Big Data Appliance and Oracle Exadata Cloudera’s comprehensive software suite including Cloudera Distribution including Apache Hadoop (CDH 4.x and 5.x) and Apache Spark Oracle Enterprise Manager combined with Cloudera Manager simplifies management of the entire Big Data Appliance Advanced analytics with Oracle R directly interacting with data stored in HDFS Handle low-latency unstructured workloads with the pre-installed and configured Oracle NoSQL Database Community Edition InfiniBand connectivity between nodes and across appliances as well as to Oracle Exadata and Oracle Exalytics Flexible configuration choices for optimizing both floor space and growth path for Hadoop and Oracle NoSQL Database Oracle Big Data Appliance X4-2 is a comprehensive Big Data platform, engineered for secure data processing with a low overall total cost of ownership. It is optimized for both batch and real-time processing utilizing Cloudera’s Distribution for Apache Hadoop (versions 4.x and 5.x) supporting YARN and MR2, Oracle NoSQL Database, Apache Spark, Cloudera Impala and Cloudera Search to satisfy diverse computing requirements. Built using industry-standard hardware from Sun, Big Data Appliance X4-2 delivers the perfect balance between compute power, I/O bandwidth and memory footprint offering 33% more storage capacity than the previous generation appliance. Big Data Appliance X4-2 provides a highly optimized platform with integrated management capabilities that allows you to derive value quickly with lower risk. Comprehensive Big Data Platform Oracle Big Data Appliance is an open, multi-purpose big data platform. It is optimized to run a diverse set of workloads including batch processing jobs as well as interactive applications. Apache Hadoop’s MapReduce framework (including YARN and MR2) powers the batch capabilities processing massive volumes of data with linear scalability. There are several options for interactive applications each with their own unique properties. Oracle NoSQL Database is a distributed key-value database. It is designed to be highly available and extremely scalable with predictable levels of throughput and latency. Cloudera Impala provides real-time SQL queries over data stored in HDFS enabling business intelligence tools to access data in Hadoop without requiring MapReduce processing. Apache Spark enables processing of large data sets in memory. Finally, Cloudera Search offers full-text interactive search over data stored in HDFS with results delivered using a faceted navigation model. In addition to providing the full Cloudera software platform, Big Data Appliance utilizes Oracle Big Data Connectors to simplify data integration and analytics. Big Data Connectors provide high speed access to data in Hadoop from Oracle Exadata and Oracle Database with data transfer rates in the order of 15 TB/hour. Big Data Connectors also enable integrated, highly scalable analytics to run on Big Data Appliance providing native access to Hadoop data and parallel processing using Oracle R Distribution. Finally, Oracle XQuery for Hadoop is a new capability that enables standard XQuery operations to process and transform documents in various formats (JSON, XML, Avro and others), executing in parallel across the Hadoop cluster. The big data domain is marked by continuous innovation; Big Data Appliance embraces these innovations by providing an open environment without compromising tight integration and enterprise-level support. Organizations are free to deploy external software to support new functionality such as graph analytics, natural language processing and fraud detection to meet the needs of the application. Support for non-Oracle components is delivered by their respective support channels and not by Oracle. Big Data Appliance X4-2 Software Integrated Software Oracle Linux 6.4 with Unbreakable Enterprise Kernel

description

 

Transcript of Bigdataappliance datasheet-1883358

Page 1: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

ORACLE BIG DATA APPLIANCE X4-2

BIG DATA FOR THE ENTERPRISE

OPEN, SECURE AND INTEGRATED

KEY FEATURES

Massively scalable, open infrastructure

to store and manage big data

Industry-leading security, performance

and the most comprehensive big data

tool set on the market all bundled in an

easy to deploy appliance.

Big Data Connectors delivers load rates

of up to 15TB per hour between Big

Data Appliance and Oracle Exadata

Cloudera’s comprehensive software

suite including Cloudera Distribution

including Apache Hadoop (CDH 4.x and

5.x) and Apache Spark

Oracle Enterprise Manager combined

with Cloudera Manager simplifies

management of the entire Big Data

Appliance

Advanced analytics with Oracle R

directly interacting with data stored in

HDFS

Handle low-latency unstructured

workloads with the pre-installed and

configured Oracle NoSQL Database

Community Edition

InfiniBand connectivity between nodes

and across appliances as well as to

Oracle Exadata and Oracle Exalytics

Flexible configuration choices for

optimizing both floor space and growth

path for Hadoop and Oracle NoSQL

Database

Oracle Big Data Appliance X4-2 is a comprehensive Big Data platform,

engineered for secure data processing with a low overall total cost of

ownership. It is optimized for both batch and real-time processing – utilizing

Cloudera’s Distribution for Apache Hadoop (versions 4.x and 5.x) supporting

YARN and MR2, Oracle NoSQL Database, Apache Spark, Cloudera Impala and

Cloudera Search to satisfy diverse computing requirements. Built using

industry-standard hardware from Sun, Big Data Appliance X4-2 delivers the

perfect balance between compute power, I/O bandwidth and memory footprint –

offering 33% more storage capacity than the previous generation appliance.

Big Data Appliance X4-2 provides a highly optimized platform with integrated

management capabilities that allows you to derive value quickly with lower risk.

Comprehensive Big Data Platform

Oracle Big Data Appliance is an open, multi-purpose big data platform. It is optimized to run

a diverse set of workloads – including batch processing jobs as well as interactive

applications. Apache Hadoop’s MapReduce framework (including YARN and MR2) powers

the batch capabilities – processing massive volumes of data with linear scalability. There are

several options for interactive applications – each with their own unique properties. Oracle

NoSQL Database is a distributed key-value database. It is designed to be highly available and

extremely scalable with predictable levels of throughput and latency. Cloudera Impala

provides real-time SQL queries over data stored in HDFS – enabling business intelligence

tools to access data in Hadoop without requiring MapReduce processing. Apache Spark

enables processing of large data sets in memory. Finally, Cloudera Search offers full-text

interactive search over data stored in HDFS – with results delivered using a faceted navigation

model.

In addition to providing the full Cloudera software platform, Big Data Appliance utilizes

Oracle Big Data Connectors to simplify data integration and analytics. Big Data Connectors

provide high speed access to data in Hadoop from Oracle Exadata and Oracle Database – with

data transfer rates in the order of 15 TB/hour. Big Data Connectors also enable integrated,

highly scalable analytics to run on Big Data Appliance – providing native access to Hadoop

data and parallel processing using Oracle R Distribution. Finally, Oracle XQuery for Hadoop

is a new capability that enables standard XQuery operations to process and transform

documents in various formats (JSON, XML, Avro and others), executing in parallel across the

Hadoop cluster.

The big data domain is marked by continuous innovation; Big Data Appliance embraces these

innovations by providing an open environment without compromising tight integration and

enterprise-level support. Organizations are free to deploy external software to support new

functionality – such as graph analytics, natural language processing and fraud detection –to

meet the needs of the application. Support for non-Oracle components is delivered by their

respective support channels and not by Oracle.

Big Data Appliance X4-2 Software

Integrated Software

Oracle Linux 6.4 with Unbreakable Enterprise Kernel

Page 2: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

2

KEY BENEFITS

Optimized, Complete and Secure Big

Data Solution

Most comprehensive big data tool set

integrated in a single appliance

Integrated with Oracle Exadata to

analyze all your data

Risk-free installation and rapid time to

value

Simplified operations, updates and

patch management though a single

command utility of the entire stack (OS,

Java, Oracle NoSQL Database and the

Cloudera stack)

Single Management Console integrating

Big Data Appliance hardware and

software monitoring through Oracle

Enterprise Manager

Single-vendor support for your entire big

data solution covering both hardware

and software

Oracle Java – JDK 7

Cloudera Software 4.x and 5.x

Cloudera’s Distribution including Apache Hadoop (CDH) with support for YARN* and MR2*

Cloudera Impala

HBase (as well as support for Accumulo)

Cloudera Search

Apache Spark*

Cloudera Manager including:

Cloudera Back-up and Disaster Recovery (BDR)

Cloudera Navigator

Oracle R Distribution

Oracle NoSQL Database Community Edition**

Oracle Big Data Appliance Enterprise Manager Plug-In

Optional Software (separately licensed)

Oracle Big Data Connectors

Oracle SQL Connector for Hadoop

Oracle Loader for Hadoop

Oracle XQuery for Hadoop

Oracle R Advanced Analytics for Hadoop

Oracle Data Integrator Application Adapter for Hadoop

Oracle Audit Vault and Database Firewall for Hadoop Auditing

Oracle Data Integrator

Oracle NoSQL Database Enterprise Edition

* Apache Spark, YARN and MR2 are features included with and supported on CDH 5.x

** Support for Oracle NoSQL Database Community Edition is not a part of Big Data Appliance. It is a separately

purchased component

Lower TCO than Do-it-Yourself Hadoop

Oracle Big Data Appliance lowers the total cost of ownership of a big data platform when

compared to a DIY system. Not only are the costs of an initial deployment lower with Big

Data Appliance, but more significantly, so are the ongoing costs of maintenance, optimization

and system growth.

Big Data Appliance provides unique pricing to dramatically reduce the three to four year TCO

when compared to a DIY big data platform. Big Data Appliance bundles the hardware

(servers, high-speed networking, power distribution units and peripherals), OS support and

subscription costs for the Cloudera software into a single price for the life of the system. A

single support license covers both the hardware and the integrated software.

Organizations do not want to spend valuable intellectual capital assembling and tuning an

optimized Hadoop/NoSQL infrastructure, especially when these resources can be applied to

delivering high value business solutions. Big Data Appliance delivers a pre-configured, highly

tuned environment out-of-box for Apache Hadoop and Oracle NoSQL Database. This

optimized environment enables companies to focus their resources on developing compelling

business applications – lowering the risk for the solution. Additionally, the pre-tuned

environment avoids extensive ramp-up time for new applications due to performance and

production issues.

Simplified Operations

Oracle Enterprise Manager provides a single entry point for managing the entire system – both

hardware and software – providing continuity across other Oracle products in the

organization. To provide deep management capabilities for Hadoop, Enterprise Manager

enables a context-aware integration with Cloudera Manager.

Page 3: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

3

RELATED PRODUCTS AND SERVICES

Oracle Big Data Appliance brings a low

risk, highly scalable big data platform to

the enterprise.

RELATED PRODUCTS

The following are related products

available from Oracle:

Oracle Exadata

Oracle Big Data Connectors

Oracle NoSQL Database

Oracle Exalytics

Oracle Business Intelligence Enterprise

Edition

Oracle Endeca Information Discovery

Oracle Data Integrator

Oracle Enterprise Manager

RELATED SERVICES

The following services are available from

Oracle Support Services:

Advanced Customer Services

Product Support Services

Consulting Services

Oracle University Courses

Big Data Appliance simplifies day-to-day operations by providing a simple one-command

installation, update, patch and expansion utility – Mammoth – which enables rapid

deployment updates (typically quarterly) to the frequently evolving Hadoop stack without

incurring significant downtime. Mammoth also enables Oracle-tested, seamless upgrades

between Hadoop versions and automated service management to ensure the best balance

between Hadoop Master Nodes and Data Nodes.

Big Data Appliance is supported by Oracle, giving organizations a single point of support for

their hardware, all integrated software (including all Cloudera software) and any additional

Oracle software installed.

Comprehensive Security

Securing data is critical to Big Data solutions in the enterprise; Big Data Appliance provides

strong authentication, authorization and auditing of data in Hadoop out of the box.

Strong authentication is provided using Kerberos. This ensures that all users are who they

claim to be – and that rogue services are not added to the system.

Big Data Appliance leverages Apache Sentry (an open-source project of which Oracle is a

founding member) to authorize SQL access via tools like Hive and Impala. By delivering and

developing Sentry, Oracle delivers Big Data Appliance with the highest data security levels

currently available for Hadoop.

Both encryption of data-at-rest and network encryption are capabilities included with Oracle

Big Data Appliance and supported by Oracle. Network encryption prevents network sniffing

from capturing protected data. Encryption of data-at-Rest ensures that data on removed

physical disks is unreadable without the passphrase or trusted platform module, which is used

to encrypt the data. Both encryption features are transparent to the applications running over

HDFS and can be applied upon install or at a later time.

To ensure security and data access compliance, Big Data Appliance integrates with Oracle

Audit Vault and Database Firewall. An Oracle Audit Vault agent is pre-installed on Big Data

Appliance to track and audit data access on the Hadoop system. By leveraging Oracle Audit

Vault and Database Firewall, all auditing across the organization is consolidated into a single

audit repository ensuring a comprehensive view across all data.

Flexible Configurations

Big Data Appliance is designed to expand as your data and requirements grow. Initial big data

implementations may start with Big Data Appliance Starter Rack. This six server rack comes

fully equipped with a complete set of switches and power distribution units (PDU) required

for a full rack. This allows the appliance to easily and efficiently expand in six node hardware

increments to larger configurations using the Oracle Big Data Appliance In-Rack Expansion.

In addition to upgrading within a rack, multiple racks can be connected using the

integrated InfiniBand fabric to form even larger configurations; up to 18 racks can be

connected in a non-blocking manner by connecting InfiniBand cables without the need for any

external switches. Larger non-blocking configurations are supported with additional external

InfiniBand switches, larger blocking network configurations can be supported without

additional switches.

Big Data Appliance is multitenant; it can be configured as a single cluster or as a set of

clusters. This provides the flexibility customers need when deploying development, test and

production clusters.

Page 4: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

4

Big Data Appliance X4-2 Hardware

Full Rack Starter Rack In-Rack Expansion

18 x compute/storage

nodes

6 x compute/storage

nodes

6 x compute/storage

nodes

Per Node:

2 x Eight-Core Intel ® Xeon ® E5-2650 V2 Processors

64 GB Memory (individual nodes can be expanded to 512 GB)

Disk Controller HBA with 512MB Battery backed write cache

12 x 4TB 7,200 RPM High Capacity SAS Disks

2 x QDR (40Gb/s) Ports

4 x 10 Gb Ethernet Ports

1 x ILOM Ethernet Port

2 x 32 Port QDR InfiniBand Switch

32 x InfiniBand ports

8 x 10Gb Ethernet ports

Leverages the leaf

switches from the Starter

Rack

1 x 36 Port QDR InfiniBand Switch

36 x InfiniBand Ports

Leverages the spine

switch from the Starter

Rack

Additional Hardware Components included:

Ethernet Administration Switch

2 x Redundant Power Distributions Units

(PDUs)

42U rack packaging

Leverages the

administration switch,

PDUs and base rack from

the Starter Rack

Spares Kit Included:

2 x 4 TB High Capacity SAS disk

InfiniBand cables

Leverages the spares kit

from the Starter Rack

Big Data Appliance X4-2 Expansions

In-Rack Expansion Multi-Rack Connection

Upgradeability:

Field upgrade leveraging either a single (6

nodes) or two (2 x 6 nodes) In-Rack

Expansions. Expansion supports multiple

generations of hardware

Additional hardware include with each

In-Rack Expansion:

6 x Compute node with direct

attached storage as shown earlier

InfiniBand and Ethernet cables to

connect all of the components

Up to 18 racks can be connected without

requiring additional InfiniBand switches

InfiniBand cables to connect 3 racks are

included in the rack Spares Kits

Additional optical InfiniBand cables

required when connecting 4 or more racks

Memory Expansions

Expand the memory in any individual node or any number of nodes from 64GB per node

to 512GB per node.

Page 5: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

5

Big Data Appliance X4-2 – Environmental Specificaions

Physical Dimensions

Height

Width

Depth

42U, 78.66” - 1998 mm

23.62” - 600mm

47.24” - 1200 mm

Weight

Starter Rack 1037 Lbs

Starter Rack + In-

Rack Expansion 1400 Lbs

Full Rack 1800 Lbs

Power

Starter Rack Maximum 4.2 KW

Typical 1 3.0 KW

Starter Rack + In-

Rack Expansion

Maximum 7.7 KW

Typical 1 5.4 KW

Full Rack Maximum 10.0KW

Typical 1 7.0 KW

Cooling

Starter Rack Maximum 14,052 BTU/hour

Typical 9,836 BTU/hour

Starter Rack + In-

Rack Expansion

Maximum 26,411 BTU/hour

Typical 18,487 BTU/hour

Full Rack Maximum 34,142 BTU/hour

Typical 23,940 BTU/hour

Airflow 2

Starter Rack Maximum 676 CFM

Typical 473 CFM

Starter Rack + In-

Rack Expansion

Maximum 1223 CFM

Typical 856 CFM

Full Rack Maximum 1,573 CFM

Typical 1,103 CFM

Further Environmental Specifications

Operating temperature/humidity: 5 ºC to 32 ºC (41 ºF to 89.6 ºF), 10% to 90% relative

humidity, non-condensing

Altitude Operating: Up to 3,048 m, max. ambient temperature is de-rated by 1° C per

300 m above 900 m

Regulations 3

Safety: UL 60950-1 2nd Ed, EN60950-1:2006 2nd Ed, CB Scheme with all country

differences

RFI/EMI: FCC CFR 47 Part 15 Subpart B Class A, EN 55022:2006+A1:2007 Class A,

EN 61000-3-11:2000, EN 61000-3-12:2005, ETSI EN 300 386 V1.4.1 (2008)

Immunity: EN 55024:1998+A1:2001:+A2:2003

Page 6: Bigdataappliance datasheet-1883358

ORACLE DATA SHEET

6

Contact Us

For more information about Oracle Big Data Appliance, visit oracle.com or call +1.800.ORACLE1 to speak to an Oracle representative.

Copyright © 2013, Oracle and/or its affiliates. All rights reserved.

This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject

to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose.

We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not

be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission.

Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Cloudera, Cloudera CDH, and Cloudera

Manager, Cloudera Navigator and Cloudera BDR are registered and unregistered trademarks of Cloudera, Inc.,

Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of

SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered

trademark licensed through X/Open Company, Ltd. 0611

Certifications 3

Safety: UL/cUL, CE, BSMI, GOST R, S-Mark, CSA C22.2 No. 60950-1-07 2nd Ed, CCC

EMC: CE, FCC, VCCI, ICES, KCC, GOST R, BSMI Class A, AS/NZ 3548, CCC

Other: Complies with WEEE Directive (2002/96/EC) and RoHS Directive (2002/95/EC)

1 Typical power usage varies by application workload

2 Airflow must be front to back

3 In some cases, as applicable, regulatory and certification compliance were obtained at

the component level

Big Data Appliance Support Services

Hardware Warranty: 1 year with a 4 hour web/phone response during normal

business hours (Mon-Fri 8AM-5PM), with 2 business day on-site

response/Parts Exchange

Oracle Premier Support for Systems: Oracle Linux and integrated software

support and 24x7 with 2 hour on-site hardware service response (subject to

proximity to service center)

Oracle Premier Support for Operating Systems

Oracle Customer Data and Device Retention

System Installation Services

Software Configuration Services

System Expansion Support Services including hardware installation and

software configuration

Quarterly on-site patch deployment service

Oracle Automatic Service Request (ASR)