XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables...
Transcript of XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables...
![Page 1: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/1.jpg)
XtremeData Inc.
dbX Analytics System Introduction
![Page 2: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/2.jpg)
Company
Founded in 2004
Locations: Schaumburg, IL (HQ / Engineering/ Sales); Bangalore, India (Engineering); St. Louis, MO (Engineering); South Bend, IN (Sales)
Current business and products
Patented “In-Socket Accelerator” (ISA) that enables companies to create higher performance, lower power appliances. Branded as XD2000F and XD2000I in the High-Performance Computing market.
Management:
Ravi Chandran; CEO and Founder
Faisal Shah; Chief Technologist, Database Appliance Former Cofounder Knightsbridge Solutions LLC
Jay Desai; Sr. VP of Strategy Former Cofounder Knightsbridge Solutions LLC
Susan Clarke; VP of Finance
Geno Valente; VP of Sales and Marketing
2
![Page 3: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/3.jpg)
Analytic queries - an unmet need
Extended
Enterprise
Archived
Data
Normative
Data
Operational
Systems
Data Sources Enterprise Information Continuum
FunctionalReporting
Decision Support
OLAP ROLAP
Historical ReportingApplications
Real-time Applications
Enterprise Data Feeds
Major Trends Driving BI Consequences
Becoming more real time & pervasive & even Mission Critical
Managing service levels for BI applications is increasingly becoming a challenge
Enterprise wide in scope with increasing data volume and data access needs
Educated users want more data - want ad hoc access/unconstrained exploration of larger data sets
Customer/partner focused - business process differentiation; allowing outside users access to previously internal data
Pre-Defined queries no longer work; outside users can not or will not tell you how they want to use/access the data. They just want to use it.
Information Hub
Operational Data Store
Data Mart
Data Mart
Data Mart
Data Warehouse
3
![Page 4: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/4.jpg)
Addressing Key Market Problems
Solutions today “restrict” access to data based on pre-defined query patterns to get performance.
Analysts and Data researchers are demanding “unrestricted” ad hoc access to find the business value inside their data – reference “Competing on Analytics”
Customers are looking for cost take-out and low TCO. Major vendors today are Expensive.
Customers are looking to bring longer time-series, archived data, and low-information density data into play for analytics, at a justifiable ROI.
4
![Page 5: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/5.jpg)
What’s missing?
Legacy solutions are not designed for analytic (ad hoc) access to large scale data
Existing large-scale BI solutions are too expensive (>$100K/TB)
Cheaper solutions (<$50K/TB) cannot perform analytics with acceptable execution speed
5
![Page 6: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/6.jpg)
Key Difference
Only XtremeData offers an appliance purpose built for ad hoc analysis of large data sets.
6
![Page 7: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/7.jpg)
XtremeData Approach
Start with a commodity Linux Cluster
Add direct-attached distributed storage
Add high-speed interconnect network (InfiniBand)
Add a re-engineered open-source DB engine (postgreSQL) that:
Supports a shared-nothing, parallel query execution model
Supports hardware acceleration via “SQL on a Chip”
Price
Performance
7
![Page 8: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/8.jpg)
XtremeData Approach(cont.)
…and it morphs into an SQL Super-Computer delivering:
Scalable, shared-nothing MPP architecture
Hardware accelerated parallel SQL processing
Efficient, zero-copy, data exchange on high bandwidth network
Dynamic load balancing at run-time
8
![Page 9: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/9.jpg)
Performance Results
Financial institution, household credit database: large tables, N-way Joins, Aggregations.
Result: On 8xNode system (MSRP $600,000): 33 min 42 sec. ~16x faster than existing solution, with estimated MSRP of $1,500,000.
National Laboratory, Graph traversal to discover linkages: single narrow long table, N-way self-Joins.
Result: On 8xNode system (MSRP $600,000): 1 hour 25 mins. ~ 5x faster than closest competitor, system configuration and MSRP unknown.
National Laboratory, Web page ranking (PageRank): rank computation, iterative traversal with updates.
Result: On 8xNode system (MSRP $600,000); 2x-8x faster than closest competitor, with estimated MSRP of $1,200,000.
Sampling of performance results from recent Proof-of-Concept engagements:
9
![Page 10: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/10.jpg)
Differentiation
Utility
Only XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data.
Price
Only XtremeData's dbX enables Petabyte-scale analytic solutions that were previously cost-prohibitive - An appliance priced at $20K/Usable TB
Performance
Only XtremeData's dbX delivers 4-30x faster results than competing products at 1/3 the investment
Green
Only XtremeData's dbX provides power and cooling advantages with an unmatched carbon footprint.
10
![Page 11: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/11.jpg)
DW/BI Industry trends
Unique among all vendors in the market today: the dbX architecture leverages the best of all industry trends...We can sustain our price/performance advantage into the future
Time TimeTime
Storage
DAS:
Shared-nothing
MPP...
SAN
Cost Performance
COTS Servers
Moore’s
Law – x86 CPU
Speed
NetworkPerformance
RDBMS Engine
Moore’s Law: H/W
density:
x86 CPU and FPGA
S/W performance:
In-efficient Usage of
Multi-core CPUs
x86 +FPGA
Acceleration
Time
Hardware Software
Network
InfiniBand
Ethernet
FC
The four components of a data analysis system are:
Hardware (3x): Storage, Computing (Servers) , Interconnect Network
Software (1x): Relational DataBase Management System (RDBMS)
11
![Page 12: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/12.jpg)
Architecture
Engineered from first principles for large database analytics
Breakthrough price/performance (16xNodes): $20K/TB at 0.4TB/min query processing
Provides clear value vs. competitive solutions (Teradata, Netezza, Oracle, GreenPlum)
Query performance (SQL processing)
Load performance (1-4 TB/hour)
Processing density (storage & performance/sq. foot)
Carbon footprint (performance/watt)
12
![Page 13: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/13.jpg)
Node from Appliance
FPGA In-Socket Accelerator (ISA) from Node
Under the hood
dbX System Data Node Features SQL on a Chip
Head Node HP DL385 with SAS Storage Approved HP Accelerator
IB Switch Quad-Core CPU; IB Connection Contains SQL in Hardware
Data Nodes Patented In-Socket Accelerator Plugs into CPU Socket
dbX: DB Appliance
13
Head
Data
x4
Data
x4
![Page 14: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/14.jpg)
Patented “SQL on a Chip”
Replaces CPU in COTS Tier 1 Servers... Qualified by HP (Accelerator Program)
SQL operations accelerated “under the hood” in silicon Big table data movement: Loads/Unloads/Scans
Big operators: Joins, Sorts, GroupBy, OrderBy,...
10x the Performance at 1/3rd the power of CPU
Leverages rapidly advancing cutting edge FPGA technology in patented way
14
![Page 15: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/15.jpg)
Configurations
Model dbX 1008 dbX 1016 dbX 1028 dbX1060
Rack (Standard 24x40) 1 (50%) 1 2 4
Data Nodes
(Opteron + SQL on a Chip)8 16 28 60
User Data (1TB drives) 30TB 60TB 105TB 225TB
15
![Page 16: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/16.jpg)
Summary: Applications for dbX
Enterprise Information Continuum
Research/Profiling Data Analysis Long Running Queries Ad Hoc / Analytic Queries
Adjunct to existing environments
Complement legacy investments in DW infrastructure
Shift analytic query workload away from the current environment -better leverage legacy environment for operational BI applications
“Sand box” for research, analysis, ad hoc…..
SQL based analytics
Information Hub
Operational Data Store
Data Mart
Data Mart
Data Mart
Data Warehouse
16
![Page 17: XtremeData Inc. dbX Analytics System Introduction · XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data. Price An Appliance priced](https://reader033.fdocuments.net/reader033/viewer/2022060415/5f12fb02c91b8b5958510aba/html5/thumbnails/17.jpg)
Filling clear market needs for Usability
XtremeData's dbX cost-effectively enables wide, deep, and unrestricted querying of structured data.
Price
An Appliance priced at $20TB/Usable TB, only XtremeData's dbXenables Petabyte-scale analytic solutions that were previously cost-prohibitive
Performance
XtremeData's dbX delivers 10x faster results than competing products at 1/3 the investment
Environment (energy-efficiency)
XtremeData's dbX provides power and cooling advantages with an unmatched competitive carbon footprint.
17