GOOGLE BIGTABLE

GOOGLE BIGTABLEGOOGLE BIGTABLE

Hans Vatne Hansen

Typical traits

● Avoid join operations

● Scale horizontally

● No ACID guarantees

Other NoSQL Systems

● Apache's HBase

● Modeled after Bigtable● Apache's Cassandra

● Facebook● Digg● Reddit● Rackspace (ISP)● Cloudkick

● LinkedIn's Project Voldemort

● Amazon's Dynamo

Comparison of NoSQL Systems

Bigtable

● Similar to a database, but not a full relational data model

● Data is indexed using row and column names

● Treats data as uninterpreted strings

● Clients can control the locality of their data

● Built on GFS

● Uses MapReduce

● Used by over 60 Google products

Servers

Data Model

● A cluster is a set of machines with Bigtable processes

● Each Bigtable cluster serves a set of tables● A table is a sparse, distributed, persistent

multidimensional sorted map

● The data in the tables is organized into three dimensions:

● Rows, Columns, Timestamps● (row:string, column:string, time:int64) → string

● A cell is the storage referenced by a particular row key, column key and timestamp

Example Slice of a Table

● Bigtable maintains data in alphabetical order by row key

● The row keys in a table are arbitrary strings

● Rows are the unit of transactional consistency

● Several rows are grouped in tablets which are distributed and stored close to each other

● Reads of short row ranges are efficient, typically require communication with only one or a few machines

● Remember the reversed URIs in the example?

Columns

● Column keys are grouped into column families

● A column key is named with syntax → family:qualifier

● Data stored in a column family is usually of the same type

● An example column family for our previous example is language, which stores the language in which a web page was written.

Timestamps

● A cell can hold multiple versions of the data

● Timestamps can be set by Bigtable or client applications

● Data is stored so that new data are fastest to read

● Garbage-collection

● Based on items (last x versions)● Based on time (last seven days)● In the example, three versions of a web page was kept

Application Programming Interface

● Create and delete tables and column families

● Change cluster, table, and column family meta-data (such as access control rights)

● Write or delete values

● Look up values from individual rows

● Iterate over a subset of the data in a table

● Data transformation (filtering)

● I/O of MapReduce jobs

● Quick summery of GFS and this presentation

● More about the building blocks of Bigtable

● SSTable● Tablet locations and assignments

● Compaction

● Shrink memory usage● Reduce data in commit log

● Compression

● Zippy● BMDiff

● Caching

GOOGLE BIGTABLE - · PDF fileBigtable Similar to a database, but not a full relational data...

Documents

Transcript of GOOGLE BIGTABLE - · PDF fileBigtable Similar to a database, but not a full relational data...

Learning diffeomorphism models of robotic sensorimotor ...users.cms.caltech.edu/~murray/preprints/cm12-icra_s.pdf · drea,murray}@cds.caltech.edu. uninterpreted observations uninterpreted

BigTable: A System for Distributed Structured Storagepages.cs.wisc.edu/~remzi/Classes/739/Fall2017/Papers/bigtable-slides-05.pdf · Bigtable master Bigtable tablet server Bigtable

Bigtable A Distributed Storage System for …cis.csuohio.edu/~sschung/cis612/Lecture_Notes_BigTable...Google Bigtable A distributed storage system for managing structured data that

Bigtable: A Distributed Storage System for Structured Data 1.

Bigtable: A Distributed Storage System for …okennedy/courses/papers/bigtable.pdfGoogle, Inc. Bigtable is a distributed storage system for managing structured data that is designed

bigtable - courses.cs.washington.edu

iBigTable : Practical Data Integrity for BigTable in Public Cloud CODASPY 2013

Bigtable A Distributed Storage System for Structured Data ...eecs.csuohio.edu/~sschung/...BigTable_Updated.pdf · Google Bigtable A distributed storage system for managing structured

Storage of Structured Data: BigTable and HBaselsd.ls.fi.upm.es/nuevas-tendencias-en-sistemas-distribuidos/HBase_2.pdf · Storage of Structured Data: BigTable and HBase HBase/BigTable

Google - Bigtable

Bigtable: A Distributed Storage System for Structured Data · 2019-02-25 · Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson

Distributed Systems 18. BigTable - School of Computinglifeifei/cs6931/bigtable.pdf · BigTable • Highly available distributed storage for structured data • Built with structured

Bigtable: A Distributed Storage System for Structured Data ...cs655/lectures/CS655-Louis_Rabiet_BigTable.pdf · Bigtable. . .. . ... .. . Other NoSQL Thoughts. . . Conclusion Bigtable:

Synthesis of Synchronization using Uninterpreted Functions*

Mapping Uninterpreted Schemes into Entity-Relationship ...

Bigtable : A Distributed Storage System for Structured Data

HBase: Bigtable Goes Realtime · Overview Bigtable is a response to challenges with the storage and computational aspects of very large data sets Infrastructure scalability issues

Online Bigtable merge compaction - UCRneal/Slides/bigtable_merge_compaction.pdf · 2015-09-22 · BIGTABLE — data storage at Google Maps, Search/Crawl, Gmail ...use BIGTABLE to

BIGTABLE - Massachusetts Institute of Technologydb.lcs.mit.edu/6.830/lectures/bigtable-slides.pdf · BigTable System Architecture Read/write Cluster Scheduling Master handles failover,