Scaling Cloud Web & Data Technologies

Post on 23-Jan-2018

455 views 0 download

Transcript of Scaling Cloud Web & Data Technologies

www.anant.us | solutions@anant.us | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

Streamline Organize UnifyBusiness Information

Scaling Cloud Web & Data TechnologiesDistributed Platform:

Spark, Cassandra, Kafka, Docker

• Who are these guys? (Intro)• Why Distribute Everything?• Spark: Distributed Computing• Cassandra: Distributed Data• Kafka: Distributed Queues• Docker: Simplified Code Distribution• Questions ?

Agenda

Who am I. What I did. What I do.

Built a Hosting CompanyWeb Application Dev/HostingIT Consulting w/ Firms/Agencies……………………………………...……...Management Consulting for Internet Teams + Difficult Data

What we do well.

Streamline, Organize & Unify

Business Information

What is Business Information

Internal vs. External (Public vs. Private)Strategic / Tactical / OperationalData / Information / KnowledgeDocuments / Records / Metrics

Project

InformationClient Service

Information

Corporate

Guides

Collaborative

Documents

Assets

& Files

Corporate

Assets

Phases of Business Technology

Business

Silos

Standardize

d Platform

Optimized

Core

Business

Modularity

Your Business Cloud is Disconnected

… so you and your team spend waste time ...

Bring it all together in One Place

Project

InformationClient Service

Information

Corporate

Guides

Collaborative

Documents

Assets

& Files

Corporate

Assets

Konotree

We will help you connect and organize your data.

find. analyze. act.

FIND

DISCOVER

FILTER

ANALYZE

VISUALIZE

MEASURE

ACT

USE

SHARE

Why Distribute Everything?

• Pros – Scale– Speed– Concurrency– Volume– Modularity

• Cons – Complexity– Latency– Parity

Systems Amateurity

• Gonna host a ..:

– Project Management System ?

– Chat Server ?

– Frontend / Backend Framework ?

– Blog System / CMS / eCommerce ?

– Social Network ?

Systems Amateurity

• Equivalent to restaurants making their own pots and pans

• Common in many teams - small to large

• “I can SELF-HOST EVERYTHING”

Kafka

• 20-30 Times More Throughput

• PubSub or Message Queue

• Persistent w/ Offset (Replays)

• Messaging / ESB / Integration

• Seen in real world ~ 15TB / Day

Spark

• Extremely Fast Processing

• Program in Java, Scala, Python, R, or C#

• Sophisticated Algorithms Available: PageRank in 5 lines.

• Real Time Stream Processing

• Awesome Integration with Hadoop or Other Data Sources

Cassandra

• Massive Amounts of Data Quickly

• Distributed Architecture - Master - Master

• Tunable Consistency - CAP

• CQL ~ SQL - Familiar / Similar Syntax

• Easily Scale DB with Additional Nodes

• Easily add Replicas in additional DCs

• Hybrid of Schema / Schema-less

• Low Barrier to Entry

Docker

• Derivable Images

• Immutable Containers

• Same Docker works Everywhere

• Same Compose works Everywhere

• Industry Wide Adoption

• Thousands of Docker Images Available

• Thousands of Compose Files Available

• Runs on Linux or Windows (Natively)

• Tested with * on Linux

• Tested with DotNet core on Linux• Tested with Legacy .NET/MSSQL on Windows

Docker Makes it Easy to Self Host

Docker Makes it Easy to Self Host

Docker Makes it Easy to Self Host

Appleseed Framework

What we do well.

Streamline, Organize & Unify

Business Information

Ongoing Subscription + Kickstart

KICKSTART*- Discovery*- Workshop- Architecture*- Review- Engineering*- Delivery

SUBSCRIPTION- Support- Advisory / Guidance- Maintenance*- Config / Customization

- Managed*- Setup / Train / Admin

*service optionsFree Consultation

www.anant.us | solutions@anant.us | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

Streamlined Data

Integration / Data Pipelines

Organized Knowledge

Search / Data Warehouses

Unified Interfaces

Portals / Dashboards / Mobile

Free Consultation