Parallella: The Most Energy Efficient Supercomputer on the Planet

Post on 15-Apr-2017

678 views 1 download

Transcript of Parallella: The Most Energy Efficient Supercomputer on the Planet

Madison+ Ruby Madison, WI, USA August 21-22, 2015

The Most Energy Efficient Supercomputer on the Planet

Moore’s Law: 2x every 18 months

Moore’s Law: 2x every 18 months

Moore’s Law 1993 - 2013 http://www.washingtonpost.com/blogs/innovations/wp/2015/04/14/10-images-that-explain-the-incredible-power-of-moores-law/

https://www.parallella.org/board/

RayHightower.com

RJ-45

Power

μUSB

μHDMI

μSD

RISC

ARM

18 cores: 2 ARM + 16 RISC

Watts => Dollars.

17.8 million watts $17.8 million per year

5 watts for Parallella?

http://rayhightower.com/blog/2014/09/09/solar-powered-parallella/

http://rayhightower.com/blog/2014/09/09/solar-powered-parallella/

http://rayhightower.com/blog/2014/09/09/solar-powered-parallella/

5 volts 1 amp 5 watts

Solar!

If one ox could not do the job they did not try to grow a bigger ox, but used two oxen. When we need greater computer power, the answer is not to get a bigger computer, but to build systems of computers and operate them in parallel.

-Grace Hopper

Reduced Instruction Set Computer

RISC

ARM A9

RISC

ARM

18 cores: 2 ARM + 16 RISC

Hello Epiphany

host.c

epiphany.c

Find all primes up to 16,000,000. Serial on Parallella.

1 #include <stdio.h> 2 #include <stdlib.h> 3 #include <math.h> 4 #include <time.h> 5 6 #define DEFAULT_MAX_TESTS 16000000 7 8 inline int isprime(unsigned long number) 9 {10 unsigned long i;11 unsigned long s = sqrt(number);12 for(i=3;i<=s;i+=2)13 {14 if(number % i == 0)15 return 0;16 }17 return 1;18 }

/* Copyright (c) Adapteva, contributed by M. Thompson with modifications by T. Malthouse. */

8 inline int isprime(unsigned long number) 9 {10 unsigned long i;11 unsigned long s = sqrt(number);12 for(i=3;i<=s;i+=2)13 {14 if(number % i == 0)15 return 0;16 }17 return 1;18 }

/* Copyright (c) Adapteva, contributed by M. Thompson with modifications by T. Malthouse. */

Find all primes up to 16,000,000. Serial on Mac OS X.

Same serial code, written in C.

Build it on OS X.

Find all primes up to 16,000,000. Parallel on Parallella.

27 #include <e-hal.h>28 29 // Default max number of prime tests per core30 // Used if a limit it not provided in argv[1]31 #define DEFAULT_MAX_TESTS 50000032 33 int main(int argc, char *argv[])34 {35 unsigned row, col, coreid, i, j;36 e_platform_t platform;37 e_epiphany_t dev;38

/* Copyright (c) Adapteva, contributed by M. Thompson with modifications by T. Malthouse.*/

Serial/Parallella:

Serial/Mac:

Parallel/Parallella:

237.1 sec

14.4 sec

18.6 sec

Summary: Finding Primes

18 cores (using one). Up to 1GHz. $150.00

4 cores (using one). 2.7GHz. $2,000.00

18 cores. Up to 1GHz. $150.00

Mandelbrot Set

Why?

Finite Element Analysis

http://www.ce.ncsu.edu/news/article/21550/making-bridges-more-robust-to-earthquakes/

break?

wear out?

work?

Finite Element Analysis

Will it{

Free Body Diagram

FF appliedfriction

Fgravity

Fnormal

Finite Element Analysis

http://www.ce.ncsu.edu/news/article/21550/making-bridges-more-robust-to-earthquakes/

https://www.e-education.psu.edu/worldofweather/node/2029

Grid spacing influences accuracy.

Weather Prediction

Parallella vs GPU

384 cores

MacBook Pro Video

Specialized vs. General

Thanks!RayHightower.com