Vldb PhD workshop 2016

77
Method to My Madness Tim Kraska

Transcript of Vldb PhD workshop 2016

Page 1: Vldb PhD workshop 2016

MethodtoMyMadnessTimKraska

Page 2: Vldb PhD workshop 2016

TakeeverythingIsaywithagrainofsalt

Disclaimer

Page 3: Vldb PhD workshop 2016

TopicsIworkedon

• XML• CloudComputing• Visualization• TransactionProcessing• Geo-Replication• ModernNetworks• GPUs• Probabilistic PricingModels forMarketPlaces

• DataIntegration• Crowd-Sourcing• SystemsforMachineLearning• GraphProcessing• …

Page 4: Vldb PhD workshop 2016

TopicsIworkedon

• XML• CloudComputing• Visualization• TransactionProcessing• Geo-Replication• ModernNetworks• GPUs• Probabilistic PricingModels forMarketPlaces

• DataIntegration• Crowd-Sourcing• SystemsforMachineLearning• Graphprocessing• …

TopicsIdidNOTworkon• SkylineProcessing

Page 5: Vldb PhD workshop 2016

TopicsIworkedon

• XML• CloudComputing• Visualization• TransactionProcessing• Geo-Replication• ModernNetworks• GPUs• Probabilistic PricingModels forMarketPlaces

• DataIntegration• Crowd-Sourcing• SystemsforMachineLearning• Graphprocessing• …

TopicsIdidNOTworkon• SkylineProcessing

Page 6: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

Page 7: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

à GladlyIdidn’t

Page 8: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

à GladlyIdidn’t

Page 9: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

à GladlyIdidn’t

à NowIamthereforalmost7years

Page 10: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

• Ineverwantedtobeaprofessor

à GladlyIdidn’t

à NowIamthereforalmost7years

Page 11: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

• Ineverwantedtobeaprofessor

à GladlyIdidn’t

à NowIamthereforalmost7years

àOK,thatoneIamstillevaluating

Page 12: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

• Ineverwantedtobeaprofessor

• Ididn’tconsiderviz-relatedworkasdeep

à GladlyIdidn’t

à NowIamthereforalmost7years

àOK,thatoneIamstillevaluating

Page 13: Vldb PhD workshop 2016

Furthermore

• IwantedtodoaPhDineconomics

• IneverwantedtoliveintheUS

• Ineverwantedtobeaprofessor

• Ididn’tconsiderviz-relatedworkasdeep

à GladlyIdidn’t

à NowIamthereforalmost7years

àOK,thatoneIamstillevaluating

à Now,wedoalotofworkinthatarea

Page 14: Vldb PhD workshop 2016

Furthermore

Notuncommonforanacademiccareerà Impossibletopredict

Page 15: Vldb PhD workshop 2016

Whythoseplaces?

Page 16: Vldb PhD workshop 2016

Outline

• ChapterI:Finding/ChangingaResearchAgenda• ChapterII:Preparinganbeingonthejobmarket• ChapterIII:Managingstudents• ChapterIV:Work/Lifebalance• ChapterV:Whybeingaprofessoris(oneof)thebestjobsintheworld

Page 17: Vldb PhD workshop 2016

Finding(Changing)aResearchAgenda

ChapterI

• ForyourPhD• ForyourPostDoc• ForyourTenureCase• …

Page 18: Vldb PhD workshop 2016

WhyXML

Page 19: Vldb PhD workshop 2016

MyFirstWork

XML

Page 20: Vldb PhD workshop 2016

Ievenjoined

Page 21: Vldb PhD workshop 2016

Mythesiswasplannedout

Page 22: Vldb PhD workshop 2016
Page 23: Vldb PhD workshop 2016
Page 24: Vldb PhD workshop 2016

Keepinmindthatwas2007

• Itwasstillnotclearifcloudisreallyathing• RememberLarryEllison:• "MaybeI'manidiot,butIhavenoideawhatanyoneistalkingabout.Whatisit?It'scompletegibberish.It'sinsane.Whenisthisidiocygoingtostop?”• “Uh,Idon'tunderstandwhatwewoulddodifferentlyinthelightofcloudcomputing,otherthanmarket...youknow,changethewordingonsomeofourads.”

….itwasthebestdecisionImade

Page 25: Vldb PhD workshop 2016

ManmussFeste feiern wie sie fallen*(Celebratewhenthechance isthereoronemustmakehaywhenthesun shines)

*DonaldKossmann

Page 26: Vldb PhD workshop 2016

ManmussFeste feiern wie sie fallen(Celebratewhenthechance isthereoronemustmakehaywhenthesun shines)

LessonI:Beopportunistic

Page 27: Vldb PhD workshop 2016

TwoMainOpportunitiesItook

• Cloudcomputing

• CollaborationwithAndyvanDamatBrownà InteractiveDataExploration

ManyothersIrejected(sometimesforgood,sometimesstupidly,e.g.,Spark)

Page 28: Vldb PhD workshop 2016

Currentopportunisticopportunities

• Modernhardware(GPUs,fastnetworks,newtypesofmemory,…)• Cyborgidea(seeAnand Rajaraman’s keynote)• Deeplearning???• Betterintegrationofhumans• …

Page 29: Vldb PhD workshop 2016

Thatdoesnotmeanyoushouldrunaftereverynewcarrot

Page 30: Vldb PhD workshop 2016

LessonII:Focusontheffuunnddaammeennttaall iissssuueess

Page 31: Vldb PhD workshop 2016

Example:UnicornHardware

NVM

Page 32: Vldb PhD workshop 2016

FFiirrssttRRoouunnddooffRReesseeaarrcchh

Fundamentalresearchonhowthehardwarewouldchangetheworld

SSeeccoonnddRRoouunnddooffRReesseeaarrcchh

DB+NVM+CrowdDB+NVM+SkylineDB+NVM+Cloud+Crowd

Page 33: Vldb PhD workshop 2016

FFiirrssttRRoouunndd::RRaaddiiooaaccttiivviittyy

Page 34: Vldb PhD workshop 2016

SSeeccoonnddRRoouunndd::RRaaddiiooaaccttiivviittyy

Page 35: Vldb PhD workshop 2016

BDS3Howtousecloud-basedstorageasadisk

SIGMOD2008

MMyyoowwnneexxaammpplleess

Page 36: Vldb PhD workshop 2016

Page

1

Page

2

Page

3

Page

4

Page

5

Page

N

BuildingaDatabaseonS3

RecordManager

PageManager

Application

Client 1/EC2

RecordManager

PageManager

Application

ClientM/EC2

........

........

........

CouldbeexecutedonEC2orcompletelyonthe

client

Page

6

Page 37: Vldb PhD workshop 2016

BDS3Howtousecloud-basedstorageasadisk

Fundamentalquestion:Whatdoesconsistencymeanforcostandscalability

SIGMOD2008

ConsistencyRationingVLDB2009

BasicIdeausedbyseveralstartups(sonotthatstupidafterall)

MMyyoowwnneexxaammpplleess

Page 38: Vldb PhD workshop 2016

LessonIII:Donotoverthinkandstartquicklyandwithwhatyouknow

Page 39: Vldb PhD workshop 2016

SSttaarrttWWiitthhWWhhaattYYoouuKKnnooww

Examples•SQLforCrowdsourcing•UseS3asadiskfordatabases

Page 40: Vldb PhD workshop 2016

Doitfast

Asaruleformystudents:foranewtopicourgoalisoftentowritethefirstinitialpaperin3-9month• Thiscanbeavisionpaperforaworkshop,CIDR,etc.orinrarecasesafullpaper

àHelpstoarticulatethegoalsàHelpstogetearlyfeedback

Page 41: Vldb PhD workshop 2016

Whatreactiondoyouwant?

1) Yeah,thatmakessense2) Peoplearguing,fightingagainstyou,….

ReactionsonCrowdDB:

Best IdeaEver Youdidn’tciteme Whataretheysmoking inBerkeley

Page 42: Vldb PhD workshop 2016

Whatreactiondoyouwant?

1) Yeah,thatmakessense2) Peoplearguing,fightingagainstyou,….

ReactionsonCrowdDB:

Best IdeaEver Youdidn’tciteme Whataretheysmoking inBerkeley

Page 43: Vldb PhD workshop 2016

• Ifyoujumpstartanewproject,itwillnotbeperfectanditwilltaketimetorefineit.• Example:

RecalllessonII:Focusontheffuunnddaammeennttaall iissssuueess

Page 44: Vldb PhD workshop 2016

CrowdDb

MetaD

ata

Statistic

sParser

Optimizer

Executor

FileAccessMethods

WorkerRelationshipManager

UICreation

UITemplateManager

FormEditor

HITManager

Disk2

Disk1

ResultCrowdSQLSQL

44

Page 45: Vldb PhD workshop 2016

Howtointegratecrowdworkers inqueryprocessing

Fundamentalquestion:Whatisthequerysemanticsintheopenworld?

BestpaperawardatICDE2013

VLDB2011

Howtousecrowdsourcingfordatacleaning

SeveralpapersatVLDBandSIGMOD

Page 46: Vldb PhD workshop 2016

LLeessssoonnIIVV::SSiimmpplliicciittyyiissccoorreeffoorraarreesseeaarrcchhaaggeennddaa• Youshouldbeabletodescribetheprobleminlessthan30seconds.• Ideally,inasinglesentence.

Page 47: Vldb PhD workshop 2016
Page 48: Vldb PhD workshop 2016

LLeessssoonnVV::TThheerreeiissaattiimmeettoorreeaaddppaappeerrssaannddrreellaatteeddwwoorrkkbbuuttnnoottiinntthheebbeeggiinnnniinngg

Page 49: Vldb PhD workshop 2016

LLeessssoonnVVII::DDoonnoottuunnddeerr--eessttiimmaatteeooppeenn--ssoouurrcceeaanndd““mmaarrkkeettiinngg””

Page 50: Vldb PhD workshop 2016

LLeessssoonnVVIIII::PPaappeerrssaarreennootteevveerryytthhiinngg,,bbuuttaanneecceessssiittyyDisclaimer:Verycontroversial

TheadviseIgotfrommyadvisor,andhegotfromhisadvisor,is:• TheonlythingthatreallycountsareSIGMODand

VLDBpaper• Havingacertainamountisanecessitybutnot

sufficienttolandagoodacademicjob

Goalwithmystudents:1top-tierpaperperyearstartingfromyear2.

Page 51: Vldb PhD workshop 2016

LLeessssoonnVVIIIIII::DDoowwhhaattiissffuunn

Page 52: Vldb PhD workshop 2016

Preparingandbeingonthejobmarket

ChapterII

• ForyourPhD• ForyourPostDoc• ForyourProfessorship• Partiallyalsoforlandingyourdreamindustryjob

Page 53: Vldb PhD workshop 2016

UnderstandtheAcademicMarket

•USA• FixcyclestartinginDecember• Tenure-trackmodel• Differentlong-termresources• Moreflexebility

• Europe(Germany,Switzerland,UK)• Ad-hod• Veryslowprocess• Extremelyhard• Faketenure-trackmodel

Icanonlycommentonthosetwomarkets

Page 54: Vldb PhD workshop 2016

Decidingwheretoapply?

Hardconstraints

Page 55: Vldb PhD workshop 2016

LessonIX:Donotunder-estimatetheimportanceofnetworking

• Itisimportantthatyouknowpeopleinthecommunity• Evenmoreimportantisit,thattherightpeopleknowyou

Page 56: Vldb PhD workshop 2016

NetworkingTips• 30s/2min/5minrule• Donotspendthetimewithyourcolleaguesataconference• Visitcompanies/universitiesoutsideoftheconferencesandasktogiveatalk• Activelyintroduceyourself(theeasiestistoaskaquestionafteratalk)• Identifythepeople,whoshouldknowyou• Professorsatcertainplaces,companies,etc.• PhDstudentsatcertainuniversities…(yourPhDfriendsmightbecomeyourcollegues)

Don’ttrytohardà itshouldcomenaturalandwithrealinterestItalsohelpstobefriendswithveryoutgoingpeople.

Page 57: Vldb PhD workshop 2016

MostConnectionsAreMadeOutsidetheVenue

Page 58: Vldb PhD workshop 2016

LessonX:Rankingsmatter,butonlytoacertaindegreeIntheUS,everythingiscenteredaroundtheUSNewsRankingWhydoesitmatter:students!!!!But,imho:

Page 59: Vldb PhD workshop 2016

LessonX:Rankingsmatter,butonlytoacertaindegreeIntheUS,everythingiscenteredaroundtheUSNewsRankingWhydoesitmatter:students!!!!But,imho:1. Yourresearchgroupcountsmorethanthe

institution

Page 60: Vldb PhD workshop 2016

LessonX:Rankingsmatter,butonlytoacertaindegreeIntheUS,everythingiscenteredaroundtheUSNewsRankingWhydoesitmatter:students!!!!But,imho:1. Yourresearchgroupcountsmorethanthe

institution2. Youradvisor countsmorethantheresearch

group

Page 61: Vldb PhD workshop 2016

LessonX:Rankingsmatter,butonlytoacertaindegreeIntheUS,everythingiscenteredaroundtheUSNewsRankingWhydoesitmatter:students!!!!But,imho:1. Yourresearchgroupcountsmorethanyour

institution2. Youradvisor countsmorethanyourresearch

group3. Yourhappiness countsmorethanyouradvisor

Page 62: Vldb PhD workshop 2016

LessonX:Rankingsmatter,butonlytoacertaindegreeIntheUS,everythingiscenteredaroundtheUSNewsRankingWhydoesitmatter:students!!!!But,imho:1. Yourresearchgroupcountsmorethanyour

institution2. Youradvisor countsmorethanyourresearch

group3. Yourhappiness countsmorethanyouradvisorMyexample:

vs.

Page 63: Vldb PhD workshop 2016

LessonXI:LookfortherightCollaborators• You,asasingleperson,donotscale• Onewaytoscaleis,havealotofcollaboratorsyoucantrust• Important:Ithastobeamutualcollaboration• Ifyouevaluateaplace,considerpeoplewhoyoucouldcollaboratewith.Especially,lookfor• Theycollaborations inthepast• Trytotalktobothparties• Doyougetalong?Doesitclick?

• Butkeepinmind:intheend successfulcollaborationshappenbecauseyoulikeeachotherandtheyarenormallynevertheonesyouplannedtohave.

Page 64: Vldb PhD workshop 2016

Students,students,students,…

ChapterIII

• Managingbachelor students• Managingmaster students• ManagingPhD students• Managinginterns

Page 65: Vldb PhD workshop 2016

LessonXII:Studentsareyourmostvaluableasset• PhDstudents,Masterstudents,bachelorstudents,…• Carefullyselectthem• Spendtimeonthepromisingonesandlessontheothers (note,thatthisisverycontroversial)• Trytofindyourbalancebetweenhands-onandhands-off

Page 66: Vldb PhD workshop 2016

Dangeroftoohands-on

• Theymightgetfrustrated• Ittakesupalotofyourtime

Page 67: Vldb PhD workshop 2016

Dangeroftoohands-off• Ittakesmuchlonger• itendsupinengineeringratherthanresearch• ittakesmoreiterations(e.g.,re-submissions)• Thetopicderails• Ifthetopicderailstoomuch,itmightnotfitwithyourfundinganymore

Example:Letsassumeyougiveastudentatasktodesignanactionfigurewaterpistol

Page 68: Vldb PhD workshop 2016

AcademicJobRamp-Up

Mysuggestion:alwayspostponetostartyouracademicjobinspring notfall.• YoucanbeinvolvedinthePhDhiring(happeninginspring).• Youminimizethetimewithoutstudents(fallsemester)• Youcanfinishsomeofyourpreviousworkwithoutstartingthetenureclock orsimplytaketimeofftogetreadyforbeingaprofessor

Page 69: Vldb PhD workshop 2016

Work/LifeBalance

ChapterIV

Page 70: Vldb PhD workshop 2016

NNoonn--LLeessssoonn::IIssttiillllddoonnoottkknnoowwhhoowwttooddooiitt

Page 71: Vldb PhD workshop 2016

Whybeingaprofessoris(oneof)thebestjobs

ChapterIV

Page 72: Vldb PhD workshop 2016

Flexibility

Page 73: Vldb PhD workshop 2016

Freedom

Page 74: Vldb PhD workshop 2016

Travel

http://kingofwallpapers.com/travel/travel-022.jpg

Page 75: Vldb PhD workshop 2016

Community

Page 76: Vldb PhD workshop 2016

Startups

Page 77: Vldb PhD workshop 2016

ThankYouTimKraska<[email protected]>

Wedohaveseveralintern(paid6-12month),PhD,andpostdocpositionsavailable