OPTO 6124 Perception Scott Stevenson Image Segmentation ......pareidolia meaning “wrong image”....
Transcript of OPTO 6124 Perception Scott Stevenson Image Segmentation ......pareidolia meaning “wrong image”....
OPTO6124 Perception ScottStevensonImageSegmentationWhatisreallybehindsomanyperceptiondemos?Perceptiondemosshowusthatourvisualunderstandingoftheworldinvolvesalotof“fillingin”ofinformationinordertoreachknowledgeofobjectsandtheirrelationships.Often,visualinformationissparseorambiguous.Thedemosaresurprisingtousbecausewedon’toftenrealizehowmuchguessworkor“topdownprocesses”areinvolved.TheprocessoffindingobjectsinanimageiscalledImageSegmentation.Perceptionisahardproblemtosolve!In1966,ArtificialIntelligencepioneerMarvinMinskyatMITaskedhisundergraduatestudentGeraldJaySussmanto“spendthesummerlinkingacameratoacomputerandgettingthecomputertodescribewhatitsaw”.(Szeliski2009,ComputerVision)ThisturnsouttobeoneofthehardestproblemsinAI.Asanexample,comparetheabilityofhumansandcomputerstomatchfacesacrossage.AndyAdlerofCarletonUniversityinOttawaCArananexperimentonhumansandmachinestoseewhichdidbetteratmatchingfaces.
Humansusedawebinterfacetosay“sameperson”or“differentperson”foralargesetoffaces.Severalcomputerprogramsmadethesamecomparisons
Thisplotshowstheerrorratesforhumansandmachines.Lowernumbersmeanbetterperformance.Humans(datashownbythedots)generallymadefewerrorsEachsolidcurveisadifferentprogram,andthenewestprogramsmakefewererrors.Softwareisjustnowcatchingupwithhumansatthisbasic,everydaytask.
ImageSegmentation.Imagesegmentationreferstotheproblemofsortingrawimagedataintodistinctobjects.Seeing“things”insteadofjustareasofcolorandorientation.Incomputervision,thisisrecognizedasanextremelydifficultproblem.Forhumans,itisalmosteffortless.ConsiderthisimagerandomlypulledfromtheInternetusingthesearchphrase“kitchendrawerclutter”
https://oldworldgardenfarms.files.wordpress.com/2014/09/kitchen-drawer.jpgIfIaskyoutoreachfortheslottedspatula,afterafewmomentsofscanningyouwillidentifytheobject,thenorientyourhandtograbthehandle.Robotsonassemblylineshavetosolvethisproblemwhenreachingforapartfromabinfullofjumbleditems.Itisanextremelydifficultproblemforartificialintelligence.
Perceptiondoesn’talwaysgetitright:Thefollowingimagesdemonstratehowyourownvisualsystemsometimesmakes“errors”orhasconflictinganswersorhasdifficultywiththisproblem.Sometimes,weseefamiliarshapesinrandomconfigurations,aphenomenoncalledpareidoliameaning“wrongimage”.TheRorschachTestusespareidoliatoprobethepsycheofpatientsinpsychoanalysis.
Rorschach’soriginalimage3 Meow!(fromDr.Bedell,sourceunknown)
Googleimagesearch,“Jesusinatortilla”
Weseeorganizationinrepeatingpatterns.Thesecanshiftandreorganizedynamically.
Sourceunknown
Weseeshapesandignorebackgrounds,evenifthosebackgroundsareshapes.Zoominontheimage,lookbetweenthepillars.
ThisisasetofactualpillarsattheExploratoriuminSanFrancisco
Insituationswithverypoorinformation,itcantakealongtimetoreachanorganizedperception.Doyouseetheanimal?
“Dalmation”fromMarr1982
Sometimeswefillinmissinginformation,likeedgesthatareprobablythereeventhoughtheyhavenocontrastintheimage.Thesearecalled“illusorycontours.”Manypeopleseeasolidwhitetrianglewithdistinctedges,lyinginfrontoftheoutlinedtriangleanddiscs.Thesolidwhitetrianglelooksalittlebrighterthanthebackground.
“KanizsaTriangle”byGaetanoKanizsa,1955
Theinferencesmadeaboutanimagehaveastronginfluenceonhowmissinginformationgetsfilledin.Isthewireframecubeinfrontofblackdiscs?Orisitbehindasetofholes?
KanizsaNeckerCube,sourceunknown.
Impossiblefigures,likethosemadefamousbyMCEscher,pitlocalsolutionstoimagesegmentationagainstglobalsolutions.Inthisfigure,themissinginformationisshading,stereoscopic3D,andmotionparallax.Withoutthese,thebrainmakesabestguessatthethreedimensionalshapesbasedonedges.Theartisthascleverlycreatedambiguityfromoneendtoanotheroftheobject.
Sourceunknown.Fromagoogleimagesearchon“MultistablePerception”