Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci...

41
http://www.ics.uci.edu/~dramanan/teaching/cs116_fall 08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout Computational Photography and Vision (CS116) Deva Ramanan [email protected] Slides adapted from Alyosha Efros, Rick Szeliski and Steve Seitz

Transcript of Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci...

Page 1: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

• http://www.ics.uci.edu/~dramanan/teaching/cs116_fall08/index.html

• Check web page often• T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140• Course intro handout

Computational Photography and Vision (CS116)

Deva [email protected]

Slides adapted from Alyosha Efros, Rick Szeliski and Steve Seitz

Page 2: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Agenda• Intros• Computational photography/vision overview• Course overview• Image processing – let’s dive in

Readings (due next class)• Online Book (link from course website): 

Richard Szeliski, Computer Vision: Algorithms and Applications – Intro: Ch 1.0 & 2.1

Ungraded HW (due next class)• MATLAB tutorial (link from course website)

Page 3: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

About me

Deva Ramanan

Relatively new faculty (2cnd class ever!)

My research focus is on computer vision

Part of new Computational Vision Lab

http://vision.ics.uci.edu/

Page 4: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

What is computational photography?Convergence of image processing, computer

vision, computer graphics and photography

Digital photography:• Simply replaces traditional sensors and recording by digital

technology• Involves only simple image processing

Computational photography• More elaborate image manipulation, more computation• New types of media (panorama, 3D, etc.)• Camera design that take computation into account

Page 5: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

GRAPHICS

What is computer graphics? (3D->2D)

3D geometry

physics

Simulation

projection

Page 6: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

What is computer vision? (2D->3D)

3D geometry

physics

Estimation

Page 7: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

What is computer vision?

Terminator 2

Page 8: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Every picture tells a story

Goal of computer vision is to write computer programs that can interpret images

Page 9: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Can computers match (or beat) human vision?

Yes and no (but mostly no!)• humans are much better at “hard” things• computers can be better at “easy” things

Page 12: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Current state of the art

The next slides show some examples of what current vision systems can do

Page 13: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Earth viewers (3D modeling)

Image from Microsoft’s Virtual Earth(see also: Google Earth)

Page 14: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Photo Tourism

http://phototour.cs.washington.edu/http://labs.live.com/photosynth/

Page 15: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Optical character recognition (OCR)

Digit recognition, AT&T labshttp://www.research.att.com/~yann/

Technology to convert scanned docs to text• If you have a scanner, it probably came with OCR software

License plate readershttp://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Page 16: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Face detection

Many new digital cameras now detect faces• Canon, Sony, Fuji, …

Page 18: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Object recognition (in supermarkets)

LaneHawk by EvolutionRobotics“A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “

Page 19: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Face recognition

Who is she?

Page 20: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns” Read the story

Page 21: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Login without a password…

Fingerprint scanners on many new laptops,

other devices

Face recognition systems now beginning to appear more widely

http://www.sensiblevision.com/

Page 22: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Object recognition (in mobile phones)

This is becoming real:• Microsoft Research• Point & Find, Nokia

Page 23: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Special effects: shape capture

Page 24: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Pirates of the Carribean, Industrial Light and Magic

Special effects: motion capture

Page 25: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Special effects: image-based rendering

Page 26: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Sports

Sportvision first down lineNice explanation on www.howstuffworks.com

Page 27: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Smart cars

Mobileye• Vision systems currently in high-end BMW, GM, Volvo models • By 2010: 70% of car manufacturers.

Slide content courtesy of Amnon Shashua

Page 28: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Vision-based interaction (and games)

Nintendo Wii has camera-based IRtracking built in. See Lee’s work atCMU on clever tricks on using it tocreate a multi-touch display!

Digimask: put your face on a 3D avatar.

“Game turns moviegoers into Human Joysticks”, CNETCamera tracking a crowd, based on this work.

Page 29: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Robotics

http://www.robocup.org/NASA’s Mars Spirit Roverhttp://en.wikipedia.org/wiki/Spirit_rover

Page 30: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Medical imaging

Image guided surgeryGrimson et al., MIT

3D imagingMRI, CT

Page 31: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Current state of the artYou just saw examples of current systems.

• Many of these are less than 5 years old

This is a very active research area, and rapidly changing• Many new apps in the next 5 years

To learn more about vision applications and companies• David Lowe maintains an excellent overview of vision

companies– http://www.cs.ubc.ca/spider/lowe/vision.html

Page 32: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

This coursehttp://www.ics.uci.edu/~dramanan/teaching/cs116_fall08/

index.html

Prerequisites• Calculus, linear algebra + probability helpful• Interest in playing with images

Emphasis on programming projects!• Best way to learn is to build something from scratch

• MATLAB has a low learning curve

• 5 projects (15% of grad) + final exam (25%)

• Project due every 2 weeks

• For larger projects, “part 1” due first week

Page 33: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Project 1: Demosaicing

1) Get feet wet with MATLAB2) Turn raw output of digital camera into a color image

Page 34: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Project 2: hole-filling and blendingThe fun stuff!

Tools: bayesian modelling, differential equations

Page 35: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Project 3: Image re-targeting

Click on video

Tools: combinatorial optimization, dynamic programming

Page 36: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Project 4: Automatic mosaicing

Tools: linear algebra, signal processing

Page 37: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Project 5: Face detection & recognition

Tools: probabilistic modeling

Page 38: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

CamerasDon’t need for class, but really cool

Digital SLRs are ideal

Point–and-shoots still nice and not too expensive (<$200)

e.g. Canon A550

Page 39: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

ReferencesThere is no required text. Various course notes and papers will be made available.  We will often use an online draft of an upcoming book:

Richard Szeliski, Computer Vision: Algorithms and Applications

There is a number of other fine texts that you can use for general reference:

Computer Vision: The Modern Approach, Forsyth and PonceVision Science: Photons to Phenomenology, Stephen Palmer Multiple View Geometry in Computer Vision, Hartley & Zisserman The Computer Image, Watt and Policarpo Linear Algebra and its Applications, Gilbert Strang

Page 40: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

A little bit of teaching philosophy…

1) Prefer discussion vs lectures – ask questions!

2) Readings before class help ‘set the stage’

3) We learn best by doing it ourselves – progamming projects important!

4) Don’t like powerpoint (makes students sleep)

-But visual aides are nice

-Slides will be available online *after* class

-I encourage you to take notes during class

Page 41: Http://dramanan/teaching/cs116_fall08/index.html Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.

Favor to ask…

Need to boost enrollment - spread the gospel about this cool class!