Computer Vision - Lec01 Intro

55

Click here to load reader

description

Computer VisionLecture 1 - Introduction

Transcript of Computer Vision - Lec01 Intro

Page 1: Computer Vision - Lec01 Intro

Computer Vision

Page 2: Computer Vision - Lec01 Intro

Basic Info •  Instructor: Paolo Favaro ([email protected]) •  Teaching Assistant: Thoma Papadhimitri

([email protected])

•  Course webpage: http://cvg.unibe.ch/teaching.html

•  Course material acknowledgements: –  Svetlana Lazebnik – University of Illinois –  Steve Seitz – University of Washington –  Jan Koenderink – University of Leuven –  Kristen Grauman – University of Texas at Austin –  Andrea Vedaldi – University of Oxford –  Srinivas Narasimhan – Carnegie Mellon University

Page 3: Computer Vision - Lec01 Intro

Textbooks •  Forsyth & Ponce, Computer

Vision: A Modern Approach

•  Richard Szeliski, Computer Vision: Algorithms and Applications (available online)

•  Kristen Grauman and Bastian Leibe, Visual Object Recognition (pdf available online)

Page 4: Computer Vision - Lec01 Intro

Course requirements

•  3 Assignments –  Deadlines specified in the next slide –  Each assignment has a maximum score of 100 and a pass is 60 –  To register for the exam one needs at least a pass on each assignment –  Each assignment to be made available in Ilias –  Assignments will require use of MATLAB (tutorial will be provided)

•  Exercises –  Weekly –  Aim is exam preparation

•  Exam –  There will be a final exam on 11 February 2014 (duration 120 mins)

•  Final Mark –  70% Exam and 30% Assignments

Page 5: Computer Vision - Lec01 Intro

Assignments

•  First assignment: Photometric Stereo –  Available on 8/10 –  Due on 29/10

•  Second assignment: Uncalibrated Stereo –  Available on 29/10 –  Due on 19/11

•  Third assignment: Object detection via Bag-of-Words –  Available on 19/11 –  Due on 10/12

Page 6: Computer Vision - Lec01 Intro

Academic integrity policy

•  Feel free to discuss assignments with each other, but coding must be done individually

•  Feel free to incorporate code or tips you find on the Web, provided this doesn’t make the assignment trivial and you explicitly acknowledge your sources

•  Remember: we can Google too!

Page 7: Computer Vision - Lec01 Intro

The goal of computer vision

•  To extract “meaning” from pixels

What we see What a computer sees Source: S. Narasimhan

Page 8: Computer Vision - Lec01 Intro

The goal of computer vision

•  To extract “meaning” from pixels

Source: “80 million tiny images” by Torralba et al.

Humans are remarkably good at this…

Page 9: Computer Vision - Lec01 Intro

What  kind  of  informa.on  can  be  extracted  from  an  image?  

Geometric  informa.on  Seman,c  informa.on  

building  

person  trashcan   car   car  

ground  

tree   tree  

sky  

door  window  

building  

roof  

chimney  

Outdoor  scene  City   European  

…  

Page 10: Computer Vision - Lec01 Intro

Why study computer vision? •  Vision is useful

•  Vision is interesting

•  Vision is difficult •  Half of primate cerebral cortex is devoted to visual

processing •  Achieving human-level visual perception is probably

“AI-complete”

Page 11: Computer Vision - Lec01 Intro

Successes of computer vision to date

Page 12: Computer Vision - Lec01 Intro

Optical character recognition (OCR)

Source: S. Seitz, N. Snavely

Digit recognition yann.lecun.com

License  plate  readers  h<p://en.wikipedia.org/wiki/Automa.c_number_plate_recogni.on  

 

Sudoku  grabber  h<p://sudokugrab.blogspot.com/  

Automa.c  check  processing  

Page 13: Computer Vision - Lec01 Intro

Biometrics

Fingerprint scanners on many new laptops,

other devices

Face recognition systems now beginning to appear more widely

http://www.sensiblevision.com/

Source: S. Seitz

Page 14: Computer Vision - Lec01 Intro

Biometrics

How the Afghan Girl was Identified by Her Iris Patterns

Source: S. Seitz

Page 15: Computer Vision - Lec01 Intro

Face detection

Many consumer digital cameras now detect faces

Source: S. Seitz

Page 16: Computer Vision - Lec01 Intro

Smile detection

Sony Cyber-shot® T70 Digital Still Camera Source: S. Seitz

Page 17: Computer Vision - Lec01 Intro

Face recognition: Apple iPhoto software

http://www.apple.com/ilife/iphoto/

Page 18: Computer Vision - Lec01 Intro

Mobile visual search: Google Goggles

Page 19: Computer Vision - Lec01 Intro

Automotive safety

Mobileye: Vision systems in high-end BMW, GM, Volvo models •  Pedestrian collision warning •  Forward collision warning •  Lane departure warning •  Headway monitoring and warning Source: A. Shashua, S. Seitz

Page 20: Computer Vision - Lec01 Intro

Google self-driving cars

Page 21: Computer Vision - Lec01 Intro

Vision-based interaction: Xbox Kinect

Page 22: Computer Vision - Lec01 Intro

3D Reconstruction: Kinect Fusion

YouTube Video

Page 23: Computer Vision - Lec01 Intro

3D Reconstruction: Multi-View Stereo

YouTube Video

Page 24: Computer Vision - Lec01 Intro

Google Maps Photo Tours

http://maps.google.com/phototours

Page 25: Computer Vision - Lec01 Intro

Special effects: shape and motion capture

Source: S. Seitz

Page 26: Computer Vision - Lec01 Intro

Vision for robotics, space exploration

NASA'S Curiosity Rover has a system consisting of 17 cameras

Page 27: Computer Vision - Lec01 Intro

Why is computer vision difficult?

Page 28: Computer Vision - Lec01 Intro

Challenges: viewpoint variation

Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

Page 29: Computer Vision - Lec01 Intro

Challenges: illumination

image credit: J. Koenderink

Page 30: Computer Vision - Lec01 Intro

Challenges: scale

slide credit: Fei-Fei, Fergus & Torralba

Page 31: Computer Vision - Lec01 Intro

Challenges: deformation

Xu, Beihong 1943

slide credit: Fei-Fei, Fergus & Torralba

Page 32: Computer Vision - Lec01 Intro

Challenges: occlusion, clutter

Image source: National Geographic

Page 33: Computer Vision - Lec01 Intro

Challenges: Motion

Page 34: Computer Vision - Lec01 Intro

Challenges: object intra-class variation

slide credit: Fei-Fei, Fergus & Torralba

Page 35: Computer Vision - Lec01 Intro

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

Page 36: Computer Vision - Lec01 Intro

Challenges: local ambiguity

Source: Rob Fergus and Antonio Torralba

Page 37: Computer Vision - Lec01 Intro

Challenges: local ambiguity

Source: Rob Fergus and Antonio Torralba

Page 38: Computer Vision - Lec01 Intro

Challenges: Inherent ambiguity

•  Many different 3D scenes could have given rise to a particular 2D picture

Page 39: Computer Vision - Lec01 Intro

Challenges or opportunities?

•  Images are confusing, but they also reveal the structure of the world through numerous cues

•  Our job is to interpret the cues!

Image source: J. Koenderink

Page 40: Computer Vision - Lec01 Intro

Depth cues: Linear perspective

Page 41: Computer Vision - Lec01 Intro

Depth cues: Aerial perspective

Page 42: Computer Vision - Lec01 Intro

Depth ordering cues: Occlusion

Source: J. Koenderink

Page 43: Computer Vision - Lec01 Intro

Shape cues: Texture gradient

Page 44: Computer Vision - Lec01 Intro

Shape and lighting cues: Shading

Page 45: Computer Vision - Lec01 Intro

Position and lighting cues: Cast shadows

Source: J. Koenderink

Page 46: Computer Vision - Lec01 Intro

Grouping cues: Similarity (color, texture, proximity)

Page 47: Computer Vision - Lec01 Intro

Grouping cues: “Common fate”

Image credit: Arthus-Bertrand (via F. Durand)

Page 48: Computer Vision - Lec01 Intro

Origins of computer vision

L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963.

Page 49: Computer Vision - Lec01 Intro

Connections to other disciplines

Computer Vision

Image Processing

Machine Learning

Artificial Intelligence

Robotics

Cognitive science Neuroscience

Computer Graphics

Page 50: Computer Vision - Lec01 Intro

The computer vision industry

•  A list of companies here: http://www.cs.ubc.ca/spider/lowe/vision.html

Page 51: Computer Vision - Lec01 Intro

Course overview

I.  Image formation: Camera and projection models II.  Radiometry and shading

I.  Calibrated/Uncalibrated photometric stereo

III.  Early vision: Image filtering, edges, interest points, denoising and deblurring

IV.  Epipolar geometry, stereo and structure from motion V.  Tracking, optical flow and registration VI.  Mid-level vision: Clustering and segmentation VII.  Recognition

I.  Bag of Features and Support Vector Machines

VIII. Deformable parts models IX.  Special topics

Page 52: Computer Vision - Lec01 Intro

Early vision

Cameras and sensors Light and color

Linear filtering Edge detection

* =

Feature extraction: corner and blob detection

•  Basic image formation and processing

Page 53: Computer Vision - Lec01 Intro

Mid-level vision

Fitting: Least squares RANSAC

Alignment

•  Fitting and grouping

Page 54: Computer Vision - Lec01 Intro

Multi-view geometry

Structure from motion

Stereo Epipolar geometry

3D Photography

Page 55: Computer Vision - Lec01 Intro

Recognition

Instance recognition, large-scale alignment Image classification

Sliding window detection Part-based models