L’intelligenza robotica Introduzione alla Realtà Virtuale

35
http:\\homes.dsi.unimi.it\ borghese 1/70 A.A. 2003-2004 L’intelligenza robotica Introduzione alla Realtà Virtuale Alberto Borghese Università degli Studi di Milano Laboratorio di Motion Analysis and Virtual Reality (MAVR) Dipartimento di Scienze dell’Informazione [email protected] http:\\homes.dsi.unimi.it\ borghese 2/70 A.A. 2003-2004 Which is real, which is virtual?

Transcript of L’intelligenza robotica Introduzione alla Realtà Virtuale

Page 1: L’intelligenza robotica Introduzione alla Realtà Virtuale

1

http:\\homes.dsi.unimi.it\∼borghese1/70A.A. 2003-2004

L’intelligenza roboticaIntroduzione alla Realtà Virtuale

Alberto BorgheseUniversità degli Studi di Milano

Laboratorio di Motion Analysis and Virtual Reality (MAVR)Dipartimento di Scienze dell’Informazione

[email protected]

http:\\homes.dsi.unimi.it\∼borghese2/70A.A. 2003-2004

Which is real, which is virtual?

Page 2: L’intelligenza robotica Introduzione alla Realtà Virtuale

2

http:\\homes.dsi.unimi.it\∼borghese3/70A.A. 2003-2004

Historical Perspective

• Virtual Worlds or Synthetic Environments• Philosophical and Technologial origin.

Philosophical background

Ontology and Gnoseology.

• Plato (world of the ideas) 428-348 a.C.• Berkeley (sensorial experience is too limited) 1685-1753.• Hegel (“what is rational is real..”) 1770-1831.• New age.

http:\\homes.dsi.unimi.it\∼borghese4/70A.A. 2003-2004

Historical Perspective (II)

Technological background

• Philco HMD, 1961.

•“Ultimate display”, Sutherland, 1970.

•Data Glove, VPL Research, 1988.

Page 3: L’intelligenza robotica Introduzione alla Realtà Virtuale

3

http:\\homes.dsi.unimi.it\∼borghese5/70A.A. 2003-2004

Virtual Reality Systems

Key characteristics are:Immersivity.Interactivity.

VR should be able to stimulate the human sensorial systemsIn a coordinated way.

http:\\homes.dsi.unimi.it\∼borghese6/70A.A. 2003-2004

A typical VR system

VR systems are constituted of:

• Input systems (measure the position in the environment and force over the environment.

• World generators (provides a realistic virtual world in which to act.

• Graphical engine (computes the output, given the input and the virtual world).

• Output systems (outputs sensorial stimuli on the subject. Vision, sound, force … are generated as if they were provided by the virtual environment.

Page 4: L’intelligenza robotica Introduzione alla Realtà Virtuale

4

http:\\homes.dsi.unimi.it\∼borghese7/70A.A. 2003-2004

Components of a VR system

• Input systems.

• World generators.

• Graphical engine.

• Output systems.

http:\\homes.dsi.unimi.it\∼borghese8/70A.A. 2003-2004

Input systems

Measure human actions on the virtual environment.

•Position. Measure the position of the body segments inside the virtual environment.

• Force. Measure the force exerted by the body segments when in contact with a virtual object.

• Estimate the motor output of the human muscle-skeleton system.

Page 5: L’intelligenza robotica Introduzione alla Realtà Virtuale

5

http:\\homes.dsi.unimi.it\∼borghese9/70A.A. 2003-2004

Position systems

• Motion capture (batch, complete information on the movement).• Real-time trackers (real- time position).• Gloves (specialized for hands).• Gaze trackers.

Adopted technology

• Opteolectronics•Marker based•Computer vision.

• Magnetical• Acoustical• Mechanical

•Measure the position of the body segments inside the virtual environment.

http:\\homes.dsi.unimi.it\∼borghese10/70A.A. 2003-2004

Edga

r Muy

brid

ge(1

896)

Page 6: L’intelligenza robotica Introduzione alla Realtà Virtuale

6

http:\\homes.dsi.unimi.it\∼borghese11/70A.A. 2003-2004

Optical systems (computer vision)

•Advantage: complete freedom of motion to the subjects.The scene is surveyed by standard videocameras.

•Disadvantage: ill-posed problems (high sensitivity to limited resolution, noise and lighting conditions).

• Solution: hierarchical multi-stage processing.

http:\\homes.dsi.unimi.it\∼borghese12/70A.A. 2003-2004

Pipe-line of processing in CV systems

•First level: Features detection.• Background subtraction (Sturman and Zelter, 1994; DiBernardo et al., 1995);• Optical flow (Barron et al., 1995);• Template matching (Borghese et al., 1990; Tomasi and Kanade, 1991);

Second level: Features matching.(Xu and Ahuja, 1994; Shashua, 1999, Weng, 2000, Gruen, 1985);

Reference: Cipolla and Pentland eds., Computer Vision for Human-Machine Interaction, Cambridge University Press, 1998.

Page 7: L’intelligenza robotica Introduzione alla Realtà Virtuale

7

http:\\homes.dsi.unimi.it\∼borghese13/70A.A. 2003-2004

Pipe-line of processing in CV systems (II)

•Third level: 3D Reconstruction.

Fourth level: Model matching.• Silhouette matching (Moezzi et al., 1996);• 3D polygonal structures

• Marching cube (Lorensen and Cline, 1987);• Snakes (Kass et al., 1988);

• Matching 3D structures• Facial models (Parke, 1996);• Superquadrics (Metaxis and Terzopoulos, 1991);

http:\\homes.dsi.unimi.it\∼borghese14/70A.A. 2003-2004

Motion Capture live (Jain et al.)

Page 8: L’intelligenza robotica Introduzione alla Realtà Virtuale

8

http:\\homes.dsi.unimi.it\∼borghese15/70A.A. 2003-2004

Optical systems – marker basedThey are based on modeling human body as a skeleton (Pedotti, 1977).

Markered subject 3D model Stick diagram Hidden model

http:\\homes.dsi.unimi.it\∼borghese16/70A.A. 2003-2004

Passive optical markers - processing

First step. Detection of the 2D position of the markers.Thresholding (Vicon, Motion Analysis, MacReflex)Correlation (Elite)

Second step. Matching the same marker on the different cameras.Third step. Reconstruction of the 3D position of the marker. Fourth step. Classification of the markers according to the model of the subject.

Page 9: L’intelligenza robotica Introduzione alla Realtà Virtuale

9

http:\\homes.dsi.unimi.it\∼borghese17/70A.A. 2003-2004

Motion capture based on markers

http:\\homes.dsi.unimi.it\∼borghese18/70A.A. 2003-2004

Optical systems – marker based (II)

Advantage: High reliability in the identification of the markers (joints).Disadvantages: Markers have to be attached to the subject before the motion. Wires carried by the subject in case of active markers.

•Active markers – LED, or magnets, with wires, time multiplexing, high sampling frequency, with few markers, minimal processing.•Passive markers – Small pieces of retro-reflective paper, Videocameras (video rates), complex data processing from image processing to 3D reconstruction.

Active vs. Passive markers technology

Page 10: L’intelligenza robotica Introduzione alla Realtà Virtuale

10

http:\\homes.dsi.unimi.it\∼borghese19/70A.A. 2003-2004

Active markers

Magnetic trackers• Electromagnetic induction. Magnetic material which is moved inside an electric field, with variable frequency. Isotrack, FastTrack and Flock of birds.• A DSP is incorporated for time filtering. • Maximum range: 1m.

Problems• Distortions and linearity. • Interference of metallic materials.

Optoelectronics active markers• LED – Selspot, Watsmart, Optotrack.

http:\\homes.dsi.unimi.it\∼borghese20/70A.A. 2003-2004

Motion capture for animation

•Motion capture•Definition of a 3D model.•Mapping of the motion onto the 3D model.•Animation.

Page 11: L’intelligenza robotica Introduzione alla Realtà Virtuale

11

http:\\homes.dsi.unimi.it\∼borghese21/70A.A. 2003-2004

Video by Superfluo

http:\\homes.dsi.unimi.it\∼borghese22/70A.A. 2003-2004

Gloves

Monitor fingers position and force.

Problems with the motion of the fingers:• overlap.• fine movements.• fast movements.• rich repertoire.

Page 12: L’intelligenza robotica Introduzione alla Realtà Virtuale

12

http:\\homes.dsi.unimi.it\∼borghese23/70A.A. 2003-2004

Sayre glove (1976)

http:\\homes.dsi.unimi.it\∼borghese24/70A.A. 2003-2004

MIT glove (1977)

Page 13: L’intelligenza robotica Introduzione alla Realtà Virtuale

13

http:\\homes.dsi.unimi.it\∼borghese25/70A.A. 2003-2004

Dig

ital

Dat

a E

ntry

Glo

ve (1

983)

http:\\homes.dsi.unimi.it\∼borghese26/70A.A. 2003-2004

Data Glove (1987)

Page 14: L’intelligenza robotica Introduzione alla Realtà Virtuale

14

http:\\homes.dsi.unimi.it\∼borghese27/70A.A. 2003-2004

Power Glove (1990)

http:\\homes.dsi.unimi.it\∼borghese28/70A.A. 2003-2004

Cyber Glove (1995)

Page 15: L’intelligenza robotica Introduzione alla Realtà Virtuale

15

http:\\homes.dsi.unimi.it\∼borghese29/70A.A. 2003-2004

Calibration

Estimate of the geometrical parameters in the transformation operated by the sensors (e.g. the perspective transformation operated by a video-camera).

Estimate of the parameters, which describe distortions introduced by the measurement system.

Measurement of a known pattern. From its distortion, the parameters can be computed.

Algorithms adopted: polynomial, local correction (neural networks, fuzzy).

http:\\homes.dsi.unimi.it\∼borghese30/70A.A. 2003-2004

Haptic displays

Convey to the subject the sensorial information generated in theinteraction with the virtual objects: force, material texture…

Measure the force exerted by the subject on the virtual environment.

Aptic displays provide a mechanical interface for Virtual Reality applications.

Most important developments have been made in the robotics field.

Page 16: L’intelligenza robotica Introduzione alla Realtà Virtuale

16

http:\\homes.dsi.unimi.it\∼borghese31/70A.A. 2003-2004

Requirements of Haptic displays

• Large bandwidth.• Low intertial and viscosity.

Technological solutions (oggetto intermediario):• Direct drive manipulandum (Yoshikawa, 1990), Phantom (2000).• Parallel manipulandum (Millman and Colgate, 1991; Buttolo and Hannaford, 1995). • Magnetic levitation devices (Salcudean and Yan, 1994; Gomi and Kawato, 1996).• Gloves (Bergamasco, 1993).

http:\\homes.dsi.unimi.it\∼borghese32/70A.A. 2003-2004

Direct drive manipulandum (phantom)

Page 17: L’intelligenza robotica Introduzione alla Realtà Virtuale

17

http:\\homes.dsi.unimi.it\∼borghese33/70A.A. 2003-2004

Parallel manipulandum (schema)

http:\\homes.dsi.unimi.it\∼borghese34/70A.A. 2003-2004

Pen Haptic display

Page 18: L’intelligenza robotica Introduzione alla Realtà Virtuale

18

http:\\homes.dsi.unimi.it\∼borghese35/70A.A. 2003-2004

Gloves (Blackfinger, 2000)

http:\\homes.dsi.unimi.it\∼borghese36/70A.A. 2003-2004

Percro gloves (Begamasco, 1993)

Page 19: L’intelligenza robotica Introduzione alla Realtà Virtuale

19

http:\\homes.dsi.unimi.it\∼borghese37/70A.A. 2003-2004

Gaze input

•Contact lenses carrying magnetic coils.•Tvcameras aligned with an IR LED source.•Stereoscopic eye-wear.

• The direction of gaze is decided by measuring the shape of the spot reflectedby the frontal portion of the cornea (Ohshima et al., 1996).

http:\\homes.dsi.unimi.it\∼borghese38/70A.A. 2003-2004

Components of a VR system

• Input systems.

• World generators.

• Graphical engine.

• Output systems.

Page 20: L’intelligenza robotica Introduzione alla Realtà Virtuale

20

http:\\homes.dsi.unimi.it\∼borghese39/70A.A. 2003-2004

World generators

Integrated systems for 3D CAD and Animation:•Maya (ex-Alias/Wavefront)•XSI (ex-Softimage)•3D Studio Max.

• 3D Structure.• Colour and Texture• Motion (animation)• Rendering (lights, shadows)

http:\\homes.dsi.unimi.it\∼borghese40/70A.A. 2003-2004

3D structure

Solid modeling•3D geometric solids: cubes, cylinders, cones…• Superquadrics (Terzopoulos and Metaxas, 1991): global parameters + local parameters.• Revolution surfaces.• NURBS (Piegle, 1993). CAD, high interactivity.•Subdivision surfaces (Schroeder, 1999).

Surface fitting to range data• Snakes (Kass et al., 1988). Energy based approach. Best curves. • Kohonen maps (1990).• Radial Basis Functions Networks (Poggio and Girosi, 1995; Borghese and Ferrari, 1998).

Page 21: L’intelligenza robotica Introduzione alla Realtà Virtuale

21

http:\\homes.dsi.unimi.it\∼borghese41/70A.A. 2003-2004

3D structure (II)

Linear approximation (mesh):•Delauney triangulation (Watson, 1981; Fang and Piegl, 1992). Direct tessellation.• Alpha shapes, ball pivoting (Bernardini et al., 2000). Post processing to regularize a Delauney tessellation.• Polymesh models (Singh et al., 1995).

Finite element models• It is a class per sé. Local modeling. Mechnical modeling.• Largely used for animation in medicine (facial animation, deformation of tissue during surgery). Multi- layer modeling.

LODModelli a dettaglio e risoluzione diversi.

http:\\homes.dsi.unimi.it\∼borghese42/70A.A. 2003-2004

Models from range data

Cyb

erw

are

who

le b

ody

scan

ner,

WB

4

Page 22: L’intelligenza robotica Introduzione alla Realtà Virtuale

22

http:\\homes.dsi.unimi.it\∼borghese43/70A.A. 2003-2004

Models from range data (II)

Cyberware smaller model3030

http:\\homes.dsi.unimi.it\∼borghese44/70A.A. 2003-2004

3D structure from range data (III)

Polhemus hand held laser scanner

Page 23: L’intelligenza robotica Introduzione alla Realtà Virtuale

23

http:\\homes.dsi.unimi.it\∼borghese45/70A.A. 2003-2004

Models from range data (IV)

Digibot II.•Platform rotates•Scanner line translates.

http:\\homes.dsi.unimi.it\∼borghese46/70A.A. 2003-2004

Components of a VR system

• Input systems.

• World generators.

• Graphical engine.

• Output systems.

Page 24: L’intelligenza robotica Introduzione alla Realtà Virtuale

24

http:\\homes.dsi.unimi.it\∼borghese47/70A.A. 2003-2004

Graphical representation

Graphical engines represent triangles => Every shape istransformed into triangles.

•The models created by the scanners are ensembles of triangles(milions of). •Much more than required by applications.

Mesh compression. Representation of the same geometry/pictorial attributes, with a reduced set of triangles.

http:\\homes.dsi.unimi.it\∼borghese48/70A.A. 2003-2004

Gaze directed rendering

Page 25: L’intelligenza robotica Introduzione alla Realtà Virtuale

25

http:\\homes.dsi.unimi.it\∼borghese49/70A.A. 2003-2004

The graphical engine (visual computing)

Parallelization (graphical boards, SIMD architectures on Pentium IV).

Multiple cache levels.

Pipelining (graphical and computational).

Look-ahead code optimization (compiler optimization).

Hardware acceleration of graphical operations (OpenGL, texture mapping…).

Double buffering (for real-time visualization of 3D models).

http:\\homes.dsi.unimi.it\∼borghese50/70A.A. 2003-2004

Collision detection

Computational demanding (On2EF).Use of multiresolution models.Hierarchical detection.

Geometry semplification (axes aligned faces).Check for common volumes.Extraction of the faces belonging to these volumes.Octree of the pairs of candidate faces.Check for intersection.

Page 26: L’intelligenza robotica Introduzione alla Realtà Virtuale

26

http:\\homes.dsi.unimi.it\∼borghese51/70A.A. 2003-2004

Animation •Key-frame animation.•Motion capture.•Dynamic animation.

http:\\homes.dsi.unimi.it\∼borghese52/70A.A. 2003-2004

Components of a VR system

• Input systems.

• World generators.

• Graphical engine.

• Output systems.

Page 27: L’intelligenza robotica Introduzione alla Realtà Virtuale

27

http:\\homes.dsi.unimi.it\∼borghese53/70A.A. 2003-2004

Output systems

Requirements for the monitor:• Large field of view (180o x 150o).•High spatial resolution (35 pixels/degree, equivalent to 12,000x12,000 pixels for a 19" display positioned at 70cmfrom the viewer).

Requirements for the world generator:•Stereoscopic vision for objects with D < 10m.•Monocular cues for objects with D > 10m.

• - Occlusions.• - Geometrical perspective and a-priori model knowledge.• - Shading.• - Motion.

http:\\homes.dsi.unimi.it\∼borghese54/70A.A. 2003-2004

The human eye

Its behavior is very similar to that of a photocamera

Page 28: L’intelligenza robotica Introduzione alla Realtà Virtuale

28

http:\\homes.dsi.unimi.it\∼borghese55/70A.A. 2003-2004

Stereo-disparity

Points further away are projected on points closer to the image center.

Vergence and focusing are strictly connected.

http:\\homes.dsi.unimi.it\∼borghese56/70A.A. 2003-2004

Passive stereo

Page 29: L’intelligenza robotica Introduzione alla Realtà Virtuale

29

http:\\homes.dsi.unimi.it\∼borghese57/70A.A. 2003-2004

Stereo image for passive stereo

http:\\homes.dsi.unimi.it\∼borghese58/70A.A. 2003-2004

Output devices (eye-glasses)

Semi-immersive: Eye-glasses (video accuracy, but user is not allowed to move, lateral vision is permitted, which limits virtual realism).

Page 30: L’intelligenza robotica Introduzione alla Realtà Virtuale

30

http:\\homes.dsi.unimi.it\∼borghese59/70A.A. 2003-2004

HMD (n-vision)

Up to 1280 x 1024, 180Hz. Time multiplexing.

http:\\homes.dsi.unimi.it\∼borghese60/70A.A. 2003-2004

Output devices (BOOM HMD)

Up to 1280 x 1024 pixels / eye CRT TechnologyHead tracking is integrated.

Page 31: L’intelligenza robotica Introduzione alla Realtà Virtuale

31

http:\\homes.dsi.unimi.it\∼borghese61/70A.A. 2003-2004

I-glasses (games)

http:\\homes.dsi.unimi.it\∼borghese62/70A.A. 2003-2004

Responsive work-bench (Strauss et al., 1995)

Virtual 3D objects are positioned on a working table. They are created projecting the stereo images over the table surface.

Page 32: L’intelligenza robotica Introduzione alla Realtà Virtuale

32

http:\\homes.dsi.unimi.it\∼borghese63/70A.A. 2003-2004

CAVE

Room 2.5m x 2.5m with Virtual images (steoscopic) projected onto its walls.

More people andComplete immersivity.

http:\\homes.dsi.unimi.it\∼borghese64/70A.A. 2003-2004

Large screen displaysLarge screen displays

Workwall

Page 33: L’intelligenza robotica Introduzione alla Realtà Virtuale

33

http:\\homes.dsi.unimi.it\∼borghese65/70A.A. 2003-2004

Wearable devices

(a) HMD – 320x240 VGA (b) Keyboard on cloth

Characteristics: mobile, context sensitive, augmented reality.

http:\\homes.dsi.unimi.it\∼borghese66/70A.A. 2003-2004

Physiological problems

• SIMM and VR sickness limit the exposure time.

• Size and distances misperception.

• Limited range in extrapersonal space.

Page 34: L’intelligenza robotica Introduzione alla Realtà Virtuale

34

http:\\homes.dsi.unimi.it\∼borghese67/70A.A. 2003-2004

Other output devices

Audio – Stereo, sound spatialization.

Force – Same devices which measure the force exerted by the subject.

http:\\homes.dsi.unimi.it\∼borghese68/70A.A. 2003-2004

Applications

• Army• Medicine• Industry (inspection, virtual prototyping)• Chemistry and Physics• Virtual theaters and theme parks• Enterteinment• Comunication• Engineering, Ergonomics and Architecture.• History.

Page 35: L’intelligenza robotica Introduzione alla Realtà Virtuale

35

http:\\homes.dsi.unimi.it\∼borghese69/70A.A. 2003-2004

La città di Giotto

http:\\homes.dsi.unimi.it\∼borghese70/70A.A. 2003-2004

La tomba di Nefertari