L’intelligenza robotica Introduzione alla Realtà Virtuale
Transcript of L’intelligenza robotica Introduzione alla Realtà Virtuale
1
http:\\homes.dsi.unimi.it\∼borghese1/70A.A. 2003-2004
L’intelligenza roboticaIntroduzione alla Realtà Virtuale
Alberto BorgheseUniversità degli Studi di Milano
Laboratorio di Motion Analysis and Virtual Reality (MAVR)Dipartimento di Scienze dell’Informazione
http:\\homes.dsi.unimi.it\∼borghese2/70A.A. 2003-2004
Which is real, which is virtual?
2
http:\\homes.dsi.unimi.it\∼borghese3/70A.A. 2003-2004
Historical Perspective
• Virtual Worlds or Synthetic Environments• Philosophical and Technologial origin.
Philosophical background
Ontology and Gnoseology.
• Plato (world of the ideas) 428-348 a.C.• Berkeley (sensorial experience is too limited) 1685-1753.• Hegel (“what is rational is real..”) 1770-1831.• New age.
http:\\homes.dsi.unimi.it\∼borghese4/70A.A. 2003-2004
Historical Perspective (II)
Technological background
• Philco HMD, 1961.
•“Ultimate display”, Sutherland, 1970.
•Data Glove, VPL Research, 1988.
3
http:\\homes.dsi.unimi.it\∼borghese5/70A.A. 2003-2004
Virtual Reality Systems
Key characteristics are:Immersivity.Interactivity.
VR should be able to stimulate the human sensorial systemsIn a coordinated way.
http:\\homes.dsi.unimi.it\∼borghese6/70A.A. 2003-2004
A typical VR system
VR systems are constituted of:
• Input systems (measure the position in the environment and force over the environment.
• World generators (provides a realistic virtual world in which to act.
• Graphical engine (computes the output, given the input and the virtual world).
• Output systems (outputs sensorial stimuli on the subject. Vision, sound, force … are generated as if they were provided by the virtual environment.
4
http:\\homes.dsi.unimi.it\∼borghese7/70A.A. 2003-2004
Components of a VR system
• Input systems.
• World generators.
• Graphical engine.
• Output systems.
http:\\homes.dsi.unimi.it\∼borghese8/70A.A. 2003-2004
Input systems
Measure human actions on the virtual environment.
•Position. Measure the position of the body segments inside the virtual environment.
• Force. Measure the force exerted by the body segments when in contact with a virtual object.
• Estimate the motor output of the human muscle-skeleton system.
5
http:\\homes.dsi.unimi.it\∼borghese9/70A.A. 2003-2004
Position systems
• Motion capture (batch, complete information on the movement).• Real-time trackers (real- time position).• Gloves (specialized for hands).• Gaze trackers.
Adopted technology
• Opteolectronics•Marker based•Computer vision.
• Magnetical• Acoustical• Mechanical
•Measure the position of the body segments inside the virtual environment.
http:\\homes.dsi.unimi.it\∼borghese10/70A.A. 2003-2004
Edga
r Muy
brid
ge(1
896)
6
http:\\homes.dsi.unimi.it\∼borghese11/70A.A. 2003-2004
Optical systems (computer vision)
•Advantage: complete freedom of motion to the subjects.The scene is surveyed by standard videocameras.
•Disadvantage: ill-posed problems (high sensitivity to limited resolution, noise and lighting conditions).
• Solution: hierarchical multi-stage processing.
http:\\homes.dsi.unimi.it\∼borghese12/70A.A. 2003-2004
Pipe-line of processing in CV systems
•First level: Features detection.• Background subtraction (Sturman and Zelter, 1994; DiBernardo et al., 1995);• Optical flow (Barron et al., 1995);• Template matching (Borghese et al., 1990; Tomasi and Kanade, 1991);
Second level: Features matching.(Xu and Ahuja, 1994; Shashua, 1999, Weng, 2000, Gruen, 1985);
Reference: Cipolla and Pentland eds., Computer Vision for Human-Machine Interaction, Cambridge University Press, 1998.
7
http:\\homes.dsi.unimi.it\∼borghese13/70A.A. 2003-2004
Pipe-line of processing in CV systems (II)
•Third level: 3D Reconstruction.
Fourth level: Model matching.• Silhouette matching (Moezzi et al., 1996);• 3D polygonal structures
• Marching cube (Lorensen and Cline, 1987);• Snakes (Kass et al., 1988);
• Matching 3D structures• Facial models (Parke, 1996);• Superquadrics (Metaxis and Terzopoulos, 1991);
http:\\homes.dsi.unimi.it\∼borghese14/70A.A. 2003-2004
Motion Capture live (Jain et al.)
8
http:\\homes.dsi.unimi.it\∼borghese15/70A.A. 2003-2004
Optical systems – marker basedThey are based on modeling human body as a skeleton (Pedotti, 1977).
Markered subject 3D model Stick diagram Hidden model
http:\\homes.dsi.unimi.it\∼borghese16/70A.A. 2003-2004
Passive optical markers - processing
First step. Detection of the 2D position of the markers.Thresholding (Vicon, Motion Analysis, MacReflex)Correlation (Elite)
Second step. Matching the same marker on the different cameras.Third step. Reconstruction of the 3D position of the marker. Fourth step. Classification of the markers according to the model of the subject.
9
http:\\homes.dsi.unimi.it\∼borghese17/70A.A. 2003-2004
Motion capture based on markers
http:\\homes.dsi.unimi.it\∼borghese18/70A.A. 2003-2004
Optical systems – marker based (II)
Advantage: High reliability in the identification of the markers (joints).Disadvantages: Markers have to be attached to the subject before the motion. Wires carried by the subject in case of active markers.
•Active markers – LED, or magnets, with wires, time multiplexing, high sampling frequency, with few markers, minimal processing.•Passive markers – Small pieces of retro-reflective paper, Videocameras (video rates), complex data processing from image processing to 3D reconstruction.
Active vs. Passive markers technology
10
http:\\homes.dsi.unimi.it\∼borghese19/70A.A. 2003-2004
Active markers
Magnetic trackers• Electromagnetic induction. Magnetic material which is moved inside an electric field, with variable frequency. Isotrack, FastTrack and Flock of birds.• A DSP is incorporated for time filtering. • Maximum range: 1m.
Problems• Distortions and linearity. • Interference of metallic materials.
Optoelectronics active markers• LED – Selspot, Watsmart, Optotrack.
http:\\homes.dsi.unimi.it\∼borghese20/70A.A. 2003-2004
Motion capture for animation
•Motion capture•Definition of a 3D model.•Mapping of the motion onto the 3D model.•Animation.
11
http:\\homes.dsi.unimi.it\∼borghese21/70A.A. 2003-2004
Video by Superfluo
http:\\homes.dsi.unimi.it\∼borghese22/70A.A. 2003-2004
Gloves
Monitor fingers position and force.
Problems with the motion of the fingers:• overlap.• fine movements.• fast movements.• rich repertoire.
12
http:\\homes.dsi.unimi.it\∼borghese23/70A.A. 2003-2004
Sayre glove (1976)
http:\\homes.dsi.unimi.it\∼borghese24/70A.A. 2003-2004
MIT glove (1977)
13
http:\\homes.dsi.unimi.it\∼borghese25/70A.A. 2003-2004
Dig
ital
Dat
a E
ntry
Glo
ve (1
983)
http:\\homes.dsi.unimi.it\∼borghese26/70A.A. 2003-2004
Data Glove (1987)
14
http:\\homes.dsi.unimi.it\∼borghese27/70A.A. 2003-2004
Power Glove (1990)
http:\\homes.dsi.unimi.it\∼borghese28/70A.A. 2003-2004
Cyber Glove (1995)
15
http:\\homes.dsi.unimi.it\∼borghese29/70A.A. 2003-2004
Calibration
Estimate of the geometrical parameters in the transformation operated by the sensors (e.g. the perspective transformation operated by a video-camera).
Estimate of the parameters, which describe distortions introduced by the measurement system.
Measurement of a known pattern. From its distortion, the parameters can be computed.
Algorithms adopted: polynomial, local correction (neural networks, fuzzy).
http:\\homes.dsi.unimi.it\∼borghese30/70A.A. 2003-2004
Haptic displays
Convey to the subject the sensorial information generated in theinteraction with the virtual objects: force, material texture…
Measure the force exerted by the subject on the virtual environment.
Aptic displays provide a mechanical interface for Virtual Reality applications.
Most important developments have been made in the robotics field.
16
http:\\homes.dsi.unimi.it\∼borghese31/70A.A. 2003-2004
Requirements of Haptic displays
• Large bandwidth.• Low intertial and viscosity.
Technological solutions (oggetto intermediario):• Direct drive manipulandum (Yoshikawa, 1990), Phantom (2000).• Parallel manipulandum (Millman and Colgate, 1991; Buttolo and Hannaford, 1995). • Magnetic levitation devices (Salcudean and Yan, 1994; Gomi and Kawato, 1996).• Gloves (Bergamasco, 1993).
http:\\homes.dsi.unimi.it\∼borghese32/70A.A. 2003-2004
Direct drive manipulandum (phantom)
17
http:\\homes.dsi.unimi.it\∼borghese33/70A.A. 2003-2004
Parallel manipulandum (schema)
http:\\homes.dsi.unimi.it\∼borghese34/70A.A. 2003-2004
Pen Haptic display
18
http:\\homes.dsi.unimi.it\∼borghese35/70A.A. 2003-2004
Gloves (Blackfinger, 2000)
http:\\homes.dsi.unimi.it\∼borghese36/70A.A. 2003-2004
Percro gloves (Begamasco, 1993)
19
http:\\homes.dsi.unimi.it\∼borghese37/70A.A. 2003-2004
Gaze input
•Contact lenses carrying magnetic coils.•Tvcameras aligned with an IR LED source.•Stereoscopic eye-wear.
• The direction of gaze is decided by measuring the shape of the spot reflectedby the frontal portion of the cornea (Ohshima et al., 1996).
http:\\homes.dsi.unimi.it\∼borghese38/70A.A. 2003-2004
Components of a VR system
• Input systems.
• World generators.
• Graphical engine.
• Output systems.
20
http:\\homes.dsi.unimi.it\∼borghese39/70A.A. 2003-2004
World generators
Integrated systems for 3D CAD and Animation:•Maya (ex-Alias/Wavefront)•XSI (ex-Softimage)•3D Studio Max.
• 3D Structure.• Colour and Texture• Motion (animation)• Rendering (lights, shadows)
http:\\homes.dsi.unimi.it\∼borghese40/70A.A. 2003-2004
3D structure
Solid modeling•3D geometric solids: cubes, cylinders, cones…• Superquadrics (Terzopoulos and Metaxas, 1991): global parameters + local parameters.• Revolution surfaces.• NURBS (Piegle, 1993). CAD, high interactivity.•Subdivision surfaces (Schroeder, 1999).
Surface fitting to range data• Snakes (Kass et al., 1988). Energy based approach. Best curves. • Kohonen maps (1990).• Radial Basis Functions Networks (Poggio and Girosi, 1995; Borghese and Ferrari, 1998).
21
http:\\homes.dsi.unimi.it\∼borghese41/70A.A. 2003-2004
3D structure (II)
Linear approximation (mesh):•Delauney triangulation (Watson, 1981; Fang and Piegl, 1992). Direct tessellation.• Alpha shapes, ball pivoting (Bernardini et al., 2000). Post processing to regularize a Delauney tessellation.• Polymesh models (Singh et al., 1995).
Finite element models• It is a class per sé. Local modeling. Mechnical modeling.• Largely used for animation in medicine (facial animation, deformation of tissue during surgery). Multi- layer modeling.
LODModelli a dettaglio e risoluzione diversi.
http:\\homes.dsi.unimi.it\∼borghese42/70A.A. 2003-2004
Models from range data
Cyb
erw
are
who
le b
ody
scan
ner,
WB
4
22
http:\\homes.dsi.unimi.it\∼borghese43/70A.A. 2003-2004
Models from range data (II)
Cyberware smaller model3030
http:\\homes.dsi.unimi.it\∼borghese44/70A.A. 2003-2004
3D structure from range data (III)
Polhemus hand held laser scanner
23
http:\\homes.dsi.unimi.it\∼borghese45/70A.A. 2003-2004
Models from range data (IV)
Digibot II.•Platform rotates•Scanner line translates.
http:\\homes.dsi.unimi.it\∼borghese46/70A.A. 2003-2004
Components of a VR system
• Input systems.
• World generators.
• Graphical engine.
• Output systems.
24
http:\\homes.dsi.unimi.it\∼borghese47/70A.A. 2003-2004
Graphical representation
Graphical engines represent triangles => Every shape istransformed into triangles.
•The models created by the scanners are ensembles of triangles(milions of). •Much more than required by applications.
⇓
Mesh compression. Representation of the same geometry/pictorial attributes, with a reduced set of triangles.
http:\\homes.dsi.unimi.it\∼borghese48/70A.A. 2003-2004
Gaze directed rendering
25
http:\\homes.dsi.unimi.it\∼borghese49/70A.A. 2003-2004
The graphical engine (visual computing)
Parallelization (graphical boards, SIMD architectures on Pentium IV).
Multiple cache levels.
Pipelining (graphical and computational).
Look-ahead code optimization (compiler optimization).
Hardware acceleration of graphical operations (OpenGL, texture mapping…).
Double buffering (for real-time visualization of 3D models).
http:\\homes.dsi.unimi.it\∼borghese50/70A.A. 2003-2004
Collision detection
Computational demanding (On2EF).Use of multiresolution models.Hierarchical detection.
Geometry semplification (axes aligned faces).Check for common volumes.Extraction of the faces belonging to these volumes.Octree of the pairs of candidate faces.Check for intersection.
26
http:\\homes.dsi.unimi.it\∼borghese51/70A.A. 2003-2004
Animation •Key-frame animation.•Motion capture.•Dynamic animation.
http:\\homes.dsi.unimi.it\∼borghese52/70A.A. 2003-2004
Components of a VR system
• Input systems.
• World generators.
• Graphical engine.
• Output systems.
27
http:\\homes.dsi.unimi.it\∼borghese53/70A.A. 2003-2004
Output systems
Requirements for the monitor:• Large field of view (180o x 150o).•High spatial resolution (35 pixels/degree, equivalent to 12,000x12,000 pixels for a 19" display positioned at 70cmfrom the viewer).
Requirements for the world generator:•Stereoscopic vision for objects with D < 10m.•Monocular cues for objects with D > 10m.
• - Occlusions.• - Geometrical perspective and a-priori model knowledge.• - Shading.• - Motion.
http:\\homes.dsi.unimi.it\∼borghese54/70A.A. 2003-2004
The human eye
Its behavior is very similar to that of a photocamera
28
http:\\homes.dsi.unimi.it\∼borghese55/70A.A. 2003-2004
Stereo-disparity
Points further away are projected on points closer to the image center.
Vergence and focusing are strictly connected.
http:\\homes.dsi.unimi.it\∼borghese56/70A.A. 2003-2004
Passive stereo
29
http:\\homes.dsi.unimi.it\∼borghese57/70A.A. 2003-2004
Stereo image for passive stereo
http:\\homes.dsi.unimi.it\∼borghese58/70A.A. 2003-2004
Output devices (eye-glasses)
Semi-immersive: Eye-glasses (video accuracy, but user is not allowed to move, lateral vision is permitted, which limits virtual realism).
30
http:\\homes.dsi.unimi.it\∼borghese59/70A.A. 2003-2004
HMD (n-vision)
Up to 1280 x 1024, 180Hz. Time multiplexing.
http:\\homes.dsi.unimi.it\∼borghese60/70A.A. 2003-2004
Output devices (BOOM HMD)
Up to 1280 x 1024 pixels / eye CRT TechnologyHead tracking is integrated.
31
http:\\homes.dsi.unimi.it\∼borghese61/70A.A. 2003-2004
I-glasses (games)
http:\\homes.dsi.unimi.it\∼borghese62/70A.A. 2003-2004
Responsive work-bench (Strauss et al., 1995)
Virtual 3D objects are positioned on a working table. They are created projecting the stereo images over the table surface.
32
http:\\homes.dsi.unimi.it\∼borghese63/70A.A. 2003-2004
CAVE
Room 2.5m x 2.5m with Virtual images (steoscopic) projected onto its walls.
More people andComplete immersivity.
http:\\homes.dsi.unimi.it\∼borghese64/70A.A. 2003-2004
Large screen displaysLarge screen displays
Workwall
33
http:\\homes.dsi.unimi.it\∼borghese65/70A.A. 2003-2004
Wearable devices
(a) HMD – 320x240 VGA (b) Keyboard on cloth
Characteristics: mobile, context sensitive, augmented reality.
http:\\homes.dsi.unimi.it\∼borghese66/70A.A. 2003-2004
Physiological problems
• SIMM and VR sickness limit the exposure time.
• Size and distances misperception.
• Limited range in extrapersonal space.
34
http:\\homes.dsi.unimi.it\∼borghese67/70A.A. 2003-2004
Other output devices
Audio – Stereo, sound spatialization.
Force – Same devices which measure the force exerted by the subject.
http:\\homes.dsi.unimi.it\∼borghese68/70A.A. 2003-2004
Applications
• Army• Medicine• Industry (inspection, virtual prototyping)• Chemistry and Physics• Virtual theaters and theme parks• Enterteinment• Comunication• Engineering, Ergonomics and Architecture.• History.
35
http:\\homes.dsi.unimi.it\∼borghese69/70A.A. 2003-2004
La città di Giotto
http:\\homes.dsi.unimi.it\∼borghese70/70A.A. 2003-2004
La tomba di Nefertari