METR 4202: Robotics & Automationrobotics.itee.uq.edu.au/~metr4202/lectures/L12... · Reference...

Guest Lecture

METR 4202: Robotics & Automation

Dr Surya Singh -- Lecture # 12 (Week 13) October 21, 2019

metr4202-staff@itee.uq.edu.au http://robotics.itee.uq.edu.au/~metr4202/ [http://metr4202.com]

Probabilistic Robotics: Control (LQR, Value Functions, Q-Learning, Deep Reinforcement Learning, etc.)

& The Future of Robotics/Automation (Open Challenges)

METR 4202: Robotics & Automation

Dr Surya Singh -- Lecture # 12 (Week 13) October 21, 2019

metr4202-staff@itee.uq.edu.au http://robotics.itee.uq.edu.au/~metr4202/ [http://metr4202.com]

Reference Material

UQ Library / Online (PDF)

UQ Library(TJ211.4 .L38 1991)

October 21, 2019 -METR 4202: Robotics 3

Probabilistic RoboticsWhat about variation?

Inherent Sources of UncertaintyThere are some common sources of uncertainty:• Inherent stochasticity in the system being modeled

• Incomplete modeling

• Incomplete observability

• Incompetent control (or action)

Two Views of Probability

Frequentist probability• Relates directly to the rates at

which events occur• Basically, things that are

directly “repeatable”• e.g. Drawing a hand of cards

Belief (Bayesian) probability• Degree of a Belief• Qualitative levels of uncertainty• For more details about why a

small set of common sense• Ramsey (1926) –

The same set of axioms control both kinds of probability

II.1 Basic Probability Theory

• Probability Theory– Given a data generating process, what are the properties of

the outcome?

• Statistical Inference– Given the outcome, what can we say about the process that

generated the data?– How can we generalize these observations and make

predictions about future outcomes?

October 21, 2019 -METR 4202: Robotics II.7

In Particular: Bayes’ Rule

• What does Bayes’ Formula helps to find?– Helps us to find:

– By having already known:

( )ABP |

( )|P A B

Example: Inference / Generative Models

Robot ControlsControls Example I: Cart & Pole

(& Obstacles)

Classic State-Space Problem: Cart & Pole• Equations of State:

(" +$) '̈ + $()̈cos) − $()̇/sin) = 3$('̈cos) + $(/)̈ − $4(sin) = 0

• Solution:

'̈ = 63 + $sin)(()̇/ − 4cos)" +$sin/)

)̈ = −3cos) − $()̇/sin)cos) + (" +$)4sin))((" +$sin/)

7̇ ='̇'̈)̇)̈=

'̇63 + $sin)(()̇/ − 4cos)

" +$sin/))̇

−3cos) − $()̇/sin)cos) + (" +$)4sin))((" +$sin/)

'̇3 − $)4

−3 + (" +$)4)("

0 1 0 00 0 −$4" 00 0 0 10 0 (" +$)4

''̇))̇+

− 1("

See Also: Tutorial 12: Cart-Pole Inverted Pendulum (State-Space)

Motion Planning & Control: a Unified Design Tool?

Atkeson, C. & Stephens, B.“Random sampling of states in dynamic programming” IEEE T. Syst. Man Cybern, 2008

Maeda, Singh, Durrant-Whyte, ICRA 2011

GPS tree

Belief tree

Seiler, et al. ICRA, May 2015, 2290-2297

⇡⇤(b) = argmax

s2SR(s, a)b(s)ds+ E

�tR(st, at)|⌧(b, a,O), ⇡⇤

#!POMDP Framework:

12October 21, 2019 -METR 4202: Robotics 12

Why? Bellman’s Principle of Optimality• Suppose we are given a DT ODE

Δ" = $ ", &" ∈ ℝ), & ∈ ℝ*

• Optimal Control:Find &: 0, - → ℝ* that minimizes the cost (or said more optimistically: maximizes the reward):

/ ", & = 0 -, " - + 2345

678ℒ :, " : , & :

• Bellman’s insight (1957):* the optimal control at time s depends only on ; <

(i.e. not any prior states!)∴ It’s an MDP!

Final cost

“Running” cost

Motion Planning & Control: Trajectory Generation with Constraints

Steering/Feedback ControlMethods:

Motion Planning Methods:

Feedback controller

Motion planner

Parameterized control policy(PID, LQR, …)

One Approach: Gain-Scheduled RRTCore Idea: • Basic approach: decoupling à tractable• Integrated approach: use feedback to shortcut the planning phase

xinit* Maeda, G.J; Singh, S.P.N; Durrant-Whyte, H.

“Feedback Motion Planning Approach for Nonlinear Control using Gain Scheduled RRTs”, IROS 2010

Gain-Scheduled RRT: Algorithm

Initialize tree(s)

Design feedback at the goal

Estimate the RoA

Extend tree

Try to reach RoA

Finish

Gain-Scheduled RRT

Holonomic case

Under differential constraints

xinit s(m)

xgoalxrand

Core Idea: • Basic approach: decoupling à tractable• Integrated approach: use feedback to shortcut the planning phase

* Maeda, G.J; Singh, S.P.N; Durrant-Whyte, H. “Feedback Motion Planning Approach for Nonlinear Control using Gain Scheduled RRTs”, IROS 2010

A RRT solution rarely reaches the goal (or connect the two trees) with zero error

How large?

Gain-Scheduled RRT: RRT Connection Gap

Connection gap

Region searchRRT connection is relaxed

Gain-Scheduled RRT: Search

Backwards tree

Forward tree

Single state search:Extensive exploration

Feedback system

Sum of squares relaxation

GS-RRT: RoA & Verification• Find a candidate

• Maximize candidate ( ρ )

• Verify candidate **R. Tedrake, “LQR-Trees: Feedback Motion Planning on Sparse Randomized Trees”, RSS 2009

V(x) =In the LQR case: J: optimal cost-to-go S: Algebraic Ricatti Eq.

Gain-Scheduled RRT:

Automatic Robot Controls

Controls Example II: Underactuated Robotics

The Jitterbug Problem

Solving the Jitterbug Problem: Continuous Action Deep Reinforcement Learning

Solving the Jitterbug Problem:

Future of RoboticsSome Research Examples J

Computer Aided Surgery: R/C Toolholders?

èMove in tandem with heart: Cardiac procedures without stopping it• Unstructured environment (patient) makes this harder

• Biomechanics approach: Predict expected tissue trajectories• (Stochastic) Robot Motion Planning / Control Methods!

Modern (Tele)Surgical Robotics:

ARC DP160100714

Computer Aided Surgery: “Soft” is “Hard”!

Research: Incorporating Stiffness (Haptics): Visual Deformable Object Analysis

Dansereau, Singh, Leitner, ICRA 2016

Iceberg to Titanic: Take Advantage of Information

• 30 Min/Day Talking on Phone – 5.5 days/year of audio samples

– Track this (notably the pauses) over time to detect onset of dimentia

• 150 Photos/Month– Time history for detecting

precursors – Skin cancer monitoring

• More Signals • Stochastic Processing(Think TAPIR!)

Robotics & Health: A Friendly Touch!

∴ Planning to Inform “Novel Robot” Design Strategy

Morley-Drabble & Singh, AIM 2018 Ham, Singh & Lucey, WACV 2017

THE Future of Robotics…

You! J

It’s all up to you!

If you want to build a ship, don't drum up the men to gather wood, divide the work and give orders. Instead, teach them to yearn for the vast and endless sea.

Antoine de Saint-Exupery, "The Wisdom of the Sands"

UQ Robotics: Dynamic Systems in Motion

Decision-Making/Planning

Mechanicsof motion

Aerial Systems

Systems

Pauline Pounds (ANU/Yale)Surya Singh (Stanford/Syd)

Diverse international research group

METR 4202: Robotics & Automationrobotics.itee.uq.edu.au/~metr4202/lectures/L12... · Reference...

Documents

Transcript of METR 4202: Robotics & Automationrobotics.itee.uq.edu.au/~metr4202/lectures/L12... · Reference...

Сақшы №36 (4202)

4202 - RIONEGRO.GOV.AR

Ver Sentencia (4202)

Grupo 4202

Configuração Inicial ONU 4202

Perception & Vision - METR 4202 Advanced Control & Robotics L7-Perception-Vision

[4202] - 326

Mark Brown (UQ), Len Coote (UQ), Spring Sampson(UQ) and

METR 4202: Advanced Control & Roboticsrobotics.itee.uq.edu.au/~metr4202/2012/L7-Perception-Vision.pdf · METR 4202: Advanced Control & Robotics ... 5 20-Aug Obstacle Avoidance & Motion

METR 130: Lecture 3 - sjsu.edu

5. Resumen Metr, Sustento Metrados

BTI 4202: Anonymity - Grothoff

Chapter 5 . Dynamics - METR 4202: Advanced Controls & Robotics 5... · 2015. 9. 21. · Chapter 5 . Dynamics 5.1 Introduction The study of manipulator dynamics is essential for both

Robot Dynamics II: Trajectories & Motionrobotics.itee.uq.edu.au/~metr4202/2013/lectures/L5... · •As θ2 0 (or π): it becomes singular (s2 0) 1 2 METR 4202: Robotics 30 August

METR 4202: Advanced Controls & Robotics - Schedule of Events · 2016. 10. 12. · Week Date Lecture (W: 12:05-1:50, 50-N202) 1 27-Jul Introduction 2 3-Aug Representing Position &

Robot Sensing & Perception · Dr Surya Singh -- Lecture # 6 September 3, 2014 ... METR 4202: Robotics 3 September 2014 -31 Camera calibration – approaches • Possible approaches:

METR 4202: Roboticsrobotics.itee.uq.edu.au/~metr4202/2013/lectures/L1... · 1 Introduction to Robotics METR 4202: Advanced Control & Robotics Dr Surya Singh – UQ Robotics Design

Metr 415/715

Sensors & Measurement...1 Sensors & Measurement “Seeing is forgetting the name of what one sees” – L. Weschler METR 4202: Advanced Control & Robotics Dr Surya Singh Lecture #

METR 4202: Robotics 2 August 2013 - 2robotics.itee.uq.edu.au/.../lectures/L3-Kinematics.v1.pdf3 Continuing from Last Week… METR 4202: Robotics 2 August 2013 - 5 Coordinate Transformations