Scaling Sampling-based Motion Planning to Humanoid Robots · 2016. 10. 28. · Lower body movement...

Scaling Sampling-based Motion Planning to Humanoid Robots

Yiming Yang, Vladimir Ivan, Wolfgang Merkt, Sethu Vijayakumar

Abstract— Planning balanced and collision-free motion forhumanoid robots is non-trivial, especially when they are oper-ated in complex environments, such as reaching targets behindobstacles or through narrow passages. Research has been donein particular to plan such complex motion on humanoids,however, these approaches are typically restricted to particularrobot platforms and environments, which can not be easilyreplicated nor applied. We propose a method that allows usto apply existing sampling-based algorithms directly to plantrajectories for humanoids by utilizing a customized statespace representation, biased sampling strategies, and a steeringfunction based on a robust inverse kinematics solver. Ourapproach requires no prior offline computation, thus one caneasily transfer the work to new robot platforms. We tested theproposed method by solving practical reaching tasks on a 38degrees-of-freedom humanoid robot, NASA Valkyrie, showingthat our method is able to generate valid motion plans that canbe executed on advanced full-size humanoid robots.

I. INTRODUCTIONHumanoid robots are highly redundant systems that are de-

signed for accomplishing a variety of tasks in environmentsdesigned for people. However, humanoids have a large num-ber of degrees-of-freedom (DoF) which makes motion plan-ning extremely challenging. In general, optimization-basedalgorithms are suitable for searching for optimal solutionseven in high dimensional systems [1] [2], but it is non-trivialto generate optimal collision-free trajectories for humanoidsusing optimization approaches within timeframes acceptablefor online planning, especially in complex environments.This is mainly due to the highly non-linear map between therobot and the collision environment. This mapping can belearned [3] [4] [5] or modelled in abstract spaces [6] [7] forlow dimensional problems, but is too difficult for high DoFhumanoids due to the curse of dimensionality and it oftencauses local minima problems. Additionally, solving locomo-tion and whole-body manipulation in complex environmentsas one combined problem requires searching through a largespace of possible actions. Instead, it is more effective tofirst generate robust walking plans to move the robot to adesired standing location, and then generate collision-freemotion with stationary feet [8]. Although assuming fixedfeet position may be viewed as restrictive, we argue thata large variety of whole body manipulation tasks can stillbe executed as a series of locomotion and manipulationsubtasks. In this work, we propose an extension to a familyof sampling based motion planning algorithms that will allowus to plan collision-free whole-body motions on floating-basesystems which require active balancing.

All authors are with School of Informatics, University of Edinburgh(Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, UnitedKingdom). email: [email protected]

Fig. 1: Collision-free and balanced full-body motion exe-cuted on the 38 DoF NASA Valkyrie robot. Lower bodymovement is not shown for clarity.

Sampling-based planning (SBP) algorithms, such as RRT[9] and PRM [10], are capable of efficiently generatingglobally valid collision-free trajectories due to their sim-plicity. In the past two decades, SBP algorithms have beenapplied to countless problems with a variety of derivatives,such as RRT-Connect [11], Expansive Space Trees (EST)[12], RRT*/PRM* [13], Kinematic Planning by Interior-Exterior Cell Exploration (KPIECE) [14], and many others[15]. However, since the SBP algorithms were originallydesigned for mobile robots and low DoF robotic arms, usingthem on high DoF systems requiring active balancing is stillchallenging. We will call a robot pose statically balancedif the controller can achieve an equilibrium in this statewhile achieving zero velocity and acceleration (e.g. whenthe projection of centre of mass lies within the support poly-gon). The subset of robot configurations with this propertyforms a low dimensional manifold defined by the balanceconstraint. In practice, the rejection rate of random samples isprohibitively high without the explicit or implicit knowledgeof the manifold.

Approaches have been proposed to address this particularproblem of using SBP algorithms for humanoid robots.Kuffner et al. [16] use a customized RRT-Connect algo-rithm to plan whole body motion for humanoids, wherethey only sample from a pre-calculated pool of posturesfor which the robot is in balance. Hauser at al. [17] in-troduce motion primitives into SBP algorithms where the

sethuText BoxIn Proceedings of IEEE International Conference on Robotics and Biomimetics, Dec. 3–7, 2016, Qingdao, China

sampler only samples states around a set of pre-stored motionprimitives. These approaches share the common idea ofusing an offline generated sample set to bootstrap onlineprocesses, thus allowing algorithms to bypass the expensiveonline generation of balanced samples [16] [17]. By storingsample configurations with different lower body postures,one can also generate whole body motion that consists ofcoordinated locomotion and upper body movement [18].However, this leads to the problem where one has to store asignificant number of samples to densely cover the balancemanifold, otherwise the algorithms may fail while validsolutions exist but were not stored during offline processing.Another direction for solving humanoid motion planningproblem is constraint sampling. Dalibard et al. [19] replacethe steering/interpolation component in RRT-Connect with aconstraint connect function to ensure the new nodes addedto the tree are balanced and collision free. Fumio et al. [20]split the whole body into several kinematic chains by fixingthe base height. Such separation reduces the processingtime for whole body kinematics, however, one loses theredundancy of the full kinematic structure. Most of theexisting approaches are normally platform specific, whichmakes it difficult and time consuming to transfer the work toother robot platforms for generic humanoid motion planningproblems. Sampling-based planning methods for humanoidsare by no means new concepts in robotics. However, thesemethods are often customized from basic algorithms suchas RRT-Connect for particular robots and environments, itis non-trivial to reuse or apply these methods on generichumanoid robots for solving generic problems.

To this end, instead of developing new SBP algorithmsspecifically for humanoids, we focus on enabling the stan-dard SBP algorithms to solve humanoids motion planningproblems by modifying the underlying key componentsof generic SBP approaches, such as space representation,sampling strategies and interpolation functions. In order tomake the method generic for any humanoid platforms, ratherthan store balanced samples during offline processing, weuse a non-linear optimization based [21] whole-body inversekinematics (IK) solver to generate balanced samples on-the-fly. Thus, the proposed method can be easily applied todifferent humanoid robot platforms without extensive pre-processing and setup. We evaluate the proposed method ona 36 DoF Boston Dynamics Atlas and a 38 DoF NASAValkyrie humanoid robots, to show that our method is capa-ble of generating reliable collision-free whole-body motionfor a generic humanoid. We also evaluate the differencebetween sampling in end-effector and configuration spacesfor different scenarios, and compare the planning time andtrajectory length to find an optimal trade off between ef-ficiency and optimality. In particular, we apply our workto solve practical reaching tasks on the Valkyrie robot, ashighlighted in Fig. 1, showing that the proposed method cangenerate reliable whole-body motion that can be executed onfull-size humanoid robots.

II. PROBLEM FORMULATION

Let C ∈ RN+6 be a robot’s configuration space, whereN is the number of articulated joints and the additional 6DoF represents the unactuated virtual joint that connects therobot’s pelvis (Tpelvis ) and the worldW ∈ SE(3). Let q ∈ Cbe the robot configuration state, Cbalance ⊂ C the manifoldof statically balanced configurations, Cfree ⊂ C the manifoldof collision free configurations and Cvalid ≡ Cbalance ∩ Cfreethe valid configuration manifold.

For humanoid robots, valid trajectories can only containstates from valid configuration manifold, i.e. q[0:T ] ⊂ Cvalid .Generating collision free samples is straightforward by usingrandom sample generators and standard collision checkinglibraries. However, generating balanced samples is non-trivial, where a random sampling technique is incapableof efficiently finding balanced samples on the low dimen-sional manifold Cbalance by sampling in high dimensionalconfiguration space C. Guided sampling or pre-samplingprocess is required for efficient valid sample generation.In our approach, a whole-body inverse kinematic solver isemployed to produce statically balanced samples. The staticbalance constraint is a combination of constraints on feet andCoM poses, i.e. the static balance constraint is consideredas satisfied when the robot’s feet have stable contact withthe ground and the CoM ground projection stays within thesupport polygon spanned by the foot contact points. In ourcase, we only consider scenarios where both feet are incontact with the ground, however, as long as the contactinformation is given, the method stays the same for whetheronly one or multiple end-effectors are in contact with theenvironment.

A. Whole-body Inverse Kinematics

Given a seed configuration qseed and nominal config-uration qnominal and a set of constraints C, an outputconfiguration that satisfies all the constraints can be generallyformulated as:

q∗ = IK (qseed ,qnominal ,C) (1)

The set of constraints for a whole-body humanoid robotmay include single joint constraints, such as position andvelocity limits for articulated joints, it may also includeworkspace pose constraints, e.g. end-effector poses, centre-of-mass position, etc. In the rest of the paper, unless specifiedotherwise, we assume the quasi-static balance constraintand joint limits constraints are included in C by default.We formulate the IK problem as a non-linear optimizationproblem (NLP) of form:

q∗ = arg minq∈RN+6 ‖q− qnominal‖2Qq (2)subject to bl ≤ q ≤ bu (3)

ci(q) ≤ 0, ci ∈ C (4)

where Qq � 0 is the weighting matrix, bl and bu arethe lower and upper joint bounds. We use a randomlysampled state as the seed pose qseed . We then use thispose as the initial value for the SQP solver [22]. Depending

𝒙"#$%#𝒙&'$%'"#

𝒙$

𝒙( 𝒙)*$+

𝒙%$&,*-

𝒙&'.

Motion evaluation

Interpolation

Sampling

Samplingspace𝒳

Fig. 2: Instead of developing new algorithms, we mod-ify those underlying components in SBP solvers to makestandard algorithms be capable of solving motion planningproblems for humanoid robots.

on the implementation of the SBP algorithm, we eitherchoose qnominal to be the current robot state or one of theneighbouring poses drawn from the pool of candidate posesalready explored by the SBP algorithm.

B. Sampling-based Motion Planning

Let x ∈ X be the space where the sampling is carried out.The planning problem can be formulated as

q[0:T ] = Planning(Rob,Env,x0,xT ) (5)

where Rob is the robot model and Env is the environmentinstance in which this planning problem is defined. x0 andxT are the initial and desired states.

In order for SBP algorithms to be able to plan motionsfor humanoid robots, we need to modify the followingcomponents that are involved in most algorithms as shownin Fig. 2: the space X where the sampling is carried out;the strategies to draw random samples; and the interpolationfunction which is normally used in steering and motionevaluation steps. In the next section, we will discuss thedetails of modifications we applied on those components forscaling standard SBP algorithms to humanoids.

III. SAMPLING-BASED PLANNING FOR HUMANOIDS

We separate the work into two parts, configuration spacesampling and end-effector space sampling. In configurationspace sampling approach, the state is represented in RN+6space with joint limits and maximum allowed base movementas the bounds, the sampling state is identical to robot config-uration, i.e. x = q ∈ C. For reaching and grasping problems,one might be interested in biasing the sampling in the end-effector related constraints, e.g. to encourage shorter end-effector traverse distance. The end-effector space approachsamples in SE(3) space with a region of interests aroundthe robot as the bounds, the state is equivalent to the end-effector’s forward kinematics, i.e. x = Φ(q) ∈ W whereΦ(·) is the forward kinematics mapping. However, the finaltrajectories are represented in configuration space, thus weassociate a corresponding configuration for each end-effector

Algorithm 1 Humanoid Configuration Space SBPsampleUniform()

1: succeed = False2: while not succeed do3: q̄rand = RandomConfiguration()4: qrand , succeed = IK (q̄rand , q̄rand ,C)

return qrand

sampleUniformNear(qnear , d)1: succeed = False2: while not succeed do3: A← Zeros(N + 6)4: while not succeed do5: q̄rand = RandomNear(qnear , d)6: Set constraint ‖qrand − q̄rand‖W < A7: qrand , succeed = IK (q̄rand ,qnear ,C)8: Increase A9: if distance(qrand ,qnear ) > d then

10: succeed = Falsereturn qrand

interpolate(qa,qb, d)1: q̄int = InterpolateConfigurationSpace(qa,qb, d)2: succeed = False3: A← Zeros(N + 6)4: while not succeed do5: Set constraint ‖qint − q̄int‖W < A6: qint , succeed = IK (q̄int ,qa,C)7: Increase A

return qint

space state to avoid ambiguity and duplicated calls to the IKsolver.

A. Configuration Space Sampling

Algorithm 1 highlights the components’ modificationsrequired for sampling in configuration space:

1) Sampling Strategies: For sampleUniform(), we firstgenerate random samples from X and then use full-body IKsolver to process the random samples to generate samplesfrom the balanced manifold Xbalance

qrand = IK (q̄rand , q̄rand ,C) (6)

where q̄rand ∈ X is a uniform random configuration andqrand ∈ Xbalance is random sample from the balanced man-ifold. We use q̄rand as nominal pose since we want to gen-erate random postures rather than postures close to other al-ready existing samples. This is to indirectly encourage explo-ration of the null-space of the task. The constraint set C con-tains the static balance and joint limit constraints. When sam-pling around a given state (sampleUniformNear(qnear , d)),we first get a random state q̄rand that is close to qnearwithin distance d. The IK solver is invoked with q̄rand asthe seed pose, and qnear as the nominal pose. An additional

configuration space constraint is added to the constraint set

‖qrand − q̄rand‖W ≤ A (7)

where A ∈ RN+6 is a tolerance vector initially set tozero. In most cases the system will be over constrained,in which case we need to increase the tolerance to ensurebalance. Normally, the lower-body joints are neglected first,i.e. increasing corresponding wi, meaning that we allow thelower-body joints to deviate from q̄rand in order to keep thefeet on the ground and maintain balance. We use xnear asthe nominal pose since later on the random state is likely tobe appended to qnear , and one wants the random state beclose to the near state. The new sample is discarded if thedistance between qnear and qrand exceeds the limit d.

2) Interpolation: In order to find a balanced state inter-polated along two balanced end-point states, we first find theinterpolated, likely to be un-balanced state

q̄int = qa + d‖qb − qa‖. (8)

A similar configuration space constraint to (7) is applied toconstrain the balanced interpolated state qint close to q̄int

|qint − q̄int‖ ≤ A (9)

The two end-point states qa and qb are valid samplesgenerated using our sampling strategies. Due to the convexformulation of the balance constraint, a valid interpolatedstate is guaranteed to be found. It is worth mentioning thatin some cases the interpolation distance equation no longerholds after increasing the tolerance, i.e. ‖xint−xa‖‖xb−xa‖ 6= d.However, this is a necessary step to ensure that the balanceconstraint are satisfied.

B. End-Effector Space Sampling

Algorithm 2 highlights the components’ modificationsrequired for sampling in end-effector space:

1) Sampling Strategies: It is straight forward to sample inSE(3) space, however, it is non-trivial to sample balancedsamples from the Xbalance manifold. For sampleUniform(),we first randomly generate SE(3) state x̄rand within a regionof interest in front of the robot. The whole-body IK isinvoked with an additional end-effector pose constraint

‖x̄rand − Φ(qrand)‖ ≤ 0 (10)

The sampler keeps drawing new random states x̄rand untilthe SQP solver returns a valid output q∗. The valid ran-dom state xrand can be calculated using forward kinemat-ics, e.g. xrand = Φ (q∗). The same procedure applies tosampleNear(xnear , d), but using xnear as the seed configu-ration.

2) Interpolation: Similar to sampling near a given state,for interpolation in end-effector space, we first find theinterpolated state x̄int ∈ SE(3) and add the following terminto constraint set

‖x̄int − Φ(q)‖ ≤ B (11)

where B ∈ R6 is a tolerance vector initially set to zero.If the system is over constrained after adding end-effector

Algorithm 2 Humanoid End-Effector Space SBPsampleUniform()

1: succeed = False2: while not succeed do3: x̄rand = RandomSE3 ()4: Set constraint ‖x̄rand − Φ(qrand)‖ ≤ 05: qrand , succeed = IK (q̄rand , q̄rand ,C)

6: xrand = Φ(qrand)return xrand ,qrand

sampleUniformNear(xnear , d)1: succeed = False2: while not succeed do3: x̄rand = RandomNearSE3 (xnear , d)4: Set constraint ‖x̄rand − Φ(qrand)‖ ≤ 05: qrand , succeed = IK (qrand ,qnear ,C)

6: xrand = x̄randreturn xrand ,qrand

interpolate(xa,xb, d)1: x̄int = InterpolateSE3 (xa,xb, d)2: succeed = False3: B ← Zeros(SE3)4: while not succeed do5: Set constraint ‖x̄int − Φ(qint)‖ < B6: qint , succeed = IK (qa,qa,C)7: Increase B8: xint = Φ(qint)

return xint ,qint

pose constraint, we need to selectively relax the tolerancefor different dimensions (x, y, z, roll, pitch, yaw) until theIK solver succeeds. Then we reassign the interpolated stateusing forward kinematics, xint = Φ(qint).

3) Multi-Endeffector Motion Planning: Some tasks re-quire coordinated motion involving multiple end-effectors,e.g. bi-manual manipulation and multi-contact motion. Itis obvious that, from a configuration space point of view,there is no difference as long as the desired configurationis specified. It is also possible for the end-effector spacesampling approach to plan motion with multiple end-effectorconstraints. Let y∗k ∈ SE(3) be the desired pose constraintsfor end-effector k ∈ {1, . . . ,K}. A meta end-effector spaceX ∈ R6×K can be constructed to represent the samplingspace for all end-effectors. Similar sampling and interpo-lation functions can be implemented by constructing extraconstraints for each end-effector k.

IV. EVALUATION

We aim to generalize the common components ofsampling-based motion planning algorithms for humanoidrobots so that existing algorithms can be used withoutextra modification. We implemented our approach in theEXOTica motion planning and optimization framework [23]

TABLE I: Planning time (in sec) of empty space reachingproblem utilising different algorithms, in seconds. The resultis averaged over 100 trails.

Algorithms Sampling SpaceEnd-Effector Space Configuration SpaceRRT 25.863± 22.894 1.4129± 1.4466PRM 4.2606± 3.0322 0.5912± 0.5912EST 28.055± 18.270 0.3112± 0.3112BKPIECE 5.3989± 5.9470 0.1781± 0.0332SBL 3.0602± 0.9859 0.2804± 0.0480RRT-Connect 2.8228± 0.3412 0.1853± 0.0450

Fig. 3: Evaluation tasks, from left to right: task 1, targetclose to robot; task 2, target far away from robot; and task3, target behind bar obstacle.

as a humanoid motion planning solver, which internallyinvokes the SBP planners from the Open Motion PlanningLibrary (OMPL, [24]). We have set up the system withour customized components, and evaluated our approach onthe following six representative algorithms: RRT [9], RRT-Connect [11], PRM [10], BKPIECE [14], EST [12]) and SBL[25]. The evaluations were performed in a single thread witha 4.0 GHz Intel Core i7-6700K CPU.

A. Empty Space Reaching

In the first experiment, we have the robot reach a targetpose in front of the robot in free space, where only self-collision and balance constraints are considered. This is asanity check to show that the proposed method can be usedrobustly across different planning algorithms to generatetrajectories for humanoid robots. We solve the reachingproblem using the six testing algorithms in two differentsampling spaces, each across 100 trials. The results areshown in Table I. Although the planning time varies acrossdifferent algorithms and sampling spaces, the result showsthat standard planning algorithms are able to generate motionplans for humanoid robots using our method. As expected,bi-directional algorithms are more efficient than their unidi-rectional variants. Also, sampling in configuration space ismuch more efficient than in end-effector space due to thehigher number of IK calls for the later case.

B. Collision-free Reaching

We setup three different scenarios, from easy to hard, asillustrated in Fig. 3, to evaluate the performance of differentalgorithms in different sampling spaces. Unfortunately, theevaluation suggests that standard unidirectional algorithmsare unable to solve these problems (within a time limit of 100seconds). Without bi-directional search, the high dimensional

(a) Trajectories generated using configuration space sampling.

(b) Trajectories generated using end-effector space sampling.

Fig. 4: Whole-body motion plans generated using differentsampling spaces. The task is identical for each column. Ingeneral, configuration space sampling leads to shorter tra-jectory length; end-effector space sampling leads to shorterend-effector traverse distance.

humanoid configuration space is too complex for sampling-based methods to explore. Table II highlights the resultsusing four different bidirectional approaches. Note that whensampling in end-effector space, only RRT-Connect is able tofind a valid solution in the given time, other bidirectionalsearch algorithms like BKPIECE and SBL are also unable tofind valid trajectories. The result indicates that RRT-Connectsampling in configuration space is the most efficient andthe most robust approach for solving humanoid whole-bodymotion planning problems. It requires the least exploration,thus bypassing expensive online IK queries. Algorithms likeBKPIECE and SBL use low-dimensional projections to biasthe sampling, however, the default projections which aretuned for mobile robots and robotic arms do not scale upto high DoF humanoid robots, which leads to long planningtime and trajectories with high costs. This can be improvedby better projection bias, but it is non-trivial to find a suitablebias without fine tuning. Also, the trajectories generatedusing RRT-Connect are shorter, meaning that the motion ismore stable and robust. It is worth mentioning that RRT-Connect takes longer time to plan when sampling in theend-effector space than it does in the configuration space,but the planned trajectories have shorter end-effector andCoM traverse distances. In some scenarios where planningtime is not critical, one can choose to use RRT-Connectin end-effector space to generate trajectories with shorterend-effector traverse distance. These results also suggestthat the whole-body IK computation dominates the planningtime. This is in contrast with classical SBP problems wherecollision-detection is the most time consuming component.However, the IK solver is necessary for maintaining balance,as shown in Fig. 5, where the trajectories’ CoM projections

TABLE II: Evaluation of whole-body collision-free motion planning. RRT-Connecte sampling in end-effector space, allother methods sampling in configuration space. C cost is the configuration space trajectory length,W cost is the end-effectortraverse distance in workspace, CoM cost is the CoM traverse distance in workspace. No. evaluation shows the number ofstate evaluation calls, i.e. evaluate if a sampled/interpolated state is valid. No. IK indicates the number of online whole-bodyIK calls, and IK time is the total time required for solving those IK, which is the most time consuming element. The resultis averaged over 100 trails.

Tasks Algorithms Planning time (s) C cost (rad.) W cost (m) CoM cost (m) No. evaluation No. IK IK time (s)

Task 1

BKPIECEc 42.5 ± 26.4 7.37 ± 2.43 2.10 ± 0.80 0.24 ± 0.10 1946 ± 1207 2598 ± 1582 41.4 ± 25.7SBLc 27.8 ± 8.59 6.25 ± 1.06 2.14 ± 0.71 0.23 ± 0.06 1313 ± 418 1508 ± 445 27.0 ± 8.33RRT-Connecte 9.91 ± 4.80 2.93 ± 0.96 0.58 ± 0.11 0.07 ± 0.02 597 ± 354 727 ± 387 9.51 ± 4.58RRT-Connectc 1.53 ± 0.80 2.71 ± 0.68 0.99 ± 0.23 0.11 ± 0.03 95 ± 54 118 ± 64 1.48 ± 0.77

Task 2


Task 3


(a) Reaching motion on the NASA Valkyrie robot. (b) Reaching motion on the Boston Dynamics Atlas robot.

Fig. 5: Collision-free whole-body motion generated in different scenarios with different robot models. The correspondingCoM trajectories are illustrated in the second row (red dots). The framework is setup so that one can easily switch to newrobot platforms without extensive preparing procedures.

are within the support polygon.In more complex scenarios, such as reaching through

narrow passages and bi-manual tasks, most algorithms failto generate valid trajectories apart from RRT-Connect. Asmentioned, some algorithms’ performance depends on thebiasing methods, e.g. projection bias and sampling bias.However, it is non-trivial to find the appropriate bias forhumanoids that would generalize across different tasks.Fig. 5 highlights some examples of reaching motion in morecomplex scenarios with different robot models. As statedearlier, this work focuses on generalising SBP algorithmsfor humanoids, where as one can easily setup the systemon new platforms as long as the robot model is given. Forinstance, one can easily switch from Valkyrie (Fig. 5a) toAtlas (Fig. 5b) in minutes without extensive pre-processingand setup procedures.

In order to test the reliability and robustness of theproposed method, we applied our work on the Valkyrierobot accomplishing reaching and grasping tasks in differentscenarios, as highlighted in Fig 6. During practical experi-ments, the collision environment is sensed by the on–boardsensor and represented as an octomap [26]. The experimentresults show that our method is able to generate collision-free

whole-body motion plans that can be executed on full-sizehumanoid robot to realise practical tasks such as reaching andgrasping. A supplementary video of the experiment resultscan be found at https://youtu.be/AZQY_QOX0Pw.

V. CONCLUSION

In this paper we generalise the key components requiredby sampling-based algorithms for generating collision-freeand balanced whole-body trajectories for humanoid robots.We show that by using the proposed methods, standard SBPalgorithms can be invoked to directly plan for humanoidrobots. We also evaluate the performance of different al-gorithms on solving planning problems for humanoids, andpoint out the limitations of some algorithms. A variety ofdifferent scenarios are tested showing that the proposedmethod can generate reliable motion for humanoid robotsin different environments. This work can be transferred todifferent humanoid robot models with easy setup procedurethat can be done in very a short period of time, withoutextensive pre-processing for adapting the existing algorithmsto different robots, as we have tested on the 36 DoF BostonDynamics Atlas and the 38 DoF NASA Valkyrie robots.In particular, we applied this work on the Valkyrie robot

Fig. 6: Collision-free whole-body motion execution on the NASA Valkyrie humanoid robot. In each row, the first figurehighlights the motion plan, followed by execution snapshots.

accomplishing different tasks, showing that the proposedmethod can generate robust whole-body motion that can beexecuted on real robots.

The whole-body inverse kinematics is crucial in terms ofguaranteeing balance and smoothness, however, the resultin Table II shows that the IK solver dominates over 95%of the online computation time. Although it depends onthe implementation and underlying algorithms of the IKsolver which is not the focus of this paper, we intend toinvestigate faster IK implementation to bootstrap samplingand interpolation. This will make the state space explorationmore efficient, so that other standard algorithms may be ableto find valid solutions within the same time window.

REFERENCES

[1] K. Rawlik, M. Toussaint, and S. Vijayakumar, “On stochastic optimalcontrol and reinforcement learning by approximate inference,” RSS,2012.

[2] N. Ratliff, M. Zucker, J. A. Bagnell, and S. Srinivasa, “CHOMP:Gradient optimization techniques for efficient motion planning,” inICRA, IEEE, 2009.

[3] M. Howard, S. Klanke, M. Gienger, C. Goerick, and S. Vijayakumar,“A novel method for learning policies from variable constraint data,”Autonomous Robots, 2009.

[4] J. Nakanishi, A. Radulescu, and S. Vijayakumar, “Spatio-temporaloptimization of multi-phase movements: Dealing with contacts andswitching dynamics,” in IEEE/RSJ IROS, 2013.

[5] H.-C. Lin, M. Howard, and S. Vijayakumar, “Learning null spaceprojections,” in IEEE ICRA, 2015.

[6] Y. Yang, V. Ivan, and S. Vijayakumar, “Real-time motion adaptationusing relative distance space representation,” in ICAR, IEEE, 2015.

[7] V. Ivan, D. Zarubin, M. Toussaint, T. Komura, and S. Vijayakumar,“Topology-based representations for motion planning and generaliza-tion in dynamic environments with interactions,” IJRR, 2013.

[8] Y. Yang, V. Ivan, Z. Li, M. Fallon, and S. Vijayakumar, “iDRM:Humanoid Motion Planning with Real-Time End-Pose Selection inComplex Environments,” in Humanoids, 2016.

[9] S. M. Lavalle, “Rapidly-Exploring Random Trees: A New Tool forPath Planning,” 1998.

[10] L. E. Kavraki, P. Svestka, J. C. Latombe, and M. H. Overmars, “Prob-abilistic roadmaps for path planning in high-dimensional configurationspaces,” ICRA, 1996.

[11] J. J. Kuffner and S. M. LaValle, “RRT-connect: An efficient approachto single-query path planning,” in ICRA, IEEE, 2000.

[12] D. Hsu, J. C. Latombe, and R. Motwani, “Path planning in expansiveconfiguration spaces,” in ICRA, IEEE, 1997.

[13] S. Karaman and E. Frazzoli, “Sampling-based algorithms for optimalmotion planning,” IJRR, pp. 846–894.

[14] I. A. Şucan and L. E. Kavraki, “Kinodynamic motion planningby interior-exterior cell exploration,” in Algorithmic Foundation ofRobotics, 2009.

[15] M. Elbanhawi and M. Simic, “Sampling-Based Robot Motion Plan-ning: A Review,” IEEE Access, 2014.

[16] J. Kuffner, K. Nishiwaki, S. Kagami, M. Inaba, and H. Inoue, “Motionplanning for humanoid robots,” in ISRR, 2005.

[17] K. Hauser, T. Bretl, K. Harada, and J.-C. Latombe, “Using mo-tion primitives in probabilistic sample-based planning for humanoidrobots,” in Algorithmic foundation of robotics, 2008.

[18] J. J. Kuffner, S. Kagami, K. Nishiwaki, M. Inaba, and H. In-oue, “Dynamically-stable motion planning for humanoid robots,”Autonomous Robots, 2002.

[19] S. Dalibard, A. Nakhaei, F. Lamiraux, and J. P. Laumond, “Whole-body task planning for a humanoid robot: a way to integrate collisionavoidance,” in Humanoids, 2009.

[20] F. Kanehiro, E. Yoshida, and K. Yokoi, “Efficient reaching motionplanning method for low-level autonomy of teleoperated humanoidrobots,” Advanced Robotics, 2014.

[21] R. Tedrake, “Drake: A planning, control, and analysis toolbox fornonlinear dynamical systems,” 2014.

[22] P. E. Gill, W. Murray, and M. A. Saunders, “SNOPT: An SQPAlgorithm for Large-Scale Constrained Optimization,” SIAM, 2005.

[23] V. Ivan, Y. Yang, M. Camilleri, and S. Vijayakumar, “EXOTica:a library for easy creation of tools for optimisation and planning,https://github.com/openhumanoids/exotica.”

[24] I. A. Şucan, M. Moll, and L. E. Kavraki, “The Open Motion PlanningLibrary,” Robotics Automation Magazine, IEEE, 2012.

[25] G. Sánchez and J.-C. Latombe, “A single-query bi-directional prob-abilistic roadmap planner with lazy collision checking,” in RoboticsResearch, 2003.

[26] A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Bur-gard, “OctoMap: An Efficient Probabilistic 3D Mapping FrameworkBased on Octrees,” Autonomous Robots, 2013.

Scaling Sampling-based Motion Planning to Humanoid Robots · 2016. 10. 28. · Lower body movement...

Documents

Transcript of Scaling Sampling-based Motion Planning to Humanoid Robots · 2016. 10. 28. · Lower body movement...