Algoritmos Genéticos aplicados em Machine Learning

Controle de um Robo(em inglês)

Robot Control using Genetic Algorithms

Summary

• Introduction– Robot Control– Khepera Simulator

• Genetic Model for Path Planning– Chromosome Representation– Evaluation Function– Case Studies

• Conclusions

The Robot Controller Problem

• Given a robot and a description of an environment, provide commands (motor speeds) to the robot, in order to achieve a path between two specified locations, which is collision-free and satisfies certain optimisation criteria.

(xi, yi)

(xf, yf)

Optimisation Criteria

• Robot should:– attempt near-optimal paths– avoid obstacles– perform straight motion

• Controller should be independent of:– the robot’s environment– target location

The Khepera Simulator

• Freeware mobile robot simulator (designed by Olivier Michel, University of Nice Sophia-Antipolis)

• User designed worlds

• Control algorithms can be written in C/C++• Robot’s position and angle reading

• 8 sensors (S0-S7): [0, 1023]• 2 motors (M1, M2): [-10, +10]

S1S2 S3

Front identification

Simulator Readings:sensors, position and angle

S1S2 S3

S0-S7: [0, 1023]S0-S7: [0, 1023]

Robot’s World

S1S2 S3

angle of the robot with the world

αα : [-: [-ππ , , ππ ]]

obstacle not obstacle very

detected closed

Control Mode

• To evolve the robot’s attitudes as it interacts with the environment

• Each robot action determines:– how well the controller performs with

respect a given task;– the next input stimuli to the controller.

• The controller should learnlearn as the robot interacts with the environment

Controller Model

Genetic Genetic AlgorithmAlgorithm

evolves robot’s evolves robot’s attitudesattitudes

Sensors

Position

Robot’s Angle

Goal Location

Motor 2 Motor 1

KheperaKheperaSimulatorSimulator

Proposed Modelbased on human behavior

IF Obstacle detectedObstacle detected

Avoid collision, forget targetAvoid collision, forget target

SStraight to the target according to the traight to the target according to the

target directiontarget direction

Sensors Reading Simplification

Sleft Sright

S1S2 S3

Sleft = ( S0 + S1 + S2 ) / 3

Sright = ( S3 + S4 + S5 ) / 3

Sback = ( S6 + S7 ) / 2

Determining the Target Direction

(x,y)α = π/2

arrival position: (xf ,yf)

β = tan-1

[(yf - y)/(xf - x)]

β - απ/2

−π/4

−π/2

−3π/4

Goal point toright side

Goal point infront

Goal point toleft side

Goal pointbehind

Direction =Direction =

IF ((((SSleftleft > L > L) or () or (SSrightright > L > L) or () or (SSbackback > L > L))))

THENObstacle detected, avoid collision, forget target

Proximity-sensor = highest value (SProximity-sensor = highest value (Sleftleft, S, Srightright, S, Sbackback))

ELSEObstacle not detected (collision-free), straight to the target

Target direction = Target direction = ββ - - αα

L=collision threshold=900

Genetic Algorithm Modelling

• Problem

• Chromosome Representation

• Evaluation Function• Genetic Operators

• Techniques

• Parameters

Chromosome Representation

M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2

TargetDirection(collision free) Obstacle detected

Left Right BackFront Left Right Back

Attitudes (Genes)

1 2 3 4 5 6 7

Which speed should be imposed to each motor in Which speed should be imposed to each motor in each situation the robot is?each situation the robot is?

Evaluation Function

10;10;10

≤≤≤≤≤≤

= ∑=

AiDiVi

AiDiViFi

• Main objectives:– (V) speed: as high as possible– (D) straight motion: same motor speed for M1 e M2– (A) action: reach a target and avoid obstacles

• Calculated based on the contribution for each gene [1,7], at each step.

max)*2/(|)2||1(| MMMVi +=

• Normalised sum of the absolute value of the motors speeds;

• Vi increases as both speeds increase• Whatever the robot does, it does quickly.

Straight Motion

=]7,6,5,1[,2))max)*2()21((1(

]4,3,2[,1

• It favours high positive speeds to both motors

• When the robot is not oriented to the target (2,3,4), D=1 avoids contradictory learning

Action

• It considers the benefit of each gene regarding to:– obstacle avoidance– target closeness

=−=≤∆=∆=≤∆=∆

=]7,6,5[,max1

0,)0(])4,3,2[(,max

0,)0()1(,max

AAielsegiandiifggi

AAielsediandiifddi

== ∑

TPiTPiAAi

TPiAi TPi

TPiTPi = total of steps executed by attitude i = total of steps executed by attitude i

AAiAAi=action’s fitness at stept of attitude i=action’s fitness at stept of attitude i

Action

=−=≤∆=∆=≤∆=∆

=]7,6,5[,max1

0,)0(])4,3,2[(,max

0,)0()1(,max

AAielsegiandiifggi

AAielsediandiifddi

== ∑

TPiTPiAAi

TPiAi TPi

Rates the distance variation to the target between

two consecutive steps, and the maximum distance in one step,

for collision free/front

Action

=−=≤∆=∆=≤∆=∆

=]7,6,5[,max1

0,)0(])4,3,2[(,max

0,)0()1(,max

AAielsegiandiifggi

AAielsediandiifddi

== ∑

TPiTPiAAi

TPiAi TPi

Rates the angle variation between two consecutive steps, and

the maximum angle in one step, for collision freeleft, right, back

Action

=−=≤∆=∆=≤∆=∆

=]7,6,5[,max1

0,)0(])4,3,2[(,max

0,)0()1(,max

AAielsegiandiifggi

AAielsediandiifddi

== ∑

TPiTPiAAi

TPiAi TPi

AAiAAi=action’s fitness at stept of attitude i=action’s fitness at stept of attitude i Increases as the distance

to the proximity-sensor increasesin the step

Improving the Target Direction Model

Â2 - Â1 3π/4 π/4

Target at LEFT

Target Behind Target in FRONT

Target at RIGHT

-3π/4 - π/4

First Target Direction Model

4 possible target directions

-3π/4 -π/4

8 possible target directions

Chromosome Representations

Cada estado corresponde a uma única atitude (par de velocidadesM1 e M2), e cada atitude corresponde a um gene do cromossoma. Target Direction (collision free) Obstacle detected

M1, M2 M1, M2 M1, M2 M1, M2 M1, M2 M1, M2 M1, M2 Front Left Right Back Left Right Back

Chromosome with 7 attitude Genes

Target Direction (collision free) Obstacle detected

M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 M1,M2 Front Front Left Left Right Right Back Back Left Right Back left right front back front back left right

Chromosome with 11 attitude Genes

Genetic Algorithm

• Integer chromosome• Population Size =100• Generations = 50• Crossover Rate = 80 %

• Mutation Rate = 4%• Roulette Wheel Reproduction• Elitism• Linear scaling of fitness• 300 Evaluation Steps for each chromosome

• Average of 25 Experiments

Genetic Algorithm Performance7 Genes Chromosome

Best Chromosomesin 1 experiment

Number of Generations

Average of Best Chromosomes in 25 experiments

1 3 5 7 9

11 13 15 17

23 25 27 29

35 37 39

45 47 49

Best Chromosomesin 1 experiment

1 3 5 7 9

Genetic Algorithm Performance 11 Genes Chromosome

Average of Best Chromosomesin 25 experiments

1 3 5 7 9

Paths Achieved in World 1Case Study 1SITUAÇÃO 1

CROMOSSOMA DE 7 GENES CROMOSSOMA DE 11 GENES7 Genes Chromosome 11 Genes Chromosome

Speed Comparison

1 2 3 4 5

Case Studies

11 Genes Chromosome

7 Genes Chromosome

CROMOSSOMA DE 7 GENES CROMOSSSOMA DE 11 GENES7 Genes Chromosome 11 Genes Chromosome

Case Studies

Speed Comparison

11 Genes Chromosome

7 Genes Chromosome

84,62%

40,00%

214,29%

Speed Comparison (%)

Case Study 1Case Study 1

Speed Comparison

Speed Comparison – World 37 Genes Cromosome 11 Genes Cromosome

Case Study 1 target not reached 729Case Study 2 1124 618Case Study 3 target not reached target not reachedCase Study 4 target not reached 1655

Conclusions

• A simple GA was able to gradually evolve the robot control

• The robot achieved near optimal path towards the goal,

avoiding obstacles

• Retraining is not necessary when the environment changes

• Controller improved performance with the 11 genes model

• The robot has no memory about previous unsuccessful paths

and may get lost

• Other tasks can be included in the model (e.g. energy supply)

• Chromosome codification is limited for few robot’s situations

Algoritmos Genéticos aplicados em Machine Learning

Documents

Transcript of Algoritmos Genéticos aplicados em Machine Learning

Implementación de algoritmos genéticos para el … de algoritmos... · Implementación de algoritmos genéticos para el diseño, ... mecanismos utilizando algoritmos genéticos

ALGORITMOS GENÉTICOS APLICADOS NA RECAPACITAÇÃO DE …

ALGORITMOS GENÉTICOS MULTIOBJETIVOS APLICADOS … · Palavras-chave: Roteamento Multicast, Algoritmos Genéticos Multiobjetivos, NSGA, NSGA-II, dominância-ε, Qualidade de Serviço.

Algoritmos Genéticos

ALGORITMOS GENÉTICOS APLICADOS A LA SEGMENTACIÓN DE … · 2014-02-22 · ALGORITMOS GENÉTICOS APLICADOS A LA SEGMENTACIÓN DE IMÁGENES CON ILUMINACIÓN NO CONTROLADA Juan Ranz,

Algoritmos Genéticos aplicados a la generación y … generativa_2...Algoritmos Genéticos aplicados a la generación y producción de formas escultóricas Emiliano Causa, emiliano.causa@gmail.com

Algoritmos Genéticos Ricardo Prudêncio. Algoritmos Genéticos – Referência Básica da Aula Estefane Lacerda – Introdução aos Algoritmos Genéticos. Em.

ALGORITMOS GENÉTICOS

Algoritmos genéticos paralelos. 2 Contenidos Introducción a los algoritmos genéticos Paralelización de algoritmos genéticos.

Algoritmos Genéticos Aplicados a Programação de ...

algoritmos genéticos aplicados à proteção e estimação de ...

ALGORITMOS GENÉTICOS APLICADOS NA OTIMIZAÇÃO DE ANTENAS

ALGORITMOS GENÉTICOS APLICADOS A LA CATEGORIZACIÓN ...fi.uba.ar/laboratorios/lsi/yolis-tesisingenieriainformatica.pdf · ALGORITMOS GENÉTICOS APLICADOS A LA CATEGORIZACIÓN AUTOMÁTICA

Algoritmos Genéticos aplicados a la generación y ...emilianocausa.com.ar/emiliano/textos/Escultura generativa_2-version... · Algoritmos Genéticos aplicados a la generación y

Johnny Alexander Bastidas Otero Algoritmos genéticos ... · Johnny Alexander Bastidas Otero . Algoritmos genéticos aplicados à solução do problema inverso biomagnético . Dissertação

Algoritmos Genéticos aplicados a Control Ambiental

Algoritmos Genéticos aplicados à otimização de áreas

Algoritmos Genéticos Aplicados ao Problema da Mochila Multidimensional

Marlos Rego Menezes Algoritmos Genéticos Aplicados ao Problema … · 2015. 4. 5. · Marlos Rego Menezes Algoritmos Genéticos Aplicados ao Problema de Reconstituição de Acidentes

Algoritmos genéticos aplicados al diseño de redes de ...