Autonomous inverted helicopter flight via reinforcement learning

Similar documents
CS 188: Artificial Intelligence

CS 188: Artificial Intelligence Fall Announcements

Announcements. CS 188: Artificial Intelligence Fall Motivating Example. Today. Autonomous Helicopter Flight. Autonomous Helicopter Setup

Announcements. CS 188: Artificial Intelligence Fall So Far: Foundational Methods. Now: Advanced Applications.

CS 188: Artificial Intelligence Fall Announcements

FLYING CAR NANODEGREE SYLLABUS

Innovating the future of disaster relief

52 BACKYARDFLYER.COM FLY

Electric VTOL Aircraft

Exploration 4: Rotorcraft Flight and Lift

German Aerospace Center Flight Operations

How Do Helicopters Fly? An Introduction to Rotor Aeromechanics

Super Squadron technical paper for. International Aerial Robotics Competition Team Reconnaissance. C. Aasish (M.

INTEGRATED SCHEDULING OF DRAYAGE AND LONG-HAUL TRANSPORT

UAV KF-1 helicopter. CopterCam UAV KF-1 helicopter specification

a Challenge for Lift-Based, Rigid Wing AWE Systems

Enhancing Wheelchair Mobility Through Dynamics Mimicking

Designing evtol for the Mission NDARC NASA Design and Analysis of Rotorcraft. Wayne Johnson From VTOL to evtol Workshop May 24, 2018

Autonomous Mobile Robot Design

UAVs using Bayesian Optimization to Locate WiFi Devices

3rd International Conference on Material, Mechanical and Manufacturing Engineering (IC3ME 2015)

Multirotor UAV propeller development using Mecaflux Heliciel

How to use the Multirotor Motor Performance Data Charts

AVIATOR REMOTE CONTROL HELICOPTER

Skycar Flight Control System Overview By Bruce Calkins August 14, 2012

INTRODUCTION TO HELICOPTER FLYING

London calling (probably)

SERVICE INFORMATION LETTER # 0121 Addition A. Subject; Tracking the Enstrom Rotor System using the Chadwick 2000 balance system.

International Journal of Scientific & Engineering Research, Volume 4, Issue 7, July ISSN BY B.MADHAN KUMAR

TAROT ZYX-S2 Three-axis Gyro User Manual

On the role of AI in autonomous driving: prospects and challenges

Designing for Reliability and Robustness with MATLAB

Design and Navigation of Flying Robots

EXPERIMENTAL FLYING AUTONOMOUS VEHICLE

Y. Lemmens, T. Benoit, J. de Boer, T. Olbrechts LMS, A Siemens Business. Real-time Mechanism and System Simulation To Support Flight Simulators

AUTONOMOUS CARS: TECHNIQUES AND CHALLENGES

Implementing Dynamic Retail Electricity Prices

Analysis of Big Data Streams to Obtain Braking Reliability Information July 2013, for 2017 Train Protection 1 / 25

Quadrotor Using Minimal Sensing For Autonomous Indoor Flight

Drones Demystified! Topic: Propulsion Systems

for Unmanned Aircraft

Optimized Electric Drive. Systems

Mercury VTOL suas Testing and Measurement Plan

Friday, 27 June Realizing a small UAV for medical transport in developing countries Master thesis: Ferdinand Peters. Dr.One

Unit 1 Introduction to VEX and Robotics

Control of Mobile Robots

AT-10 Electric/HF Hybrid VTOL UAS

VERT 1 VERTICAL TAKE OFF / LANDING RC PLANE

Predicting Solutions to the Optimal Power Flow Problem

Five Cool Things You Can Do With Powertrain Blockset The MathWorks, Inc. 1

Integrated System Models Graph Trace Analysis Distributed Engineering Workstation

Smart Control of Low Voltage Grids

CONTENTS. Introduction 1. Features 1. Specification 1. Contents 2. Tools And Items 3. Assembly of the front landing gears 4

THE KIDDIE HAWK AIR ACADEMY AND THE LIVING LEGENDS OF AVIATION

Electric Drive - Magnetic Suspension Rotorcraft Technologies

A brief History of Unmanned Aircraft

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

CHAPTER 11 FLIGHT CONTROLS

HVE Vehicle Accelerometers: Validation and Sensitivity

MIPRover: A Two-Wheeled Dynamically Balancing Mobile Inverted Pendulum Robot

REU: Improving Straight Line Travel in a Miniature Wheeled Robot

Inventory Routing for Bike Sharing Systems

Energy-efficient Autonomous Four-rotor Flying Robot Controlled at 1 khz

Seventh Framework Programme THEME: AAT Breakthrough and emerging technologies Call: FP7-AAT-2012-RTD-L0 AGEN

Segway with Human Control and Wireless Control

A practical investigation of the factors affecting lift produced by multi-rotor aircraft. Aaron Bonnell-Kangas

Shock tube based dynamic calibration of pressure sensors

In recent years, multirotor helicopter type autonomous UAVs are being used for aerial photography and aerial survey. In addition, various

Wheeled Mobile Robots

Better Performance Starts with Better Technology THE BLR ADVANTAGE

Intelligent Energy Management System Simulator for PHEVs at a Municipal Parking Deck in a Smart Grid Environment

Learning to Fly: Design and Construction of an Autonomous Airplane

XIV.C. Flight Principles Engine Inoperative

Fuzzy Architecture of Safety- Relevant Vehicle Systems

SPEED AND TORQUE CONTROL OF AN INDUCTION MOTOR WITH ANN BASED DTC

AGENT-BASED MODELING, SIMULATION, AND CONTROL SOME APPLICATIONS IN TRANSPORTATION

Le développement technique des véhicules autonomes

ELECTRIC VEHICLES DRIVE CONTROL THEORY AND PRACTICE

Caution Notes. Features. Specifications. Installation. A3 3-axis Gyro & Stabilizer User Manual V1.0

Design, Fabrication & Simulation of a Semi-Rigid Helicopter Swashplate Control Mechanisms

INTELLIGENT ENERGY MANAGEMENT IN A TWO POWER-BUS VEHICLE SYSTEM. DISTRIBUTION STATEMENT A. Approved for public release; distribution is unlimited.

Multi-agent systems and smart grid modeling. Valentin Robu Heriot-Watt University, Edinburgh, Scotland, UK

NUmERiCAL STUdY Of HELiCOPTER fuselage AEROdYNAmiC CHARACTERiSTiCS WiTH influence Of main ROTOR

INTELLIGENT ENERGY MANAGEMENT IN A TWO POWER-BUS VEHICLE SYSTEM

Identification of a driver s preview steering control behaviour using data from a driving simulator and a randomly curved road path

Prop effects (Why we need right thrust) Torque reaction Spiraling Slipstream Asymmetric Loading of the Propeller (P-Factor) Gyroscopic Precession

Automated Driving - Object Perception at 120 KPH Chris Mansley

Vehicle Steering Control with Human-in-the-Loop

Test of. Boeing MH-47G Chinook. Produced by Area-51 Simulations

Active Driver Assistance for Vehicle Lanekeeping

Regulating Highly Automated Robot Ecologies: Insights from Three User Studies

Autonomous Quadrotor for the 2014 International Aerial Robotics Competition

Rotary Wing Micro Air Vehicle Endurance

The following slideshow and talk were presented at the Uber Elevate Summit on April 25 th, The text included here is an approximate transcript

AUTOPILOT SELECTION GUIDE & PRICE LIST. The NEW

THE INVESTIGATION OF CYCLOGYRO DESIGN AND THE PERFORMANCE

Accelerating the Development of Expandable Liner Hanger Systems using Abaqus

Preface. Acknowledgments. List of Tables. Nomenclature: organizations. Nomenclature: acronyms. Nomenclature: main symbols. Nomenclature: Greek symbols

Bild : Bernhard Mühr German Aerospace Center Flight Operations

Prediction Model of Driving Behavior Based on Traffic Conditions and Driver Types

Transcription:

Autonomous inverted helicopter flight via reinforcement learning Andrew Y. Ng, Adam Coates, Mark Diel, Varun Ganapathi, Jamie Schulte, Ben Tse, Eric Berger, and Eric Liang By Varun Grover

Outline! Helicopter flying! Objective! Setup! Model Identification! Controller Design! Experiments! Decision Making! Strength and Weaknesses! Applicability to my project

Helicopter flying! From How stuff works, Helicopters can do three things that an airplane cannot. " Fly Backward " Rotate in the air " Hover! Well Helicopters can do one more thing " Inverted Hover

Helicopter flying! Main Rotor rotating wing assembly.! Tail Rotor - produces thrust just like an airplane's propeller does

Helicopter flying! If you give the main rotor wings a slight angle of attack on the shaft and spin the shaft, the wings start to develop lift.! In order to actually control the machine, both the main rotor and the tail rotor need to be adjustable.

Helicopter flying! Hovering in a helicopter requires experience and skill. The pilot adjusts the cyclic to maintain the helicopter's position over a point on the ground. The pilot adjusts the collective to maintain a fixed altitude (especially important when close to the ground). The pilot adjusts the foot pedals to maintain the direction that the helicopter is pointing.! Hovering an inverted helicopter is quite challenging

Objective! Perform an autonomous inverted hovering.! Stochastic, Non-linear! The problem is high-dimensional: " X,Y,Z coordinates for position and velocity " Main rotor " Tail rotor

Helicopter setup! Modified Bergen industrial twin helicopter http://www.bergenrc.com/industrialtwin.as p! Also equipped with " PC104 flight computer " Inertial Science ISIS-IMU accelerometers and turning-rate gyroscopes " Novatel GPS unit " MicroStrain 3d magnetic compass

Machine Learning for controller design! Flown using four controls: " a[1] and a[2] for forward/backward or sideways motion " a[3] pitch angle which changes the angel of the main rotor blades. " a[4] tail rotor pitch Pitch control rods shown in orange

Model Identification! To learn the dynamics of the helicopter, collect data while a human flies the helicopter upside-down.! Collect information about position (x,y,z), orientation velocity and angular velocity! A total of 391 such states were collected.

Model Identification! These 12 dimensional state is reduced into 8 dimensional state using body coordinates!we can represent a state only using 6 dimensions

Model Identification! Time difference between s t and s t+1 = 0.1 seconds! Used linear regression to learn to predict! Errors in one-step prediction were modeled as Gaussian.! Estimated noise variance via maximum likelihood.

Model Identification! Using the above model create a simulator! Human pilot tests the model by flying the helicopter in the simulation! Simulation used to test the controller

Controller design! Used Reinforcement learning! Reward function: punishment for deviation of desired position and orientation

Controller design! Policy is represented as a neural network! Once a policy is defined, calculate! Choose the gains for the controller so that we obtain a policy which maximizes the above value

Controller design! Use Monte Carlo method on the simulated model to calculate the state transition probabilities.! Use these transitional probabilities to calculate! Repeat m times to get an average.

Controller design! But since the Monte Carlo method works with stochastic values, estimating the best policy is hard because of noise in the result! Use the PEGASUS method to convert the stochastic problem into deterministic problem

PEGASUS! Stands for Policy Evaluation-of-Goodness And Search Using Scenarios! we can reduce the problem of policy search in an arbitrary POMDP to one in which all the transitions are deterministic! This reduction is achieved by transforming the original POMDP into an equivalent one that has only deterministic transitions.! For this paper they fixed the random number sequence used by the simulator.

Controller design! Now apply hill-climbing algorithm to search for the best policy Function Hill-Climbing(problem) { inputs: problem local variables: current_node, next_node loop do next_node = a highest-valued successor of current_node if valueof(next_node) < valueof(current_node) return current_node current_node = next_node end do }

Hill-Climbing Algorithm properties! Moves in the direction of increasing value.! Only need to store the current node and its evaluation! Hill-climbing can get stuck in local maxima Once in a local maxima the algorithm will halt even though the solution may be far from satisfactory! In a plateau the hill-climbing algorithm conducts a random walk.! Why use hill-climbing is not clear.

Experiments! Were able to learn how to hover an inverted helicopter! It took 72 hours to design and demonstrate a stable inverted flight controller

Decision Making! Choosing probability distribution models for modeling error! Number of iterations to average over to pick the best policy! Given a state which action will maximize reward

Strengths and Weaknesses! + Solves a very complex problem! + Fast solution (72 hours)! - Lots of high level information, not many concrete details! - Model constructed does not take whether conditions into account! - Using hill-climbing

Applicability to my project! Improving performance of the MTS problem by modeling opponent! Liuyang and I! We are estimating the target movements through a linear function! Similar to Policy Iteration, we can use hill-climb parameter setting to find optimal parameter values.

References! [Ng, Coates, Diel, Ganapathi, Schulte, Tse, Berger, Liang 2004] Inverted autonomous helicopter flight via reinforcement learning. International Symposium on Experimental Robotics.! [http://travel.howstuffworks.com/helicopter.htm]! [Ng, Jordan 2000] PEGASUS: A policy search method for large MDPs and POMDPs. In Uncertainty in Artificial Intelligence, Proceedings of Sixteenth Conference, pages 406-415.! [Russell, Norvig 1995] Artificial Intelligence: A Modern Approach, Prentice Hall Series in Artificial Intelligence. Englewood Cliffs, New Jersey! [Ng, Kim, Jordan, Sastry 2004] Autonomous helicopter flight via reinforcement learning. In Neural Information Processing Systems 16.