A Bayesian Approach to Safe Imitation Learning For AIs and Robots

In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:

"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)

Now, doughty researchers are working to improve imitation learning:

DropoutDAgger: A Bayesian Approach to Safe Imitation Learning

While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.

Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.

Scroll down for more stories in the same category. (Story submitted 10/6/2017)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Artificial Intelligence ")

LawGeex AI Beats 20 Top Lawyers
'The Law Society has strict rules on the use of pseudo-intelligent software - terrified of putting... its members out of work.' - Greg Egan, 1991.

Still Wondering If You'd Work For A Robot Boss?
'This is all coming to you courtesy of the simstim unit wired into your deck, of course.'

CIMON Space Sidekick For Weary Astronauts
I welcome our floating robotic assistants.

Biomind AI Doctor Mops Floor With Human Doctors
'My aim was just not to lose by too much.' - Human Physician participant.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

Wound Healing With Wearable Nanogenerators
'... forcing the energy transfer which allowed him to ... erase the other internal-external damage.'

Flying Dragon Robot Transforms In Mid-Air
Terrific prototype video.

Negative Matter Fluid Theorized In New Paper
'Of course, being negative matter, when you push it, it comes toward you..'

Grow Structures Upon Planetfall - Myco-Architecture
'They'll also start pulling in gases and liquids from the local atmosphere...'

MXene Hydrogel Skin For Robots Flexes And Senses
'The plastex swam and whirled like boiling toothpaste...'

EXPLORER, The First Total-Body Scanner
'The object is built up of an infinite series of plane layers, at the focus of the ray...'

UK Police AI To Stop Criminals Before They Strike
'... the computing mechanisms that studied and restructured the incoming material.'

Sonitus Audio Interface Positioned Beyond The Noise
'... an instrument having relatively small bit pieces adapted to be gripped between the teeth.'

Volvo's Self-Driving Mining Trucks
'A procession of automatic ore carts was racing over the bleak slag'

Audi Pop.Up Autonomous Electric Flying Car
'The cab was an egg-shaped bubble of light metals and plastics...'

Music Not Impossible (MNI) Vibrotactile Wearable Experience
Don't you want to experience the 'feely' effects?

Chinese Face Recognition Mistakes Bus Ad For Jaywalker
'... the imprint of her image on the telephoto cell.'

A Look Back At Apollo's Emergency Escape Vehicle
'A simple mechanism... it drove the iron ball through space like a ship.'

InMotion Glide 3 Electric Unicycle For The Last Mile
'...gyro-stabilized on a single wheel.'

China's Social Credit System - A Facebook-1984 Mashup
'Prestige, face, mana, repute, glory: the Sirenese word is strakh.'

Musk Declares Tesla Supercharger Capacity Will Double By Next Year
'Recharge the batteries... in almost every town and village...'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.