A Bayesian Approach to Safe Imitation Learning For AIs and Robots

In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:

"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)

Now, doughty researchers are working to improve imitation learning:

DropoutDAgger: A Bayesian Approach to Safe Imitation Learning

While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.

Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.

Scroll down for more stories in the same category. (Story submitted 10/6/2017)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Artificial Intelligence ")

Still Wondering If You'd Work For A Robot Boss?
'This is all coming to you courtesy of the simstim unit wired into your deck, of course.'

CIMON Space Sidekick For Weary Astronauts
I welcome our floating robotic assistants.

Biomind AI Doctor Mops Floor With Human Doctors
'My aim was just not to lose by too much.' - Human Physician participant.

MIT Boffins Create Psychopath AI On Purpose
There's a lesson in this for neural net AI engineers everywhere.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

SWEEPER Robot Peter Piper Picking Peppers
'... little machines, that went from plant to plant, apparently on caterpillar tracks, cutting off the ripe fruit.'

Oil from Algae - Can It Be Done?
'We dump everything that's waste into the tanks, pump the oil off the top.'

Moving Whole Planets, Revisited
There was a lot of work done on this idea over the years.

Disney Keeps Backups Of Star Wars Franchise Actors
'She is a personality-construct, a congeries of software agents...'

Farming In Space Starts With Mycorrhiza
'The inner leaves were beginning to curl faster than the outer leaves.'

Jaguar I-Pace Audible Vehicle Alert System For EVs
'Of course not a vehicle moved by means of internal explosions of a derivative of rock oil...'

Autonomous 'Fiberbots' Weave Large Structures
'It extrudes material like a spider.'

Birds Aren't Real - Wake Up, California! (With Bird Watching Guide)
'When he had first built them, they had been crude indeed, flying mechanisms with little more than a reflex-response unit.'

Self-Healing Material Pulls Carbon Out Of The Air
'... could seal the punctures.'

IRL Glasses Block Screens, Limit Vision To Real Life
'If you couldn't see the ads, how would you know what was fashionable?'

Testing The Single-Person Spacecraft
'...the lower part of the suit was simply a rigid cylinder.'

Shapeshifting Materials Transform By Light
'Its lines wavered, flowed, and then painfully reformed.'

Fully Automated Farm Iron Ox Hydroponics
'Had these machines in some incredible fashion been provided with brains?'

BrainNet Social Network Of Brains
'I used my implant to tell MILLIE what we wanted and she took care of it'

Phil Nuyttnn's City Under The Sea
'Under the lower roof there was no water, but a clear and luminous atmosphere...'

IONITY Opens First 10 Fast-Charging Stations
'Recharge the batteries... in almost every town and village...'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.