A Bayesian Approach to Safe Imitation Learning For AIs and Robots
In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:
"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)
Now, doughty researchers are working to improve imitation learning:
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.
Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.
Scroll down for more stories in the same category. (Story submitted 10/6/2017)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
CIMON Space Sidekick For Weary Astronauts
I welcome our floating robotic assistants.
Biomind AI Doctor Mops Floor With Human Doctors
'My aim was just not to lose by too much.' - Human Physician participant.
MIT Boffins Create Psychopath AI On Purpose
There's a lesson in this for neural net AI engineers everywhere.
China Uses Artificial Intelligence To Grade Student Papers
Looks like the City Fathers are starting to take over China's education system.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
LA Subway Scanner, As Seen In 'Total Recall'
'I'm afraid to tell you this Mr. Quaid, but you have suffered a schizoed embolism...'
Sion Electric Car Covered With Solar Panels
'It drew its power from six square yards of sunpower screens on its low curved roof.'
PAL-V Liberty Flying Helicopter Car
'...lifted themselves to skimming flight upon whirling helicopters."
Space Drones - UK's Effective Space To Launch Rocket Tugs
'Twenty rocket tugs towed it from its Earth hangar out into space.'
DIY Autonomous Robot Detects Trash
'The search-bug detached itself and rolled forward.'
Ancient Russian Walking Excavator Would Be Perfect RV
I don't need it to go fast, it just needs to amble along.
ELROI Satellite 'License Plate'
Robert Heinlein was thinking about this in 1941.
When Robots Beg For Their Lives
"Just what do you think you're doing... Dave.'
Do You Still Want A Folding Screen Phone?
'A paper thin polycarbon screen unfurled...'
'Snapchat Dysmorphia' Now A Thing, Say Plastic Surgeons
'The program raced up the screen one scan line at a time, subtly smoothing, deleting and coloring.'
Quiet Electric Cars Law Finalized By US Transportation Department
'... a sound tape to supply the noise'
Drone Assassin Fails To Kill Venezuelan President
'The spotter descends, and we think it searches the vicinity, looking for the victim's face...'
Stick-On Tape Speakers, As Predicted By Bruce Sterling
Flexible tape speakers, someday.
Bezos Invites You To New Life In Off-World Colonies
'A new life awaits you!'
Amazon's Rekognition System Sees Criminals In Congress
'... the imprint of her image on the telephoto cell.'
Build Your Own Space Suit For Cheap
'I'm going to pump the air from this room... so that the interior will be like airless and pressure-less space.'
More SF in the News Stories
More Beyond Technovelgy science news stories