A Bayesian Approach to Safe Imitation Learning For AIs and Robots
In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:
"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)
Now, doughty researchers are working to improve imitation learning:
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.
Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.
Scroll down for more stories in the same category. (Story submitted 10/6/2017)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Legal Profession Now Fairly Bristling With AI
'The virtual counsel appeared to be about forty-five years old and prosperous.' - Greg Bear, 2007
Venezuelans Teaching Your Self-Driving Car
‘She wouldn't stop until Antar had told her everything he knew...’ Amitav Ghosh, 1995.
Pun Generation Via Neural Nets
'You said you wanted him to be able to distinguish between laugh-power in different gags...' - William Tenn, 1951.
Can We Comprehend Deep Learning Systems?
'You’ve nothing remotely like it, so I can’t describe it to you.' - Lewis Padgett, 1943.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Orbital Display's Low Earth Orbit Advertisements
'A vast circle of scarlet stars came up into the greenish desert dusk.'
Neuromorphic Computing Hardare
'He had constructed an organ, a brain, of metal, entirely inorganic and lifeless...'
Vascularized Human Skin 3D Printed
Trillionaires Still Earth-Bound
'I shall never forget the sight... when the yellow gleam of the precious metal appeared under the star dust.'
Digit V2 Bipedal Robot From Agility Robotics
Oh, and now I suppose someone will develop the robotic porch pirate.
3D Printed Dubai Building Is World's Largest
'This thing will start at one end of ...a house and build it complete to the other end, following drawings only.'
Grow Plants On Moon Or Mars!
'In contrast to the airless desolation outside, the interior of this five-acre greenhouse was the one most desirable place to be.'
California Gets Shockwave Rider-Style Avoidance Zones
'It was cheaper to pay the refugees to go without up-to-the-minute equipment.'
Microbot Interstellar von Neumann Explorers
'Evidently they have never had a planet of their own...'
Hail SmartCan! Your Trash Bin Takes Itself Out
'...a waste can twenty feet away stirred into life.'
Finally! Microsoft Surface Neo And Surface Duo Implement Excellent Courier Idea
'Runcible, whose pages were thicker and more densely packed with computational machinery...'
Tap Strap 2 Now With Air Mouse
'He waved his hand and the circuit switched abruptly.'
Legal Profession Now Fairly Bristling With AI
'The virtual counsel appeared to be about forty-five years old and prosperous.'
Entire Planet Modeled In New MS Flight Sim
'CIC uses [it] to keep track of every bit of spatial information that it owns...'
FlyZoo Robot Hotel By Alibaba
'... hotels that specialized in non-human service.'
Implanted Memories Provide Songs To Birds
Finches can't tell the difference.
More SF in the News Stories
More Beyond Technovelgy science news stories