A Bayesian Approach to Safe Imitation Learning For AIs and Robots

In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:

"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)

Now, doughty researchers are working to improve imitation learning:

DropoutDAgger: A Bayesian Approach to Safe Imitation Learning

While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.

Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.

Scroll down for more stories in the same category. (Story submitted 10/6/2017)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Artificial Intelligence ")

Datagrid Model Generation Perfect For Eternal Cities Of Science Fiction
'... there was enough flexibility to allow for wide variation. - Arthur C. Clarke, 1956.

Amazon Echo And Google Home Should Have Morality Software
'The Dwoskin Morality Rating-Computer could 'spot the slightest tendency to deviation' from the social norm...' - Kendall Foster Crossen, 1953.

Deepfakes From OpenAI GPT-2 Algorithm
'How can you compete with an IBM heavy-duty logomatic analogue?' - JG Ballard, 1971.

Fishy Facial Recognition Now Possible
'Palenkis can identify random line patterns better than any other species in the universe.' - Frank Herbert, 1969.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

Datagrid Model Generation Perfect For Eternal Cities Of Science Fiction
'... there was enough flexibility to allow for wide variation.

Kazahk Ironist Protester Arrested For Blank Sign Protest
'...a man carried a white rectangular sign, blank on both sides.'

Bitcoin Surges Again, To $7,000
'... electronic, private cash, unbacked by any government, untraceable, completely anonymous.'

China Develops Taste Testing Robots
'Install taste buds in the end of one tentacle...'

North Sea Stone Age Reconstruction And Philip K Dick
'Your Dip digs back into antiquity. Rome. Greece. Dust and old volumes.'

Tesla Robotaxis Will Automatically Recharge Themselves
'Then it appeared to make up its mind, and trundled over to a wall socket...'

New Lifelike Material Powered By Artificial Metabolism
'... The biological robots were not living creatures.'

Husqvarna Automower 435X AWD
'Gramp Stevens sat in a lawn chair, watching the mower at work...'

Elon Musk Foretells Tesla Sans Steering Wheel
'How about the steering wheel... I don't need one.'

Adversarial Patches Trick Computer Vision
'The surveillance cameras can all see it, but then they forget they’ve seen it.'

Amazon Warehouse Computer Can Fire People Now
'The system has already fired five people...'

BrainEx Restores Some Activity To Severed Pig Head
'... they placed the brain in a special solution, having all the properties of Nursing the brain cells.'

Yes, But Do Astrobees Have Lasers For Lightsaber Training?
'... Ancient weapons are no match for a good blaster at your side, kid.'

'Young Razorbacks Before Their Katanas Grow In'
'Twin robotic arms with gleaming three-foot sword blades unfolded from the forward hydraulic assemblies...'

A New Way To Run Into Things
'He made an adjustment, pointed the tube at the wall beside Etzwane, and projected a cone of light.'

'Metallic Wood' Strong Like Titanium, Floats In Water
'A metal... light as cork and stronger than steel...'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.