A Bayesian Approach to Safe Imitation Learning For AIs and Robots

In his wonderful 1943 short story Q.U.R., science fiction writer Anthony Boucher writes about robots that are able to learn by viewing hundreds of images:

"I got one of those new electronic cameras - you know, one thousand exposures per second... So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second. The proper proportion of vuzd, in case you're interested, works out to three-point-six-five-four-seven eight-two-three drops. It's done with a flip of the third joint of the tentacle on the down beat. It didn't seem right to use Guzub to make a robot that would compete with him and probably drive him out of business, so we've promised him a generous pension from the royalties on usuform barkeeps."
(Read more about Anthony Boucher's usuform robot bartender)

Now, doughty researchers are working to improve imitation learning:

DropoutDAgger: A Bayesian Approach to Safe Imitation Learning

While imitation learning is becoming common practice in robotics, this approach often suffers from data mismatch and compounding errors. DAgger is an iterative algorithm that addresses these issues by continually aggregating training data from both the expert and novice policies, but does not consider the impact of safety. We present a probabilistic extension to DAgger, which uses the distribution over actions provided by the novice policy, for a given observation. Our method, which we call DropoutDAgger, uses dropout to train the novice as a Bayesian neural network that provides insight to its confidence. Using the distribution over the novice's actions, we estimate a probabilistic measure of safety with respect to the expert action, tuned to balance exploration and exploitation. The utility of this approach is evaluated on the MuJoCo HalfCheetah and in a simple driving experiment, demonstrating improved performance and safety compared to other DAgger variants and classic imitation learning.

Modern-day roboticists might be interested in Boucher's distinction between manlike robots and usuform robots.

Scroll down for more stories in the same category. (Story submitted 10/6/2017)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Artificial Intelligence ")

Orwell's Memory Hole Looms Larger Thanks To Nvidia
'All history was a palimpsest, scraped clean and reinscribed exactly as often as was necessary.' - George Orwell, 1948.

SciFiQ Science Fiction Writing Aid
'Books were just a commodity that had to be produced, like jam or bootlaces.' - George Orwell, 1948.

Elon Musk Fears A 'Fleet-Wide Hack' Of Autonomous Vehicles
'Khan grinned. 'It's alive! Bu-wahhahahah!''

Shelley.ai AI Terrifies Thanks To Reddit's Nosleep
'How can you compete with IBM?' - JG Ballard, 1971.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

GM Introduces Cruise AV With No Steering Wheel
'How about the steering wheel?' ... 'I do not need one.'

Subsurface Martian Ice Slabs Piece Of Cake For Miners
'One shy little fellow with bloodshot eyes of old-time drillman stood up. 'I'm an ice miner,' he said.'

LG Rollable Version Of Niven's Poster TV
'A television that unrolled like a poster.'

Multi-Robot Farming On Highly Sloped Land
High Plains, indeed.

Aeolus Robot Brings Jetson's Rosie Closer
Domestic duties, robotically performed.

Sony's New, Cuter Aibo Robot Puppy
Engineered to be adorable.

Earth-1 Transformer Gundam Car
Is it a Gundam? Or maybe a Transformer.

Self-Driving Domino's Pizza Car
Yes, but can it negotiate entry at your Burbclave?

I Want Massive Space Freighters!
Ah, the space freighters of old.

When Will The Feds Ban Human Drivers?
'The first laws came out forcing the old machines off the highways...'

Our World Formed In A Bubble?
'The Worldcraft bubble glittered, catching the light...'

Will You Live To See EM Pulse Scattering By Ships Nearing Light Speed?
'...half a million kilometers away, the Stardrive went on.'

Jabil Integrated Textile Heart Monitoring
'Della's first present was an imipolex sweatshirt called a heartshirt…'

Made In Space To Manufacture Optical Fiber In Orbit
'Mass-produced only in the orbiting factories...'

Dune Fans! Power Your Devices With Sweaty Shirts
Yet another power source from humans.

Orwell's Memory Hole Looms Larger Thanks To Nvidia
'All history was a palimpsest, scraped clean and reinscribed exactly as often as was necessary.'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.