Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Hurdl PIXL Wearable Helps Fans Connect With Stars
Like Macross Plus!
Advertising Drones Hover Over Traffic In Mexico
'Blurbflies are allowd to travel the streets, buzzing their adverts alive and direct...' - Jeff Noon, 2000.
Audiobooks - Fastest Growing Format In Publishing
'The public preferred lectons...' - Stanislaw Lem, 1961.
Douglas Adams Your Babel Fish Is Ready - The Pilot By Waverly
'You'll need to have this fish in your ear.' - Douglas Adams, 1979.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Bat Bot Robotic Flapping-Wing Drone
'The dark birdforms dotted the mountaintops like statues of prehistoric beasts, wings outspread...'
NASA's Astronaut Rescue Ball
'Ball and closely-prisoned man plummeted downward..'
ARM Wants To Build Brain Chips
'Slivers of microsoft, angular fragments of colored silicon...'
Sky Fence - A Drone-Proof Shield Created Over Prison
'There’s still a protective field over the whole thing. It volatilizes anything that tries to get through.'
Geoengineering The Atmosphere For Climate Change
'...a uniform temperature for each degree of latitude the year round.'
Archinaut Orbiting Robotic Factory
'mass-produced only in the orbiting factories...'
Cryonic Preservation - The Last Perk You'll Ever Need
'Is there not also a law providing for voluntary suspension of animation?'
Computers Understand Humans By Watching And Modeling Them
Soon, your computer will be watching you... and judging you.
NASA Asks For Moon To Earth Delivery Ideas
'Authority's 3-g catapult was almost one hundred kilometers long...'
Musk Tunnels Wisely Restrict Drivers
Too many robots.
Robot Swarms Controlled With Augmented Reality
'You're not thinking in enough dimensions...'
MIT's C-LEARN Helps Robots Transfer Learning To Other Robots
'Talk Between Robots radio...'
Mini-Brains In A Dish
'Cultured brains on a slab.'
Rapid Automated Search For Habitable Planets Needed
'I was near enough it now to set my automatic astronomical instruments to searching it for a habitable planet.'
WatchSense Perfect For Fat-Fingered Smartwatch Owners
'Now all you had to do was wave your hand in the general direction of the components...'
Digital Construction Platform Robot 3D Prints A Building
'It extrudes material like a spider.'
More SF in the News Stories
More Beyond Technovelgy science news stories