Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
FlexPai Foldable Phone By Royole
'...A paper thin polycarbon screen unfurled.' - William Gibson, 1986.
BrainNet Social Network Of Brains
'I used my implant to tell MILLIE what we wanted and she took care of it' - Pournelle and Niven, 1981.
Messaging Extraterrestrial Intelligence (METI) Workshop
SF writers have thought about this since the 19th century.
Burner Generates Temporary Phone Numbers
'Interesting phone system he's got, by the way...' - John Varley, 1984.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Amazing Kepler Space Telescope Decommissioned By NASA
'Thus it came about that the search for a planetiferous sun... was not unduly prolonged...'
ODYSSEUS Solar-Powered Stratospheric Plane Flies Forever
'The planes flew continuously, twenty-four hours a day...'
Augmented and-or Virtual Reality Shoes From Google
'The auto-treadmill's bumps and gullies matched whatever terrain the goggles showed me...'
Soon, Your Tesla Will Follow You Like A Pet
'... follow him as faithfully as a well-trained hound.'
Chinese Watrix Gait Recognition Watching You Always
'... those pesky gait-recognition cameras.'
FlexPai Foldable Phone By Royole
'...A paper thin polycarbon screen unfurled.'
Oh Yes, We're Building The Rotating Tower In Dubai
'Give me an old-fashioned tetragon on a central pivot every time.'
Bioreactor Helps Legless Frogs Get Their Jump Back
'An alien drug... Used by an insect race... It can repair bones and organs. It can grow new tissue."
Xinhua AI Anchor Puts CGI Face To Automated News
'...a congeries of software agents.'
Wirewax Watching You Watch, Adjusting Your Experience
'He adjusted the n, the r and b knobs, and hopefully anticipated a turn for the better...'
LawGeex AI Beats 20 Top Lawyers
'The Law Society has strict rules on the use of pseudo-intelligent software - terrified of putting... its members out of work.'
ROAM Robotics Skiing Exoskeleton
'The real genius in the design is that you don't have to control the suit; you just wear it...'
MIT Headset Lets You Communicate Without Speaking
'The subvocal read nerve signals, letting her enter words by just beginning to will them...'
Exploring Oceans Across The Solar System
'Black liquid flashed past the turbotís infrared eyes.'
SWEEPER Robot Peter Piper Picking Peppers
'... little machines, that went from plant to plant, apparently on caterpillar tracks, cutting off the ripe fruit.'
Oil from Algae - Can It Be Done?
'We dump everything that's waste into the tanks, pump the oil off the top.'
More SF in the News Stories
More Beyond Technovelgy science news stories