Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Rid Thyself Of Ads On The Newsbox
'Can't we scramble that commercial?' - Robert Heinlein, 1941.
Google's Remixed 'Your News Update' ala Heinlein, Clarke, Pohl
'Perhaps we had better use the soundtrack and let it hunt.' Robert Heinlein, 1941.
5G Will Be Crucial Backup For Self-Driving Cars
'... some bored drone pusher in a remote driving centre has got your life in his hands.' - Charles Stross, 2007
Olfactory User Interfaces - Judith Amores Dissertation
Awakened with a whiff of lemon.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Tesla Will Have Metal Gear Snake Autocoupler, Musk Confirms
'Its motion was so swift, complex, and perfect that at first I did not see it as a machine, in spite of its metallic glitter.'
Starlink Satellites Leading Edge On-Orbit Debris Mitigation
Propulsion-assisted orbital decay, brought to you by SpaceX.
Healight Ultraviolet Endotracheal Device Has Covid-19 Treatment Potential
'He applied the tip of the instrument to the interior of the wound...'
Parents Use AI To See One Last Message From Their Deceased Son
'...what's to keep me from showing face, Man?'
Satoshi Tomizu Creates Pocket Universes And Worldcraft Bubbles In Glass
'The Worldcraft bubble glittered, catching the light...'
Space Hero Inc. Offers Trip To ISS As Reality Show Prize
'This is Elmer Schmitz, presenting to you the finalists in our Aviation Quiz Program...'
I Love Ceiling-Mounted Robots
'Immediately an enormous apparatus fell on to her out of the ceiling...'
Armano Remote Control Excavator
'The bulldozer moved through the... mine... '
OK, NASA 3D Printer Of Food Not Yet Star Trek Food Synthesizer
Maybe not, but we're seeing definite progress.
Kelly Clarkson Show Like Black Mirror '15 Million Merits'
'These people are pieces of software called avatars.'
Salto Jumping Robot Now Sticks Landing!
'Lucky touched the leap knob and the hopper's leg retracted.'
Gyroscopic Median-Straddling Mass Transit Vehicles
'It was among these leviathans that the little gyrocar was daring to thrust its puny self...'
Bigrating Laser Beam-Riding Light Sail Is Self-Correcting
'That sail will be twenty thousand miles at the wide part.'
ISS Astronauts Test Estee Lauder 'Advanced Night Repair' Skin Serum
'Out in the New Moon, just ask for what you want...'
LG Wing Twisting Smartphone Might Be Fun
'A polycarbon screen unfurled...'
Mushroom Coffin Returns You To Nature, Naturally
'She touched the leaf. She was wanted.'
More SF in the News Stories
More Beyond Technovelgy science news stories