Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
I Want My 1928 Telestereo Hologram Now
'Instantly there appeared standing upon the disk, the image of a man...' - Edmond Hamilton, 1928.
Realistic Translation With The Waverly Labs Ambassador
'The speech patterns you actually hear decode the brainwave matrix which has been fed into your mind by your Babel fish.' - By Douglas Adams, 1979.
Soli Gesture Tech Will Be In Google Pixel 4
'I enjoy watching this way, but - He waved his hand and the circuit switched abruptly.' - Philip K Dick, 1955.
Lost Language Meanings Found By Machine Learning
'The autopilot would need data before it could begin a translation...' - Larry Niven, 1970.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
I Want My 1928 Telestereo Hologram Now
'Instantly there appeared standing upon the disk, the image of a man...'
Memes Now Come From Neural Nets
'Your order said for him to be able to be able to work out twists on the gags in the file...'
Robot Dog Learns To Be Doggy From Real Dogs
'So we took pictures of Guzub making a Three Planets, and I could construct this one to do it exactly right down to the thousandth of a second.'
Unwanted Cruise Ships Huddle Together Out At Sea
'On the screen they passed in an endless, boundaryless flood of green specks...'
Sono Sion Electric Car Charges As You Drive
'It drew its power from six square yards of sunpower screens on its low curved roof.'
News Mood Filter Web Extension
'He adjusted the n, the r and b knobs, and hopefully anticipated a turn for the better...'
Fetal Lamb Rests In Artificial Womb
'... stewing warm on their cushion of peritoneum and gorged with blood-surrogate and hormones, the foetuses grew and grew...'
MIT Wants To Catch Interstellar Visitors
'INVESTIGATE MYSTERIOUS OBJECT ENTERING NEW CALEDONIA SYSTEM FROM NORMAL SPACE'
AutoX Sets Up Asia's Largest Robotaxi Center
'The robot cab seemed to know where it was going and, no doubt, the master machine from which it received its signals knew.'
E - Ink's Automatic Self Styling Color-Changing Dress
'The racks of gowns itched and quivered, their colors running into blurred pools.'
Soft Robots Use Kirigami Piezoelectric Sensor Skin
'A worthy opponent was the golem.'
Bosch Smartglasses Laser Paints AR Image On Your Retina
'Soon we'll be testing a system that projects directly on the retina of the eye.'
Maybe We Could Hibernate Until The Covid-19 Pandemic's End
'Cold-rest was a common last resort therapy for functional psychoses.'
Workplace Monitoring Hell, I Mean, Tool For Safe Distancing
'And here is the weirdest part -- I never see another employee the entire day.'
Patent Office Says AIs Cannot Be Inventors
'The real smart ones are as smart as the Turing heat is willing to let 'em get.'
Starlink Orbital Network Like Coruscant Traffic Jam
Vermin of the Sky? Or Internet access for all?
More SF in the News Stories
More Beyond Technovelgy science news stories