Deep Speech 2: Mandarin and English Recognized
End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.
We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.
As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!
One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...
Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.
Scroll down for more stories in the same category. (Story submitted 12/20/2015)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Publishing Technologies In Science Fiction
In response to a reader question, a set of links related to publishing technologies in science fiction
Hurdl PIXL Wearable Helps Fans Connect With Stars
Like Macross Plus!
Advertising Drones Hover Over Traffic In Mexico
'Blurbflies are allowd to travel the streets, buzzing their adverts alive and direct...' - Jeff Noon, 2000.
Audiobooks - Fastest Growing Format In Publishing
'The public preferred lectons...' - Stanislaw Lem, 1961.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
Desktopography Makes Virtual Desktops Real
'Ender doodled on his desk, drawing contour maps of mountainous islands and then telling his desk to display them in three dimensions...'
LaWS Laser Can Take Out Rogue Drones
Looks like a weapon for the Runaway squad!
Moon Express Lunar Robot Mining: Shine On, Harvest Moon
'The bulldozer moved through the lunar strip mine... '
Liquid Body Armor For TALOS Exoskeleton
'... instantly became rigid all over when something struck it...'
Hyperloop One Video Shows It Works!
'Complete evacuation of the interior of the tubes [and] a wave that provides the new propulsive energy for the cars...'
Chairless Chair Exoskeleton By Sapetti
'Earth's scientists... devised rigid metallic clothing...'
Publishing Technologies In Science Fiction
Well, this should be enough references to start...
Russia Working On Military Exoskeletons
'Вы похожи на большую стальную гориллу...'
3D Printed Bionic Chinese Skin
Your skin is ready!
Flexup Tire Design Good For Tumblebugs
'Each spoke telescopes into five sections.'
3D Printed Graphene Aerogel - So Light!
'... light as cork and stronger than steel...'
Asteroid Deflection With DART
'This obelisk is one huge deflector mechanism...'
Translate One2One From IBM's Watson Your Communication Solution
'It then excretes into the mind of its carrier a telepathic matrix...'
News Now Philip K. Dick's Bailiwick
'A vast complex electronic organism... responsible to no one...'
Autonomous BADGER Robot Drilling Machine
'The compacted matter... makes a better tunnel lining than concrete, don't you think?'
TALOS Exoskeleton Development Proceeding
'Suited up, you look like a big steel gorilla...'
More SF in the News Stories
More Beyond Technovelgy science news stories