Deep Speech 2: Mandarin and English Recognized

End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!

One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...

Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

Scroll down for more stories in the same category. (Story submitted 12/20/2015)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Communication ")

Soli Gesture Tech Will Be In Google Pixel 4
'I enjoy watching this way, but - He waved his hand and the circuit switched abruptly.' - Philip K Dick, 1955.

Lost Language Meanings Found By Machine Learning
'The autopilot would need data before it could begin a translation...' - Larry Niven, 1970.

The Future Of Elon Musk's Neuralink
'Cerebral Electromagnetic Emmission Amplification and Relay System call it artificial telepathy, if you like.' - Richard Meredith, 1969.

BloxVox Mutes Cellphone Convos
'had he not been talking into a hush-a-phone which he had plugged into the telephone jack...' - Robert Heinlein, 1940.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

Orbital Display's Low Earth Orbit Advertisements
'A vast circle of scarlet stars came up into the greenish desert dusk.'

Neuromorphic Computing Hardare
'He had constructed an organ, a brain, of metal, entirely inorganic and lifeless...'

Vascularized Human Skin 3D Printed
Hey Fishboy!

Trillionaires Still Earth-Bound
'I shall never forget the sight... when the yellow gleam of the precious metal appeared under the star dust.'

Digit V2 Bipedal Robot From Agility Robotics
Oh, and now I suppose someone will develop the robotic porch pirate.

3D Printed Dubai Building Is World's Largest
'This thing will start at one end of ...a house and build it complete to the other end, following drawings only.'

Grow Plants On Moon Or Mars!
'In contrast to the airless desolation outside, the interior of this five-acre greenhouse was the one most desirable place to be.'

California Gets Shockwave Rider-Style Avoidance Zones
'It was cheaper to pay the refugees to go without up-to-the-minute equipment.'

Microbot Interstellar von Neumann Explorers
'Evidently they have never had a planet of their own...'

Hail SmartCan! Your Trash Bin Takes Itself Out
'...a waste can twenty feet away stirred into life.'

Finally! Microsoft Surface Neo And Surface Duo Implement Excellent Courier Idea
'Runcible, whose pages were thicker and more densely packed with computational machinery...'

Tap Strap 2 Now With Air Mouse
'He waved his hand and the circuit switched abruptly.'

Legal Profession Now Fairly Bristling With AI
'The virtual counsel appeared to be about forty-five years old and prosperous.'

Entire Planet Modeled In New MS Flight Sim
'CIC uses [it] to keep track of every bit of spatial information that it owns...'

FlyZoo Robot Hotel By Alibaba
'... hotels that specialized in non-human service.'

Implanted Memories Provide Songs To Birds
Finches can't tell the difference.

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.