Deep Speech 2: Mandarin and English Recognized

End-to-end deep learning presents the opportunity to improve speech recognition systems continually with increases in data and computation. Indeed, this paper proves that transcription performance can be vastly improved using the same approach for very different languages.

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

As far as I know, the first sf reference to the idea of real, automated translation of language is the translatophone from a story of the same name by Frank Stockton - in 1901!

One of the most successful of these various contrivances, and the one, indeed, in which I was most deeply interested, was a small machine very much resembling in appearance the tube, with a mouth-piece at one end and an ear-piece at the other, frequently used by deaf persons, but very different in its construction and action. In the ordinary instrument the words spoken into the mouth-piece are carried through the tube to the ear, and are then heard exactly as they are spoken. When I used my instrument the person spoke into the mouth-piece exactly as if it were an ordinary tube, but the result was very different, for the great feature of my invention was that, no matter what language was spoken by the person at the mouth-piece, be it Greek, Choctaw, or Chinese, the words came to the ear in perfect English...

Via Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

Scroll down for more stories in the same category. (Story submitted 12/20/2015)

Follow this kind of news @Technovelgy.

| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |

Would you like to contribute a story tip? It's easy:
Get the URL of the story, and the related sf author, and add it here.

Comment/Join discussion ( 0 )

Related News Stories - (" Communication ")

Realistic Translation With The Waverly Labs Ambassador
'The speech patterns you actually hear decode the brainwave matrix which has been fed into your mind by your Babel fish.' - By Douglas Adams, 1979.

Soli Gesture Tech Will Be In Google Pixel 4
'I enjoy watching this way, but - He waved his hand and the circuit switched abruptly.' - Philip K Dick, 1955.

Lost Language Meanings Found By Machine Learning
'The autopilot would need data before it could begin a translation...' - Larry Niven, 1970.

The Future Of Elon Musk's Neuralink
'Cerebral Electromagnetic Emmission Amplification and Relay System call it artificial telepathy, if you like.' - Richard Meredith, 1969.

 

Google
  Web TechNovelgy.com   

Technovelgy (that's tech-novel-gee!) is devoted to the creative science inventions and ideas of sf authors. Look for the Invention Category that interests you, the Glossary, the Invention Timeline, or see what's New.

 

 

 

 

 

Current News

Cruise Autonomous Car Drives Aimlessly For An Hour
Convincing video shows progress (and limitations).

Fast Charging A Bus In 20 Seconds
'... in almost every town and village.'

Realistic Translation With The Waverly Labs Ambassador
'The speech patterns you actually hear decode the brainwave matrix which has been fed into your mind by your Babel fish.'

Biotech Firms Raised $Millions For Anti-Agathics (Longevity Drugs)
'Against Death doth no simple grow.'

Out-Of-Work Blue Collar Robots Need Your Help
'His legs relaxed with a rattle as he cut off all power below his waist... and ran his eye down the Help Wanted - Robot column...'

The Dawn Of Orbiting Manufacturing In 2020?
'It can be mass-produced only in the orbiting factories.'

Smart Contact Lenses Charges With 3D Printed Antenna
'He realized that it was not quite a clear lens.'

Segway S-Pod Fulfills Dire 1928 SciFi Prophecy
'Noiselessly, on rubber-tired wheels, they journeyed down the long aisles...'

Physicist Inspired By SciFi And Seeing Back In Time
'Here is the chronoscope... Scansion depends upon a special curved field...'

Airbnb Has AI Psychiatrist Looking At Your Facebook
'It's illegal to hold back information during a psyche test.'

NASA's Electric Motor Scooter
'...all the [lunar] prospectors took bicycles along as a matter of course'

Moving Suns To Different Galactic Neighborhoods
'...to swerve their star from its course, the globemen made use of a simple physical principle.'

Students Surveilled By School Phone Apps
Cheer up, students. '...cracking my SchoolBook had been easy.'

Massage Robot Has Soft Hands, Er, Pads
'The automatic massager began to fumble gently over my naked form.'

Medical Tattoos Are STILL Being Researched
'Following the current craze, she has had a subdermal pattern of micro-channels implanted.'

Elon Musk's Traffic Tunnel Challenge Is Boring
'The car vibrated... threading the maze of local tubes.'

More SF in the News Stories

More Beyond Technovelgy science news stories

Home | Glossary | Invention Timeline | Category | New | Contact Us | FAQ | Advertise |
Technovelgy.com - where science meets fiction™

Copyright© Technovelgy LLC; all rights reserved.