Visual Speech Recognition - When Will HAL Read Lips For Real?
Visual Speech Recognition, also known as automated lip reading, is a field with a special meaning for science fiction fans. In the film 2001:A Space Odyssey, the HAL 9000 computer was able to read lips.
(HAL 9000 [background] eavesdrops on astronauts Poole and Bowman)
In the film, HAL's increasingly erratic behavior becomes a matter of concern for the astronauts. Since HAL can effectively monitor every part of the ship, the astronauts retire to a small pod to discuss the matter. Unfortunately, it turns out that somebody did research on computer lip-reading, and so HAL was on to them, with very unfortunate results for Poole.
In a recent paper, Ahmad Hassanat at Mu’tah University in Jordan provides a review of existing approaches, and suggestions for moving forward with VSR. He also outlines some of the challenges in actually creating a computer able to read lips, like the fictional HAL 9000.
(From Visual Speech Recognition chart)
The fundamental process of lip reading is to recognize a sequence of shapes formed by the mouth and then match it to a specific word or sequence of words.
There is a significant challenge here. During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes.
And therein lies the problem. A sequence of visemes cannot usually be associated with a unique word or sequence of words. Instead, a sequence of visemes can have several different solutions.
The first problem for automated lip reading is face and lip recognition. This has improved in leaps and bounds in recent years. A more difficult challenge is in recognizing, extracting and categorizing the geometric features of the lips during speech.
This is done by measuring the height and width of the lips as well as other features such as the shape of the ellipse bounding the lips, the amount of teeth on view and the redness of the image, which determines the amount of tongue that is visible.
Determining the exact contour of the lips is hard because of the relatively small difference between pixels showing face and lips.
Another problem is that some people are more expressive with their lips than others so it easier to interpret what they are saying from lip movements alone. Indeed, some people hardly move their lips at all and these so-called “visual-speechless persons” are almost impossible to interpret.
Hassanat’s own visual speech recognition system is remarkably good. His experiments achieve an average success rate of 76 percent, albeit in carefully controlled conditions. The success rate is even higher for women because of the absence of beards and mustaches.
Technovelgy readers may want to recall that, even in the surveillance classic 1984, the telescreen was always on, but whether or someone was watching was not clear.
There was of course no way of knowing whether you were being watched at any given moment. How often, or on what system, the Thought Police plugged in on any individual wire was guesswork.
With Visual Speech Recognition, thought, your conversation with others could be surveilled by machines even if people are not watching.
Via Technology Review and Visual Speech Recognition
Scroll down for more stories in the same category. (Story submitted 9/17/2014)
Follow this kind of news @Technovelgy.
| Email | RSS | Blog It | Stumble | del.icio.us | Digg | Reddit |
you like to contribute a story tip?
Get the URL of the story, and the related sf author, and add
Comment/Join discussion ( 0 )
Related News Stories -
Amazon Echo And Google Home Should Have Morality Software
'The Dwoskin Morality Rating-Computer could 'spot the slightest tendency to deviation' from the social norm...' - Kendall Foster Crossen, 1953.
Deepfakes From OpenAI GPT-2 Algorithm
'How can you compete with an IBM heavy-duty logomatic analogue?' - JG Ballard, 1971.
Fishy Facial Recognition Now Possible
'Palenkis can identify random line patterns better than any other species in the universe.' - Frank Herbert, 1969.
LawGeex AI Beats 20 Top Lawyers
'The Law Society has strict rules on the use of pseudo-intelligent software - terrified of putting... its members out of work.' - Greg Egan, 1991.
Technovelgy (that's tech-novel-gee!)
is devoted to the creative science inventions and ideas of sf authors. Look for
the Invention Category that interests
you, the Glossary, the Invention
Timeline, or see what's New.
'Metallic Wood' Strong Like Titanium, Floats In Water
'A metal... light as cork and stronger than steel...'
Seabreacher, H.G. Winter's 1939 Torpoon
'Ken lay full-length in the padded body compartment, his feet resting on the controlling bars of the directional planes, hands on the torpoon's engine levers.'
Abundant Robotics Autonomous Apple Harvester Robot
'... little machines, that went from plant to plant... cutting off the ripe fruit.'
Charging An Electric Car In 2019 (Video), 1912 (Photo) And 1894 (Fiction)
'Recharge the batteries... in almost every town and village...'
Japan Uses Explosives On Asteroid
'...a tiny, rocket-powered projectile, drove towards the mysterious bulk. It hit, exploding into a cloud of incandescent vapour.'
Get Your Speeder Flying Motorcycle From Jetpack Aviation
'The flycycles were miracles of compact design.'
FLIR Black Hornet 3 Palm-sized Drone
These drones can provide situational awareness beyond visual line-of-sight capability.
Dockworkers Protest Driverless Trucks
'It resembled conventional human-operated transportation vehicles, but with one exception -- there was no driver's cabin.'
Flying Car Concept By Kash Sirinanda
'Each one consists of a hub with many tiny spokes... On the end is a squat foot, rubber tread on the bottom...'
Unfurl The Future! Huawei Mate X versus Galaxy Fold
'A paper thin polycarbon screen unfurled silently from the top of the unit and immediately grew rigid.'
Amazon Echo And Google Home Should Have Morality Software
'The Dwoskin Morality Rating-Computer could 'spot the slightest tendency to deviation' from the social norm...'
China Building Robot Wives
'Want a life-companion, a pleasant one?'
China Social Credit System Like State-Run Whuffie
'At least there was no mandatory Whuffie check on the monorail platform...'
Project Soli Radar Gesture Chip Now FCC Approved
'He waved his hand and the circuit switched abruptly.'
Stan, Robot Valet, Will Drag Your Car Away
'He activated the grapple tracks. '
Jibo Home Robot Says Goodbye, Is Killswitched
'It resembles an oyster....'
More SF in the News Stories
More Beyond Technovelgy science news stories