Artificial Intelligence
Newly developed ip-reading technology could help in solving crimes and provide communication assistance for people with hearing and speech impairments according to the researchers who built it.
New lip-reading technology developed at the University of East Anglia could help in solving crimes and provide communication assistance for people with hearing and speech impairments.
Related articles
In the paper, 'Decoding visemes: Improving machine lip-reading', authors Dr Helen L. Bear and Prof Richard Harvey of UEA’s School of Computing Sciences, describe how their system can be applied “any place where the audio isn’t good enough to determine what people are saying.”“Lip-reading is one of the most challenging problems in artificial intelligence so it’s great to make progress on one of the trickier aspects, which is how to train machines to recognise the appearance and shape of human lips,” said Harvey
Dr Helen L. Bear |
"Potentially, a robust lip-reading system could be applied in a number of situations, from criminal investigations to entertainment."
Bear, said unique problems with determining speech arise when sound isn’t available – such as on CCTV footage – or if the audio is inadequate and there aren’t clues to give the context of a conversation. The sounds ‘/p/,’ ‘/b/,’ and ‘/m/’ all look similar on the lips, but now the machine lip-reading classification technology can differentiate between the sounds for a more accurate translation.She said, “We are still learning the science of visual speech and what it is people need to know to create a fool-proof recognition model for lip-reading, but this classification system improves upon previous lip-reading methods by using a novel training method for the classifiers.
“Potentially, a robust lip-reading system could be applied in a number of situations, from criminal investigations to entertainment. Lip-reading has been used to pinpoint words footballers have shouted in heated moments on the pitch, but is likely to be of most practical use in situations where are there are high levels of noise, such as in cars or aircraft cockpits.
“Crucially, whilst there are still improvements to be made, such a system could be adapted for use for a range of purposes – for example, for people with hearing or speech impairments. Alternatively, a good lip-reading machine could be part of an audio-visual recognition system."
While Bear and Harvey's work is impressive, it does conjure up an image of an AI system that may not always work in our favour...
0 comments:
Post a Comment