Natural spoken dialogue technology has long been a dream for many. Advances by European researchers are making this a reality. The results of their work could soon be used to allow us to verbally interact with technology in our everyday lives, from the music systems in our cars to functions in the homes of wheelchair users.
Interactions between human and computer are currently inefficient, particularly when we try talking. Previously, users have had to rely on specific commands making natural interactions in everyday language impossible.
Motivated by the idea of allowing people to say what they want to say, in the way they want to say it, the EU-funded TALK (Talk and Look, Tools for Ambient Linguistic Knowledge) project set about developing technology that would also allow for the systems to learn from the process.
“We developed methods for designing better, more natural, flexible, and adaptive spoken dialogue systems that learn from their interactions with users,” says Oliver Lemon from Edinburgh University and project coordinator. “We showed for the first time that machine learning techniques in Information State Update systems can lead to better human-computer interaction.”
And show it they did, at the IST 2006 conference in Helsinki, where project partners BMW, Bosch, and DFKI showcased some of the fruits of the project with SAMMIE, an in-car dialogue system for an MP3 player.
Sammie was installed in a BMW car. The system operates in German and English. This multilingual system is a first in human-computer interactions.
“This in-car system was extensively tested by BMW and Bosch, with real drivers, and was assessed to be less distracting and more comfortable than two competing systems,” says Lemon.
Information states versus graphs
First generation human-computer dialogue systems modelled speech interactions as a series of graphs. While this yields a functioning system, it is also an inefficient and fixed system. Key to the advances is a mathematical structure known as the Information State Update (ISU) approach, which was developed by TALK’s predecessor projects.
The approach, already used in many projects and applications around the world, uses information recorded in the course of human-computer-dialogue and saved in the ‘information state’ of the system.
This can include, for example, a formal meaning representation of information that has been uttered by a user, and a complete history of the dialogue up to that point. The computer or device then calculates the appropriate reaction for the situation, as well as updating the information state with new data on the user and context.
“This approach allows a level of flexibility, adaptivity, robustness, and naturalness of interaction which is superior to the previous techniques, which rely on simple finite state machine representations which essentially model conversations as enormous graphs,” says Lemon.
The TALK technologies can also be applied to different languages, graphical interfaces and operating systems, which means developing other applications that use the technology will be more time- and cost-effective.
“Given the continuous improvements in embedded computing devices, such as better processor speed and miniaturisation, we can expect that speech interfaces similar to SAMMIE can now be installed in vehicles at relatively low cost,” says Lemon.
“In fact, note that many new vehicles now do have voice control, like the BMW iDrive that is similar in many respects to the TALK project's SAMMIE system, although some of its features have been simplified.”
Out of the car and into the home
In addition to SAMMIE in the car, which allows the user to control the onboard music system by talking to it, the impressive list of developments using the ISU and machine learning technologies include TownInfo. The service gives tourists a talking guide of the place they are visiting. Another, AgendaTalk, functions as a voice-controlled calendar and diary.
Additionally, the technology has been applied to making the lives of housebound or mobility-restricted people easier. The European researchers developed MIMUS – a spoken dialogue system for smart homes for wheelchair users.
“The smart-home system is an excellent example of the potential of natural, robust, and adaptive spoken dialogue technology to transform the ways in which we will interact with IT in the future, for all members of society,” says Lemon.
The future today
The BMW iDrive, developed alongside the SAMMIE system, is proof that the TALK technology is the science of today and not tomorrow. But what about the other possibilities? When could these start filtering through to everyday life?
“Some other results, such as machine learning techniques, will take longer to influence spoken dialogue systems in everyday life, but the levels of adaptivity and robustness that the various TALK systems offer will be vital in future speech interfaces,” says Lemon.
And the researchers are continuing their studies, albeit under a different guise. A new EU funded project ‘CLASSIC’ (Computational Learning in Adaptive Systems for Spoken Conversation) will ensure the EU-researchers keep pushing forward the boundaries of human-computer interaction.
Source:
ICT Results
Related stories:
Scientists making waves with wireless ocean monitoring
The Great Barrier Reef and other sensitive environments could in future be managed with the help of a marine wireless network developed by University of Queensland (UQ)-led research.
Researchers develop neural implant that learns with the brain
Devices known as brain-machine interfaces could someday be used routinely to help paralyzed patients and amputees control prosthetic limbs with just their thoughts. Now, University of Florida researchers have taken the concept a step further, devising a way for computerized devices not only to translate brain signals into movement but also to evolve with the brain as it learns.
3-D photonic crystals make novel add-drop filters
Researchers at the U.S. Department of Energy's Ames Laboratory have come up with a potentially perfect way to sort and distribute the massive amounts of data that travel daily over optical fibers to people throughout the world. The new technology, a three-dimensional photonic crystal add-drop filter, promises greatly enhanced transmission of multiple wavelength channels (wavelengths of light) traveling along the same optical fiber. The innovative filter is a significant achievement in the effort to develop all-optical transport networks that would eliminate electrical components from optical transmission links and guarantee virtually flawless data reception to end users of the Internet and other fiber-based telecommunications systems.
Pollution from marine vessels linked to heart and lung disease
Pollution from marine shipping causes approximately 60,000 premature cardiopulmonary and lung cancer deaths around the world each year, according to a report scheduled to appear in the Dec. 15 issue of
Environmental Science and Technology, the journal of the American Chemical Society.
Machines might talk with humans by putting themselves in our shoes
While robots can do some remarkable things, they don't yet possess the gift of gab. Since the 1970s, researchers have been trying to develop a speech-based human-machine interface, but improvements are gradual, and some fear that the performance of current systems may not reach an adequate level for real-world applications.
Blackboard Problems Leave Vista on Double-Secret Probation
On college campuses, Microsoft's Vista operating system may be in danger of failing courses that use Blackboard, a key software program for communication between teachers and students.
Europe and Russia confirm closer space cooperation
Europe and Russia are strengthening their cooperation in space. The Head of the Federal Space Agency of the Russian Federation (Roscosmos), Anatoly Perminov, ESA Director General, Jean-Jacques Dordain, and European Commission Director General Heinz Zourek met today at Roscosmos in Moscow within the framework of the Tripartite Space Dialogue between the European Commission, European Space Agency and Roscosmos.
Adobe Ships Photoshop Lightroom 1.0
Adobe Systems announced that its Adobe Photoshop Lightroom 1.0 software is now out of beta and available for customers at a special introductory price of US $199. New Adobe Photoshop Lightroom is the professional photographer's essential toolbox, providing one easy application for managing, adjusting, and presenting large volumes of digital photographs.