It will mean stories can be defined, on the fly, with a precision greater than a library's card catalogue. The News Engine Web Services (NEWS) platform is aimed at news agencies, governments and large enterprises and will enable them to develop highly advanced analysis to raw text, with a vast number of potential applications.
News agencies will be able to automatically create very highly personalised news profiles for readers. Governments will be able to analyse social and political trends through newspaper reports, at a much higher level of detail than was possible previously, and large businesses will be able to study market and product developments.
The project that developed the platform even managed to develop a proof-of-concept service for analysing audio, by combining their system with a commercial voice recognition programme.
At the heart of this functionality is the powerful classification and ontology-based annotation system that can work across languages. "News classifications up to now typically consisted of about 12 terms, like sport, world news, finance, that a journalist knew off by heart," says Dr Ansgar Bernardi, deputy head of the Knowledge Management Group at DFKI, the German Research Centre for Artificial Intelligence, and coordinator of the IST-funded NEWS project.
"That's not very precise. Our system can automatically analyse a story and access 1300 classification terms to define it," says Bernardi.
What's more it can access a large ontology of terms related to the specific story definitions within a class, terms like president, head-of-state and government in the politics class, for example. The end result is a very large data set of standardised terms that define the story's content.
That data set can then be used in a huge variety of ways to potentially answer almost any query a user can imagine. A simple example: "Show me news items about the US president in January 2006" will deliver news items about George W. Bush in this time frame.
"We expect that platform users will take the basic functionality and develop around it to respond to the information they want to analyse," says Bernardi. The system also needs to be 'trained' for analysis of specific topics.
To avoid 'false positives', where two people of the same name are confused, for example, or where two cities have the same name, the NEWS team developed IdentityRank, an adaptive algorithm for instance disambiguation.
"It really started out as a by-product of our main work, but it works well and I think it may generate quite a bit of scientific interest," says Bernardi.
It's only one of NEWS' many achievements, and work will not stop there. "We have developed a great network during the project and the consortium has agreed to offer mutual support for a further two years. In the meantime we are pursuing commercial opportunities, several news agencies are interested in the platform, and we had a lot of exposure at CEBIT '05 and '06," says Bernardi.
Source:
IST Results
Related stories:
Closure of ‘Halo Wars' developer shocking
It's hard to believe that any developer making a game based on Halo could be shut down for financial reasons, but that's the fate awaiting Dallas-based Ensemble Studios.
New polling system will track viewers' instant impressions of debate
Among the millions of Americans watching Friday's first presidential debate will be 2,000 with cell phone or computer mouse in hand.
Oracle's Agile Buy Could Boost PLM
With Oracle's intended $495 million acquisition of software maker Agile Software, product lifecycle management may finally have its day.
Microsoft Outlines BI Strategy, Platform
Katmai, PerformancePoint Server and Excel will play key roles in Microsoft's quest to make business intelligence more ubiquitous.
Will 'Santa Rosa' Make a Big Splash in the Mobile Market?
Think of it as Centrino, part 4. While Advanced Micro Devices' attempted to steal the spotlight last week, all the company did was call attention to Intel's 'Santa Rosa' launch tomorrow, which represents the latest major update to Intel's Centrino mobile platform. Intel is now set to officially launch the platform Wednesday morning.
NBC/News Corp. Video Site to Be 'Mostly Free'
News Corp. and NBC Universal confirmed that their unnamed video site will launch this summer, featuring "mostly free" content together with partners AOL, Microsoft, and Yahoo. But don't say it competes with YouTube, they insisted.
Blinkx to lead in video search engine
When it comes to video searching on the Net, blinkx is big. Deeming itself the smartest and largest video search engine on the Web, blinkx.tv delivers 4 million hours of searchable content -- audio, video, and TV via RSS -- and boast more content than Google Video and Yahoo.
It's a wrap, GDC top 12 list
It's been a long week of talking game development tools, dealing with a cold and rainy San Jose, and listening to companies saying "Wait until E3," instead of answering questions. But this correspondent persevered to bring you the top 12 tools and items uncovered at the 2006 Game Developer's Conference.