Automatic analysis of the contents of multimedia documents requires to combine informations coming from various data types (audio, video, text...). In this paper, we propose an architecture that describes agents for processing flows of information. These agents can be applied to elementary data types (audio, video...), but also on the results produced by other agents. The architecture includes a Multimedia Flow Browser, which is able to display simultaneously visual representations of the various information flows produced by agents, and an Agent Editor, which provides a graphical interface to create new agents by combining existing agents. The architecture is open, so that it is possible to add new data types (and the procedures to visualize them) and new agents. A simulated example is presented to show the possible usage of this tool in an application based on TV News recordings.
A generic tool for content-based multimedia browsing
IJCAI 1995, International Joint Conference on Artificial Intelligence Workshop on Intelligent Multimedia Information Retrieval, August 20-25, 1995, Montreal, Canada
PERMALINK : https://www.eurecom.fr/publication/138