RELATING WORDS AND IMAGE SEGMENTS ON MULTIPLE LAYERS FOR EFFECTIVE BROWSING AND RETRIEVAL (TP-P6)

Author(s) :

Andrea Kutics	(Tokyo University of Technology / NTT Data Corporation, Japan)
Akihiko Nakagawa	(NTT Data Corporation, Japan)
Shoji Arai	(Japan Systems Co., Ltd., Japan)
Hiroyuki Tanaka	(NTT Data Corporation, Japan)
Sakuichi Ohtsuka	(NTT Data Corporation, Japan)

Abstract :

This work proposes a new method for relating words and image segments by finding semantic coherence between these two cues on multiple layers. The method is based on the matching of visual segment clusters with words on various levels of abstraction. Our purpose here is to ease two main problems encountered in content-based image retrieval, namely, lack of semantic information captured by visual feature-based indexing, and difficulty of handling subjectivity of user queries. The method is very promising for effective browsing and retrieval in large image data sets. It supports both target- and category-type browsing and searching schemes as well as textual and/or visual query specifications. Results of experiments on a wide, non-specific image domain suggests that step by step semantic inference on consecutive layers of image - word association helps to improve accuracy of retrieval and browsing.

Menu