IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 2, No 3, March 2012 ISSN (Online): 1694-0814 www.IJCSI.org 223
Web Image Retrieval Search Engine based on Semantically Shared Annotation
Alaa Riad1, Hamdy Elminir2 and Sameh Abd-Elghany3
1Vice dean of Students Affair, Faculty of Computers and Information Sciences, Mansoura University Mansoura, Egypt
2Head of Electronic and Communication Dept, Misr Higher Institute of Engineering and Technology Mansoura, Egypt
3Faculty of Computers and Information Sciences, Mansoura University Mansoura, Egypt
Abstract perceives from the image [3]. This paper try to narrow the This paper presents a new majority voting technique that combines semantic gap problem and enhance the retrieval precision by the two basic modalities of Web images textual and visual features fusing the two basic modalities of Web images, i.e., Textual of image in a re-annotation and search based framework. The context (usually represented by keywords) and visual features proposed framework considers each web page as a voter to vote the for retrieval. This paper presents a new majority voting relatedness of keyword to the web image, the proposed approach is not only pure combination between image low level feature and technique that considers each web page as a voter to vote the textual feature but it take into consideration the semantic meaning of relatedness of keyword to the web image, the proposed each keyword that expected to enhance the retrieval accuracy. The approach is not only pure combination between image low proposed approach is not used only to enhance the retrieval accuracy level feature and textual feature but it take into consideration of web images; but also able to annotated the unlabeled images. the semantic meaning of each keyword that expected to Keywords: Web Image Retrieval, Ontology, Data Mining. enhance the retrieval accuracy.
This paper is organized as follows; the related work presented 1. Introduction in section 2, while, section 3 presents in details the architecture of the proposed approach, while section 4 With the advent of the Internet, billions of images are now concludes this research. freely available online [1]. The rapid growth in the volume of such images can make the task of finding and accessing image of interest, overwhelming for users Therefore, some 2. Related work additional processing is needed in order to make such collections searchable in a useful manner. The current Web Current research in web image retrieval suggested a joint use image retrieval search engines, including Google image existing Textual context and visual features can provide a search, Lycos and AltaVista photo finder, use text (i.e., better retrieval results [4,5]. The simplest approach for this surrounding words) to look for images, without considering method is based on counting the frequency-of-occurrence of image content [2,15]. However, when the surrounding words words for automatic indexing. This simple approach can be are ambiguous or even irrelevant to the image, the search extended by giving more weights to the words which occur in based on text only will result in many unwanted result the alt or src tag of the image or which can occur inside the images. head tag or any other important tags of the HTML document. In contrast, Content-Based Image Retrieval (CBIR) systems However, purely combination of traditional text-based are proposed to utilize only low-level features, such as color, retrieval and content-based retrieval is not adequate to deal texture and shape, to retrieve similar images. In general, the with the problem of image retrieval on the WWW. The first bottleneck to the efficiency of CBIR is the semantic gap reason is that there is already too much clutter and irrelevant between the high level image interpretations of the users and information on the web pages. These semantic features are the low level image features stored in the database for less accurate than annotating text. The second reason is due indexing and querying. In other words, there is a difference to the mismatch between the page author’s expression and the between what image features can distinguish and what people user’s understanding and expectation. This problem is similar
Copyright (c) 2012 International Journal of Computer Science Issues. All Rights Reserved. IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 2, No 3, March 2012 ISSN (Online): 1694-0814 www.IJCSI.org 224
to the subjectivity of image annotation. The third reason is considered as being highly relevant to each other [13]. By due to the difficulty to find out the relationship between low- considering a near-duplicate (visually similar) images as a level features and high-level features. transaction and its associated keywords as the items in the transaction, it is very natural to discover correlations between The second approach takes a different stand and treats images image low level feature and keywords by applying and texts as equivalent data. It attempts to discover the association rules mining model [14]. correlation between visual features and textual words on an unsupervised basis, by estimating the joint distribution of 3.2. System Components features and words and posing annotation as statistical inference in a graphical model. For example image retrieval The proposed search engine is composed mainly on two system based on decision trees and rule induction was phase, preprocessing phase and semantic search phase. The presented in [7] to annotate web image using combination of preprocessing phase is responsible for data and image image feature and metadata, while in [8], a system that collection and semantic annotation, while semantic search automatically integrate the keyword and visual features for phase is responsible for image retrieval. The next sections web image retrieval by using association rule mining explain in details each phase and its components. technology. These approaches usually learn the keywords 3.2.1 Preprocessing Phase correlations according to the appearance of keywords in the web page, and the correlation may not reflect the real The preprocessing phase has the following main modules: correlation for annotating Web images or semantic meaning These modules are: (1) Web Crawler module, (2) Parser of keywords such as synonym [9]. Ontology-based image module, (3) Image Processing module, (4) NLP module, (5) retrieval is an effective approach to bridge the semantic gap Voting Module. Figure 1 shows these modules. Each module because it is more focused on capturing semantic content from these modules will be composed to a set of functions in which has the potential to satisfy user requirements better terms of system functionality. The following section of the [10,11]. While semantically rich ontology addresses the need research contains the description of each module and its for complete descriptions of image retrieval and improves the functions in details. precision of retrieval. However, the lack of text information which affects the performance of keyword approach is still a (1) Web Crawler Module problem in text ontology approach. Ontology works better with the combination of image features [12].this paper There are lots of images available on the Web pages. In order presents a new framework for web image retrieval search to collect these images, a crawler (or a spider, which is a engine which relay not only on ontology to discover the program that can automatically analyze the web pages and semantic relationship between different keywords inside the download the related pages). Instead of creating a new web page but also propose a new voting annotation technique ontology from scratch, we extend WordNet, the well-known extract the shared semantically related keywords from word ontology, to word-image ontology, WordNet is one of different web pages to eliminate and solve the problem of the most widely used, commonly supported and best subjectivity of image annotation of traditional approaches and developed ontologies. In WordNet, different senses and enhance the performance of the retrieval results by taking the relations are defined for each word. We will use Wordnet to semantic of the correlated data into consideration. provide a comprehensive list of all classes likely to have any kind of visual consistency. We do this by extracting all non- abstract nouns from the database, 75,062 of them in total. , by 3. Architecture of the Proposed Web Image collecting images for all nouns, we have a dense coverage of Retrieval Search Engine Prototype all visual forms.
3.1. General Overview (2) HTML Parsing Module
Due to there are many unrelated keywords are associated to In our system, we use HTML Parser to transform html the web image, in order to enhance the retrieval process of documents into DOM tree. the DOM Tree is based webpage web image, it should be decrease or remove these keywords. segmentation algorithm that automatically segments web The proposed approach trying to solve this shortcoming of pages into sections, with each section consisting of a web most of current systems by proposing a voting technique image and its contextual information (i.e. image segment), which based on frequent itemset mining to get the relation and then extract the text and images by traversing through the between low level feature of image content and high level DOM tree. . First, we generate the DOM tree for each web feature. page containing the web images. From the bottom, visual The more frequent the keyword occurs is the key of objects like image, text paragraph are identifier as basic measuring keyword correlation. If two or more keywords elements. The tags such as
, |