Academic
Publications
Improving pseudo-relevance feedback in web information retrieval using web page segmentation

Improving pseudo-relevance feedback in web information retrieval using web page segmentation,10.1145/775152.775155,Shipeng Yu,Deng Cai,Ji-Rong Wen,Wei

Improving pseudo-relevance feedback in web information retrieval using web page segmentation   (Citations: 131)
BibTex | RIS | RefWorks Download
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant information from navigation, decoration, and interaction part of the page. In this paper, we propose a VIsion-based Page Segmentation (VIPS) algorithm to detect the semantic content structure in a web page. Compared with simple DOM based segmentation method, our page segmentation scheme utilizes useful visual cues to obtain a better partition of a page at the semantic level. By using our VIPS algorithm to assist the selection of query expansion terms in pseudo-relevance feedback in web information retrieval, we achieve 27% performance improvement on Web Track dataset.
Conference: World Wide Web Conference Series - WWW , pp. 11-18, 2003
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
    • ...A commonly used query suggestion method [6, 21, 30, 32] is to find the additional keywords from the retrieved document of the initial query...
    • ...Recently, Yu et al. [32] selected the additional keywords from vision-based segments of Web pages in order to cope with the multiple topics residing therein...

    Lusong Liet al. MQSS: multimodal query suggestion and searching for video search

    • ...Some other approaches, e.g., [31, 4], make use of visual cues or eye tracker to improve passage feedback for web search: on the server side of a search engine, documents can be decomposed into topically difierent components via visual cues [31], while on the client side of users, gaze-based attention feedback [4] can go down to the sub-document level by exploiting evidence about which document parts the user looks at. However, such ...
    • ...Some other approaches, e.g., [31, 4], make use of visual cues or eye tracker to improve passage feedback for web search: on the server side of a search engine, documents can be decomposed into topically difierent components via visual cues [31], while on the client side of users, gaze-based attention feedback [4] can go down to the sub-document level by exploiting evidence about which document parts the user looks at. However, such ...

    Yuanhua Lvet al. Positional relevance model for pseudo-relevance feedback

    • ...Among all the query expansion methods, pseudo-relevance feedback (PRF) [9,10] is nearly the most attractive one because it does not require any user input...

    Yuanbin Wuet al. Pseudo-Relevance Feedback Based on mRMR Criteria

    • ...The work published by Yu et al. [6] is based on a visual page segmentation using the VIPS algorithm...

    Radek Burget. Visual Area Classification for Article Identification in Web Documents

    • ...page segmentation, that groups semantically related interface objects in a hierarchical structure and accordingly tag the semantic role of each object [Yu03]...
    • ...Many researchers have explored heuristic approaches [Ahm08, Bal06, Bur05, Buy01a, Buy01b, Che03, Che05, Gu02, Gup03, Mil02, Yan00, Yan01, Yu03] to discovering information organization underlying a Web page...

    Kang Zhanget al. Exploring semantic roles of Web interface components

Sort by: