Academic
Publications
A Tool for Computing the Visual Similarity of Web Pages

A Tool for Computing the Visual Similarity of Web Pages,10.1109/SAINT.2010.17,Mar ´ ia Alpuente,Daniel Romero

A Tool for Computing the Visual Similarity of Web Pages  
BibTex | RIS | RefWorks Download
Recently, we proposed a functional technique for identifying similar Web pages that is based on measuring tree similarity. The key idea behind the method is to transform each Web page into a compressed, normalized tree that effectively represents its visual structure. In this work, we develop an optimization of this technique that is based on memoization and that achieves significant improvements in efficiency in both time and space. This work also presents a tool that implements the proposed technique as well as two case studies for two real scenarios. Experiments on real documents show that the optimized algorithm performs significantly better than the original technique and demonstrate the practicality of our approach.
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.