Unsupervised query segmentation using only query logs

Unsupervised query segmentation using only query logs,10.1145/1963192.1963239,Nikita Mishra,Rishiraj Saha Roy,Niloy Ganguly,Srivatsan Laxman,Monojit C

Unsupervised query segmentation using only query logs   (Citations: 1)
BibTex | RIS | RefWorks Download
We introduce an unsupervised query segmentation scheme that uses query logs as the only resource and can effectively capture the structural units in queries. We believe that Web search queries have a unique syntactic structure which is distinct from that of English or a bag-of-words model. The segments discovered by our scheme help understand this underlying grammatical structure. We apply a statistical model based on Hoeffding's Inequality to mine significant word n-grams from queries and subsequently use them for segmenting the queries. Evaluation against manually segmented queries shows that this technique can detect rare units that are missed by our Pointwise Mutual Information (PMI) baseline.
Conference: World Wide Web Conference Series - WWW , pp. 91-92, 2011
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
    • ...Finally, the very recent approach of Mishra et al. [14] compares with our method in terms of feature complexity...
    • ...Annotator P erformance Algorithm Measure MI [4] [19] [21] [7] [14] [10] Our...
    • ...We would like to thank Shane Bergsma and Qin Iris Wang (the authors of [4]); David J. Brenes, Daniel Gayo-Avello, and Rodrigo Garcia (the authors of [7]); Chao Zhang, Nan Sun, Xia Hu, Tingzhu Huang, and Tat-Seng Chua (the authors of [21]); and Nikita Mishra, Rishiraj Saha Roy, Niloy Ganguly, Srivatsan Laxman, and Monojit Choudhury (the authors of [14]) for providing us with their experimental data, especially for doing so on a very short ...

    Matthias Hagenet al. Query segmentation revisited

Sort by: