Academic
Publications
Measuring the degree of similarity between objects in text retrieval systems

Measuring the degree of similarity between objects in text retrieval systems,D. Ellis,J. Furner-hines,P. Willett

Measuring the degree of similarity between objects in text retrieval systems   (Citations: 18)
BibTex | RIS | RefWorks Download
Published in 1993.
Cumulative Annual
    • ...1987; Oberski, 1988; Wang, Wong, & Yao, 1992; Ellis, Furner-Hines, & Willett, 1993; Gmür,...
    • ...other (Hubálek, 1982; Ellis, Furner-Hines, & Willett, 1993)...
    • ...perspective, Ellis, Furner-Hines, and Willett (1993)...
    • ...product and the difference sum (Ellis, Furner-Hines, & Willett, 1993)...

    Jesper W. Schneideret al. Matrix comparison, Part 1: Motivation and important issues for measuri...

    • ...The computations were repeated with two common similarity measures, Cosine and Dice (Ellis, Furner-Hines, & Willett, 1993), and three different weighting schemes for document terms: relative frequency (term frequency over the number of tokens in the document), tf-idf (in the inquery form) (Callan, Croft, & Harding, 1992), and...

    Gheorghe Muresanet al. Topic modeling for mediated access to very large document collections

    • ...1979; Jones & Furnas, 1987; Ellis et al., 1993; Rorvig, 1999), and the choice of a specific measure may...
    • ...should settle down on those measures most appropriate for its needs. For the field of IR (Ellis et al., 1993)...
    • ...Equation 1 demonstrates this for the cosine coefficient1 which is commonly used to measure interdocument relationships (Ellis et al., 1993)...
    • ...used - (Van Rijsbergen, 1979; Willett, 1983; Ellis et al., 1993), and secondly because such schemes weight...
    • ...found - which is in agreement with previous suggestions and findings (Van Rijsbergen, 1979; Willett, 1983; Ellis et al., 1993)...

    Anastasios Tombroset al. Query-Sensitive Similarity Measures for Information Retrieval

    • ...Experiments with the normalised Euclidean distance, and the Dice coe"cient, did not produce significantly di!erent results ‐ again in agreement with previous suggestions and findings (Ellis, Furner-Hines, & Willett, 1993; Norreault et al., 1981; Van Rijsbergen, 1979; Willett, 1983)...

    Anastasios Tombroset al. The effectiveness of query-specific hierarchic clustering in informati...

    • ...Conventional measures of interdocument relationships (Ellis et al., 1993), such as the cosine coefficient for example, can not detect such a similarity, since they do not take into account the specific context (i.e...
    • ...Van Rijsbergen, 1979; Jones & Furnas, 1987; Ellis et al., 1993), and the choice of a specific measure may influence the outcome of the calculations...
    • ...For the field of IR (Ellis et al., 1993) have concluded that “the historical attachment to the association coefficients provided by the Dice and cosine formulae is in no need of revision”...
    • ...interdocument relationships (Ellis et al., 1993)...
    • ...The use of term weighting schemes for document vectors does not address this issue, firstly because such schemes are not always applied when calculating inter-object similarities - binary representations are often used - (Van Rijsbergen, 1979; Willett, 1983; Ellis et al., 1993), and secondly because such schemes weight terms according to their indexing importance within a document collection (Van Rijsbergen, 1979), and not according to ...
    • ...After initial experimentation with different vector weighting schemes for the cosine coefficient (binary weights, term frequency weights) no significant differences were found - which is in agreement with previous suggestions and findings (Van Rijsbergen, 1979; Willett, 1983; Ellis et al., 1993)...

    Anastasios Tombroset al. Query-sensitive similarity measures for the calculation of interdocume...

Sort by: