Statistical and Semantic Similarity Between English Sentences - Anis Zaman - Books - LAP LAMBERT Academic Publishing - 9783659616389 - October 22, 2014
In case cover and title do not match, the title is correct

Statistical and Semantic Similarity Between English Sentences

Anis Zaman

Price
A$ 77.49

Ordered from remote warehouse

Expected delivery Feb 5 - 18
Add to your iMusic wish list

Statistical and Semantic Similarity Between English Sentences

This book presents various algorithms to compute semantic similarities between english texts. I explored three different algorithms for computing English sentence similarity. The first algorithm, which is well-explored in the literature [Salton and Buckley, 1988, Wu and Salton, 1981], weights words in each sentence according to term frequency and inverse document frequency (tf-idf ) and uses no semantic information. The second algorithm uses measures of the semantic distance between words belonging to the same part of speech. The third algorithm combines the tf-idf scores and the semantic distance scores between words. I evaluated the performance of the second and third algorithms on two data sets: O?Shea?s set of sentence pairs with human similarity judgements [Li et al., Aug, Rubenstein and Goodenough, 1965], and Microsoft Research?s sentence-level paraphrase dataset [Rus et al., 2012]. On O?Shea?s data set, the third algorithm more accurately matches human judgments than the second. On the Microsoft data set, there was not a significant difference between the two algorithms

Media Books     Paperback Book   (Book with soft cover and glued back)
Released October 22, 2014
ISBN13 9783659616389
Publishers LAP LAMBERT Academic Publishing
Pages 72
Dimensions 4 × 150 × 220 mm   ·   117 g
Language English