keyboard_arrow_up
Computing Semantic Similarity Measure Between Words Using Web Search Engine

Authors

Pushpa C N, Girish S, Nitin S K, Thriveni J, Venugopal K R and L M Patnaik, University Visvesvaraya College of Engineering, India

Abstract

Semantic Similarity measures between words plays an important role in information retrieval, natural language processing and in various tasks on the web. In this paper, we have proposed a Modified Pattern Extraction Algorithm to compute the supervised semantic similarity measure between the words by combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and non-synonymous word-pairs. The proposed Modified Pattern Extraction Algorithm outperforms by 89.8 percent of correlation value.

Keywords

Information Retrieval, Semantic Similarity, Support Vector Machine, Web Mining, Web Search Engine, Web Snippets.

Full Text  Volume 3, Number 5