Paper Title

Efficient Text Mining Using Side Information of Documents

Authors

  • Rosemary Tripura
  • P.Selvaraj

Keywords

data mining, text mining, Stop word, word stemming, NLP.

Abstract

Due to the increasing availability of digital data, text document continue to grow as well hence the need of text mining. These digital documents comprise of the normal body text as well as side information. The side information will be in different formats for example hyperlinks and may contain useful information for mining. It is of utmost importance that the value of the side information be ascertained before consideration in the data selected for the text mining process as it may give an adverse impact on the quality of text mined. A principled way to perform the mining process is therefore required so as to maximize on the benefits of side information. In this paper, we use the Naive Bayes model to create an effective text mining approach.

Article Type

Published

How To Cite

Rosemary Tripura, P.Selvaraj. "Efficient Text Mining Using Side Information of Documents".INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH ISSN:2321-9939, Vol.3, Issue 1, pp.409-414, URL :https://rjwave.org/ijedr/papers/IJEDR1501075.pdf

Issue

Volume 3 Issue 1 

Pages. 409-414

Article Preview