Efficient Text Mining Using Side Information of Documents
- Rosemary Tripura
- P.Selvaraj
data mining, text mining, Stop word, word stemming, NLP.
Due to the increasing availability of digital data, text document continue to grow as well hence the need of text mining. These digital documents comprise of the normal body text as well as side information. The side information will be in different formats for example hyperlinks and may contain useful information for mining. It is of utmost importance that the value of the side information be ascertained before consideration in the data selected for the text mining process as it may give an adverse impact on the quality of text mined. A principled way to perform the mining process is therefore required so as to maximize on the benefits of side information. In this paper, we use the Naive Bayes model to create an effective text mining approach.
Rosemary Tripura, P.Selvaraj. "Efficient Text Mining Using Side Information of Documents".INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH ISSN:2321-9939, Vol.3, Issue 1, pp.409-414, URL :https://rjwave.org/ijedr/papers/IJEDR1501075.pdf
Volume 3 Issue 1
Pages. 409-414