Feature extraction and duplicate detection for text mining: A survey


Downloads per month over past year

Ramya, R.S. and Venugopal, K.R. and Iyengar, S.S. and Patnaik, L. (2016) Feature extraction and duplicate detection for text mining: A survey. Global Journal of Computer Science and Technology: C Software & Data Engineering, 16 (05). ISSN 0975-4172


Download (801kB) | Preview


Text mining, also known as Intelligent Text Analysis is an important research area. It is very difficult to focus on the most appropriate information due to the high dimensionality of data. Feature Extraction is one of the important techniques in data reduction to discover the most important features. Proce- ssing massive amount of data stored in a unstructured form is a challenging task. Several pre-processing methods and algo- rithms are needed to extract useful features from huge amount of data. The survey covers different text summarization, classi- fication, clustering methods to discover useful features and also discovering query facets which are multiple groups of words or phrases that explain and summarize the content covered by a query thereby reducing time taken by the user.

Item Type: Article
Uncontrolled Keywords: text feature extraction, text mining, query search, text classification
Subjects: Faculty of Engineering > Computer Science & Information Science Engineering
Divisions: University Visvesvarayya College of Engineering > Department of Computer Science and Information Science Engineering
Depositing User: Ms Laxmi Kamble
Date Deposited: 15 Sep 2021 07:22
Last Modified: 15 Sep 2021 07:22
URI: http://eprints-bangaloreuniversity.in/id/eprint/9517

Actions (login required)

View Item View Item