Journal Home \| Aim & Scope \| Author(s) Information \| Editorial Board \| MSP Download Statistics

Research Journal of Applied Sciences, Engineering and Technology

Abstract


2014(Vol.8, Issue:23)

Article Information: An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm K. Selvi and R.M. Suresh Corresponding Author: K. Selvi Submitted: ‎September ‎18, ‎2014 Accepted: October 17, ‎2014 Published: December 20, 2014
Abstract:
Pattern recognition, envisaging supervised and unsupervised method, optimization, associative memory and control process are some of the diversified troubles that can be resolved by artificial neural networks. Problem identified: Of late, discovering the required information in massive quantity of data is the challenging tasks. The model of similarity evaluation is the central element in accomplishing a perceptive of variables and perception that encourage behavior and mediate concern. This study proposes Artificial Neural Networks algorithms to resolve similarity measures. In order to apply singular value decomposition the frequency of word pair is established in the given document. (1) Tokenization: The splitting up of a stream of text into words, phrases, signs, or other significant parts is called tokenization. (2) Stop words: Preceding or succeeding to processing natural language data, the words that are segregated is called stop words. (3) Porter stemming: The main utilization of this algorithm is as part of a phrase normalization development that is characteristically completed while setting up in rank recovery technique. (4) WordNet: The compilation of lexical data base for the English language is called as WordNet Based on Artificial Neural Networks, the core part of this study work extends n-gram proposed algorithm. All the phonemes, syllables, letters, words or base pair corresponds in accordance to the application. Future work extends the application of this same similarity measures in various other neural network algorithms to accomplish improved results. Key words: Artificial Neural Networks, natural language processing, porter stemming, similarity measure, wordnet , ,
Abstract	PDF	HTML

Cite this Reference: K. Selvi and R.M. Suresh, . An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm. Research Journal of Applied Sciences, Engineering and Technology, (23): 2320-2328.

ISSN (Online): 2040-7467
ISSN (Print): 2040-7459

Information

Sales & Services

Home | Contact us | About us | Privacy Policy
Copyright © 2024. MAXWELL Scientific Publication Corp., All rights reserved