%0 Journal Article %A Daniel I. Morariu and Lucian N. Vintan and Volker Tresp %D 2008 %J International Journal of Computer and Information Engineering %B World Academy of Science, Engineering and Technology %I Open Science Index 21, 2008 %T Evolutionary Feature Selection for Text Documents using the SVM %U https://publications.waset.org/pdf/15879 %V 21 %X Text categorization is the problem of classifying text documents into a set of predefined classes. After a preprocessing step, the documents are typically represented as large sparse vectors. When training classifiers on large collections of documents, both the time and memory restrictions can be quite prohibitive. This justifies the application of feature selection methods to reduce the dimensionality of the document-representation vector. In this paper, we present three feature selection methods: Information Gain, Support Vector Machine feature selection called (SVM_FS) and Genetic Algorithm with SVM (called GA_SVM). We show that the best results were obtained with GA_SVM method for a relatively small dimension of the feature vector. %P 3172 - 3178