Multi-Class Document Classification Based on Deep Neural Network and Word2Vec

Authors

  • İlkay YELMEN Istanbul Aydın University

Keywords:

Document Classification, Multiclass Classification, Data Preprocessing, Word Embedding Methods, Machine Learning, Deep Learning

Abstract

With the increase in unstructured data, the importance of classification of text-based documents has increased. In particular, the classification of news texts and digital documentation provides easy access to the information sought. In this study, a large amount of news textual data was used. After the data set was preprocessed, Bag of Words (BoW), TF-IDF, Word2Vec and Doc2Vec word embedding methods were applied. In the classification phase, Random Forest (RF), Multilayer Perceptron (MLP), Support Vector Machine (SVM) and Deep Neural Network (DNN) algorithms were applied. As a result of the experimental studies, using the Word2Vec method together with the DNN algorithm performed the best result.

Downloads

Download data is not yet available.

Downloads

Published

03-02-2022

How to Cite

[1]
İlkay YELMEN, “Multi-Class Document Classification Based on Deep Neural Network and Word2Vec”, JAST, vol. 15, no. 1, pp. 59–65, Feb. 2022.

Issue

Section

Articles