LE QUY DON
Technical University
VietnameseClear Cookie - decide language by browser settings

Building Vietnamese topic modeling based on core terms and applying in text classification

Dao, T.T. and Thanh, T.D. and Hai, T.N. and Ngoc, V.H. (2015) Building Vietnamese topic modeling based on core terms and applying in text classification. In: 5th International Conference on Communication Systems and Network Technologies, CSNT 2015, 4 April 2015 through 6 April 2015.

Text
Building Vietnamese topic modeling based on core terms and applying in text classification.pdf

Download (224kB) | Preview

Abstract

In the languages, the occur of words are indicated about meaning of contents in text. Generative models for text, such as the topic model, have the potential to make important contributions to the statistical analysis of large document collections, and the development of a deeper understanding of human language learning and processing. In this paper, we proposed a novel method for building Vietnamese topic model based on core terms and conditional probability. With this approach, we reduced cost of time for building corpus. After that, we perform with Vietnamese text classification and the experimental show that, this corpus will help text classification system really effectively than traditional methods, higher accuracy and reduced complex data processing. © 2015 IEEE.

Item Type: Conference or Workshop Item (Paper)
Divisions: Faculties > Faculty of Information Technology
Identification Number: 10.1109/CSNT.2015.22
Uncontrolled Keywords: Classification (of information); Complex networks; Computational linguistics; Data handling; Data mining; Word processing; Conditional probabilities; Document collection; Generative model; Human language; Text classification; Text mining; Topic Modeling; Vietnamese; Text processing
Additional Information: Conference code: 115903. Language of original document: English.
URI: http://eprints.lqdtu.edu.vn/id/eprint/9948

Actions (login required)

View Item
View Item