LE QUY DON
Technical University
VietnameseClear Cookie - decide language by browser settings

A New Method for Vietnamese Text Correction using Sequence Tagging Models

Thi, T.B. and Hoang, H.L.N. and Thi, H.N. and Viet, A.P. (2023) A New Method for Vietnamese Text Correction using Sequence Tagging Models. In: UNSPECIFIED.

Full text not available from this repository. (Upload)

Abstract

In this paper, we present a new approach for Vietnamese text error correction. A corrector consists of a Transformer to encode the input sequence, and sequence tag-gers to perform both error detection and correction. For the taggers, we apply special tokens to process insertions, deletions, and substitutions. The correction is performed in many steps repeatedly until the stopping criteria are met. At each step, we just correct the source sentence with minimal spans of tokens. These solutions make two advantages including 1) detecting and correcting various error types of Vietnamese texts, and 2) not generating uncontrollable outputs as generative models. As a result, our approach has yielded remarkable performance. On realistic dataset, our proposal model archives 79.5 errors detected and 62.7 errors corrected; the highest SacreBLEU score is 86.10, that is a promising result. © 2023 IEEE.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Divisions: Offices > Office of International Cooperation
Identification Number: 10.1109/KSE59128.2023.10299446
Uncontrolled Keywords: Error detection, Automating text correction; Errors correction; Grammar checking; Grammatical error correction; Grammatical errors; Input sequence; New approaches; Tagging models; Text correction; Vietnamese, Error correction
Additional Information: cited By 0; Conference of 15th International Conference on Knowledge and Systems Engineering, KSE 2023 ; Conference Date: 18 October 2023 Through 20 October 2023; Conference Code:194303
URI: http://eprints.lqdtu.edu.vn/id/eprint/11039

Actions (login required)

View Item
View Item