Thi, T.B. and Hoang, H.L.N. and Thi, H.N. and Viet, A.P. (2023) A New Method for Vietnamese Text Correction using Sequence Tagging Models. In: UNSPECIFIED.
Full text not available from this repository. (Upload)Abstract
In this paper, we present a new approach for Vietnamese text error correction. A corrector consists of a Transformer to encode the input sequence, and sequence tag-gers to perform both error detection and correction. For the taggers, we apply special tokens to process insertions, deletions, and substitutions. The correction is performed in many steps repeatedly until the stopping criteria are met. At each step, we just correct the source sentence with minimal spans of tokens. These solutions make two advantages including 1) detecting and correcting various error types of Vietnamese texts, and 2) not generating uncontrollable outputs as generative models. As a result, our approach has yielded remarkable performance. On realistic dataset, our proposal model archives 79.5 errors detected and 62.7 errors corrected; the highest SacreBLEU score is 86.10, that is a promising result. © 2023 IEEE.
Item Type: | Conference or Workshop Item (UNSPECIFIED) |
---|---|
Divisions: | Offices > Office of International Cooperation |
Identification Number: | 10.1109/KSE59128.2023.10299446 |
Uncontrolled Keywords: | Error detection, Automating text correction; Errors correction; Grammar checking; Grammatical error correction; Grammatical errors; Input sequence; New approaches; Tagging models; Text correction; Vietnamese, Error correction |
Additional Information: | cited By 0; Conference of 15th International Conference on Knowledge and Systems Engineering, KSE 2023 ; Conference Date: 18 October 2023 Through 20 October 2023; Conference Code:194303 |
URI: | http://eprints.lqdtu.edu.vn/id/eprint/11039 |