Vu, L. and Tra, D.V. and Nguyen, Q.U. (2016) Learning from imbalanced data for encrypted traffic identification problem. In: 7th Symposium on Information and Communication Technology, SoICT 2016, 8 December 2016 through 9 December 2016.
Learning from imbalanced data for encrypted traffic identification problem.pdf
Download (259kB) | Preview
Abstract
Identifying encrypted application trafic is an important issue for many network tasks including quality of service, fire-wall enforcement and security. One of the challenging problems of classifying encrypted application traffic is the imbalanced property of network data. Usually, the amount of unencrypted traffic is much higher than the amount of encrypted traffic. To date, the machine learning based approach for identifying encrypted traffic often solely focused on examining and improving algorithms. The techniques for addressing imbalanced data are rarely investigated. In this paper, we present a thorough analysis of the impact of various techniques for handling imbalanced data when machine learning approaches are applied to identifying encrypted traffic. The experiments are conducted on a wellknown network traffic dataset and the results showed that some techniques for addressing imbalanced data help machine learning algorithms to achieve better performance. © 2016 ACM.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Divisions: | Faculties > Faculty of Information Technology |
Identification Number: | 10.1145/3011077.3011132 |
Uncontrolled Keywords: | Artificial intelligence; Cryptography; Data handling; Learning systems; Network security; Quality of service; Encrypted traffic; Fire wall; Imbalanced data; Machine learning approaches; Network data; Network traffic; Learning algorithms |
Additional Information: | Conference code: 125331. Language of original document: English. |
URI: | http://eprints.lqdtu.edu.vn/id/eprint/9771 |