Мультиклассовая классификация корпуса смешанных текстов алгоритмами машинного обучения

Bibliographic Details
Parent link:Курзина, И. А. (химик ; 1972-). Перспективы развития фундаментальных наук=Prospects of Fundamental Sciences Development: сборник научных трудов XX Международной конференции студентов, аспирантов и молодых ученых, г. Томск, 25-28 апреля 2023 г..— .— Томск: Изд-во ТПУ, 2023
Т. 3 : Математика.— 2023.— С. 53-55
Main Author: Гузеев Е. В.
Corporate Author: Национальный исследовательский Томский политехнический университет
Other Authors: Семёнов М. Е. (727)
Summary:Заглавие с экрана
This paper compares different machine learning algorithms for multiclass classification of mixed texts corpus. The F1-score was used as a quality metric for the algorithm comparison. The algorithms with acceptable quality for the corpus of texts were selected in the process. The dataset includes 7863 rows and 4 features, the gradient boosting showed the best result based on metric F1=0.771.
Текстовый файл
Language:Russian
Published: 2023
Subjects:
Online Access:http://earchive.tpu.ru/handle/11683/80895
Format: Electronic Book Chapter
KOHA link:https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=674348
Description
Summary:Заглавие с экрана
This paper compares different machine learning algorithms for multiclass classification of mixed texts corpus. The F1-score was used as a quality metric for the algorithm comparison. The algorithms with acceptable quality for the corpus of texts were selected in the process. The dataset includes 7863 rows and 4 features, the gradient boosting showed the best result based on metric F1=0.771.
Текстовый файл