Мультиклассовая классификация корпуса смешанных текстов алгоритмами машинного обучения
| Parent link: | Курзина, И. А. (химик ; 1972-). Перспективы развития фундаментальных наук=Prospects of Fundamental Sciences Development: сборник научных трудов XX Международной конференции студентов, аспирантов и молодых ученых, г. Томск, 25-28 апреля 2023 г..— .— Томск: Изд-во ТПУ, 2023 Т. 3 : Математика.— 2023.— С. 53-55 |
|---|---|
| Main Author: | |
| Corporate Author: | |
| Other Authors: | |
| Summary: | Заглавие с экрана This paper compares different machine learning algorithms for multiclass classification of mixed texts corpus. The F1-score was used as a quality metric for the algorithm comparison. The algorithms with acceptable quality for the corpus of texts were selected in the process. The dataset includes 7863 rows and 4 features, the gradient boosting showed the best result based on metric F1=0.771. Текстовый файл |
| Language: | Russian |
| Published: |
2023
|
| Subjects: | |
| Online Access: | http://earchive.tpu.ru/handle/11683/80895 |
| Format: | Electronic Book Chapter |
| KOHA link: | https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=674348 |
| Summary: | Заглавие с экрана This paper compares different machine learning algorithms for multiclass classification of mixed texts corpus. The F1-score was used as a quality metric for the algorithm comparison. The algorithms with acceptable quality for the corpus of texts were selected in the process. The dataset includes 7863 rows and 4 features, the gradient boosting showed the best result based on metric F1=0.771. Текстовый файл |
|---|