Обработка решений судов Томской области и города Томска с помощью технологий OLAP и Data Mining

Bibliographic Details
Parent link:Технологии Microsoft в теории и практике программирования: сборник трудов XIII Всероссийской научно-практической конференции студентов, аспирантов и молодых ученых, г.Томск, 22-23 марта 2016 г./ Национальный исследовательский Томский политехнический университет (ТПУ), Институт кибернетики ; ред. кол. А. В. Лиепиньш [и др.]. [С. 67-69].— , 2016
Main Author: Хлопонин И. А.
Corporate Author: Национальный исследовательский Томский политехнический университет (ТПУ) Институт кибернетики (ИК) Кафедра вычислительной техники (ВТ)
Other Authors: Паршина Д. М. (727), Кудинов А. В. Антон Викторович
Summary:Заглавие с титульного экрана
The article is intended to analyze various data obtained from websites of regional and district Tomsk courts via advanced analytic technologies such as OLAP and Data Mining. The process of comparing structure open documents and their parsing using Python and NoSQL databases are considered in details.Near-duplicates and shingling, as well as regular expressions stand for analyzing and comparing texts, sentences and words. Due to these algorithms, the issue relating to extraction of necessary units can be sorted out effectively and quite accurately.
Published: 2016
Series:Геоинформационные системы и технологии
Subjects:
Online Access:http://earchive.tpu.ru/handle/11683/33215
Format: Electronic Book Chapter
KOHA link:https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=620885
Description
Summary:Заглавие с титульного экрана
The article is intended to analyze various data obtained from websites of regional and district Tomsk courts via advanced analytic technologies such as OLAP and Data Mining. The process of comparing structure open documents and their parsing using Python and NoSQL databases are considered in details.Near-duplicates and shingling, as well as regular expressions stand for analyzing and comparing texts, sentences and words. Due to these algorithms, the issue relating to extraction of necessary units can be sorted out effectively and quite accurately.