Efficient data management tools for the heterogeneous big data warehouse

Dades bibliogràfiques
Parent link:Physics of Particles and Nuclei Letters: Scientific Journal
Vol. 13, iss. 5.— 2016.— [P. 689–692]
Autor corporatiu: Национальный исследовательский Томский политехнический университет (ТПУ) Институт кибернетики (ИК) Кафедра оптимизации систем управления (ОСУ) Научно-учебная лаборатория "Виртуальный промысел" (НУЛ ВП)
Altres autors: Alekseev A. A. Aleksandr Aleksandrovich, Osipova V. V. Viktoriya Viktorovna, Ivanov M. A. Maksim Anatoljevich, Klimentov A. A. Aleksey Anatoljevich, Grigorjeva N. V. Nina Valerjevna, Nalamvar H. S. Hitesh Sanzhay
Sumari:Title screen
The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics.
Режим доступа: по договору с организацией-держателем ресурса
Idioma:anglès
Publicat: 2016
Matèries:
Accés en línia:http://dx.doi.org/10.1134/S1547477116050022
Format: Electrònic Capítol de llibre
KOHA link:https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=653282

MARC

LEADER 00000naa0a2200000 4500
001 653282
005 20250227111551.0
035 |a (RuTPU)RU\TPU\network\18697 
035 |a RU\TPU\network\18641 
090 |a 653282 
100 |a 20170221d2016 k||y0rusy50 ba 
101 0 |a eng 
102 |a RU 
135 |a drcn ---uucaa 
181 0 |a i  
182 0 |a b 
200 1 |a Efficient data management tools for the heterogeneous big data warehouse  |f A. A. Alekseev [et al.] 
203 |a Text  |c electronic 
300 |a Title screen 
320 |a [References: p. 692 (4 tit.)] 
330 |a The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics. 
333 |a Режим доступа: по договору с организацией-держателем ресурса 
461 |t Physics of Particles and Nuclei Letters  |o Scientific Journal 
463 |t Vol. 13, iss. 5  |v [P. 689–692]  |d 2016 
610 1 |a электронный ресурс 
610 1 |a труды учёных ТПУ 
610 1 |a СУБД 
610 1 |a система управления базами данных 
610 1 |a Relational Database Management System 
610 1 |a RDBMS 
610 1 |a NoSQL 
610 1 |a SQL 
610 1 |a Big Data 
610 1 |a Heterogeneous Data Warehouse 
610 1 |a хранилища 
610 1 |a данные 
610 1 |a Apache Hadoop 
610 1 |a Hive 
610 1 |a MongoDB 
610 1 |a Data Manipulation Language 
610 1 |a DML 
701 1 |a Alekseev  |b A. A.  |c specialist in the field of automatic control  |c Engineer of Tomsk Polytechnic University, Postgraduate  |f 1988-  |g Aleksandr Aleksandrovich  |3 (RuTPU)RU\TPU\pers\37959 
701 1 |a Osipova  |b V. V.  |c specialist in the field of informatics and computer technology  |c programmer, associate Professor of Tomsk Polytechnic University, candidate of technical Sciences  |f 1984-  |g Viktoriya Viktorovna  |3 (RuTPU)RU\TPU\pers\35513  |9 18695 
701 1 |a Ivanov  |b M. A.  |c ˆspecialist in the field of informatics and computer technology  |c Associate Professor of Tomsk Polytechnic University, Candidate of technical sciences  |f 1980-  |g Maksim Anatoljevich  |3 (RuTPU)RU\TPU\pers\37960 
701 1 |a Klimentov  |b A. A.  |g Aleksey Anatoljevich 
701 1 |a Grigorjeva  |b N. V.  |g Nina Valerjevna 
701 1 |a Nalamvar  |b H. S.  |c specialist in the field of automatic control  |c assistant of Tomsk Polytechnic University  |f 1987-  |g Hitesh Sanzhay  |3 (RuTPU)RU\TPU\pers\36316 
712 0 2 |a Национальный исследовательский Томский политехнический университет (ТПУ)  |b Институт кибернетики (ИК)  |b Кафедра оптимизации систем управления (ОСУ)  |b Научно-учебная лаборатория "Виртуальный промысел" (НУЛ ВП)  |3 (RuTPU)RU\TPU\col\20411 
801 2 |a RU  |b 63413507  |c 20170227  |g RCR 
856 4 |u http://dx.doi.org/10.1134/S1547477116050022 
942 |c CF