Efficient data management tools for the heterogeneous big data warehouse
| Parent link: | Physics of Particles and Nuclei Letters: Scientific Journal Vol. 13, iss. 5.— 2016.— [P. 689–692] |
|---|---|
| Autor corporatiu: | |
| Altres autors: | , , , , , |
| Sumari: | Title screen The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics. Режим доступа: по договору с организацией-держателем ресурса |
| Idioma: | anglès |
| Publicat: |
2016
|
| Matèries: | |
| Accés en línia: | http://dx.doi.org/10.1134/S1547477116050022 |
| Format: | Electrònic Capítol de llibre |
| KOHA link: | https://koha.lib.tpu.ru/cgi-bin/koha/opac-detail.pl?biblionumber=653282 |
MARC
| LEADER | 00000naa0a2200000 4500 | ||
|---|---|---|---|
| 001 | 653282 | ||
| 005 | 20250227111551.0 | ||
| 035 | |a (RuTPU)RU\TPU\network\18697 | ||
| 035 | |a RU\TPU\network\18641 | ||
| 090 | |a 653282 | ||
| 100 | |a 20170221d2016 k||y0rusy50 ba | ||
| 101 | 0 | |a eng | |
| 102 | |a RU | ||
| 135 | |a drcn ---uucaa | ||
| 181 | 0 | |a i | |
| 182 | 0 | |a b | |
| 200 | 1 | |a Efficient data management tools for the heterogeneous big data warehouse |f A. A. Alekseev [et al.] | |
| 203 | |a Text |c electronic | ||
| 300 | |a Title screen | ||
| 320 | |a [References: p. 692 (4 tit.)] | ||
| 330 | |a The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics. | ||
| 333 | |a Режим доступа: по договору с организацией-держателем ресурса | ||
| 461 | |t Physics of Particles and Nuclei Letters |o Scientific Journal | ||
| 463 | |t Vol. 13, iss. 5 |v [P. 689–692] |d 2016 | ||
| 610 | 1 | |a электронный ресурс | |
| 610 | 1 | |a труды учёных ТПУ | |
| 610 | 1 | |a СУБД | |
| 610 | 1 | |a система управления базами данных | |
| 610 | 1 | |a Relational Database Management System | |
| 610 | 1 | |a RDBMS | |
| 610 | 1 | |a NoSQL | |
| 610 | 1 | |a SQL | |
| 610 | 1 | |a Big Data | |
| 610 | 1 | |a Heterogeneous Data Warehouse | |
| 610 | 1 | |a хранилища | |
| 610 | 1 | |a данные | |
| 610 | 1 | |a Apache Hadoop | |
| 610 | 1 | |a Hive | |
| 610 | 1 | |a MongoDB | |
| 610 | 1 | |a Data Manipulation Language | |
| 610 | 1 | |a DML | |
| 701 | 1 | |a Alekseev |b A. A. |c specialist in the field of automatic control |c Engineer of Tomsk Polytechnic University, Postgraduate |f 1988- |g Aleksandr Aleksandrovich |3 (RuTPU)RU\TPU\pers\37959 | |
| 701 | 1 | |a Osipova |b V. V. |c specialist in the field of informatics and computer technology |c programmer, associate Professor of Tomsk Polytechnic University, candidate of technical Sciences |f 1984- |g Viktoriya Viktorovna |3 (RuTPU)RU\TPU\pers\35513 |9 18695 | |
| 701 | 1 | |a Ivanov |b M. A. |c specialist in the field of informatics and computer technology |c Associate Professor of Tomsk Polytechnic University, Candidate of technical sciences |f 1980- |g Maksim Anatoljevich |3 (RuTPU)RU\TPU\pers\37960 | |
| 701 | 1 | |a Klimentov |b A. A. |g Aleksey Anatoljevich | |
| 701 | 1 | |a Grigorjeva |b N. V. |g Nina Valerjevna | |
| 701 | 1 | |a Nalamvar |b H. S. |c specialist in the field of automatic control |c assistant of Tomsk Polytechnic University |f 1987- |g Hitesh Sanzhay |3 (RuTPU)RU\TPU\pers\36316 | |
| 712 | 0 | 2 | |a Национальный исследовательский Томский политехнический университет (ТПУ) |b Институт кибернетики (ИК) |b Кафедра оптимизации систем управления (ОСУ) |b Научно-учебная лаборатория "Виртуальный промысел" (НУЛ ВП) |3 (RuTPU)RU\TPU\col\20411 |
| 801 | 2 | |a RU |b 63413507 |c 20170227 |g RCR | |
| 856 | 4 | |u http://dx.doi.org/10.1134/S1547477116050022 | |
| 942 | |c CF | ||