Top Page | English | 简体中文 | 繁體中文 | 한국어 | 日本語
Thursday, 26 February 2015, 16:30 HKT/SGT
Share:
    

Source: Fujitsu Ltd
Fujitsu Develops Column-Oriented Data-Processing Engine Enabling Fast, High-Volume Data Analysis in Database Systems
Accelerates analysis processing more than fifty-fold on a single server with PostgreSQL open-source database

TOKYO, Feb 26, 2015 - (JCN Newswire) - Fujitsu Laboratories Ltd. today announced that it has developed a column-oriented data-storage and processing engine that enables fast analysis of large volumes of data in a database system.

Fujitsu New Technology

In recent years, column-oriented databases have emerged as a system that allows for better speed when reading and analyzing large volumes of data, as a counterpart to existing row-oriented databases, which are suited to handling data updates. But problems have been either that the changes to row-oriented data cannot be automatically reflected in column-oriented data, or that the size of the column-oriented data is constrained by installed memory.

Fujitsu has developed an engine that, running on a PostgreSQL open-source database, without being dependent on memory capacity, instantly updates column-oriented data in response to changes in row-oriented data, and processes column-oriented data quickly. The engine quickly analyzes indexes(1), which are provided by most database systems, and can be used by developers without special consideration to whether the storage method is row-oriented or column-oriented. With a parallel-processing engine especially suited for processing column-oriented data, analyses run on a single CPU core are conducted 4 times faster than before, and one server equipped with 15 CPU cores can run analyses at least 50 times faster.

Even on smaller computer systems with little memory, this technology enables real-time data analysis reflecting the latest data.

Details of this technology are being presented at the Seventh Forum on Data Engineering and Information Management (DEIM 2015), opening March 2 in Koriyama, Fukushima.

Background

Database systems are able to report processing results back to a terminal efficiently, for what is called online transaction processing (OLTP), and are used widely for processing changes to data, such as with the storage and utilization of data from business systems.

Issues

In recent years, there has been an increasing demand for high-volume data analysis that is fast and available on demand, creating a need for a single database system that can handle OLTP and high-volume data analysis simultaneously. In contrast to the row-oriented data that is best-suited to OLTP, column-oriented data is better for data analysis, but this method gets bogged down when processing changes to data. One relatively recent solution is to store both row-oriented and column-oriented data as a way to accelerate analyses. But with previous technologies, changes to the row-oriented data are not automatically reflected to the column-oriented data, and memory constraints are also problematic.

About the Technology

Fujitsu has developed an engine for PostgreSQL open-source databases that instantly reflects updated row-oriented data to column-oriented data, stores column-oriented data without being dependent on memory capacity, and quickly conducts analysis of column-oriented data. Massive volumes of column-oriented data can be stored by taking advantage of a new technique for managing column-oriented data. The engine also enables high-speed analyses of the indexes that typical database systems provide, and can be used without special consideration for whether the data is stored as row-oriented or column-oriented. On the DBT-3 benchmark(2) Query1 for reading, filtering, and aggregating, the parallel-processing analysis engine, which has been optimized for column-oriented data, runs 4 times faster on a single CPU core than its predecessors. On a single server with 15 CPU cores, performance is at least 50 times faster.

Key features of the technology are as follows:

1. Large-volume column-oriented data storage

To efficiently manage large volumes of column-oriented data that cannot fit into memory, data domains are managed in "extents," large increments (roughly 260,000 records), in which data domains are secured or deleted, and in which free domains are reclaimed. While managing large increments and simultaneously running analyses can result in long wait times, Fujitsu has adopted a solution in the form of MultiVersion Concurrency Control (MVCC (3)), which allows analyses to run at the same time that data domains are managed.

2. Column-oriented indexes (column-store indexes)

Like other indexes, creating a column-oriented index (column-store index) is a way to select a data-storage method (row-oriented or column-oriented) that suits the contents of the database being queried and to process it. When there is an update to row-oriented data from which the column-store index is created, the column-oriented data is automatically updated. This completely frees users from worries about the data-storage method.

3. Analysis engine optimized for column-oriented data and parallel processing using an original shared-memory structure

Simply using column-oriented data to improve read performance does not make the most of the benefits that column-oriented data can offer. Fujitsu developed an analysis engine that can apply the same process at once to multiple types of data (vector processing), which improves performance under single parallelization. Also as a parallel-analysis mechanism, the company also developed a new shared-memory structure so that multiple processes operating in parallel in PostgreSQL can hand off data with little slowdown. On a server with 15 CPU cores, this can achieve minimum fifty-fold performance improvements over the previous PostgreSQL.

Results

This technology enables existing smaller systems with limited memory to achieve real-time analysis and utilization of big-data in ways that were not possible before.

Future Plans

Fujitsu is aiming for a commercial implementation of this technology during fiscal 2015, as a part of Symfoware Server, Fujitsu's database product.

(1) Index: Hint information for searching a database more quickly.
(2) DBT-3 benchmark: A benchmark for measuring the performance of decision-support systems.
(3) MultiVersion Concurrency Control: A technique for ensuring consistency when there are simultaneous requests from multiple users. Used in many database systems.


Contact:
Fujitsu Limited
Public and Investor Relations
Tel: +81-3-3215-5259
URL: www.fujitsu.com/global/news/contacts/

Fujitsu Laboratories Ltd.
ICT Systems Laboratories 
Server Technologies Lab
E-mail: Retimer_ISSCC2015@ml.labs.fujitsu.com


Topic: Press release summary
Source: Fujitsu Ltd

Sectors: Cloud & Enterprise
http://www.acnnewswire.com
From the Asia Corporate News Network


Copyright © 2024 ACN Newswire. All rights reserved. A division of Asia Corporate News Network.


Fujitsu Ltd Links

http://www.fujitsu.com

https://plus.google.com/+Fujitsu

https://www.facebook.com/FujitsuJapan

https://twitter.com/Fujitsu_Global

https://www.youtube.com/user/FujitsuOfficial

https://www.linkedin.com/company/fujitsu/

Fujitsu Ltd
May 17, 2024 13:03 HKT/SGT
Fujitsu chosen for GENIAC project, starts development of large language models for logical reasoning
May 13, 2024 17:32 HKT/SGT
Supercomputer Fugaku retains first place worldwide in HPCG and Graph500 rankings
May 10, 2024 11:20 HKT/SGT
Release of "Fugaku-LLM" - a large language model trained on the supercomputer "Fugaku"
May 9, 2024 09:41 HKT/SGT
Fujitsu introduces "explainable AI" for use in genomic medicine and cancer treatment planning
May 8, 2024 07:52 HKT/SGT
ServiceNow and Fujitsu announce strategic commitment to launch innovative cross-industry solutions
May 7, 2024 16:53 HKT/SGT
Fujitsu launches mainframe modernization automation service for the Japanese market
Apr 23, 2024 09:25 HKT/SGT
Fujitsu SX Survey reveals key success factors for sustainability
Apr 22, 2024 15:09 HKT/SGT
Fujitsu and METRON collaborate to drive ESG success: slashing energy costs, boosting productivity with new manufacturing industry solutions
Apr 19, 2024 09:17 HKT/SGT
Fujitsu develops technology to convert corporate digital identity credentials, enabling participation of non-European companies in European data spaces
Apr 18, 2024 10:14 HKT/SGT
Fujitsu and Oracle collaborate to deliver sovereign cloud and AI capabilities in Japan
More news >>
 News Alerts
Copyright © 2024 ACN Newswire - Asia Corporate News Network
Home | About us | Services | Partners | Events | Login | Contact us | Privacy Policy | Terms of Use | RSS
US: +1 214 890 4418 | Beijing: +86 400 879 3881 | Hong Kong: +852 8192 4922 | Singapore: +65 6549 7068 | Tokyo: +81 3 6859 8575

Connect With us: