Four leading trends of database innovation

12/04/2021

By Li Feifei, Vice President of Alibaba Group

In the wake of the pandemic, one trend that emerged is the need for digitalization, and this will continue – and even grow – in the future. While it’s still not clear what that actual economic impact of Covid-19 will be, the Statista Research Department, predicts that the spending on technologies and services to enable digital transformation worldwide is expected to reach more two trillion US dollars. The Gulf Cooperation Council has taken great strides in digitalization, which is an essential step towards digital transformation. By 2025, the revenue from the AI market in the MENA region is forecasted to reach more than one billion US dollars.

As businesses – and even economies – become more digitalized, there’s no escaping that data and the ability to gain insights from it are at the heart of every successful organization – and the foundation of any successful insight-led architecture will remain the database.

Here is a look at four database trends that can help businesses and economies to surge ahead in the wake of the pandemic.

Hybrid Transaction/Analytical Processing (HTAP): Big Data and Databases

Over the past ten years, databases and big data have distanced themselves in their functionality. One is responsible for online workloads, while the other covers batch processing vast quantities of data. However, from the perspective of applications, with a single system that can solve the whole process of data generation, processing, storage, and consumption, it can help create a better online experience for customers as they navigate retail websites and services.

From the retailer’s perspective, to improve performance, they need to synchronize several million transactions every day; each bringing with it several different data types as they improve recommendations, and consequently, the experience for online shoppers. However, by achieving online and batch workload integration or HTAP, these problems can be easily solved. And this trend will continue to grow over the next decade, which is why Gartner classifies OPDBMS (Operational Database Management Systems) and DMSA (Data Management Solutions for Analytics) into one Cloud DBMS market, and this is the core logic behind the integration of the two fields.

Cloud-Native Architectures and Distributed Processing Techniques

It is a well-known fact that more and more organizations are adopting a cloud infrastructure. They are not only moving their generic business process applications on to the cloud, but also their mission critical applications and data.

According to recent industry research, 75 percent of all databases will be deployed or migrated to a cloud platform by 2022. This means that organizations are in need of cloud services and solutions that will support their cloud native architecture. In fact, business organizations are already developing or sourcing cloud native applications, as it allows the business processes to be more dynamic.

Combined with distributed processing techniques, without heavy infrastructure set up, users can enjoy the elasticity and high availability brought by cloud native databases with ease and efficiency.

Intelligent and Self-Driving Databases

Cloud computing has changed everything because it has fueled the growth of data. But, we are still far from real AI. We use deep neural networks today and they need large-scale data to be really useful. AI is a black box today, but AI tech used as heuristics has worked. It has made a mark in computer vision and speech recognition, for example.

Now, it is making a mark in databases too. We will have self-driving databases in the future, and our roadmap is to fully automate a database. The complexity in automating databases arises because usage changes from customer to customer, which makes it tough to automate the entire process.

However, we can use AI for common scenarios. For example, we can help different workloads from e-commerce or traditional systems to tune system parameters to improve their latency and scalability and use ML algorithms to ensure that databases are secure and running without anomaly.

Multi-model Databases

It’s hard to imagine that there was a time when there were other database models besides a relational one. However, there are more types, including document-oriented, graph database, time series, triple store, etc. In a rapidly digitizing world, arriving at insights from heterogeneous data will continue to remain a challenge as businesses seek to derive the most out of newer advances in technology such as AI, IoT, and beyond.

Databases will use cloud native design to decouple storage and compute, and be compatible with a wide variety of open-source standard interfaces. They will also support switching between open source systems and seamlessly connecting with multiple computing and analytical engines.

In addition, wide-column table and time series models will be supported, as well as the storage and analysis of structured, semi-structured and unstructured data. This will deliver improved performance for IoT, especially for the multi-model data store, computing and analytics of device metadata, device operation data (time series data) and device logs.