• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

AI to Enable Accurate Modelling of Data Storage System Performance

AI to Enable Accurate Modelling of Data Storage System Performance

© iStock

Researchers at the HSE Faculty of Computer Science have developed a new approach to modelling data storage systems based on generative machine learning models. This approach makes it possible to accurately predict the key performance characteristics of such systems under various conditions. Results have been published in the IEEE Access journal.

Data storage systems play an important role in today’s digital world, as they are responsible for the safety and prompt availability of vast amounts of information. These systems consist of many components, including controllers, HDD and SSD disks, as well as cache memory, which work together to ensure fast and efficient operation. To achieve optimal performance, it is essential to accurately predict how these systems will function in different scenarios, such as when the load on the system changes.

Researchers at the HSE Faculty of Computer Science developed a new approach to modelling data storage system performance, which relies on generative machine learning models. The authors proposed a method that provides high-precision predictions of the key performance characteristics of the systems: the number of input/output operations per second (IOPS) and latency.

The modelling includes two stages. First, the scientists collect data by measuring the system’s performance under various loads and configurations. This data is then fed to two special generative models: the CatBoost regression model and the normalizing flow model. CatBoost works well with tabular data and can accurately predict average values and performance deviations. The normalizing flow model produces a complete distribution of possible outcomes, taking into account data uncertainties and variability.

Mikhail Hushchyn

‘One of the main advantages of our method is that it does not require detailed knowledge of the internal structure of the system components. This is often impossible due to the manufacturers’ trade secrets. Instead, our generative models are trained directly on real-world data. For instance, in our study, we trained a model using 300,000 measurements. This makes our approach versatile and applicable to any type of data storage system,’ says study author Mikhail Hushchyn, a senior research fellow at the HSE Faculty of Computer Science.

The researchers tested the accuracy of the proposed approach using Little's law, a fundamental principle of queuing theory. According to test results, these predictions are highly consistent with real observations: prediction errors range from just 4–10% for IOPS and 3–16% for latency, while the correlation with the observed values reaches 0.99.

Aziz Temirkhanov

‘Our proposed approach opens up broad prospects for optimising and planning the operation of data centres. It makes it possible to predict the behaviour of the system amid load changes, identify potential performance issues, and optimise power consumption. Furthermore, expensive physical experiments are no longer required for accurate modelling,’ stated Aziz Temirkhanov, a junior research fellow at the Laboratory of Methods for Big Data Analysis.

The experimental code and measurements of the storage system performance are publicly available.

See also:

How Colour Affects Pricing: Why Art Collectors Pay More for Blue

Economists from HSE University, St Petersburg State University, and the University of Florida have found which colours in abstract paintings increase their market value. An analysis of thousands of canvases sold at auctions revealed that buyers place a higher value on blue and favour bright, saturated palettes, while showing less appreciation for traditional colour schemes. The article has been published in Information Systems Frontiers.

New Method for Describing Graphene Simplifies Analysis of Nanomaterials

An international team, including scientists from HSE University, has proposed a new mathematical method to analyse the structure of graphene. The scientists demonstrated that the characteristics of a graphene lattice can be represented using a three-step random walk model of a particle. This approach allows the lattice to be described more quickly and without cumbersome calculations. The study has been published in Journal of Physics A: Mathematical and Theoretical.

HSE Researchers Assess Creative Industry Losses from Use of GenAI

Speaking at the IPQuorum.Music forum on October 15, Leonid Gokhberg, HSE First Vice Rector, and Daniil Kudrin, an expert at the Centre for Industry and Corporate Projects of HSE ISSEK, presented the findings of the first study in Russia on the economic impact of GenAI on creative professions. The analysis shows that creators’ potential losses could reach one trillion roubles by 2030.

‘Fall into ML Has Firmly Established Itself as a Landmark Event in Russia’s AI Scene’

On October 24–25, 2025, the AI and Digital Science Institute of the HSE Faculty of Computer Science will host the fourth annual Fall into ML 2025 conference at the HSE Cultural Centre. The event is once again supported by its general partner, Sber. The focus this year is on breakthrough research and the future of fundamental AI.

Scientists Have Modelled Supercapacitor Operation at Molecular and Ionic Level

HSE scientists used supercomputer simulations to study the behaviour of ions and water molecules inside the nanopores of a supercapacitor. The results showed that even a very small amount of water alters the charge distribution inside the nanopores and influences the device’s energy storage capacity. This approach makes it possible to predict how supercapacitors behave under different electrolyte compositions and humidity conditions. The paper has been published in  Electrochimica Acta.  The study was supported by a grant from the Russian Science Foundation (RSF).

Designing an Accurate Reading Skills Test: Why Parallel Texts are Important in Dyslexia Diagnosis

Researchers from the HSE Centre for Language and Brain have developed a tool for accurately assessing reading skills in adults with reading impairments. It can be used, for instance, before and after sessions with a language therapist. The tool includes two texts that differ in content but are equal in complexity: participants were observed to read them at the same speed, make a similar number of errors, and understand the content to the same degree. Such parallel texts will enable more accurate diagnosis of dyslexia and better monitoring of the effectiveness of interventions aimed at addressing it. The paper has been published in Educational Studies.

Internal Clock: How Heart Rate and Emotions Shape Our Perception of Time

Our perception of time depends on heart rate—this is the conclusion reached by neuroscientists at HSE University. In their experiment, volunteers watched short videos designed to evoke specific emotions and estimated each video's duration, while researchers recorded their heart activity using ECG. The study found that the slower a participant's heart rate, the shorter they perceived the video to be—especially when watching unpleasant content. The study has been published in Frontiers in Psychology.

Scientists Identify Personality Traits That Help Schoolchildren Succeed Academically

Economists from HSE University and the Southern Federal University have found that personality traits such as conscientiousness and open-mindedness help schoolchildren improve their academic performance. The study, conducted across seven countries, was the first large-scale international analysis of the impact of character traits on the academic achievement of 10 and 15-year-olds. The findings have been published in the International Journal of Educational Research.

HSE Researchers Introduce Novel Symmetry-Aware Neural Network Architecture

Researchers at the HSE Laboratory for Geometric Algebra and Applications have developed a new neural network architecture that can accelerate and streamline data analysis in physics, biology, and engineering. The scientists presented their solution on July 16 in Vancouver at ICML 2025, one of the world's leading conferences on machine learning. Both the paper and the source code are publicly available.

Critique of Obscure Reason: Artificial Intelligence in the Perception of Mathematicians

Mathematicians at HSE University believe that there is no need to fear losing jobs because of the widespread use of AI, while at the same time they warn against uncritical acceptance of works and projects prepared with its help. AI, however, can be a useful tool in research, creating models and processing large volumes of information.