approved
Why Are Learned Indexes So Effective

A recent trend in algorithm design consists of augmenting classic data structures with machine learning models, which are better suited to reveal and exploit patterns and trends in the input data so to achieve outstanding practical improvements in space occupancy and time efficiency. This is especially known in the context of indexing data structures where despite few attempts in evaluating their asymptotic efficiency, theoretical results are yet missing in showing that learned indexes are probably better than classic indexes, such as B+ trees and their variants. In this paper, we present the first mathematically-grounded answer to this open problem. We obtain this result by discovering and exploiting a link between the original problem and a mean exit time problem over a proper stochastic process which, we show, is related to the space and time occupancy of those learned indexes. Our general result is then specialised to five well-known distributions: Uniform, Lognormal, Pareto, Exponential, and Gamma; and it is corroborated in precision and robustness by a large set of experiments.

Tags
Data and Resources
To access the resources you must log in
  • Link to the publicationHTML

    The resource: 'Link to the publication' is not accessible as guest user. You must login to access it!
  • PDFPDF

    The resource: 'PDF' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Creator Ferragina, Paolo, paolo.ferragina@unipi.it
Creator Lillo, Fabrizio, fabrizio.lillo@sns.it
Creator Vinciguerra, Giorgio, giorgio.vinciguerra@phd.unipi.it
Group Demography, Economy and Finance 2.0
Link http://proceedings.mlr.press/v119/
Publisher Proceedings of Machine Learning Research ISSN: 2640-3498
Source Volume 119: International Conference on Machine Learning, 13-18 July 2020, Virtual
Thematic Cluster Text and Social Media Mining [TSMM]
system:type ConferencePaper
Management Info
Field Value
Author Rapisarda Beatrice
Maintainer Vinciguerra Giorgio
Version 1
Last Updated 12 September 2023, 09:30 (CEST)
Created 3 February 2021, 12:11 (CET)