Undergrad accidentally shreds 40-year hash table gospel

May Be Interested In:Samsung’s Galaxy Tab S10 FE Plus will be bigger and cost more


It isn’t often that a decades-old assumption underpinning modern technology is overturned, but a recent paper based on the work of an undergraduate and his two co-authors has done just that.

That assumption refers to hash tables, and a conjecture based on work from the 1980s regarding the optimal way to store and query the data in them. The student, formerly of Rutgers University in New Jersey, came up with a new kind of hash table that is faster and uses fewer steps to find specific elements, all while being unaware of that conjecture.

As detailed by Quanta Magazine, Andrew Krapivin, now a graduate at the University of Cambridge, is one of the co-authors on a paper, “Optimal Bounds for Open Addressing Without Reordering,” published last month that sets out how his hash table can find elements faster than was previously considered possible.

Hash tables have been around since the 1950s, and are an example of a key-value store where a hash function is used to generate the index for the data value based on the key itself.

Previously, a historic paper authored by computer scientist Andrew Yao, “Uniform hashing is optimal,” had asserted that the best way of finding an individual element or an empty location in a hash table is simply to access potential locations randomly, an approach known as uniform probing.

The new paper states that despite its simplicity, Yao’s conjecture had never been settled.

There was a way to get around this involving the insertion algorithm carrying out a reordering process after insertion, i.e. to optimize the placement of elements within the hash table. But it wasn’t clear if this was a necessary step in order to speed things up.

The 2025 paper claims that even without reordering elements over time, it is possible to construct a hash table using Krapivin’s method that achieves far better probe complexity – the average number of locations that need to be checked (probed) to find a specific key – than previous hash table methods.

The authors of the paper say that they refer to their insertion strategy as “elastic hashing” because of the way that the algorithm often probes much further down the table before snapping back to the position it ends up using.

According to Quanta, the paper demonstrates that for Krapivin’s hash table method, the time required for worst-case queries and insertions is proportional to (log x)2, which is much faster than the previously assumed linear time complexity in x. Here, x is a number that describes how close the hash table is to being completely full, where x = 100 means the table is 99 percent full, and x = 1,000 means the table is 99.9 percent full.

Krapivin is said to have come up with this method after reading a paper titled “Tiny Pointers,” co-authored by his professor at Rutgers, and exploring how to further miniaturize pointers so they used even less memory space. ®

share Share facebook pinterest whatsapp x print

Similar Content

Sony announces PlayStation 5 rental scheme in UK
Sony announces PlayStation 5 rental scheme in UK
Newsom Announces Executive Order on Rebuilding As LA Continues to Burn
Newsom Announces Executive Order on Rebuilding As LA Continues to Burn
Sunita Williams performing a spacewalk outside the International Space Station (pic: NASA+)
‘Abandoned’ astro takes recordbreaking ninth spacewalk
Greens leader urges end to the AUKUS deal with 'very dangerous' Trump
Greens leader urges end to the AUKUS deal with ‘very dangerous’ Trump
Reason Matthew McConaughey was only paid $200k for Dallas Buyers Club role
Reason Matthew McConaughey was only paid $200k for Dallas Buyers Club role
Pic: iStock
Meta offers creators $5,000 to join Facebook and Instagram amid TikTok uncertainty
News of the Moment: Keeping You Informed | © 2025 | Daily News