Evaluating search key distribution impact on searching performance in large data streams

Bowonsak Srisungsittisunti, Jirawat Duangkaew, Nakarin Chaikaew

Abstract


The distribution pattern of search keys is assessed in this study by contrasting four methods of index searching on large-scale JSON files with data streams. The Adelson-Velskii and Landis (AVL) tree, binary search tree (BST), linear search (LS), and binary search (BS) are among the search strategies. We look at the normal distribution, left-skewed distribution, and right-skewed distribution of search-key distributions. According to the results, LS performs the slowest, averaging 653.166 milliseconds, whereas AVL tree performs better than the others in dense index, with an average search time of 0.005 milliseconds. With 0.011 milliseconds per keyword for sparse index, BS outperforms LS, which averages 1007.848 milliseconds. For dense indexing, an AVL tree works best; for sparse indexing, BS is recommended.


Keywords


Binary search tree; Dense indexing; Indexing efficiency; Large-scale dataset; Sparse indexing;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v14.i3.pp2537-2546

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES).

View IJAI Stats