Improved Skips for Faster Postings List Intersection

Abstract

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for matching function of Boolean retrieval systems is proposed and tried to extend postings list data structures and increase the efficiency of using postings lists and skips. The final effect of these considerations is in decreasing the search time.

Keywords