FIELD: information technology.
SUBSTANCE: identification index for documents containing keywords is used. The index contains an encoded delta index list of the document identifier (ID), wherein the delta index list of the document ID contains a plurality of records, each record using a symbol to represent the value of the delta index of the document ID for each document from a plurality of documents in a search area containing a keyword. Each of the symbols of the delta index list of the document ID is compared with one category from a finite set of categories and with the index in each category from the finite set of categories. Each category contains a basic value and each symbol in the delta index list of the document ID is the sum of the basic value for the category compared with it and the value of the delta index of the document ID represented by said symbol.
EFFECT: faster search process and high accuracy of search results.
17 cl, 11 dwg
Authors
Dates
2013-12-27—Published
2009-05-13—Filed