FIELD: physics, computer technology.
SUBSTANCE: invention concerns data search and intellectual systems, particularly methods of information search in large document data base. Assessment matrix of correlations between words is defined and totaled in subject context matrix; interference elements are removed from context matrix; image of documents included in search index is generated in the form of a list for each unique term used in document; term frequency index in documents is developed; accuracy and comprehensiveness of notion manifestation are assessed; and assessments are added to frequency index. Search request is provided; numbers of documents where at least one request term is present are store in computer memory; requested notion manifestation degree in found documents is calculated as function of accuracy and comprehensiveness assessments; assessment W(i) is calculated as function of requested notion manifestation degree, proximity and order of requested words and correlation to requested word forms; found documents are sorted by W(i) assessment and presented to user.
EFFECT: reduction of search document image size from square to linear dependency on unique term number in document; reduced subject context size; enhanced computation efficiency and search accuracy.
Title | Year | Author | Number |
---|---|---|---|
SYSTEM AND METHOD FOR SEMANTIC SEARCH | 2013 |
|
RU2563148C2 |
METHOD AND SYSTEM OF SEMANTIC PROCESSING TEXT DOCUMENTS | 2016 |
|
RU2630427C2 |
METHOD FOR SYNTHESIS OF SELF-TEACHING SYSTEM FOR EXTRACTING KNOWLEDGE FROM TEXT DOCUMENTS FOR SEARCH ENGINES | 2002 |
|
RU2273879C2 |
METHOD OF CONSTRUCTING SEMANTIC MODEL OF DOCUMENT | 2011 |
|
RU2487403C1 |
METHOD OF CLUSTERING OF SEARCH RESULTS DEPENDING ON SEMANTICS | 2014 |
|
RU2564629C1 |
EXPANDING OF INFORMATION SEARCH POSSIBILITY | 2015 |
|
RU2618375C2 |
METHOD OF SEARCHING FOR INFORMATION IN TEXT ARRAY | 2008 |
|
RU2392660C2 |
METHOD AND SYSTEM FOR SEMANTIC SEARCH OF ELECTRONIC DOCUMENTS | 2011 |
|
RU2473119C1 |
METHOD OF SYNTHESIS OF SELF-TRAINED ANALYTICAL QUESTION-ANSWER SYSTEM WITH EXTRACTION OF KNOWLEDGE FROM TEXTS | 2007 |
|
RU2345416C1 |
METHOD AND SYSTEM FOR ARRANGING DIALOGUE WITH USER IN USER-FRIENDLY CHANNEL | 2018 |
|
RU2688758C1 |
Authors
Dates
2009-02-27—Published
2007-05-03—Filed