FIELD: information technology.
SUBSTANCE: invention relates to information searching and sampling. To achieve the technical outcome, a sequence of ambiguous information components is received from a user and transformed into one or more corresponding sequences of less ambiguous information components. These sequences of less ambiguous information are given as input data into the search engine. Search results are received from the search engine and presented to the user. Translation between these sets of characters and/or languages can be done by analysing use of terms in the aligned text. Probabilities can be associatively linked to each possible translation. These probabilities can be corrected by analysing interaction of the user with the search results.
EFFECT: possibility of searching using queries written in set of characters or language, which is different from the set of characters or language of documents, which are to be found and obtaining relevant search results.
45 cl, 16 dwg
Authors
Dates
2009-08-10—Published
2004-09-13—Filed