FIELD: machine learning.
SUBSTANCE: invention relates to methods and a system for forming a set of training objects for training a machine learning algorithm (hereinafter – MLA) performed by a server. The method is to perform obtaining of the first query sent to the server and associated with the first set of search results associated with the user action parameter. Then, based on the terms of the first query, a set of queries previously sent to the server that differ from the first query by the specified number of terms is obtained. Then, for a set of queries, a set of search results is got that differ from the search results in the first set of search results. After that, the similarity score of the first query and queries from the query set is calculated based on the first set of search results, other search results sets, and user action parameters in the first and other search results sets. Then, subsets of queries from a set of queries are determined based on a similarity score less than a predefined similarity threshold. Finally, a set of training objects is formed to be used as negative training examples for MLA training, with each training object containing the first query, another query from a subset of queries, and a similarity score of the first query and the other query.
EFFECT: technical result is increased efficiency of MLA training.
27 cl, 5 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SYSTEM FOR EXPANDING SEARCH QUERIES IN ORDER TO RANK SEARCH RESULTS | 2018 |
|
RU2720905C2 |
SYSTEM AND METHOD OF FORMING TRAINING SET FOR MACHINE LEARNING ALGORITHM | 2017 |
|
RU2711125C2 |
METHOD AND SYSTEM FOR GENERATING FEATURE FOR RANGING DOCUMENT | 2018 |
|
RU2733481C2 |
METHOD AND SERVER FOR REPEATED TRAINING OF MACHINE LEARNING ALGORITHM | 2019 |
|
RU2743932C2 |
METHOD AND SYSTEM OF SELECTION FOR RANKING SEARCH RESULTS USING MACHINE LEARNING ALGORITHM | 2018 |
|
RU2731658C2 |
METHOD AND SYSTEM FOR CREATING ANNOTATION VECTORS FOR DOCUMENT | 2017 |
|
RU2720074C2 |
SYSTEM AND METHOD FOR FORMATION OF TRAINING SET FOR MACHINE LEARNING ALGORITHM | 2020 |
|
RU2790033C2 |
METHOD AND SERVER FOR TRAINING MACHINE LEARNING ALGORITHM IN OBJECT RANKING | 2020 |
|
RU2782502C1 |
SEARCH INDEX CONSTRUCTION METHOD AND SYSTEM USING MACHINE LEARNING ALGORITHM | 2018 |
|
RU2720954C1 |
METHOD AND SERVER FOR GENERATING META-ATTRIBUTE FOR RANGING DOCUMENTS | 2018 |
|
RU2721159C1 |
Authors
Dates
2021-03-02—Published
2018-12-29—Filed