FIELD: physics.
SUBSTANCE: invention relates to a method and a server for determining a training set for training a machine learning algorithm (MLA) for classifying digital objects. Method comprises obtaining by server of multiple training examples for training MLA, training example includes text data associated with corresponding digital object, and indication of true class of corresponding object; server ordering said plurality of training examples into an ordered sequence of training examples, said training example has previous training examples in an ordered sequence and subsequent training examples in an ordered sequence; generating, by the server, a text feature for said training example based on the text data in said training example, as well as text data and true classes of only previous training examples in an ordered sequence without taking into account text data in subsequent training examples; determining, by a server, a training set for MLA based on said training example, wherein the training set has training input and a label, the training input includes a text feature, the label represents the true class of the corresponding object, wherein the digital object is any of: a digital document provided as a search result in response to a search query, digital element recommended to the user of the content recommendation system, e-mail message intended for an e-mail platform user.
EFFECT: high reliability of the prediction model by reducing the risk and effect of retraining during the phase of using the prediction model.
56 cl, 8 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SERVER FOR REPEATED TRAINING OF MACHINE LEARNING ALGORITHM | 2019 |
|
RU2743932C2 |
METHOD AND SYSTEM FOR GENERATING TRAINING DATA FOR MACHINE LEARNING ALGORITHM | 2021 |
|
RU2819647C2 |
METHODS AND SERVERS FOR DETERMINING METRIC-DEPENDENT THRESHOLDS USED WITH MULTIPLE NESTED METRICS FOR BINARY CLASSIFICATION OF A DIGITAL OBJECT | 2020 |
|
RU2795202C2 |
SEARCH INDEX CONSTRUCTION METHOD AND SYSTEM USING MACHINE LEARNING ALGORITHM | 2018 |
|
RU2720954C1 |
METHOD AND SYSTEM FOR GENERATING FEATURE FOR RANGING DOCUMENT | 2018 |
|
RU2733481C2 |
METHOD AND SERVER FOR TRAINING MACHINE LEARNING ALGORITHM IN TRANSLATION | 2020 |
|
RU2770569C2 |
METHOD AND SERVER FOR TRAINING MACHINE LEARNING ALGORITHM IN OBJECT RANKING | 2020 |
|
RU2782502C1 |
METHOD AND SERVER FOR TEACHING A NEURAL NETWORK TO FORM A TEXT OUTPUT SEQUENCE | 2020 |
|
RU2798362C2 |
METHODS AND SERVERS FOR RANKING DIGITAL DOCUMENTS IN RESPONSE TO A QUERY | 2020 |
|
RU2775815C2 |
MACHINE TRAINING | 2005 |
|
RU2391791C2 |
Authors
Dates
2024-04-19—Published
2020-11-19—Filed