CLUSTERING OF DOCUMENTS Russian patent published in 2022 - IPC G06F16/35 

Abstract RU 2768209 C1

FIELD: physics.

SUBSTANCE: invention relates to computer engineering for analyzing documents. Technical result is achieved by obtaining an input document; determining, by evaluating a document similarity function using one or more calculated attributes of the input document, a plurality of similarity metrics, where each similarity indicator from the plurality of similarity indicators reflects the degree of similarity between the input document and the corresponding cluster of documents from the plurality of document clusters; determining the maximum similarity score from the plurality of similarity indicators; determining that the input document does not belong to any of the document clusters from the plurality of document clusters if the maximum similarity score is below a threshold value; creating a new cluster of documents; and assigning an input document to a new cluster of documents.

EFFECT: high accuracy of clustering documents.

20 cl, 6 dwg

Similar patents RU2768209C1

Title Year Author Number
AUTOMATIC DETERMINATION OF SET OF CATEGORIES FOR DOCUMENT CLASSIFICATION 2018
  • Nikita Orlov
  • Konstantin Anisimovich
RU2701995C2
SYSTEM AND METHOD OF FORMING TRAINING SET FOR MACHINE LEARNING ALGORITHM 2017
  • Lakhman Konstantin Viktorovich
  • Chigorin Aleksandr Aleksandrovich
  • Yurchenko Viktor Sergeevich
RU2711125C2
RETRIEVING FIELDS USING NEURAL NETWORKS WITHOUT USING TEMPLATES 2019
  • Stanislav Semenov
RU2737720C1
SIMULTANEOUS RECOGNITION OF PERSON ATTRIBUTES AND IDENTIFICATION OF PERSON IN ORGANIZING PHOTO ALBUMS 2018
  • Savchenko Andrey Vladimirovich
RU2710942C1
SYSTEMS AND METHODS FOR DETECTING BEHAVIOURAL THREATS 2019
  • Dichiu Daniel
  • Niculae Stefan
  • Bosinceanu Elena A.
  • Zamfir Sorina N.
  • Dincu Andreea
  • Apostoae Andrei A.
RU2803399C2
SYSTEMS AND METHODS FOR DETECTING BEHAVIOURAL THREATS 2019
  • Dichiu Daniel
  • Niculae Stefan
  • Bosinceanu Elena A.
  • Zamfir Sorina N.
  • Dincu Andreea
  • Apostoae Andrei A.
RU2772549C1
METHOD OF CONSTRUCTING AND DETECTION OF THEME HULL STRUCTURE 2013
  • Bogdanova Daria Nikolaevna
  • Kopylov Nikolay Yurievich
RU2583716C2
SYSTEMS AND METHODS FOR DETECTING BEHAVIOURAL THREATS 2019
  • Dichiu Daniel
  • Niculae Stefan
  • Bosinceanu Elena A.
  • Zamfir Sorina N.
  • Dincu Andreea
  • Apostoae Andrei A.
RU2778630C1
AI TRANSACTION ADMINISTRATION SYSTEM 2020
  • Fehling, Ronny
  • Short, Samantha
  • De Goursac, Axel
  • Dubois, Raphael
  • Erlebach, Joerg
  • Von Funck, Karin
RU2777958C2
CHARACTER RECOGNITION USING A HIERARCHICAL CLASSIFICATION 2018
  • Aleksey Alekseevich Zhuravlev
RU2693916C1

RU 2 768 209 C1

Authors

Stanislav Semenov

Alexandra Antonova

Alexey Misyrev

Dates

2022-03-23Published

2020-11-13Filed