METHOD AND SYSTEM FOR CLUSTERING DOCUMENTS Russian patent published in 2021 - IPC G06F16/35 

Abstract RU 2757592 C1

FIELD: computer technology.

SUBSTANCE: method and a system for forming clusters of documents using a generalized metric parameter are implemented. The first document and the second document are received for a potential cluster containing the first document and the second document, and the first metric parameter is determined, indicating the degree of mutual complementation of the content of documents in the potential cluster, and the second metric parameter, indicating the degree of dilution of the content of documents in this potential cluster. The generalized metric parameter is determined based on the first metric parameter and the second metric parameter. Based on the generalized metric parameter, a cluster containing the first and second documents is formed. Another document (or documents) or clusters can be added to a cluster by defining an updated generalized metric parameter for a potential cluster and comparing the updated generalized metric parameter with the generalized metric parameter.

EFFECT: increase in the efficiency of document clustering and reduction in the loss of bandwidth of communication channels.

32 cl, 6 dwg

Similar patents RU2757592C1

Title Year Author Number
SYSTEM AND METHOD OF FORMING TRAINING SET FOR MACHINE LEARNING ALGORITHM 2017
  • Lakhman Konstantin Viktorovich
  • Chigorin Aleksandr Aleksandrovich
  • Yurchenko Viktor Sergeevich
RU2711125C2
METHOD AND SYSTEM FOR GENERATING AN OBJECT CARD 2018
  • Akulov Yaroslav Viktorovich
RU2739554C1
METHOD AND SYSTEM FOR GENERATING PUSH-NOTIFICATIONS ASSOCIATED WITH DIGITAL NEWS 2018
  • Akulov Yaroslav Victorovich
RU2731654C1
SYSTEM AND METHOD FOR CONTROL AND ORGANISATION OF WEB-BROWSER CACHE FOR OFFLINE BROWSING 2014
  • Dodonov Alexey Vladimirovich
  • Kpasichkov Ievgen Viktorovich
RU2608668C2
METHOD AND SYSTEM FOR DETECTING ABNORMAL VISITS TO WEBSITES 2019
  • Cherkasov Dmitry Aleksandrovich
  • Anisimov Aleksandr Vladimirovich
  • Gankin Grigory Mikhailovich
RU2775824C2
METHOD AND SYSTEM FOR CREATING ANNOTATION VECTORS FOR DOCUMENT 2017
  • Gusakov Aleksey Yurievich
  • Drozdovsky Andrey Dmitrievich
  • Duzhik Valery Ivanovich
  • Kalinin Pavel Vladimirovich
  • Naydin Oleg Pavlovich
  • Safronov Aleksandr Valerievich
RU2720074C2
OPTIMIZED BROWSER REPRODUCTION PROCESS 2014
  • Men Bipin
  • Istkhem Majkl
  • Syuj Khoj
  • Chzhou Syaobo
RU2638726C1
METHOD AND SYSTEM FOR DETERMINING FACT OF USER VISITING A POINT OF INTEREST 2020
  • Shishkin Aleksandr Leonidovich
  • Goltsman Irina Anatolevna
  • Petrov Danil Vadimovich
  • Shaposhnikov Denis Evgenevich
RU2767958C2
METHOD AND SERVER OF DEFINING THE ORIGINAL REFERENCE TO THE ORIGINAL OBJECT 2016
  • Borisova Tatyana Sergeevna
  • Zhivotvorev Dmitrij Sergeevich
RU2660593C2
OPTIMIZED BROWSER PLAYBACK PROCESS 2017
  • Meng, Biping
  • Eastham, Michael
  • Xu, Hui
  • Zhou, Xiaobo
RU2756482C2

RU 2 757 592 C1

Authors

Shagraev Aleksey Galimovich

Dates

2021-10-19Published

2019-02-08Filed