METHOD AND SYSTEM FOR CLASSIFYING AND FILTERING PROHIBITED CONTENT IN A NETWORK Russian patent published in 2020 - IPC G06F16/35 G06F21/00 G06N20/20 

Abstract RU 2738335 C1

FIELD: computer equipment.

SUBSTANCE: method of classifying and filtering content in a network, performed on a computing device comprising at least a processor and memory, which comprises instructions for executing a preparatory step, on which a collection of HTML documents is formed, wherein collection is formed so that each of documents included in it can be related to different classes of content; converting obtained from previous step data from HTML document into text; generating a token matrix for training the ensemble of classifiers; based on the produced token matrix, creating an ensemble of classifiers, comprising at least four classifiers, wherein for each classifier a decision priority is predetermined; a working step of obtaining a URL and downloading an associated HTML document; converting HTML document into text; generating a vector of pure tokens for the classifier ensemble; starting the ensemble of classifiers trained at the preparatory stage; outputting an analysis result comprising a result of classifying content contained in the received document; content is filtered based on the obtained class.

EFFECT: technical result consists in improvement of accuracy of classification and filtration of prohibited content in a network.

12 cl, 6 dwg

Similar patents RU2738335C1

Title Year Author Number
METHOD FOR ATTRIBUTION OF PARTIALLY STRUCTURED TEXTS FOR FORMATION OF NORMATIVE-REFERENCE INFORMATION 2020
  • Fedosin Sergei Alekseevich
  • Plotnikova Natalia Pavlovna
  • Martynov Vladislav Aleksandrovich
  • Ryskin Konstantin Eduardovich
  • Kuznetsov Dmitrii Aleksandrovich
  • Deniskin Aleksandr Vladimirovich
  • Vechkanova Iuliia Sergeevna
  • Fediushkin Nikolai Alekseevich
  • Tsilikov Nikita Sergeevich
RU2750852C1
METHOD FOR TEXTUAL INFORMATION RECOGNITION AND ITS INTEGRITY EVALUATION IN INTERNET ELECTRONIC DOCUMENTS 2013
  • Molchanov Artem Nikolaevich
  • Skurnovich Aleksej Valentinovich
  • Stel'Makh Ehduard Petrovich
  • Molchanov Il'Ja Nikolaevich
RU2550543C1
METHOD AND SYSTEM FOR EXTRACTING NAMED ENTITIES 2021
  • Vodolazskij Daniil Ivanovich
  • Gladkikh Prokhor Vladimirovich
  • Sorokin Semen Aleksandrovich
  • Cherkasov Roman Vladislavovich
  • Gazizov Kuat
RU2823914C2
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1
METHOD AND SYSTEM FOR STATIC ANALYSIS OF EXECUTABLE FILES BASED ON PREDICTIVE MODELS 2020
  • Prudkovskij Nikolaj Sergeevich
RU2759087C1
METHOD AND SYSTEM FOR ARRANGING DIALOGUE WITH USER IN USER-FRIENDLY CHANNEL 2018
  • Kuznetsov Nikita Aleksandrovich
  • Kiryanov Denis Pavlovich
  • Chernopyatov Andrej Sergeevich
  • Domanskaya Kristina Sergeevna
RU2688758C1
ESG-RATING WORD PROCESSING SYSTEM 2023
  • Lapina Vera Vladimirovna
  • Mylnikov Leonid Aleksandrovich
  • Storchevoj Maksim Anatolevich
RU2825081C1
METHOD OF DETERMINING PROFILE OF MOBILE DEVICE USER ON MOBILE DEVICE ITSELF AND DEMOGRAPHIC PROFILING SYSTEM 2016
  • Yoo Jaebong
  • Kryzhanovskiy Konstantin Alexandrovich
  • Podoynitsina Lyubov Vladimirovna
  • Romanenko Alexander Alexandrovich
  • Polubotko Dmitry Valerievich
  • Kazantsev Alexey Yurievich
  • Moiseenko Andrey Konstantinovich
  • Maslennikov Mstislav Vladimirovich
RU2647661C1
METHOD AND SERVER FOR PROCESSING TEXT SEQUENCE IN MACHINE PROCESSING TASK 2020
  • Yemelyanenko Dmitry Viktorovich
  • Provilkov Ivan Sergeevich
  • Voyta Elena Aleksandrovna
RU2775820C2
AUTOMATED LEGAL ADVICE SYSTEM CONTROL METHOD 2019
  • Prikhodko Olga Viktorovna
  • Khyurri Ruslan Vladimirovich
  • Prikhodko Olga Viktorovna
RU2718978C1

RU 2 738 335 C1

Authors

Prudkovskij Nikolaj Sergeevich

Dates

2020-12-11Published

2020-05-12Filed