METHOD AND SYSTEM FOR CLASSIFYING DISPLAY PAGES USING SUMMARIES Russian patent published in 2009 - IPC G06F17/30 

Abstract RU 2377645 C2

FIELD: physics; computer engineering.

SUBSTANCE: invention relates to means of classifying information. A system for classifying web pages uses a web page summarisation system to generate summaries of web pages. The summary of a web page may include sentences of the web page which are most closely related to the primary topic of the web page. The summarisation system may combine advantages of multiple summarisation techniques to identify sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system can apply conventional classification techniques to the summary to classify the web page. The classification system can use conventional classification techniques such as a simplified Bayesian classifier or a support vector technique to identify the classifications of a web page based on the summary generated by the summarisation system.

EFFECT: increased reliability of processed information.

66 cl, 8 dwg

Similar patents RU2377645C2

Title Year Author Number
METHOD AND SYSTEM FOR IDENTIFYING IMAGE RELATEDNESS USING LINK ANALYSIS AND PAGE LAYOUT 2005
  • Tsaj Dehn
  • Vehn' Tszi-Zhun
  • Ma Vehj-In
  • Kheh Sjaofehj
RU2390833C2
CHECKING RELEVANCE BETWEEN KEY WORDS AND WEBSITE CONTENT 2005
  • Chzhan Behn'Juj
  • Tszehn Khua-Tszjun'
  • Li Li
  • Nadzhm Tarek
  • Ma Vehj-In
  • Li In
  • Chehn' Chzhehn
RU2375747C2
METHOD AND SYSTEM FOR CALCULATING UNIT SIGNIFICANCE VALUE IN DISPLAY PAGE 2005
  • Lju Khaj
  • Vehn' Tszi-Zhun
  • Sun Zhujkhua
  • Ma Vehj-In
RU2387004C2
OFFERING ALLIED TERMS FOR MULTISEMANTIC INQUIRY 2005
  • Chzhan Behn'Juj
  • Tszehn Khua-Tszjun'
  • Li Li
  • Nadzhm Tarek
  • Ma Vehj-In
  • Li In
  • Chehn' Chzhehn
RU2393533C2
METHOD OF CONSTRUCTING SEMANTIC MODEL OF DOCUMENT 2011
  • Turdakov Denis Jur'Evich
  • Nedumov Jaroslav Rostislavovich
  • Sysoev Andrej Anatol'Evich
RU2487403C1
METHOD OF POSITIONING TEXT IN KNOWLEDGE SPACE BASED ON ONTOLOGY SET 2009
  • Anshukov Sergej Aleksandrovich
  • Bardin Valerij Vladimirovich
RU2476927C2
METHOD AND SYSTEM FOR COORDINATING WEB DATABASE SCHEMES 2005
  • Vehn' Tszi-Zhun
  • Ma Vehj-In
RU2386997C2
THEMATIC MODELS WITH A PRIORI TONALITY PARAMETERS BASED ON DISTRIBUTED REPRESENTATIONS 2018
  • Tutubalina Elena Viktorovna
  • Nikolenko Sergey Igorevich
RU2719463C1
BROWSING IMAGES THROUGH INTELLECTUALLY ANALYZED HYPERLINKED FRAGMENTS OF TEXT 2014
  • Bejker Sajmon Dzhon
  • Kannan Anitkha
  • Ramnatkh Krishnan
RU2696305C2
RECRUITMENT SYSTEM USING MACHINE LEARNING AND DOWNSIZING OF MULTIDIMENSIONAL DATA AND A METHOD FOR RECRUITING PERSONNEL USING MACHINE LEARNING AND LOWERING THE DIMENSION OF MULTIDIMENSIONAL DATA 2019
  • Danshchin Georgii Andreevich
  • Reushkin Viktor Viktorovich
  • Sidorov Aleksandr Alekseevich
RU2711717C1

RU 2 377 645 C2

Authors

Chzhan Behn'Juj

Shehn' Do

Tszehn Khua-Tszjun'

Ma Vehj-In

Chehn' Chzhehn

Dates

2009-12-27Published

2005-04-29Filed