METHOD AND APPARATUS FOR AUTOMATICALLY SUMMARISING CONTENTS OF ELECTRONIC DOCUMENTS Russian patent published in 2016 - IPC G06F17/22 

Abstract RU 2595594 C2

FIELD: information technology.

SUBSTANCE: invention relates to means of summarising an electronic document. Method comprises generating a feature vector for an electronic document, wherein feature vector comprises a plurality of features of electronic document. Weight coefficient is assigned to each of plurality of features. Summarisability core is assigned to electronic document to be summarised in accordance with weight coefficient assigned to each of plurality of features, wherein summarisability score indicates whether electronic document is summarisable. Method then includes determining if electronic document is summarisable. Electronic document is split into a plurality of parts, wherein each of plurality of parts is associated with a respective length, corresponding to an informativeness score, and a respective coherence score. Method then includes automatically selecting a subset of plurality of parts, such that an aggregate informativeness score of subset is maximised, while an aggregate length of subset is less than or equal to a maximum length. Subset is then arranged as a summary of electronic document.

EFFECT: technical result is improved relevance for finding documents.

23 cl, 7 dwg

Similar patents RU2595594C2

Title Year Author Number
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS 2017
  • Bulgakov Ilya Aleksandrovich
  • Indenbom Evgenij Mikhajlovich
RU2665261C1
RETRIEVAL OF INFORMATION OBJECTS USING A COMBINATION OF CLASSIFIERS ANALYZING LOCAL AND NON-LOCAL SIGNS 2018
  • Indenbom Evgenij Mikhajlovich
RU2686000C1
ANNOTATION BY MEANS OF SEARCHING 2007
  • Chzhan Lej
  • Van Sin'-Tszin
  • Tszin Fehn
  • Ma Vehj-In
RU2439686C2
CLASSIFIER TRAINING USED FOR EXTRACTING INFORMATION FROM TEXTS IN NATURAL LANGUAGE 2018
  • Matskevich Stepan Evgenevich
  • Bulgakov Ilya Aleksandrovich
RU2681356C1
METHOD AND SYSTEM FOR STORING AND SEARCHING INFORMATION EXTRACTED FROM TEXT DOCUMENTS 2015
  • Matskevich Stepan Evgenievich
RU2605077C2
ANNOTATION IDENTIFICATION TO IMAGE DESCRIPTION 2015
  • Li Majkl Chun-Chi
RU2699416C2
EXTRACTING INFORMATION OBJECTS WITH THE HELP OF A CLASSIFIER COMBINATION 2017
  • Matskevich Stepan Evgenevich
  • Starostin Anatolij Sergeevich
  • Sukhodolov Dmitrij Andreevich
RU2679988C1
AUTOMATIC DETECTION AND EXTRACTION OF PREVIOUS ANNOTATIONS, RELEVANT FOR IMAGING STUDY, FOR EFFICIENT VIEWING AND REPORTING 2013
  • Mabotuvana Tkhusitkha Danandzhaya De Silva
  • Tsyan Yuechen
  • Sevenster Merlejn
  • Mankovich Gebriel Rajan
RU2640009C2
CLASSIFICATION OF DOCUMENTS BY LEVELS OF CONFIDENTIALITY 2019
  • Zyuzin Andrej Andreevich
  • Uskova Olesya Vladimirovna
RU2732850C1
SYSTEM AND METHOD FOR SYNCHRONIZING INTERACTIONS BETWEEN SEVERAL SOFTWARE CLIENTS IN MEETING WITH NOTARY 2018
  • Jenkins, Alexander James
  • Kinsel, Patrick A.
  • Pase, Adam
RU2772345C2

RU 2 595 594 C2

Authors

Mani Inderdzhit

Siurana Eudzhenio

D'Alojsio-Montilla Nikolas

Suonson Bart K.

Dates

2016-08-27Published

2012-09-11Filed