SEARCH TABLES UNDERSTANDING Russian patent published in 2018 - IPC G06F17/30 G06F7/02 

Abstract RU 2671047 C2

FIELD: computer equipment.

SUBSTANCE: invention relates to computer engineering. Technical result is achieved by selecting a predetermined number of columns from the table as subject candidate columns, each candidate column is potentially suitable for the correct subject table column, with each subject candidate column including a plurality of values; for each subject candidate column: determining the joint occurrence for values in the subject candidate column, including determining how often the values in the subject candidate column also occur in the correct subject columns in a variety of other tables, calculating an estimate for the subject candidate column based on the said determined collaborative occurrence, the computed estimate showing the likelihood that the subject candidate column is the correct subject column; and classifying the subject candidate column as one of the correct subject table column and a non-proprietary column of the table based on the calculated score for the subject candidate column.

EFFECT: technical result consists in increasing the efficiency of detecting one or more subject table columns.

33 cl, 11 dwg

Similar patents RU2671047C2

Title Year Author Number
EXTRACTING INFORMATION FROM STRUCTURED DOCUMENTS CONTAINING TEXT IN NATURAL LANGUAGE 2015
  • Danielyan Tatiana Vladimirovna
  • Bulgakov Ilya Aleksandrovich
RU2607976C1
SYSTEM AND METHOD FOR SELECTING RELEVANT PAGE ITEMS WITH IMPLICITLY SPECIFYING COORDINATES FOR IDENTIFYING AND VIEWING RELEVANT INFORMATION 2015
  • Tsyplyaev Maksim Viktorovich
  • Vinokurov Nikita Alekseevich
RU2708790C2
METHOD AND SYSTEM OF SEMANTIC PROCESSING TEXT DOCUMENTS 2016
  • Mitelkov Dmitrij Vladimirovich
  • Novikov Andrej Yurevich
  • Satin Boris Borisovich
RU2630427C2
CONSTRUCTING QUERIES FOR EXECUTION OVER MULTI-DIMENSIONAL DATA STRUCTURES 2014
  • Khyuz Gregori
  • Koulson Majkl Dzh.
  • St-Sir Aleksandr Tristan
  • Mokhamud Fajsal
  • Palmer-Boroski Tereza
  • Shiperski Klemens
  • Dumitru Marius
RU2679977C1
METHOD AND SYSTEM FOR STORING AND SEARCHING INFORMATION EXTRACTED FROM TEXT DOCUMENTS 2015
  • Matskevich Stepan Evgenievich
RU2605077C2
RECOVERY OF TEXT ANNOTATIONS RELATED TO INFORMATION OBJECTS 2017
  • Bulgakov Ilya Aleksandrovich
  • Indenbom Evgenij Mikhajlovich
RU2665261C1
METHOD OF DETECTING TRAINING DATA FOR MACHINE LEARNING OF COMPUTER SYSTEM OF INDUSTRIAL INTERNET OF THINGS POWERED BY RECHARGEABLE BATTERY 2023
  • Grebeshkov Aleksandr Iurevich
  • Batyrshina Iana Aleksandrovna
RU2819568C1
LONG-TERM STORAGE OF TYPES AND COPIES OF NET DATA 2005
  • Kakivaja Gopala Krishna K.R.
  • Dani Savitri N.
RU2400803C2
METHOD AND SYSTEM FOR AUTOMATIC LEGAL DECISION-MAKING 2019
  • Karpets Mikhail Valerevich
  • Nakipov Iskander Nailevich
  • Denisov Ilya Vyacheslavovich
  • Emelyanov Yaroslav Igorevich
  • Volkova Olga Sergeevna
  • Novikov Mikhail Yurevich
  • Kuznetsov Maksim Viktorovich
  • Burlakova Marina Valerievna
  • Krylova Darya Andreevna
  • Klykov Gleb Igorevich
  • Shulga Sergej Aleksandrovich
RU2732071C1
METHODS AND SYSTEMS FOR CONVERTING MATRIXES BASED ON SPARSE VECTORS 2019
  • Maxwell, Evan
  • Barnard, Leland
  • Yadav, Ashish
  • Staples, Jeffrey
  • Reid, Jeffrey
  • Habegger, Lukas
RU2764557C1

RU 2 671 047 C2

Authors

Wang, Zhongyuan

Zoryn, Kanstantsyn

Chen, Zhimin

Chakrabarti, Kaushik

Finnigan, James P.

Narasayya, Vivek R.

Chaudhuri, Surajit

Ganjam, Kris

Dates

2018-10-29Published

2014-06-30Filed