METHOD OF EXTRACTING USEFUL CONTENT FROM MOBILE APPLICATION SETUP FILES FOR FURTHER COMPUTER DATA PROCESSING, PARTICULARLY SEARCH Russian patent published in 2015 - IPC G06F17/00 

Abstract RU 2568276 C2

FIELD: physics, computer engineering.

SUBSTANCE: invention relates to digital data processing using computer systems, particularly to methods of processing data, particularly meant for special-purpose functions and mobile applications. A method of extracting useful content from setup files of mobile applications for further computer data processing comprises steps of downloading, from the Internet onto a server, an application setup file which is always a some zip-file; selecting an archive extract utility therefor; in case of successful selection of an archive extract utility, unzipping the downloaded setup file into a directory with files; analysing the obtained directory, making a list of files contained therein; selecting a file for further analysis from the list; selecting software for reading the file by searching all known formats; in case of successful selection of software for reading the file, analysing the selected file for search of primary content; creating a list of internal addresses of the primary content in the form of a set of lines; moving to analysis of the next file until there are files in the directory; performing analysis of the text content of the list of internal addresses of the primary content and dividing the text of each line into a set of characters which identify a method of storing the corresponding unit of content, a set of characters which identifies a document to which said unit of content relates, and a set of characters which identifies the type of said unit of content; dividing the lines of internal addresses of the unit of content based on the storage method into secondary content and useful content; deleting the secondary content; selecting on the remaining list groups of lines with internal addresses of units of content having groups of characters with completely matching position and text, which reflect the content storage method; performing statistical filtering of the selected groups; performing analysis of the text content of the lines of the list of addresses on the set of characters identifying the document, and selecting groups of addresses of units of content relating to each document of the useful content of the application; downloading, from the application, useful content relating to each document into a separate file, thereby creating application documents; indexing the obtained application document files for association therewith, thereby creating a description of the content thereof; storing, in a database, the name of the application, a link to the application and the description of the application; downloading the setup file of a new application and repeating all of the described sequences; performing computer processing of the obtained database; storing the created indexable database array on a server; using for search queries of users received via the Internet.

EFFECT: automatic extraction of useful content from setup files of mobile applications for further indexing, computer data processing and storage of the useful content of mobile applications in a database on a server for further search.

13 cl, 2 dwg

Similar patents RU2568276C2

Title Year Author Number
METHOD OF MANAGING WEB SITE DATA 2018
  • German Mikhail Sergeevich
RU2691834C1
CHECKING METHOD OF WEB PAGES FOR CONTENT IN THEM OF TARGET AUDIO AND/OR VIDEO (AV) CONTENT OF REAL TIME 2013
  • Orel Denis Olegovich
  • Fomichev Aleksej Nikolaevich
RU2530671C1
MACHINE-SENSIBLE INFORMATION PROCESSING METHOD 2016
  • Vasilev Vladimir Yaroslavovich
  • Zilberman Mark Sholemovich
  • Kiselev Oleg Mikhajlovich
RU2625936C1
SEARCH INDEX FORMAT OPTIMISATION 2009
  • Khassanov Raif
  • Merrigan Chehdd Krejton
  • Petriuk Mikhaj
  • Kokhan Artem Ivanovich
RU2503058C2
GENERATION OF BROWSER SUGGESTIONS BASED ON INTERNET OF THINGS DEVICE DATA 2015
  • Patten Michael J.
  • Kapadia Ritika
RU2711057C2
METHOD, SYSTEM AND COMPUTER DEVICE FOR PROVIDING COMMUNICATION SERVICES BETWEEN RESOURCES IN COMMUNICATION NETWORKS AND INTERNET TO PERFORM TRANSACTIONS 2002
  • Serebrennikov Oleg Aleksandrovich
RU2273107C2
METHOD OF GENERATING AND USING RECURSIVE INDEX OF SEARCH ENGINES 2011
  • Serebrennikov Oleg Aleksandrovich
RU2459242C1
SYSTEM AND METHOD TO COMBINE PASSIVE AND ACTIVE MODES 2008
  • Got'E Ehrik
  • Ljubbers Villem
  • Zherar Fransua
RU2454820C2
DELEGATED MANAGEMENT OF DISTRIBUTED RESOURCES 2004
  • Gochiman Chiprian
RU2360368C2
SYSTEM AND METHOD FOR COLLECTING INFORMATION FOR DETECTING PHISHING 2016
  • Volkov Dmitrij Aleksandrovich
RU2671991C2

RU 2 568 276 C2

Authors

Nagornyj Aleksej Sergeevich

Dates

2015-11-20Published

2014-01-24Filed