FIELD: computer technology.
SUBSTANCE: invention relates to the field of computer technology, namely to a method for compressing genome sequence data. The method of compressing genomic sequence data includes obtaining a reading record by one or more computers, determining by one or more computers the correspondence of the reading record to the reading that is accurately mapped to the reference sequence or inaccurately mapped to the reference sequence, based on the determination by one or more computers that the reading record corresponds to the reading that is inaccurately mapped to the reference sequence, determination by one or more computers of how much the number of inconsistencies of an inaccurately matched reading satisfies a given threshold the number of nonconformities, and based on the determination that the number of nonconformities satisfies a given threshold number of nonconformities (i) obtaining by one or more computers an offset relative to the previous nonconformity that is less than the maximum encoded offset value, and (ii) encoding by one or more computers of each nonconformity of an inaccurately matched reading and an offset relative to the previous nonconformity in a record size 1 byte.
EFFECT: technical result is aimed at reducing data compression losses.
31 cl, 4 dwg,
Title | Year | Author | Number |
---|---|---|---|
METHOD OF COMPRESSING GENOME SEQUENCE DATA | 2020 |
|
RU2815860C1 |
METHODS OF ENCODING AND DECODING INFORMATION | 2017 |
|
RU2659025C1 |
GENOMIC INFRASTRUCTURE FOR LOCAL AND CLOUD PROCESSING AND ANALYSIS OF DNA AND RNA | 2017 |
|
RU2804029C2 |
METHOD FOR CODING VIDEO DATA SIGNAL FOR USE WITH MULTIDIMENSIONAL VISUALIZATION DEVICE | 2014 |
|
RU2667605C2 |
DEVICE AND METHOD FOR ACCELERATION OF COMPRESSION AND DECOMPRESSION OPERATIONS | 2014 |
|
RU2629440C2 |
ENTROPY CODER FOR IMAGE COMPRESSION | 2011 |
|
RU2575679C2 |
GENOMIC INFRASTRUCTURE FOR LOCAL AND CLOUD PROCESSING AND ANALYSIS OF DNA AND RNA | 2017 |
|
RU2761066C2 |
METHOD OF GROUP CODING OF RASTER-TYPE DATA STREAM | 2004 |
|
RU2350035C2 |
SEARCH INDEX FORMAT OPTIMISATION | 2009 |
|
RU2503058C2 |
METHOD AND DEVICE FOR VIDEO ENCODING BASED ON LONG-TERM REFERENCE FRAME, COMPUTING DEVICE AND DATA CARRIER | 2021 |
|
RU2799709C1 |
Authors
Dates
2023-11-15—Published
2020-09-11—Filed