FIELD: biotechnology.
SUBSTANCE: method for identifying somatic mutations in a plurality of variants is described, including: obtaining a plurality of variants containing somatic mutations and germline variants; applying a database filter to a specified set of options. The following is carried out: determination of the first germline variants in the specified set of variants, and each of the specified first germline variants has a number of alleles in the first reference set of variants, which is greater than or equal to the threshold number of alleles. Next, a proximity filter is applied to the specified set of variants, including: grouping variants from the specified set of variants into a plurality of groups, with variants located in the same region of the genome being grouped into the same group, identifying variants from the database in the specified set of variants wherein the variant from the database is present in the second reference set of variants, and identifying second germline variants in said plurality of variants, wherein each of said second germline variants has an allele frequency within the approximate allele frequency range of the at least one variant from the database in the same group as the specified second germline variant. Somatic mutations in said plurality of variants are then determined by removing said identified first and second germline variants from said plurality of variants. A method for determining tumour mutational burden for a tumour is also disclosed, comprising: obtaining sequence data from a biological sample containing a tumour cell; determining a plurality of variants based on said sequence data; and determining the number of somatic mutations in said plurality of variants in accordance with the above method, wherein said number of somatic mutations represents the tumour mutational burden of the tumour. A computerized method for identifying somatic mutations in a variety of variants is also described.
EFFECT: invention expands the arsenal of tumour diagnostic tools.
42 cl, 11 dwg, 3 tbl, 5 ex
Title | Year | Author | Number |
---|---|---|---|
CALCULATION OF THE BURDEN OF TUMOUR MUTATIONS USING TUMOUR RNA SEQUENCING DATA AND CONTROLLED MACHINE LEARNING | 2020 |
|
RU2759205C1 |
SYSTEM AND METHOD OF INTERPRETING ALLELES USING GRAPH-BASED REFERENCE GENOME | 2019 |
|
RU2809124C2 |
A DEEP LEARNING FRAME FOR IDENTIFYING SEQUENCE PATTERNS THAT CAUSE SEQUENCE SPECIFIC ERRORS (SSE) | 2019 |
|
RU2745733C1 |
INDIVIDUALIZED ANTI-TUMOR VACCINES | 2012 |
|
RU2779946C2 |
INDIVIDUALISED VACCINES FOR CANCER | 2012 |
|
RU2670745C9 |
PREDICTION OF IMMUNOGENICITY OF T-CELL EPITOPES | 2014 |
|
RU2724370C2 |
METHODS FOR TRAINING DEEP CONVOLUTIONAL NEURAL NETWORKS BASED ON DEEP LEARNING | 2018 |
|
RU2767337C2 |
METHODS AND SYSTEMS FOR OBTAINING SETS OF UNIQUE MOLECULAR INDICES WITH HETEROGENEOUS LENGTH OF MOLECULES AND CORRECTING ERRORS THEREIN | 2018 |
|
RU2766198C2 |
IDENTIFICATION, PRODUCTION AND USE OF NEOANTIGENS | 2016 |
|
RU2729116C2 |
SPLICING SITES CLASSIFICATION BASED ON DEEP LEARNING | 2018 |
|
RU2780442C2 |
Authors
Dates
2024-02-14—Published
2019-10-30—Filed