FIELD: data processing.
SUBSTANCE: invention relates to a method and a system for generating synthetic data. Method comprises obtaining an original data sample containing information on data subjects, and at least one set of data associated with said data subjects; classifying information contained in the data sample to identify attributes related to sensitive data (SD) and attributes not related to SD; at least one synthetic data generator is trained using machine learning algorithms based on: data sample; values of SD classes obtained at the previous stage; metadata about data for training and primary and foreign keys, linking information about data subjects and data set; at least one set of synthetic data is generated by means of a trained generator, in which the set of attributes of the data subjects differs from the sets of attributes of the data subjects contained in the original data sample, with preservation of primary and foreign keys containing information on the data subjects.
EFFECT: possibility of generating synthetic data, the structure of which corresponds to the original data, while ensuring confidentiality of sensitive data.
10 cl, 7 dwg, 10 tbl
Title | Year | Author | Number |
---|---|---|---|
METHOD FOR IMAGE GENERATION BASED ON USER PREFERENCE ANALYSIS | 2023 |
|
RU2812413C1 |
SYSTEM FOR RECOVERY OF ROCK SAMPLE THREE-DIMENSIONAL STRUCTURE | 2018 |
|
RU2718409C1 |
METHOD OF TRANSMITTING MOTION OF A SUBJECT FROM A VIDEO TO AN ANIMATED CHARACTER | 2019 |
|
RU2708027C1 |
METHOD FOR OBTAINING LOW-DIMENSIONAL NUMERIC REPRESENTATIONS OF SEQUENCES OF EVENTS | 2020 |
|
RU2741742C1 |
METHOD AND SYSTEM FOR AUTOMATIC GENERATION OF A PROGRAM CODE FOR AN ENTERPRISE DATA WAREHOUSE | 2017 |
|
RU2683690C1 |
TRAINING GAN (GENERATIVE ADVERSARIAL NETWORKS) TO CREATE PIXEL-BY-PIXEL ANNOTATION | 2019 |
|
RU2735148C1 |
METHOD AND SYSTEM FOR PARAPHRASING TEXT | 2023 |
|
RU2814808C1 |
TWO-MODE IMAGING INCLUDING QUALITY METRICS | 2011 |
|
RU2589383C2 |
METHOD FOR BUILDING SYNTHETIC CT IMAGES BASED ON MRI IMAGE DATA | 2020 |
|
RU2778112C2 |
METHOD FOR GENERATING THREE-DIMENSIONAL POINT CLOUDS | 2020 |
|
RU2745445C1 |
Authors
Dates
2024-08-08—Published
2023-10-18—Filed