FIELD: physics.
SUBSTANCE: invention relates to method for uniform distribution of data when forming data marts on server equipment. Method includes steps of: allocating the recorded information, which is characterized by the number of lines and the occupied space in the files on the disk in the file system on the server equipment; determining the number of rows and the size of the data mart in bytes that it occupies in files on a disk in a file system on server equipment; redistributing the rows from the data mart source files, forming new files, each of which contains no more than the specified number of rows, calculated according to the expression, where the maximum number of lines per file is equal to the ratio of the sum of the total number of lines in the data mart, multiplied by the target block size in bytes of each generated file, and the size of the disk space occupied by the mart in bytes, minus one, to the size of the disk space occupied by the mart in bytes, with dropping of the fractional part, wherein target size in bytes of each generated file is equal to set constant value for each corresponding separate cluster; or redistributing lines from the data mart source files to form new files, wherein the value of the number of generated files is equal to the ratio of the sum of the size of the data mart in bytes, which it occupies in files on a disk in the file system, and the target size in bytes of each generated file minus one, to the target size in bytes of each generated file, with dropping of the fractional part; based on the obtained values of the number of lines in the file or the number of files, performing the obtained sets of lines uniform recording on discs in the file system on the server equipment.
EFFECT: providing uniform distribution of data when creating data marts on server equipment.
2 cl, 6 dwg
Title | Year | Author | Number |
---|---|---|---|
METHOD AND SYSTEM FOR FORMING PARTITIONED DATA MARTS CONTAINING GEODATA AND THEIR USE IN PROCESS OF DATA STORAGE OPERATION | 2023 |
|
RU2811359C1 |
METHOD AND SYSTEM FOR MANAGING METADATA IN HIGH-LOAD CLOUD ENVIRONMENTS | 2024 |
|
RU2829567C1 |
METHOD OF PROCESSING DATA IN HYBRID STORAGE | 2023 |
|
RU2831216C1 |
DYNAMIC REAL-TIME FILE SYSTEMS COMPATIBILITY DETERMINATION | 2020 |
|
RU2808634C1 |
METHOD OF CONSTRUCTING A DISTRIBUTED INFORMATION SYSTEM | 2018 |
|
RU2699683C1 |
METHOD OF RECORDING INFORMATION ON WRITE-ONCE OPTICAL DISCS ARRANGED IN HYBRID STRUCTURES | 2024 |
|
RU2839304C1 |
DEVICE FOR EDITING, METHOD FOR EDITING AND DATA CARRIER | 2000 |
|
RU2263954C2 |
METHOD AND SYSTEM FOR AUTOMATED GENERATION AND FILLING OF DATA MARTS USING DECLARATION DESCRIPTION | 2022 |
|
RU2795902C1 |
TRANSPARENT RECOVERY AFTER FAILURE | 2012 |
|
RU2595903C2 |
METHOD OF PROTECTING AVAILABILITY AND SECURITY OF STORED DATA AND SYSTEM FOR ADJUSTABLE PROTECTION OF STORED DATA | 2014 |
|
RU2584755C2 |
Authors
Dates
2025-05-21—Published
2024-10-24—Filed