FIELD: physics.
SUBSTANCE: invention relates to the field of deep learning, in particular to the method and device for compilation of neural networks. Method is executed by at least one processor and comprises steps of: a) obtaining a neural network model and transforming it into a unified format representing a directed graph, where the graph vertices are operations of the neural network model, b) determining the capabilities of the neural network model operations on the main or auxiliary computing device, c) separating the directed graph into a main subgraph and preprocessing and postprocessing subgraphs, d) generating a sequence of starting operations of the neural network model based on the interaction of the main subgraph sections with the preprocessing and postprocessing subgraphs, e) performing the operations of the neural network model of the main subgraph to optimize their execution, resulting in an optimized unified format, f) converting the optimized unified format into a high-level dialect, g) converting a high-level dialect into a common level dialect, h) converting a common level dialect into a low-level dialect, i) generating a binary data stream based on a low-level dialect and splitting the binary data stream into separate instructions for parallel processing on hardware units of the main computing device, j) executing instructions obtained in step i) on a main computing device, and performing operations from preprocessing and postprocessing subgraphs on an auxiliary computing device based on a start-up sequence generated at step d).
EFFECT: high efficiency of the neural network compilation process.
13 cl, 2 dwg
Title | Year | Author | Number |
---|---|---|---|
TRAFFIC CLASSIFICATION SYSTEM | 2018 |
|
RU2697648C2 |
MARKUP LANGUAGE AND OBJECT MODEL FOR VECTOR GRAPHICS | 2003 |
|
RU2321892C2 |
ARTIFICIAL NEURAL NETWORKS COMPRESSION SYSTEM BASED ON ITERATIVE APPLICATION OF TENSOR APPROXIMATIONS | 2019 |
|
RU2734579C1 |
METHOD AND APPARATUS FOR JOINT DEBAYERING AND IMAGE NOISE ELIMINATION USING A NEURAL NETWORK | 2020 |
|
RU2764395C1 |
METHOD FOR CONSTRUCTING PROCESSORS FOR OUTPUT IN CONVOLUTIONAL NEURAL NETWORKS BASED ON DATA-FLOW COMPUTING | 2020 |
|
RU2732201C1 |
METHOD AND SYSTEM FOR MODIFYING PROGRAM CODE | 2023 |
|
RU2824522C1 |
SYSTEM AND METHOD FOR STATIC ANALYSIS OF EXECUTABLE BINARY CODE AND SOURCE CODE USING FUZZY LOGIC | 2021 |
|
RU2783152C1 |
METHOD AND SYSTEM FOR EXECUTING PROGRAM CODE USING NEURAL NETWORKS | 2024 |
|
RU2834580C1 |
METHOD OF DIVIDING CONVOLUTION OPERATION ON DEVICE WITH MATRIX MULTIPLIERS BASED ON SYSTOLIC ARRAYS | 2024 |
|
RU2830039C1 |
METHOD AND SYSTEM FOR EXECUTING PROGRAM CODE | 2024 |
|
RU2834855C1 |
Authors
Dates
2025-03-03—Published
2024-11-22—Filed