CN112463387B

CN112463387B - Method for identifying deep learning model on local server based on GPU space-time resource consumption

Info

Publication number: CN112463387B
Application number: CN202011427759.XA
Authority: CN
Inventors: 宣琦; 刘壮壮; 朱振强; 朱城超; 郑俊杰; 陈作辉; 张鑫辉; 翔云; 邱君瀚
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2020-12-07
Filing date: 2020-12-07
Publication date: 2024-03-29
Anticipated expiration: 2040-12-07
Also published as: CN112463387A

Abstract

The method for identifying the deep learning model on the local server based on GPU space-time resource consumption comprises the following steps: s1, building an experiment platform for collecting utilization rate and power consumption data of a local server GPU; (S2) running various common deep learning models on a local server by using an open source deep learning framework such as tensorflow, pytorch; (S3): capturing the utilization rate and power consumption data of the GPU in real time in the process of running the deep learning model by the local server; (S4) processing the acquired data; and (S5) constructing a convolutional neural network to classify the acquired data, and presenting a test result in a confusion matrix mode. The final result shows that when the deep learning model runs, the GPU utilization rate and the power consumption of the local server have correlation with the model in the deep learning model, and through analysis of the correlation, a lot of effective information about the model can be obtained.

Description

Method for identifying deep learning model on local server based on GPU space-time resource consumption

Technical Field

The invention relates to the field of deep learning model identification, and is mainly applied to the field of safety of deep learning models, in particular to a method for identifying a deep learning model on a local server based on GPU space-time resource consumption.

Background

Side channel attacks are an attack technique based on side channel information. Side channel information means other information in the encryption device than explicit information directly related to the ciphertext, such as GPU usage of the device, memory occupancy of the GPU, power consumption, electromagnetic radiation, time consumption, and the like. With the continuous penetration of artificial intelligence technology in industries such as military, civil and the like, the safety problem and attack and defense technology of the artificial intelligence technology are more and more concerned. Traditional artificial intelligence technology safety considerations only stay at the software level, and model output is misclassified by adding disturbance to the input of the deep learning model, namely generating an countermeasure sample. Such attacks are generally classified into black box attacks, which are known about model information, and white box attacks, which are known about model architecture, training data, model weights, and the like, according to the degree of knowledge about the model. Clearly, the more known the model, the more threatening the attack, white-box attacks are generally superior to black-box attacks in terms of their effectiveness. It would be further advantageous for an attacker's attack if some model information could be known more. In fact, at model operation, an attacker may analyze the space-time resource consumption of the device at the hardware level to obtain part of model information, and such analyzed information is collectively referred to as side channel information. The implementation of the model cracking through the side channel information is called a side channel attack.

It is very common to run deep learning model applications based on servers, such as image recognition, signal recognition, network classification, etc. Because the deep learning model operation itself requires extremely high computational power, the model is deployed on a server, and the input and output results of the model are transmitted through a local network or the internet, which is a common application scenario. Thus, there is a large portion of deep learning models that provide artificial intelligence services to the marketplace in a server-deployed manner. For this type of model, we cannot directly obtain the information of the running models from the server, but can obtain the use condition of the hardware resources of the server by the models, and then determine the running model by using the hardware resources. As for the local server, the side channel information available to the attacker is side channel information such as CPU cache, data transmission time, etc. In the prior art, the information of the deep learning model is stolen by utilizing the Cache (Cache) of the CPU. The invention provides a method for identifying a deep learning model in a server by acquiring the utilization rate and the power consumption of a GPU, which realizes the identification of different deep learning models by simple actual operation.

Disclosure of Invention

The invention provides a method for directly identifying a deep learning model in a local server through the utilization rate and power consumption information of a GPU, which aims to overcome the defect that the class of the deep learning model in the local server is difficult to identify by using a traditional attack method in the prior art.

The technical conception of the invention is as follows: experiments show that when the deep learning model is operated, the GPU utilization rate and the power consumption of the local server have correlation with the internal model, and the correlation is shown in that when the input is the same, the more complex parameters of the model are, the higher the GPU utilization rate and the power consumption are, and otherwise, the lower the GPU utilization rate and the power consumption are. Through analysis of the correlation, the information of the deep learning model in the local server can be deduced according to the utilization rate and the power consumption information of the GPU, and an attacker can change the local server from black box attack to gray box attack, even white box attack, so that the success accuracy of the attack is improved.

The technical scheme adopted by the invention for achieving the aim of the invention is as follows:

1. a method for identifying a deep learning model on a local server based on GPU space-time resource consumption, comprising the steps of:

s1, building an experiment platform for collecting utilization rate and power consumption data of a local server GPU;

s2, running various common deep learning models on a local server by using an open-source deep learning framework such as tensorflow, pytorch;

s3: capturing the utilization rate and power consumption data of the GPU in real time in the process of running the deep learning model by the local server;

s4, processing the acquired data;

and S5, constructing a convolutional neural network to classify the acquired data, and presenting a test result in a confusion matrix mode.

Further, the step S3 specifically includes: the utilization rate and the power consumption data of the GPU when different deep learning models run are acquired in real time, and input signals of the different deep learning models can be one-dimensional time sequence data, two-dimensional image data and complex network data.

Further, the step S4 specifically includes:

s4.1, carrying out linear normalization processing on the acquired data, wherein the purpose of normalization is to limit the acquired data to 0 to 1, so that the speed of gradient descent to solve the optimal solution is increased, and the recognition precision is improved, and the formula is as follows:

wherein x is the minimum value in the original data acquired by min (x), the maximum value in the original data acquired by max (x), and the normalized value of the original data.

And S4.2, converting the normalized data into 512-512 two-dimensional gray-scale pictures, fully playing the advantages of the convolutional neural network in terms of image feature extraction, reducing the computational complexity, and then marking each picture with a corresponding label, so that the data can be directly input into the convolutional neural network for training and testing.

Further, the step S5 specifically includes: the convolutional neural network comprises four convolutional layers, the number of model parameters is compressed by using 3 multiplied by 3 and 1 multiplied by 1 convolution kernels, the number of characteristic channels is doubled after each pooling operation to keep the integrity of the characteristics as much as possible, the probability of neuronal death and overfitting in training are reduced by using a relu nonlinear activation function, and the output of the neural network becomes probability distribution by using a softmax activation function, so that classification is more accurate.

The beneficial effects of the invention are as follows:

(1) And the local server platform is reasonably utilized, the deployment is simple, the data acquisition is convenient, and the analysis is easy.

(2) The invention shows that when the deep learning model runs, the GPU utilization rate and the power consumption of the local server have correlation with the internal model, and a lot of effective information about the model can be obtained through analysis of the correlation.

(3) The method for identifying the deep learning model in the local server can change the traditional challenge sample attack from black box attack to gray box attack and even white box attack, and remarkably improves the attack accuracy.

(4) The method for converting the one-dimensional time sequence data into the two-dimensional image can fully play the advantages of the convolutional neural network in the aspect of image feature extraction, reduce the computational complexity, and provide a solution idea for the difficulty in processing the one-dimensional time sequence data.

(5) The convolutional neural network algorithm has good classification effect on one-dimensional time sequence data transfer.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

fig. 2 (a) is a GPU usage data visualization, and fig. 2 (b) is a visualization of GPU usage data normalized;

fig. 3 (a) is a graphic view of the GPU power consumption data, and fig. 3 (b) is a graphic view of the normalized GPU power consumption data;

FIG. 4 is a graph of normalized GPU usage data;

FIG. 5 is a graph of normalized GPU power consumption data;

FIG. 6 is a graph of the classification result of an image model input as two-dimensional image data;

FIG. 7 is a graph of the classification result of a model of a one-dimensional time series data signal;

fig. 8 is a diagram of the classification result of the data node model input as a complex network.

Detailed Description

The following describes the embodiments of the present invention in further detail with reference to the drawings.

Referring to fig. 1 to 8, a method for identifying a deep learning model on a local server based on GPU space-time resource consumption includes the steps of:

s1, building an experimental platform for collecting utilization rate and power consumption data of a local server GPU, wherein the experimental platform specifically comprises the following steps:

the local server side is configured to use I7-6700 of Intel for CPU and GTX TITAN X PASCAL of Injeida for GPU;

s2, running various common deep learning models on a local server by using an open source deep learning framework such as tensorflow, pytorch and the like, wherein the method specifically comprises the following steps:

running image recognition models alexnet, mobilenetv, mobiletv 2, internv 3, resnetv1 and resnetv2 on the server; the signal recognition models are fx_ crmr, nin, alexnet, fx _resnet and lstm respectively; the node classification models are demo_ net, gat, gcn, graphsage, h-gcn and mixhop respectively;

s3: capturing the utilization rate and power consumption data of the GPU in real time in the process of running the deep learning model by the local server; the method specifically comprises the following steps:

the python program package pynvml and psutil can be combined to collect the utilization rate and power consumption data of the GPU when different deep learning models run in real time, wherein the input of the signal recognition model is one-dimensional time sequence data, the input of the image recognition model is two-dimensional image data and the input of the node classification model is complex network data;

s4, processing the acquired data; the method specifically comprises the following steps:

s4.1, carrying out linear normalization processing on the acquired data, as shown in fig. 2 and 3, wherein the purpose of normalization is to limit the acquired data between 0 and 1, so that the speed of gradient descent to obtain the optimal solution is increased, and the recognition accuracy is improved, wherein the formula is as follows:

wherein x is the minimum value in the original data acquired by min (x), the maximum value in the original data acquired by max (x), and the value after normalization of the x' original data;

s4.2, converting normalized data into 512 grayscale images shown in figures 4 and 5, fully playing the advantages of the convolutional neural network in terms of image feature extraction, reducing the computational complexity, and then marking each image with a corresponding label, so that the images can be directly input into the convolutional neural network for training and testing;

s5, constructing a convolutional neural network to classify the acquired data, and presenting a test result in a confusion matrix mode; the method specifically comprises the following steps:

s5.1, the convolutional neural network comprises four convolutional layers, the number of model parameters is compressed by using 3 multiplied by 3 and 1 multiplied by 1 convolution kernels, the number of characteristic channels is doubled after each maximum pooling operation to keep the integrity of the characteristics as much as possible, a relu nonlinear activation function is used to reduce the probability of death and overfitting of neurons in training, and the softmax activation function enables the output of the neural network to become probability distribution, so that classification is more accurate;

s5.2, inputting the processed data into a convolutional neural network, training the training set, and outputting a classification result by a test set through a confusion matrix, wherein the number represents the classification precision such as 0.96 represents 96% accuracy, FIG. 6 is the classification result of the image recognition model, FIG. 7 is the classification result of the signal recognition model, and FIG. 8 is the classification result of the node classification model.

S6, combining the model classification result with the image model attack method, the signal model attack method and the node model attack method, the special attack can be carried out on the deep learning model operated by the local server, so that the attack accuracy is improved.

The embodiments described in the present specification are merely examples of implementation forms of the inventive concept, and the scope of protection of the present invention should not be construed as being limited to the specific forms set forth in the embodiments, and the scope of protection of the present invention and equivalent technical means that can be conceived by those skilled in the art based on the inventive concept.

Claims

s2, running a deep learning model on a local server by using a tensorflow, pytorch open source deep learning framework;

the method comprises the steps that the utilization rate and the power consumption data of the GPU when different deep learning models run are collected in real time, and input signals of the different deep learning models are one-dimensional time sequence data, two-dimensional image data and complex network data; s4, processing the acquired data; the method specifically comprises the following steps:

s4.1, carrying out linear normalization processing on the acquired data, wherein the formula is as follows:

s4.2, converting the normalized data into 5125×12 gray-scale pictures, and marking each picture with a corresponding label;

s5, constructing a convolutional neural network to classify the acquired data, and presenting the test result in a confusion matrix mode specifically comprises the following steps:

the convolutional neural network comprises four convolutional layers, the convolutional kernels are 3 multiplied by 3 and 1 multiplied by 1, a maximum pooling layer is added behind each convolutional layer, two fully-connected layers are used behind a first fully-connected layer, a relu activation function is used behind a second fully-connected layer, a softmax activation function is used behind a second fully-connected layer, and a prediction result is output.