CN113807598A

CN113807598A - SVM heavy oil pipeline corrosion rate prediction method based on PSO-WOA hybrid optimization

Info

Publication number: CN113807598A
Application number: CN202111127589.8A
Authority: CN
Inventors: 谢飞; 苏禹铭; 王丹; 孙东旭
Original assignee: Liaoning Shihua University; Shenyang University of Technology
Current assignee: Liaoning Shihua University; Shenyang University of Technology
Priority date: 2021-09-26
Filing date: 2021-09-26
Publication date: 2021-12-17

Abstract

The invention relates to a PSO-WOA hybrid optimization-based SVM thick oil pipeline corrosion rate prediction method, which uses target thick oil pipeline detection data as a data set, optimizes parameters of a Support Vector Machine (SVM) by combining a PSO (particle swarm optimization) -WOA (whale) hybrid optimization algorithm, and an established model is applied to the fields of thick oil pipeline steel corrosion rate prediction and corrosion protection. The method disclosed by the invention combines the advantages of optimal solution searching by PSO and local optimal WOA jumping out, optimizes the model parameters, improves the prediction precision of the model under a limited data set, and provides reliable reference for the operation and maintenance of the heavy oil pipeline.

Description

SVM heavy oil pipeline corrosion rate prediction method based on PSO-WOA hybrid optimization

Technical Field

The invention relates to a PSO (particle swarm) -WOA (whale) hybrid optimization-based thick oil pipeline corrosion rate prediction method based on a Support Vector Machine (SVM), which is applied to the field of corrosion and protection of oil and gas pipelines.

Background

The prediction of the corrosion rate of the buried pipeline is always a hot point of research of scholars at home and abroad, and the prediction of the corrosion rate has great significance on the basic design of the pipeline and the related safety, environmental and economic consequences.

Because various different environmental parameters which can directly influence and predict a pipeline corrosion rate model are quite complex, and the corrosion rate and various related parameters in a medium system are always in a nonlinear relationship with each other, the purpose of predicting the metal corrosion speed by using the established model is possible to be realized under the condition that accurate corrosion rate and related parameter data can be obtained, so that some scholars begin to establish a corrosion rate prediction model by using a neural network. After a neural network prediction model is developed for a period of time, how to improve the prediction performance of the model and better establish the prediction model suitable for pipeline corrosion becomes a technical problem which is always eagerly solved in the field.

Disclosure of Invention

The invention aims to provide a PSO-WOA hybrid optimization-based SVM heavy oil pipeline corrosion rate prediction method, which integrates the advantages of two algorithms, optimizes model parameters and improves the prediction accuracy of the model.

In order to achieve the purpose, the technical scheme of the invention is as follows: a corrosion rate prediction method for an SVM heavy oil pipeline based on PSO-WOA hybrid optimization comprises the following steps:

s1, selecting environment factors which are representative and relevant to corrosion behaviors in the operation process of the thick oil pipeline as input variables;

s2, detecting relevant parameters along the pipeline on site by taking the thick oil pipeline as a research object, and sorting detection data to serve as a model data set;

s3, selecting 22 groups of sample data as training samples, establishing a thick oil pipeline corrosion rate prediction model, using 8 groups of sample data as test samples, and evaluating the optimized model precision;

s4, establishing a corrosion rate prediction model of the SVM heavy oil pipeline based on PSO-WOA hybrid optimization;

and step S5, evaluating the model prediction effect according to the relevant evaluation indexes.

In an embodiment of the present invention, the representative and relevant environmental factors of the corrosion behavior of the thick oil pipeline in step S1 include soil resistivity, water content, and Cl along the pipeline^-Content, potential gradient, oxidation-reduction potential, pH value.

In an embodiment of the present invention, in step S4, an SVM is used as a basic neural network model for thick oil pipeline corrosion rate prediction, and the specific implementation steps are as follows:

(1) the basic principle of SVM is through a non-linear mapping function

Mapping an input sample x to a high-dimensional feature space F, creating a linear regression function according to the principle of minimizing the structural risk, converting each group of data in a training sample into the high-dimensional space, and performing linear regression on data in the high-dimensional space, so that the nonlinear fitting problem of an original sample is converted into the linear regression problem of the sample in the high-dimensional space, wherein the obtained fitting function is as follows:

y′＝f(x)＝w·Φ(x)+b (1)

wherein x is the input vector of the sample, w is the weight vector, w belongs to R, b is the deviation vector of the sample, b belongs to R, y' is the predicted value;

when solving the regression fitting problem by SVM, fitting errors are considered and allowed, and a linear insensitive loss function (epsilon) is introduced on the basis of the support vector machine to obtain a regression support vector machine, thus, the regression problem is converted into a solution of an objective function minimizing the structural risk, i.e., values of w and b of the objective function are minimized,

whereinC is greater than 0, which is a penalty factor, and controls the penalty degree of each sample to exceed the error epsilon, and relaxation factors ζ i and ζ i' are introduced to reduce the error generated in the training process, wherein the relaxation factors are respectively

The upper limit and the lower limit of the sample training error under the condition of (1) represent a degree of deviation of the sample from an outlier or from a boundary, epsilon refers to a parameter of a linear insensitive loss function, and when the absolute error of a predicted value and a measured value is less than epsilon, the absolute error is ignored; otherwise, it will be included in the error;

(2) introducing Lagrange multiplier a when solving equation_iAnd b_iThe Lagrange function is constructed, the function expression of the problem is regarded as a convex quadratic programming problem, so that the parameter partial derivatives can be solved respectively, and the dual is an initial formula

Is converted into

On the one hand, they are approximate, and on the other hand, w and b can be solved first, and for lambda, lambda has a value if and only if the data falls on two imaginary lines; for other samples, λ does not work, i.e. λ only works on the support vector, the problem solution is simplified; therefore, after the dual problem is converted, convex optimization is simplified into maximization of a quadratic form, and a saddle point of a Lagrangian equation can be solved; determining and setting the partial derivative of each variable to zero; after introducing lagrangian and using dual principle, convert to:

(3) when solving the non-linear problem by means of SVM, the input samples x are passed through a non-linear mapping function

Is mapped onto a high-dimensional feature space F, and then a linear regression of the high-dimensional feature space is performed, the transformation from the low-dimensional space to the high-dimensional space being by a kernel function

The function can not replace dot products in a high-dimensional space, and the use of a nonlinear mapping function is avoided

The calculation time and complexity are greatly reduced; the Radial basis function is used, as follows:

where g is the width of the kernel function, i.e. the kernel parameter (g > 0), x_i-x_jIs Euclidean norm, and after introducing kernel parameters, optimization target conversion is as follows:

solving the SVM nonlinear regression model function as follows:

x₁，x₂，...，x_nis an input variable, a_i-b_iAre network weights and each output vector is a linear transformation of the intermediate node support vector with respect to f (x).

In the step S4, the parameters of the SVM model are optimized by using a PSO-WOA hybrid optimization method, and the specific implementation steps are as follows:

(1) the steps of optimizing the penalty factor C and the kernel parameter g through the PSO algorithm are as follows:

initialization: randomly assigning values to the speed and position of the particles, and learningHabit factor c₁And c₂Carrying out assignment to obtain a penalty factor C and a kernel parameter g of the SVM;

and (3) fitness evaluation: calculating fitness function values of all particles, and initializing a local optimal value and a global optimal value;

and (3) updating: updating the velocity and position of the particle to obtain a new population, comparing the fitness value with its own historical optimum, and updating the global optimum of the population parameters C and g, the velocity (v) of the particle and its current position (CB) being updated according to equations (7) and (8),

v^t+1＝wv^t+c₁r₁(Pbest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (7)

CB^t+1＝CB^t+v^t (8)

where t denotes the number of iterations, r₁And r₂Is [0,1 ]]Two random variables of (1), c₁And c₂Is the learning coefficient, Pbest is the particle optimal position, and Gbest is the particle swarm optimal position;

(2) the specific steps of the WOA optimization algorithm are as follows:

given a random number p ∈ (0,1), if p <0.5 and | A | <1, we randomly find prey, the artificial whale algorithm uses random individual positions in its population for food finding, and updates their spatial positions using the following formula:

X_t+1＝X_rand-A·D (9)

t denotes the number of iterations that have been performed, the individual position is denoted by X, and the individual X is randomly selected before the position is updated_randThe total length of (2) is represented by D ═ C · X_rand-X_tI represents that the parameter A is in [ -2,2]Generated randomly, and in addition, random individual X_randEffect on distance of present Individual X Using C as [0,2 ]]Random number representation;

if p <0.5 and | A | is >1, continue to wrap around the prey,

after the artificial whale has searched for food, the spatial location is updated with the following equation:

X_t+1＝X_best-A·|C·X_best-X_t| (10)

wherein the position of the food corresponds to X_bestThe location of the globally optimal individual in the population;

if p is more than or equal to 0.5, carrying out spiral predation;

the characteristics of the motion track of the artificial whale and the logarithmic spiral are consistent, and the artificial whale can swim to the optimal individual X_bestIt also updates its spatial location:

X_t+1＝D_best·e^bl·cos2πl+X_t (11)

after iterative update with X_t+1X representing whale position, single X before position update_bestLength of (D) ═ X_best-X_tI calculation, constant of spiral track forming is expressed by b, and is in [ -1,1 [)]Assigning a value to l randomly;

substituting the optimized model parameters into an SVM model to calculate an adaptive value, wherein a fitness calculation formula of an objective function when the WOA is combined with the SVM to carry out regression prediction is as follows:

where M is the number of samples, y_i、y_iSetting a WOA optimization target as c and g parameters of an SVM model and taking an obtained optimal solution as an SVM model parameter, wherein the WOA optimization target is a true value and a model prediction value;

(3) a new solution aiming at the organic fusion of the PSO of the WOA in the exploration stage and the development stage;

the initial population is started and updated by the PSO, and then the obtained solution is updated again by the WOA,

v^t+1＝wv^t+c₁r₁(Whalebest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (13)

as shown in formula (13), the WOA optimization result is fused into a PSO mathematical model formula (7), the particle swarm position is updated according to a new equation, the optimal position of the particle responsible for solving the optimal solution of the complex nonlinear problem is replaced by a while position, the related decision variables are updated, the solution is efficiently guided to the optimal solution, the WOA enables the particle to reach the optimal value more quickly, the calculation time is reduced, and finally, if the determined iteration number is reached, the developed PSO-WOA optimization process is ended.

In an embodiment of the present invention, the penalty factor C obtained after the hybrid optimization is 5.65, and the kernel parameter g is 0.5.

Compared with the prior art, the invention has the following beneficial effects: the hybrid optimization algorithm disclosed by the invention integrates the advantages of the PSO in the development stage and the WOA algorithm in the exploration stage, optimizes the model parameters by using the hybrid optimization algorithm, combines main detection parameters influencing the corrosion rate prediction of the thick oil pipeline, and improves the accuracy of model prediction under the condition of limited field detection data quantity. The number of items to be detected is reduced, and the demand for data acquisition is reduced on the premise of ensuring the prediction precision. A corrosion rate prediction model which is more suitable for the thick oil pipeline and has more excellent prediction performance is established, and effective reference is provided for thick oil pipeline corrosion rate research.

And (3) taking the target thick oil pipeline detection data as a data set, and performing a hybrid optimization SVM pipeline external corrosion rate prediction model of particle swarm optimization and whale optimization algorithm (PSO-WOA). WOA is used in the exploration phase, which is primarily the ability of the algorithm to try a large number of possible solutions because it uses logarithmic spiral paths and therefore covers a large uncertain search space with less computation time. The particle swarm algorithm has the capability of obtaining a near-optimal solution, and is an algorithm for mining an optimal solution from an unknown search space, so that a local optimal solution is provided. And comparing the model performance with the related evaluation indexes of other optimization models to verify the advantages and reliability of the model performance. The method is applied to the thick oil pipeline corrosion rate prediction model, combines the advantages of the PSO and the WOA optimization algorithm, optimizes the parameters of the model, and improves the prediction accuracy of the model.

Drawings

Fig. 1 is a diagram of an SVM neural network.

FIG. 2 is a flow chart of prediction of a PSO-WOA hybrid optimization SVM model.

FIG. 3 is a comparison graph of the predicted results of four model test sets.

FIG. 4 is a comparison graph of the prediction errors of the four model test sets.

Detailed Description

The invention provides a PSO-WOA hybrid optimization-based SVM heavy oil pipeline corrosion rate prediction method, which integrates the advantages of two algorithms, optimizes model parameters and improves the prediction accuracy of the model. The method is realized as follows:

s1, selecting representative and relevant environmental factors of corrosion behavior in the running process of the thick oil pipeline, and taking soil resistivity, water content, C1-content, potential gradient, oxidation-reduction potential and pH value along the pipeline as input variables;

In the step S4, the concrete implementation steps of using the SVM as the thick oil pipeline corrosion rate prediction basic neural network model are as follows:

(1) the basic principle of SVM is through a non-linear mapping function

The input samples x are mapped onto a high-dimensional feature space F. The linear regression function is created according to the principle of minimizing the structural risk, each group of data in the training sample is transformed into a high-dimensional space, and the data is processed in the high-dimensional spaceAnd (4) performing linear regression, so that the problem of nonlinear fitting of the original sample is converted into the problem of linear regression of the sample in a high-dimensional space. The fitting function obtained is:

y′＝f(x)＝w·Φ(x)+b (1)

where x is the input vector of samples, w is the weight vector, w ∈ R, b is the bias vector of samples, b ∈ R, y' is the predictor.

When solving the regression fitting problem by SVM, fitting errors are taken into account and allowed. And introducing a linear insensitive loss function (epsilon) on the basis of the support vector machine to obtain a regression support vector machine. Thus, the regression problem is converted into a solution to the objective function that minimizes the risk of the structure, i.e. minimizes the values of w and b of the objective function.

Where C > 0, which is a penalty factor. It controls the punishment of each sample to exceed the error epsilon. In order to reduce the error generated in the training process, relaxation factors ζ i and ζ i' are introduced, wherein the relaxation factors are respectively

The upper and lower limits of the sample training error under (1) indicate the degree to which the sample deviates from the outlier or from the boundary. ε refers to the parameter of the linear insensitive loss function. When the absolute error between the predicted value and the measured value is less than epsilon, the absolute error is ignored; otherwise, it will be included in the error.

(2) Introducing Lagrange multiplier a when solving equation_iAnd b_iTo construct the lagrangian function. The functional expression of the problem is regarded as a convex quadratic programming problem, so that the parameters can be solved by partial derivatives respectively. Dual is an initial formula

Is converted into

One is their approximation and the other is such that w, b can be solved first, and for λ, λ has a value if and only if the data falls on the two imaginary lines. For other samples, λ does not work, i.e., λ only works on the support vectors, the problem solution is simplified. Thus after converting it to a dual problem, convex optimization is simplified to a quadratic form of maximization and the saddle points of the lagrange equation can be solved. The partial derivative of each variable is determined and set to zero. After introducing lagrangian and using dual principle, convert to:

Is mapped onto the high-dimensional feature space F. Then, a linear regression of the high-dimensional feature space is performed. The transformation from the low-dimensional space to the high-dimensional space is performed by a kernel function

To be realized. The function can not replace dot products in a high-dimensional space, and the use of a nonlinear mapping function is avoided

And greatly reduces the calculation time and complexity. There are four common SVM kernel functions, and the present invention employs a Radial basis function, as follows:

where g is the width of the kernel function, i.e. the kernel parameter (g > 0), x_i-x_jIs the euclidean norm. After nuclear parameters are introduced, optimization targeting is converted into:

solving the SVM nonlinear regression model function as follows:

the structure diagram of the regression SVM is shown in FIG. 1. In fig. 1, each intermediate node corresponds to a support vector. x is the number of₁，x₂，...，x_nIs an input variable, a_i-b_iAre network weights and each output vector is a linear transformation of the intermediate node support vector with respect to f (x). The kernel function setting of the SVM model has great influence on the prediction result, but the kernel function is selected in no specified setting mode, so the method also comprises the step of using an optimization algorithm to find the parameters and the penalty factors of the optimal kernel function.

The method also comprises the step of optimizing the parameters of the SVM model by using a PSO-WOA mixed optimization method, and the specific implementation steps are as follows:

(1) the particle swarm optimization technology is developed for simulating the behaviors of animals and birds to find a global optimization function, the algorithm can optimize a penalty factor C and a kernel parameter g, and has the characteristics of simple principle, only need of a few calculation parameters, execution of quick search and high efficiency. The algorithm can display the dynamic particle tracking of the current search state and adjust the search strategy to achieve the best search. The steps of optimizing the penalty factor C and the kernel parameter g through the PSO algorithm are as follows:

initialization: randomly assigning the speed and position of the particle and learning factor c₁And c₂And assigning to obtain a penalty factor C and a kernel parameter g of the SVM.

And (3) fitness evaluation: and calculating the fitness function values of all the particles, and initializing the local optimal value and the global optimal value.

And (3) updating: and updating the speed and the position of the particles to obtain a new population, comparing the fitness value with the historical optimal value of the fitness value, and updating the global optimal values of the population parameters C and g. Essentially, the particle velocity (v) and its current position (CB) are updated according to equations (7) and (8).

v^t+1＝wv^t+c₁r₁(Pbest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (7)

CB^t+1＝CB^t+v^t (8)

Where t denotes the number of iterations, r₁And r₂Is [0,1 ]]Two random variables of (1), c₁And c₂Is the learning coefficient, Pbest is the particle optimal position, and Gbest is the particle swarm optimal position.

(2) The WOA algorithm is inspired by unique foraging behavior of the whale of the sitter and is a new meta-heuristic system optimization design algorithm. The location update behavior of the WOA algorithm consists of three phases: in the feeding stage, the feeding mode is to use random individual positions in the population; shrinking the periphery and updating the spatial position; spiral predation, when artificial whales swim to the best individual X_bestWhen this happens, it also follows the trajectory of the logarithmic spiral and updates its spatial position again. The specific steps of the WOA optimization algorithm are as follows:

given a random number p ∈ (0,1), if p <0.5 and | A | <1, look for prey randomly. The artificial whale algorithm uses random individual positions in its population for food finding and updates their spatial positions using the following formula:

X_t+1＝X_rand-A·D (9)

t denotes the number of iterations that have been performed, the individual position is denoted by X, and the individual X is randomly selected before the position is updated_randThe total length of (2) is represented by D ═ C · X_rand-X_tAnd | represents. Parameter A is [ -2,2]And randomly generating. In addition, random individuals X_randEffect on distance of present Individual X Using C as [0,2 ]]And (4) random number representation.

If p <0.5 and | A | >1, continue to wrap around the prey.

X_t+1＝X_best-A·|C·X_best-X_t| (10)

wherein the position of the food corresponds to X_bestThe location of the globally optimal individual in the population.

If p is more than or equal to 0.5, spiral predation is carried out.

X_t+1＝D_best·e^bl·cos2πl+X_t (11)

after iterative update with X_t+1X representing whale position, single X before position update_bestLength of (D) ═ X_best-X_tI calculation, constant of spiral track forming is expressed by b, and is in [ -1,1 [)]And assigning a value to the l randomly.

And substituting the optimized model parameters into the SVM model to calculate an adaptive value. The fitness calculation formula of the target function when the WOA is combined with the SVM to carry out regression prediction is as follows:

where M is the number of samples, y_i、y_i' true value and model predicted value. Setting the WOA optimization target as c and g parameters of the SVM model, and taking the obtained optimal solution as SVM model parameters. The whale optimization algorithm has the advantages that the special mechanism that the prey position is randomly selected is benefited, the trouble that the algorithm is trapped in a local optimal solution is avoided, and the target searching capability is further enhanced.

(3) The hybrid optimization algorithm used in the invention is a new solution aiming at organic fusion of the PSOs of the WOA in the exploration stage and the development stage. The particle swarm optimization algorithm is an algorithm for mining an optimal solution from an unknown search space, but is easy to fall into local stagnation, the exploration capability of WOA jumping out of local optimization is strong, the optimal solution and the optimal solution are combined with optimal characteristics (the exploration of WOA and the utilization of PSO), the capability of finding the optimal solution of the problem is improved, and the problem of falling into local stagnation or local optimization is avoided.

Since both algorithms are randomization techniques, we use an uncertain search space from the start to the maximum iteration ceiling during the iteration process. The initial population is started and updated by the PSO and then the obtained solution is updated again by the WOA.

v^t+1＝wv^t+c₁r₁(Whalebest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (13)

As shown in formula (13), the WOA optimization result is fused into the PSO mathematical model formula (7), and the particle swarm position is updated according to the new formula. The optimal position of the particles responsible for solving the optimal solution of the complex nonlinear problem is replaced by a while position, and related decision variables are updated, so that the solution can be efficiently guided to the optimal solution. WOA brings the particles to the optimum value faster, reducing computation time. Finally, if the determined number of iterations is reached, the developed PSO-WOA optimization process is ended. The model optimization process is shown in flow chart 2.

The method comprises the following steps of establishing a PSO-WOA hybrid optimization SVM thick oil pipeline corrosion rate prediction model, establishing GA-SVM, PSO-SVM and WOA-SVM thick oil pipeline corrosion rate prediction models, and comparing prediction effects of various methods, wherein the prediction models comprise:

selecting soil resistivity, water content and Cl^-And six environmental factors of content, potential gradient, oxidation-reduction potential and pH value are used as input variables of the thick oil pipeline corrosion rate prediction model. And (3) taking the target thick oil pipeline as a research object, and obtaining 30 groups of detection data through data detected on site. 22 groups of data are selected as training data, and 8 groups of data are selected as test data, and are used for testing the prediction effect of the model and carrying out comparative analysis with other models.

Model training: firstly, a thick oil pipeline corrosion rate prediction model based on PSO-WOA hybrid optimization SVM is established, and meanwhile, a GA-SVM, PSO-SVM and WOA-SVM thick oil pipeline corrosion rate prediction model is established for comparison of prediction effects. The test set output results of the trained model are shown in fig. 3, and the prediction error comparison graph of the test set output results is shown in fig. 4.

And (4) analyzing a prediction result:

and (3) evaluating the prediction performance by using a regression decision coefficient and combining five evaluation indexes of mean square error MSE, mean absolute error MAE, mean absolute percentage error MAPE, root mean square error RMSE, residual square and SSE, wherein the five indexes are respectively positioned as follows:

mean square error MSE:

mean absolute error MAE:

mean absolute percent error MAPE:

root mean square error RMSE:

residual sum of squares SSE:

where n is the parameter variable value function of the test model in a prediction data unit group in the prediction data set, y is_i' true predicted values, y, respectively, of the test model, which are verified by the true predictions_iFor testing model samples to be predictedAnd (5) true prediction value.

According to the evaluation indexes, the comparison of the predicted effect performance of the four models is obtained, and is shown in table 1.

As can be seen from fig. 3, fig. 4 and table 1, the prediction performance of the PSO-WOA hybrid optimization SVM is improved compared with the other three algorithms, the average absolute errors of the GA-SVM, the PSO-SVM, the WOA-SVM and the PSO-WOA-SVM are respectively 0.0251, 0.0155, 0.0114 and 0.0055, the average absolute error of the PSO-WOA-SVM is lower than those of the other three optimization algorithms, and the average absolute percentage errors MAPE are respectively 7.88%, 4.76%, 3.64% and 1.71%, and the RMSE is respectively 0.0289, 0.0174, 0.0131 and 0.0057 by further analyzing other prediction indexes. The prediction index of the PSO-WOA-SVM can be found to be improved in an all-around manner compared with the other three algorithms, and the prediction performance of the PSO-WOA-SVM model is proved to be improved in comparison with the other three algorithms. Compared with the other three models, the method has the advantages that the prediction stability is realized, and the error of corrosion rate prediction in the pipeline operation risk evaluation is reduced.

The matters not described in detail in the present specification belong to the prior art known to those skilled in the art, and the above embodiments are only for illustrating the present invention and not for limiting the present invention. Although the related embodiments of the present invention and the accompanying drawings are disclosed for illustrative purposes, those skilled in the art will appreciate that; various substitutions, changes, modifications and the like are possible without departing from the spirit and scope of the present invention and the appended claims. Therefore, all equivalent technical solutions also belong to the scope of the present invention, and the protection scope of the present invention should be defined by the claims, not limited to the disclosure of the best embodiment and the accompanying drawings.

TABLE 1 comparison of model prediction Performance evaluation indexes

Claims

1. A corrosion rate prediction method for an SVM heavy oil pipeline based on PSO-WOA hybrid optimization is characterized by comprising the following steps:

2. The PSO-WOA hybrid optimization-based SVM thick oil pipeline corrosion rate prediction method as claimed in claim 1, wherein the representative and relevant environmental factors of thick oil pipeline corrosion behavior in the step S1 comprise soil resistivity, water content and Cl along the pipeline^-Content, potential gradient, oxidation-reduction potential, pH value.

3. The PSO-WOA hybrid optimization-based SVM thick oil pipeline corrosion rate prediction method as claimed in claim 1, wherein in the step S4, SVM is used as a basic neural network model for thick oil pipeline corrosion rate prediction, and the method is implemented by the following steps:

(1) the basic principle of SVM is through a non-linear mapping function

Mapping an input sample x to a high-dimensional feature space F, creating a linear regression function according to the principle of minimizing the structural risk, converting each group of data in a training sample into the high-dimensional space, and performing linear regression on the data in the high-dimensional space, thereby converting the nonlinear fitting problem of an original sample into a sample in the high-dimensional spaceThe fitting function obtained by the linear regression problem of (1) is:

y′＝f(x)＝w·Φ(x)+b (1)

wherein C is>0, which is a penalty factor, which controls the penalty of each sample to exceed the error epsilon, and in order to reduce the error generated during the training, relaxation factors ζ i and ζ i' are introduced, respectively

Is converted into

On the one hand they are similar, and on the other hand they areTo solve for w, b first, for λ, λ has a value if and only if the data falls on the two imaginary lines; for other samples, λ does not work, i.e. λ only works on the support vector, the problem solution is simplified; therefore, after the dual problem is converted, convex optimization is simplified into maximization of a quadratic form, and a saddle point of a Lagrangian equation can be solved; determining and setting the partial derivative of each variable to zero; after introducing lagrangian and using dual principle, convert to:

The calculation time and complexity are greatly reduced; the radial basis (Radialbasis) function is used, as follows:

where g is the width of the kernel function, i.e. the kernel parameter (g)>0)，x_i-x_jIs Euclidean norm, and after introducing kernel parameters, optimization target conversion is as follows:

solving the SVM nonlinear regression model function as follows:

x₁，x₂,…,x_nis an input variable, a_i-b_iAre network weights and each output vector is a linear transformation of the intermediate node support vector with respect to f (x).

4. The PSO-WOA hybrid optimization-based SVM heavy oil pipeline corrosion rate prediction method as claimed in claim 1, wherein in the step S4, a PSO-WOA hybrid optimization method is used for optimizing parameters of an SVM model, and the method is implemented by the following steps:

initialization: randomly assigning the speed and position of the particle and learning factor c₁And c₂Carrying out assignment to obtain a penalty factor C and a kernel parameter g of the SVM;

v^t+1＝wv^t+c₁r₁(Pbest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (7)

CB^t+1＝CB^t+v^t (8)

(2) the specific steps of the WOA optimization algorithm are as follows:

given a random number p e (0,1), if p <0.5 and | a | <1, we randomly find prey, the artificial whale algorithm uses random individual positions in its population for food finding, and updates their spatial positions using the following formula:

X_t+1＝X_rand-A·D (9)

if p <0.5 and | A | >1, continue to wrap around the prey,

X_t+1＝X_best-A·|C·X_best-X_t| (10)

if p is more than or equal to 0.5, carrying out spiral predation;

X_t+1＝D_best·e^bl·cos2πl+X_t (11)

v^t+1＝wv^t+c₁r₁(Whalebest^t-CB^t)+c₂r₂(Gbest^t-CB^t) (13)