CN116469481B - LF refined molten steel composition forecasting method based on XGBoost algorithm - Google Patents
LF refined molten steel composition forecasting method based on XGBoost algorithm Download PDFInfo
- Publication number
- CN116469481B CN116469481B CN202310726453.1A CN202310726453A CN116469481B CN 116469481 B CN116469481 B CN 116469481B CN 202310726453 A CN202310726453 A CN 202310726453A CN 116469481 B CN116469481 B CN 116469481B
- Authority
- CN
- China
- Prior art keywords
- model
- molten steel
- xgboost
- value
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 229910000831 Steel Inorganic materials 0.000 title claims abstract description 51
- 239000010959 steel Substances 0.000 title claims abstract description 51
- 239000000203 mixture Substances 0.000 title claims abstract description 15
- 238000013277 forecasting method Methods 0.000 title claims abstract description 8
- 238000000034 method Methods 0.000 claims abstract description 29
- 238000004519 manufacturing process Methods 0.000 claims abstract description 16
- 238000011156 evaluation Methods 0.000 claims abstract description 13
- 229910052782 aluminium Inorganic materials 0.000 claims abstract description 9
- 238000012549 training Methods 0.000 claims description 17
- 230000006870 function Effects 0.000 claims description 15
- 238000012360 testing method Methods 0.000 claims description 7
- 230000002159 abnormal effect Effects 0.000 claims description 6
- 238000013450 outlier detection Methods 0.000 claims description 5
- 238000012216 screening Methods 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 230000000694 effects Effects 0.000 claims 1
- 238000007670 refining Methods 0.000 abstract description 20
- 238000009628 steelmaking Methods 0.000 abstract description 7
- 238000010079 rubber tapping Methods 0.000 abstract description 6
- 238000009749 continuous casting Methods 0.000 abstract description 4
- 238000007781 pre-processing Methods 0.000 abstract description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 30
- 229910052786 argon Inorganic materials 0.000 description 15
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 6
- 238000003723 Smelting Methods 0.000 description 4
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 4
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 229910052742 iron Inorganic materials 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 229910052760 oxygen Inorganic materials 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 239000002893 slag Substances 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 229910000519 Ferrosilicon Inorganic materials 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- OOFUFVVNELMCSU-UHFFFAOYSA-N [Ar].[Fe] Chemical compound [Ar].[Fe] OOFUFVVNELMCSU-UHFFFAOYSA-N 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000007872 degassing Methods 0.000 description 1
- 238000010891 electric arc Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000009849 vacuum degassing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/20—Identification of molecular entities, parts thereof or of chemical compositions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/10—Pre-processing; Data cleansing
- G06F18/15—Statistical pre-processing, e.g. techniques for normalisation or restoring missing data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2113—Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2148—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the process organisation or structure, e.g. boosting cascade
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/70—Machine learning, data mining or chemometrics
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Computational Biology (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Chemical & Material Sciences (AREA)
- Computing Systems (AREA)
- Crystallography & Structural Chemistry (AREA)
- Databases & Information Systems (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Health & Medical Sciences (AREA)
- Carbon Steel Or Casting Steel Manufacturing (AREA)
- Treatment Of Steel In Its Molten State (AREA)
Abstract
The invention discloses an LF refined molten steel composition forecasting method based on XGBoost algorithm, which comprises the following steps: collecting LF refining initial parameters; preprocessing data; constructing an XGBoost model by using a plot-importance function in an XGBoost module under a python environment; the relative ratio of the predicted value to the actual value in the XGBoost model uses the average absolute error MAE and the R square as evaluation criteria to reflect the quality of the predicted value; and predicting the molten steel components after the model evaluation is qualified. The whole process is simple, and the components Al, si and Mn of molten steel in the initial stage of LF refining are predicted by using an XGBoost algorithm, so that the converter tapping and the initial stage of LF refining are guided, the stability of the components of molten steel is improved, the steelmaking production efficiency is improved, the steelmaking cost is reduced, and qualified high-quality molten steel is provided for the continuous casting process.
Description
Technical Field
The invention relates to the technical field of steelmaking, in particular to an LF refined molten steel composition forecasting method based on an XGBoost algorithm.
Background
In modern steelmaking processes mainly comprising an electric arc furnace or a converter, LF refining, vacuum degassing and continuous casting, the LF process plays a role in supporting the top and bottom and is known as a buffer in the steelmaking process. The temperature and the components of the molten steel after primary smelting generally have certain fluctuation, and can be controlled by LF refining, so that molten steel with qualified temperature and components is provided for degassing or continuous casting procedures regularly. In view of the advantages, almost all excellent steel is produced through an LF refining process, but enterprises face some common problems in the LF process at present, for example, a primary refining furnace end point often has certain fluctuation, a steel ladle arrives at a station and is added with a deoxidizer and slag according to experience of operators, so that fluctuation of molten steel components and slagging property in the primary LF stage is large, the fluctuation is often genetic, and finally the fluctuation of product quality is large. How to realize stable control of molten steel components in the initial stage of LF refining is a great difficulty facing current metallurgical workers.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides an LF refined molten steel component forecasting method based on an XGBoost algorithm, utilizes the XGBoost algorithm to forecast the contents of three elements of Al, si and Mn in the molten steel component in the initial stage of LF refining, and aims to guide the converter tapping and the initial process of LF refining and improve the stability of the molten steel component.
In order to achieve the above purpose, the invention adopts the following technical scheme: an LF refined molten steel composition forecasting method based on XGBoost algorithm comprises the following steps:
s1: combining metallurgical principle analysis and experience of field operators, primarily screening out characteristic parameters affecting molten steel components in the initial stage of LF, combining a large amount of actual production data, integrating and storing into a database;
s2: duplicate functions in python are utilized to remove repeated items in the data, and dropna functions are utilized to remove null values; the z-score method is utilized to normalize the data, and the influence of the variation size and the numerical value size of the dimension and the variable is eliminated; deleting abnormal points by using a 3 sigma outlier detection method;
s3: the preprocessed data is used for representing the correlation among the features by using pearson correlation coefficients, and the features with little meaning on model training are removed;
s4: constructing an XGBoost model by using a plot-importance function in an XGBoost module under a python environment, establishing a relation between input features and output features, sequencing contribution degrees of the output features according to the input features, and finally eliminating features with contribution degree scores smaller than 20 to a target index;
s5: the relative ratio of the predicted value to the actual value in the XGBoost model uses the average absolute error MAE and the R square as evaluation criteria to reflect the quality of the predicted value, and the specific formula is as follows:
,
,
wherein:n、mpredicting the number of samples;is a model predictive value; />Is an actual value; />Is the average value;RSSis the sum of squares of the residuals;
s6: if in step S5MAEValue sum R 2 When the values are in the preset range, judging that the XGBoost model is qualified in evaluation, entering a step S7, otherwise, failing, and carrying out operations of the steps S2 to S5 again;
s7: and predicting the molten steel composition by adopting a qualified XGBoost model.
As a specific implementation manner, the data preprocessed in the step S3 is randomly divided into a training group and a test group, the XGBoost model is provided with three models of Si, mn and Al, and training parameters of the three models are set as follows:
the boost is set as a tree model gbtree;
the maximum depth max-depth of the tree is set to 6;
the learning rate cta is set to 0.03 in the Si and Mn model and 0.01 in the Al model;
the gbtree classifier number was set to 900 in the Si, mn model and 1000 in the Al model.
As a specific embodiment, in the Si modelMAEThe value setting range is less than 0.015, in Mn modelMAEThe value setting range is less than 0.023 and is in an Al modelMAEThe value setting range is less than 0.003, and the three modelsR 2 The value setting ranges are all greater than 0.98.
As a specific embodiment, the method for forecasting the composition of the LF refined molten steel further includes the following steps:
s8: after step S7, the actual production data in a period of time in the future is collected and included in the sample data and stored in the database, and then steps S2 to S7 are repeated.
Due to the application of the technical scheme, compared with the prior art, the invention has the following advantages: the invention predicts the Al, si and Mn components of the molten steel in the initial stage of LF refining by using an XGBoost algorithm, and aims to guide the tapping of a converter and the initial process of LF refining, improve the stability of the components of the molten steel, improve the steelmaking production efficiency, reduce the steelmaking cost and provide qualified high-quality molten steel for the continuous casting process.
Drawings
FIG. 1 is a characteristic variable-importance score chart of the molten steel component Al in the LF refining treatment in example 1;
FIG. 2 is a characteristic variable-importance score chart of the LF refining treatment molten steel composition Si in example 1;
FIG. 3 is a characteristic variable-importance score chart of the LF refining treatment molten steel component Mn in example 1;
FIG. 4 is a graph showing the comparison between the measured and predicted values of the initial Al content in LF refining in example 1;
FIG. 5 is a graph showing comparison between measured and predicted values of Si content in the initial stage of LF refining in example 1;
FIG. 6 is a graph showing the comparison of measured and predicted values of Mn content in the initial stage of LF refining in example 1;
FIG. 7 is a flow chart of the initial molten steel composition prediction in LF refining based on XGBoost algorithm.
Detailed Description
The technical scheme of the invention is further described below with reference to the accompanying drawings and specific embodiments.
The invention provides an LF refined molten steel composition forecasting method based on XGBoost algorithm, which is shown in FIG. 1 and comprises the following steps:
s1, acquiring LF refining initial parameters: combining metallurgical principle analysis and experience of field operators, primarily screening out characteristic parameters affecting molten steel components in the initial stage of LF, collecting a large amount of actual production data, integrating and storing the data in a database;
s2, data preprocessing: duplicate functions in python are utilized to remove repeated items in the data, and dropna functions are utilized to remove null values; the z-score method is utilized to normalize the data, and the influence of the variation size and the numerical value size of the dimension and the variable is eliminated; deleting abnormal points by using a 3 sigma outlier detection method;
s3, representing the correlation among the features by using pearson correlation coefficients, and eliminating the features with little meaning on model training;
s4, constructing an XGBoost model by using the training set:
the preprocessed data are randomly divided into a training group and a testing group, wherein an XGBoost model is provided with three models of Si, mn and Al, and training parameters of the three models are set as follows:
the boost is set as a tree model gbtree;
the maximum depth max-depth of the tree is set to 6;
the learning rate cta is set to 0.03 in the Si and Mn model and 0.01 in the Al model;
the number of the gbtree classifiers is set to 900 in the Si and Mn models and 1000 in the Al models;
constructing an XGBoost model by using a plot-importance function in an XGBoost module under a python environment, establishing a relation between input features and output features, sequencing contribution degrees of the output features according to the input features, and finally eliminating features with contribution degree scores smaller than 20 to a target index;
s5, calculating evaluation indexes MAE and R 2 :
Comparing the predicted value with the actual value in the XGBoost model by using average absolute errors MAE and R 2 As an evaluation criterion, the quality of the predicted value is reflected, and the specific formula is as follows:
,
,
wherein:n、mpredicting the number of samples;is a model predictive value; />Is an actual value; />Is the average value;RSSis the sum of squares of the residuals;
s6, verifying the XGBoost model by using a test set: if Si content prediction model in step S5MAEIn a model for forecasting Mn content with a value less than 0.015MAEThe value is less than 0.023 and the Al content is predicted in the modelMAEThe value should be less than 0.003,R 2 all are larger than 0.98, judging that the forecast model is evaluated to be qualified, entering the step S7, otherwise, failing, and carrying out operations of the steps S2 to S5 again;
s7, model prediction result treatment: adopting a qualified XGBoost model to predict molten steel components, and ending;
s8, after the qualified XGBoost model is adopted for predicting the molten steel components, collecting actual production data in a period of time in the future, incorporating the actual production data into a sample data base, and then repeating the steps S2 to S7.
Example 1
In the example, taking actual production data of Jiangsu Yongsteel group company two steel works from 2019 1 month 1 day to 2023 1 month 1 day as an example, the content of Al element in LF refined molten steel is predicted, and the steps are as follows:
s1, integrating 2020 group data acquired on site and storing the integrated 2020 group data into a database, and primarily screening out parameters possibly related to the content of Al element in refined molten steel in an LF furnace, namely, input characteristic parameters, wherein the method comprises the following steps: LF furnace start-up temperature, furnace cover furnace age, ladle temperature, smelting period, total oxygen amount, molten iron weight, aluminum block addition amount, net promoting dosage, high-purity ferrosilicon addition amount, converter end point C content, converter end point S content, argon-back Mn content, argon-back Mo content and the like;
s2, eliminating repeated items in the data by using a duplicate function in python, and eliminating null values by using a dropna function; the z-score method is utilized to normalize the data, and the influence of the variation size and the numerical value size of the dimension and the variable is eliminated; deleting abnormal points by using a 3 sigma outlier detection method, and remaining 2000 groups of data;
s3, representing the correlation among the features by using pearson correlation coefficients, and eliminating the features with little meaning on model training;
s4, constructing an Al model by using the training set:
the preprocessed data are randomly divided into 1600 groups of training data, 400 groups of test data, and training parameters are set as follows:
the boost is set as a tree model gbtree;
the maximum depth max-depth of the tree is set to 6;
the learning rate cta is 0.01;
the gbtree classifier number is 1000,
constructing an Al model by using a plot-importance function in an XGBoost module under a python environment, establishing a relation between input features and output features, sorting contribution degrees of the output features according to the input features, finally eliminating features with a contribution degree score smaller than 20 to a target index, and finally selecting feature parameters, wherein the figure 1 is output feature parameter sorting of 30 bits before the importance of the output features of the Al model, and comprises the following steps: aluminum bean addition amount, furnace cover furnace age, LF furnace inlet temperature, argon post-S content, ladle temperature, smelting period, converter tapping molten steel weight, argon post-Si content, molten steel total oxygen content, nitrogen consumption, LF furnace age, argon station inlet time, tapping process time consumption, slag recovery value, converter endpoint S content, argon post-Mo content, argon post-Mn content, converter endpoint C content, argon post-Nb content, argon post-V content, aluminum element proportion in molten steel, steam recovery amount, difference between converter tapping and LF furnace inlet temperature, actual slag MgO content, gun head age, molten iron temperature, difference between molten iron-argon post-Ni element content, recovery heat value and argon post-Cr content;
s5, calculating evaluation indexes MAE and R 2 :
Comparing the predicted value with the actual value in the Al model by using average absolute errors MAE and R 2 As an evaluation criterion, the quality of the predicted value is reflected, and the specific formula is as follows:
,
,
wherein:n、mthe number of samples is predicted, and 2000 is taken as the number;is a model predictive value; />Is an actual value; />Is the average value;RSSis the sum of squares of the residuals;
the predicted value in the Al model is compared with the actual value, as shown in the comparison with reference to fig. 4, by calculation, when the number of classifiers of the Al model is set to 1000,MAEin the form of 0.002345,R 2 0.983041 is reached;
s6, verifying the XGBoost model by using a test set: in Al modelMAEThe value is smaller than the set value of 0.003 at the same timeR 2 If the set value is greater than 0.98, judging that the Al model is qualified in evaluation, and entering step S7;
s7, model prediction result treatment: predicting the Al content in the molten steel component by adopting a qualified Al model, and ending;
s8, after the qualified Al model is adopted to predict the content of Al in the molten steel component, collecting actual production data in a period of time in the future, incorporating the actual production data into a sample data and storing the sample data into a database, and then repeating the steps S2 to S7.
Example 2
The content of Si and Mn elements in the refined molten steel of the LF furnace is predicted by adopting the same method as in the embodiment 1:
s1, integrating 43147 groups of data acquired on site and storing the data into a database, and primarily screening out parameters possibly related to the content of Si and Mn elements in refined molten steel of an LF furnace, namely, input characteristic parameters, wherein the method comprises the following steps: LF furnace start-up temperature, furnace cover furnace age, ladle temperature, smelting period, total oxygen amount, molten iron weight, aluminum block addition amount, net promoting dosage, high-purity ferrosilicon addition amount, converter end point C content, converter end point S content, argon-back Mn content, argon-back Mo content and the like;
s2, eliminating repeated items in the data by using a duplicate function in python, and eliminating null values by using a dropna function; the z-score method is utilized to normalize the data, and the influence of the variation size and the numerical value size of the dimension and the variable is eliminated; deleting abnormal points by using a 3 sigma outlier detection method, and remaining 43053 groups of data;
s3, representing the correlation among the features by using pearson correlation coefficients, and eliminating the features with little meaning on model training;
s4, constructing two models of Si and Mn by using a training set:
the Si model and the Mn model randomly divide 43053 data into 34296 data and 8757 data, and training parameters of the two models are set as follows:
the boost is set as a tree model gbtree;
the maximum depth max-depth of the tree is set to 6;
the learning rate cta is 0.03;
the number of the gbtree classifiers is 900;
constructing an XGBoost model by using a plot-importance function in an XGBoost module under a python environment, establishing a relation between input features and output features, sorting the contribution degree of the output features according to the input features, and finally eliminating features with the contribution degree score smaller than 20 to a target index, wherein the output feature parameters of the Si model with the contribution degree score smaller than 20 are sorted, as shown in figures 2 and 3, and figure 2 is the output feature parameter sorting of the first 30 bits of the importance of the output features, and the method comprises the following steps: si content after argon, furnace age of LF furnace, si content before argon, S content after argon and the like; FIG. 3 is a ranking of output characteristic parameters of the first 30 bits of the Mn model output characteristic importance, including Mn content after argon, mn content before argon, C content after argon, S content after argon, etc.;
s5, calculating evaluation indexes MAE and R 2 :
Calculated by the same formula as in example 1, where n and m are 43053, and the predicted values in both Si and Mn models are compared with the actual values, as shown in FIGS. 5 and 6, by calculation, when the number of classifiers for the Si model and Mn model is set to 900, the two models are obtainedMAE0.014151 and 0.022906 respectively,R 2 0.988921 is achieved;
s6, verifying the XGBoost model by using a test set: in Si modelMAEA value less than the set point of 0.015, in Mn modelMAEThe value is less than the set value of 0.023, and the two modelsR 2 The values are all larger than the set value of 0.98, the Si model and the Mn model are judged to be qualified, and the step S7 is carried out;
s7, model prediction result treatment: respectively predicting Si and Mn contents in molten steel components by adopting a qualified Si model and an Mn model, and ending;
s8, after the qualified Si model and Mn model are adopted to predict the Si and Mn content in the molten steel component, collecting actual production data in a period of time in the future, incorporating the actual production data into a sample data and storing the sample data into a database, and then repeating the steps S2 to S7.
Therefore, the overall forecasting accuracy of the LF refined molten steel composition forecasting model is higher, the method has good guiding significance for actual production of a production line, and along with the continuous increase of the subsequent data quantity, redundant abnormal points and noise are eliminated, and the accuracy of the forecasting model is increased.
The above embodiments are provided to illustrate the technical concept and features of the present invention and are intended to enable those skilled in the art to understand the content of the present invention and implement the same, and are not intended to limit the scope of the present invention. All equivalent changes or modifications made in accordance with the spirit of the present invention should be construed to be included in the scope of the present invention.
Claims (3)
1. An LF refined molten steel composition forecasting method based on an XGBoost algorithm is characterized by comprising the following steps:
s1: combining metallurgical principle analysis and experience of field operators, primarily screening out characteristic parameters affecting molten steel components in the initial stage of LF, combining a large amount of actual production data, integrating and storing into a database;
s2: duplicate functions in python are utilized to remove repeated items in the data, and dropna functions are utilized to remove null values; the z-score method is utilized to normalize the data, and the influence of the variation size and the numerical value size of the dimension and the variable is eliminated; deleting abnormal points by using a 3 sigma outlier detection method;
s3: the preprocessed data is used for representing the correlation among the features by using pearson correlation coefficients, and the features with little meaning on model training are removed;
s4: constructing an XGBoost model by using a plot-importance function in an XGBoost module under a python environment, establishing a relation between input features and output features, sequencing contribution degrees of the output features according to the input features, and finally eliminating features with contribution degree scores smaller than 20 to a target index;
s5: the relative ratio of the predicted value to the actual value in the XGBoost model is used as a main evaluation standard, the difference value between the predicted value and the actual value is reflected, the R square is used as an auxiliary evaluation standard, and the fitting effect of the model is reflected, wherein the specific formula is as follows:
wherein: n and m are the number of sample predictions;is a model predictive value; y is i Is an actual value; />Is the average value; RSS is the sum of squares of the residuals;
s6: if the MAE value and R in step S5 2 When the values are in the preset range, judging that the XGBoost model is qualified in evaluation, entering a step S7, otherwise, failing, and carrying out operations of the steps S2 to S5 again;
s7: adopting a qualified XGBoos t model to predict molten steel components;
here, the data preprocessed in step S3 is randomly divided into a training group and a test group, and the XGBoost model is provided with three models of Si, mn, and Al, where training parameters of the three models are set as follows:
the boost is set as a tree model gbtree;
the maximum depth max-depth of the tree is set to 6;
the learning rate cta is set to 0.03 in the Si and Mn model and 0.01 in the Al model;
the gbtree classifier number was set to 900 in the Si, mn model and 1000 in the Al model.
2. The method for predicting the composition of LF refined molten steel based on XGBoost algorithm as set forth in claim 1, wherein the set range of MAE values in Si model is less than 0.015, the set range of MAE values in Mn model is less than 0.023, the set range of MAE values in Al model is less than 0.003, R of the three models 2 The value setting ranges are all greater than 0.98.
3. The LF refined molten steel composition prediction method based on xgboost algorithm according to claim 1, characterized in that the LF refined molten steel composition prediction method further comprises the steps of:
s8: after step S7, the actual production data in a period of time in the future is collected and included in the sample data and stored in the database, and then steps S2 to S7 are repeated.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310726453.1A CN116469481B (en) | 2023-06-19 | 2023-06-19 | LF refined molten steel composition forecasting method based on XGBoost algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310726453.1A CN116469481B (en) | 2023-06-19 | 2023-06-19 | LF refined molten steel composition forecasting method based on XGBoost algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116469481A CN116469481A (en) | 2023-07-21 |
CN116469481B true CN116469481B (en) | 2023-08-29 |
Family
ID=87177453
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310726453.1A Active CN116469481B (en) | 2023-06-19 | 2023-06-19 | LF refined molten steel composition forecasting method based on XGBoost algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116469481B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117077545B (en) * | 2023-10-13 | 2023-12-26 | 南通钢安机械制造有限公司 | Molten steel fluidity optimization method and system based on silicon content control |
CN118016203B (en) * | 2024-04-10 | 2024-06-14 | 苏州方兴信息技术有限公司 | LF refining temperature forecasting method based on mechanism model and XGBoost algorithm |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05222429A (en) * | 1992-02-17 | 1993-08-31 | Nkk Corp | Method for predicting slopping scale of molten steel and slag in refining process |
CN102766728A (en) * | 2012-06-25 | 2012-11-07 | 攀钢集团研究院有限公司 | Method and device for real-time prediction of sulfur content of molten steel in refining process of ladle refining furnace |
CN114242179A (en) * | 2021-12-09 | 2022-03-25 | 中冶赛迪重庆信息技术有限公司 | Method and system for predicting yield of refined alloy element, electronic device, and medium |
CN114861535A (en) * | 2022-04-27 | 2022-08-05 | 江苏集萃工业过程模拟与优化研究所有限公司 | Water treatment process machine learning modeling method using CFD as data preprocessing |
CN115015161A (en) * | 2022-05-30 | 2022-09-06 | 长春大学 | Coal quality component prediction method |
CN115906538A (en) * | 2023-01-09 | 2023-04-04 | 湖南华联云创信息科技有限公司 | Method for predicting molten steel components in ladle refining furnace |
-
2023
- 2023-06-19 CN CN202310726453.1A patent/CN116469481B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05222429A (en) * | 1992-02-17 | 1993-08-31 | Nkk Corp | Method for predicting slopping scale of molten steel and slag in refining process |
CN102766728A (en) * | 2012-06-25 | 2012-11-07 | 攀钢集团研究院有限公司 | Method and device for real-time prediction of sulfur content of molten steel in refining process of ladle refining furnace |
CN114242179A (en) * | 2021-12-09 | 2022-03-25 | 中冶赛迪重庆信息技术有限公司 | Method and system for predicting yield of refined alloy element, electronic device, and medium |
CN114861535A (en) * | 2022-04-27 | 2022-08-05 | 江苏集萃工业过程模拟与优化研究所有限公司 | Water treatment process machine learning modeling method using CFD as data preprocessing |
CN115015161A (en) * | 2022-05-30 | 2022-09-06 | 长春大学 | Coal quality component prediction method |
CN115906538A (en) * | 2023-01-09 | 2023-04-04 | 湖南华联云创信息科技有限公司 | Method for predicting molten steel components in ladle refining furnace |
Also Published As
Publication number | Publication date |
---|---|
CN116469481A (en) | 2023-07-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN116469481B (en) | LF refined molten steel composition forecasting method based on XGBoost algorithm | |
CN108676955B (en) | A kind of BOF Steelmaking Endpoint carbon content and temprature control method | |
CN104630410B (en) | A kind of pneumatic steelmaking quality real-time dynamic forecast method based on data parsing | |
CN106096788B (en) | Converter steelmaking process cost control method and system based on PSO _ ELM neural network | |
CN114611844B (en) | Method and system for determining alloy addition amount in converter tapping process | |
CN109359723A (en) | Based on the converter terminal manganese content prediction technique for improving regularization extreme learning machine | |
CN112036081B (en) | Method for determining addition amount of silicon-manganese alloy in converter tapping based on yield prediction | |
CN113435114B (en) | Method for accurately controlling nitrogen content of steel grade of CV-RH-CC process route | |
CN111915080A (en) | Raw fuel cost optimal proportioning method based on molten iron quality constraint | |
CN111020118B (en) | RH endpoint temperature prediction method based on particle swarm optimization case reasoning | |
Shi et al. | Key issues and progress of industrial big data-based intelligent blast furnace ironmaking technology | |
JP6617842B1 (en) | Metal material design support method and design support device | |
CN111798023B (en) | Comprehensive coke ratio prediction method in steelmaking sintering production | |
CN102968644A (en) | Method for predicting smelting finishing point of argon-oxygen refined iron alloy | |
CN111518980A (en) | Correction method and system for converter end point carbon content prediction model | |
CN115456264B (en) | Endpoint carbon content and endpoint temperature prediction method for small and medium-sized converter | |
Han et al. | BOF oxygen control by mixed case retrieve and reuse CBR | |
CN113362903B (en) | Method for intelligently adding lime in TSC (thyristor switched capacitor) stage of large converter | |
CN109840309A (en) | A kind of calculation method of iron melt desulfurizing agent dosage | |
JP6160283B2 (en) | Slapping prediction method in converter blowing. | |
Zhao et al. | Optimal batching plan of deoxidation alloying based on principal component analysis and linear programming | |
CN116127345B (en) | Converter steelmaking process mode design method based on deep clustering generation countermeasure network | |
CN117708708B (en) | Quality rating and recommending method for ferroalloy for steel making | |
CN117744503A (en) | Prediction method for tapping temperature of converter | |
Huang et al. | Prediction of alloy yield based on optimized BP neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |