CN118153783A

CN118153783A - Intelligent water-saving pipeline cleaning optimization system based on artificial intelligence

Info

Publication number: CN118153783A
Application number: CN202410565984.1A
Authority: CN
Inventors: 杨郭荣; 李艳林
Original assignee: Beijing Younengchuang Energy Saving Technology Co ltd
Current assignee: Beijing Younengchuang Energy Saving Technology Co ltd
Priority date: 2024-05-09
Filing date: 2024-05-09
Publication date: 2024-06-07
Anticipated expiration: 2044-05-09

Abstract

The invention relates to the technical field of pipeline cleaning, and particularly discloses an intelligent water-saving pipeline cleaning optimization system based on artificial intelligence. The scheme combines a depth deterministic strategy gradient algorithm and a self-adaptive sampling control method to formulate a cleaning strategy, so that the pipeline cleaner can learn autonomously and adapt to the change of a cleaning environment; the neural network is used for constructing a deterministic strategy function and an action value function, and the pipeline cleaner can intelligently adjust cleaning parameters through real-time rewarding feedback, so that efficient learning and optimizing processes are realized; the stability and reliability of the current state are evaluated in real time by calculating the change rate and the confidence of the state space, and the cleaning parameters and the sampling frequency of the sensor are dynamically adjusted according to the actual situation, so that the method is better suitable for the change of the environment, and the cleaning effect and the cleaning efficiency are improved.

Description

Intelligent water-saving pipeline cleaning optimization system based on artificial intelligence

Technical Field

The invention relates to the technical field of pipeline cleaning, in particular to an intelligent water-saving pipeline cleaning optimization system based on artificial intelligence.

Background

The household pipeline cleaning is critical to maintaining the normal operation of a pipeline system, preventing blockage and prolonging the service life of a pipeline, and the traditional pipeline cleaning method is usually static, lacks intelligent decision, and has the problems of resource waste and high labor cost; the general pipeline cleaning method lacks a real-time feedback mechanism, cannot be timely adjusted and optimized according to the actual cleaning effect and environmental change, and has the problems that only the cleaning effect is focused, and factors such as cleaning cost, cleaning time and the like are ignored; the general strategy formulation method is often based on experience or rules, and cannot flexibly adjust the strategy according to real-time environmental changes, especially in a complex pipeline cleaning environment, a plurality of states and parameter changes are involved, and the problem that the complexity cannot be adapted exists.

Disclosure of Invention

Aiming at the problems of resource waste and high labor cost caused by the fact that the traditional pipeline cleaning method is usually static and lacks intelligent decision, the scheme adopts a depth deterministic strategy gradient algorithm and an adaptive sampling control method to combine and formulate a cleaning strategy, so that the pipeline cleaner can autonomously learn and adapt to the change of the cleaning environment, and unnecessary resource waste and labor cost are reduced; aiming at the problems that a general pipeline cleaning method lacks a real-time feedback mechanism, cannot be timely adjusted and optimized according to actual cleaning effects and environmental changes, only focuses on the cleaning effects and neglects factors such as cleaning cost and cleaning time, the scheme uses a neural network to construct a deterministic strategy function and an action value function, and the pipeline cleaner can intelligently adjust cleaning parameters through real-time rewarding feedback to realize efficient learning and optimizing processes and comprehensively consider a plurality of factors, so that the pipeline cleaner can better balance the cleaning effects, cost and time in the learning process; aiming at the problems that a general strategy formulation method is often based on experience or rules, the strategy cannot be flexibly adjusted according to real-time environment changes, and particularly in a complex pipeline cleaning environment, a plurality of states and parameter changes are involved, and the complexity cannot be adapted.

The intelligent water-saving pipeline cleaning optimization system based on artificial intelligence comprises a thermal circulation module, a water quality detection module and a pipeline cleaning module;

The thermal circulation module comprises a hot water source, water outlet points, water using points, a hot water pipe and a water return pipe, the water flow direction in the pipeline is set, hot water flows out from the water outlet points, is conveyed to each water using point along the hot water pipe, flows through the water return pipe and is conveyed back to the hot water source, thermal circulation information is formed, and the thermal circulation information is transmitted to the water quality detection module;

The water quality detection module receives thermal cycle information from the thermal cycle module, deploys a sensor at each water consumption point, constructs a cleaning strategy and sets a cleaning instruction; the sensor is used for monitoring water quality data, including pH value, turbidity, solubility and heavy metal amount, the cleaning strategy is constructed based on a depth deterministic strategy gradient algorithm and a self-adaptive sampling control method, the cleaning instruction consists of a cleaning switch and cleaning parameters, the cleaning parameters comprise a water temperature value, a water pressure value and a cleaning agent concentration, whether the cleaning effect of the pipeline reaches the standard is judged according to the monitored water quality data, and if the cleaning effect of the pipeline reaches the standard, the cleaning instruction is set to be zero; otherwise, setting a cleaning switch to be 1, setting a cleaning parameter according to a cleaning strategy, and sending a cleaning instruction to the pipeline cleaning module;

the pipeline cleaning module receives a cleaning instruction from the water quality detection module, a pipeline cleaner is installed, the longest cleaning time is set, the pipeline cleaner is used for setting the water temperature and the water pressure of pipeline cleaning, releasing a cleaning agent, analyzing the cleaning instruction, and stopping cleaning if a cleaning switch is 0; otherwise, continuing to clean according to the cleaning parameters until the longest cleaning time is reached.

Further, in the water quality detection module, the construction of the cleaning strategy includes the following steps:

Step S1: strategy initialization, namely initializing a cleaning strategy by using a depth deterministic strategy gradient algorithm, wherein the strategy comprises defining a state space and an action space, defining a reward function, initializing an action, and constructing a deterministic strategy function and an action value function;

Step S2: self-adaptive sampling control, namely calculating confidence coefficient according to the change rate of a state space, wherein the change rate of the state space is obtained by monitoring a sensor, and the cleaning parameters, the sampling frequency and the mode of the sensor are dynamically adjusted;

Step S3: strategy training, namely, iteratively training a cleaning strategy through interaction between a pipeline cleaner and a cleaning environment, and maximizing a pipeline cleaning effect;

Step S4: and evaluating performance of the cleaning strategy, feeding the performance of the cleaning strategy back to the cleaning strategy, and adjusting and optimizing the cleaning strategy according to actual requirements.

Further, in step S1, the policy initialization includes the following steps:

Step S11: defining a state space and an action space, wherein the state space is water quality data monitored by a sensor in real time, the state comprises pH value, turbidity, solubility of oxygen and heavy metal amount, the action space is a cleaning parameter regulated by a pipeline cleaner, and the actions comprise regulating a water temperature value, regulating a water pressure value and regulating a cleaning agent concentration;

step S12: defining a reward function, and designing the reward function as a feedback signal of the pipeline cleaner by combining the pipeline cleaning effect, the cleaning cost and the cleaning time, wherein the following formula is used:

；

In the method, in the process of the invention, Representing a bonus function that is based on the received data,The cleaning effect of the pipeline is shown,The cost of the cleaning is indicated and,The time for the cleaning is indicated and the time for the cleaning is indicated,Coefficients representing trade-offs of different factors;

Step S13: initializing an action, defining a value range of the action, and initializing the action to a random value in the value range;

Step S14: constructing a deterministic strategy function, namely constructing the deterministic strategy function by using a neural network, wherein the deterministic strategy function represents the optimal action taken by the pipeline cleaner in a given state, and an input layer comprises 4 neurons for inputting the state; the hidden layer is designed as a fully connected layer containing 64 neurons, connected using a ReLU activation function; the output layer is designed as a full-connection layer containing 3 neurons, and outputs the value of the corresponding action, and the formula is as follows:

；

In the method, in the process of the invention, The state of the display is indicated and,The action is represented by an action which,Representing the function of the ReLU activation,A matrix of weights representing the hidden layer is presented,The bias vector of the hidden layer is represented,Representing a weight matrix of the output layer,A bias vector representing an output layer;

step S15: constructing an action value function, namely constructing the action value function by using a neural network, wherein the action value represents the expected return of a specific action taken by the pipeline cleaner under a given state, and the input layer comprises 7 neurons for inputting a state-action pair; the hidden layer is designed as a fully connected layer containing 64 neurons, connected using a ReLU activation function; the output layer comprises a neuron and outputs a corresponding action value, and the formula is as follows:

；

In the method, in the process of the invention, The function of the action value is represented,A parameter representing a function of the action value,A state-action pair is represented and,A matrix of weights representing the hidden layer is presented,The bias vector of the hidden layer is represented,Representing a weight matrix of the output layer,Representing the offset vector of the output layer.

Further, in step S2, the adaptive sampling control includes the steps of:

Step S21: calculating the confidence coefficient, estimating the confidence coefficient of the current state based on the change rate of the state space, and if the change rate is smaller, the current state is relatively stable and the confidence coefficient is higher; conversely, if the rate of change is greater, then the confidence is lower, and the formula is as follows:

；

In the method, in the process of the invention, The degree of confidence is indicated and,The rate of change of a certain state in the state space,Indicating the maximum rate of change of this state,Standard deviation representing the feature;

Step S22: adjusting the cleaning parameters, dynamically adjusting the cleaning parameters according to the change rate and the confidence coefficient of the state space, and keeping the current cleaning parameters unchanged when the change rate is small and the confidence coefficient is high; when the change rate is large or the confidence is low, the water temperature, the water pressure and the concentration of the cleaning agent are adjusted;

Step S23: adjusting the sampling frequency and mode of the sensor according to the confidence coefficient, and reducing the sampling frequency to save energy and resources when the confidence coefficient is higher; when the confidence is lower, the sampling frequency is increased to obtain more data to evaluate the current state.

By adopting the scheme, the beneficial effects obtained by the invention are as follows:

(1) Aiming at the problems of resource waste and high labor cost caused by the lack of intelligent decision making in the conventional pipeline cleaning method, the scheme adopts a depth deterministic strategy gradient algorithm and a self-adaptive sampling control method to combine and formulate a cleaning strategy, so that the pipeline cleaner can autonomously learn and adapt to the change of the cleaning environment, and unnecessary resource waste and labor cost are reduced.

(2) Aiming at the problems that a general pipeline cleaning method lacks a real-time feedback mechanism, cannot be timely adjusted and optimized according to actual cleaning effects and environmental changes, only focuses on the cleaning effects and neglects factors such as cleaning cost and cleaning time, the scheme uses a neural network to construct a deterministic strategy function and an action value function, through real-time rewarding feedback, a pipeline cleaner can intelligently adjust cleaning parameters, an efficient learning and optimizing process is realized, and a plurality of factors are comprehensively considered, so that the pipeline cleaner can better balance the cleaning effects, cost and time in the learning process.

(3) Aiming at the problems that a general strategy formulation method is often based on experience or rules, the strategy cannot be flexibly adjusted according to real-time environment changes, and particularly in a complex pipeline cleaning environment, a plurality of states and parameter changes are involved, and the complexity cannot be adapted.

Drawings

FIG. 1 is a schematic diagram of an intelligent water-saving pipeline cleaning optimization system based on artificial intelligence;

Fig. 2 is a thermal cycle information chart of the pipe cleaning according to the present invention.

The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all embodiments of the invention; all other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Referring to fig. 1 and 2, the intelligent water-saving pipeline cleaning optimization system based on artificial intelligence provided by the invention comprises a thermal circulation module, a water quality detection module and a pipeline cleaning module;

In a second embodiment, referring to fig. 1, the water quality detection module according to the above embodiment, the cleaning strategy is constructed, and the method includes the following steps:

By executing the operation, aiming at the problems of resource waste and high labor cost caused by the lack of intelligent decision in the conventional pipeline cleaning method, the scheme adopts a depth deterministic strategy gradient algorithm and an adaptive sampling control method to combine and formulate a cleaning strategy, so that the pipeline cleaner can autonomously learn and adapt to the change of the cleaning environment, and unnecessary resource waste and labor cost are reduced.

Embodiment three, referring to fig. 1, the embodiment is based on the above embodiment, and in step S1, the policy initialization includes the following steps:

；

By executing the operation, the pipeline cleaner can intelligently adjust the cleaning parameters through real-time reward feedback, realize efficient learning and optimizing processes, comprehensively consider a plurality of factors, and enable the pipeline cleaner to better balance the cleaning effect, cost and time in the learning process.

Fourth embodiment, referring to fig. 1, the adaptive sampling control in step S2, which is based on the above embodiment, includes the following steps:

；

By executing the operation, the general strategy formulation method is often based on experience or rules, the strategy cannot be flexibly adjusted according to real-time environment changes, and particularly in a complex pipeline cleaning environment, a plurality of states and parameter changes are involved, and the problem that the complexity cannot be adapted exists.

In the fifth embodiment, referring to fig. 1, the embodiment is based on the above embodiment, in the pipe cleaning module, the longest cleaning time is 1 hour, and if the pipe cleaning effect does not reach the standard after the longest cleaning time is reached, the pipe maintenance is required.

Embodiment six, referring to fig. 1, the embodiment is based on the above embodiment, and in step S12, the cleaning cost includes water consumption, cleaning agent consumption, and power consumption.

An embodiment seven, referring to fig. 1, based on the above embodiment, in step S4, the performance evaluation includes collecting time, manpower, water resources and cleaning agent costs consumed in the cleaning process, comparing these costs with the cleaning effect, and evaluating the economical efficiency and efficiency of the cleaning strategy; collecting and analyzing historical cleaning data, and evaluating long-term effects and trends of cleaning strategies so as to optimize and improve; and evaluating whether the cleaned water flow is smoother, the water quality is improved, and dirt residues exist in the pipeline.

In the embodiment, the flow of pipeline cleaning is specifically that hot water is released by a hot water source, hot water flows out from a water outlet point, a cleaning agent is dissolved into a pipeline after passing through a pipeline cleaner, rust and dirt in the pipeline begin to be cleaned, water quality data is monitored by a sensor before the water flows to each water using point, cleaned sewage flows out from the water using point, the water flows circulate, and cleaning is stopped after the cleaning effect of the pipeline reaches the standard or the longest cleaning time is reached.

It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

The invention and its embodiments have been described above with no limitation, and the actual construction is not limited to the embodiments of the invention as shown in the drawings. In summary, if one of ordinary skill in the art is informed by this disclosure, a structural manner and an embodiment similar to the technical solution should not be creatively devised without departing from the gist of the present invention.

Claims

1. Intelligent water-saving pipeline cleaning optimization system based on artificial intelligence, which is characterized in that: the device comprises a thermal circulation module, a water quality detection module and a pipeline cleaning module;

2. The intelligent water saving pipe cleaning optimization system based on artificial intelligence according to claim 1, wherein: in the water quality detection module, the construction of the cleaning strategy comprises the following steps:

3. The intelligent water saving pipe cleaning optimization system based on artificial intelligence according to claim 2, wherein: in step S1, the policy initialization includes the following steps:

step S12: defining a reward function, and designing the reward function as a feedback signal of the pipeline cleaner by combining the pipeline cleaning effect, the cleaning cost and the cleaning time;

Step S14: constructing a deterministic strategy function, namely constructing the deterministic strategy function by using a neural network, wherein the deterministic strategy function represents the optimal action taken by the pipeline cleaner in a given state, and an input layer comprises 4 neurons for inputting the state; the hidden layer is designed as a fully connected layer containing 64 neurons, connected using a ReLU activation function; the output layer is designed as a full-connection layer containing 3 neurons and outputs the value of the corresponding action;

Step S15: constructing an action value function, namely constructing the action value function by using a neural network, wherein the action value represents the expected return of a specific action taken by the pipeline cleaner under a given state, and the input layer comprises 7 neurons for inputting a state-action pair; the hidden layer is designed as a fully connected layer containing 64 neurons, connected using a ReLU activation function; the output layer comprises a neuron and outputs a corresponding action value.

4. An artificial intelligence based intelligent water saving pipeline cleaning optimization system according to claim 3, characterized in that: in step S2, the adaptive sampling control includes the steps of:

step S21: calculating the confidence coefficient, estimating the confidence coefficient of the current state based on the change rate of the state space, and if the change rate is smaller, the current state is relatively stable and the confidence coefficient is higher; conversely, if the rate of change is greater, the confidence is lower;