CN113133035A - LTE high-load cell discrimination method and system - Google Patents

LTE high-load cell discrimination method and system Download PDF

Info

Publication number
CN113133035A
CN113133035A CN202110490568.6A CN202110490568A CN113133035A CN 113133035 A CN113133035 A CN 113133035A CN 202110490568 A CN202110490568 A CN 202110490568A CN 113133035 A CN113133035 A CN 113133035A
Authority
CN
China
Prior art keywords
index
samples
inflection point
screening
load cell
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110490568.6A
Other languages
Chinese (zh)
Other versions
CN113133035B (en
Inventor
吴宇庆
徐俊凯
刘飞浪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HUBEI POST TELECOMMUNICATION PLANNING DESIGN CO Ltd
Original Assignee
HUBEI POST TELECOMMUNICATION PLANNING DESIGN CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HUBEI POST TELECOMMUNICATION PLANNING DESIGN CO Ltd filed Critical HUBEI POST TELECOMMUNICATION PLANNING DESIGN CO Ltd
Priority to CN202110490568.6A priority Critical patent/CN113133035B/en
Publication of CN113133035A publication Critical patent/CN113133035A/en
Application granted granted Critical
Publication of CN113133035B publication Critical patent/CN113133035B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/02Arrangements for optimising operational condition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/08Testing, supervising or monitoring using real traffic

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention relates to a high-load cell screening method and system, belongs to the technical field of communication, and particularly relates to an LTE high-load cell screening method and system. According to the invention, the high-load cell to be expanded can be screened out only according to the wireless performance statistical data extracted by the LTE network manager. Compared with the prior art, the method has two positive effects: compared with the traditional algorithm, the error of the method related by the invention is at least one order of magnitude smaller. The method related by the invention is a non-parametric algorithm, does not need to manually appoint a threshold, directly derives a result according to the wireless performance statistical data, and is more objective.

Description

LTE high-load cell discrimination method and system
Technical Field
The invention relates to a high-load cell screening method and system, belongs to the technical field of communication, and particularly relates to an LTE high-load cell screening method and system.
Background
Along with the rapid construction and development of the domestic LTE network, the requirements for network capacity analysis and optimization are stronger and stronger, and the core work is used for discriminating the LTE high-load cells to be expanded. For example, an operator performs capacity expansion with reference to the device carrying capacity, the number of effective RRC connected users, and the cell throughput based on the channel utilization, analyzes the performance statistical data extracted by the network manager, and meets the "high load capacity to be expanded" condition when the cell meets the following conditions:
threshold one (large flow): the average utilization rate of the downlink PRB of the cell in the busy hour is more than 50 percent, and the throughput of the cell in the busy hour is more than 6 GB; threshold two (multi-user): the average utilization rate of downlink PRBs in a cell is more than 50% in a busy hour, and the maximum number of effective RRC connections is more than 200;
and (3) statistical conditions are as follows: and the large data platform extracts full-month data according to months, counts up reaching a capacity expansion threshold one or two when busy for at least 4 days continuously for 7 days, and outputs a capacity expansion list.
The traditional algorithm for discriminating the high-load cell to be expanded by using the specified expansion threshold is a parameter algorithm. However, since the parameter of the capacity expansion threshold directly affects several billion dollars of investment per year only in china, operators generally uniformly account the important parameter nationwide at the group level and issue the important parameter to provinces and cities for execution. In actual projects, the user behavior difference of different places is large, the user models are different in regions, and on the other hand, various marketing measures such as 'no-flow package' and the like cause the user models to fluctuate dramatically. Therefore, the parameter of 'one cutting' is difficult to be refined according to the user group or the specific area, and the error is large.
Therefore, it is a technical problem that needs to be solved urgently at present to improve an intelligent monitoring system in the prior art to meet the requirements of different application scenarios.
Disclosure of Invention
The following presents a simplified summary of one or more aspects in order to provide a basic understanding of such aspects. This summary is not an extensive overview of all contemplated aspects, and is intended to neither identify key or critical elements of all aspects nor delineate the scope of any or all aspects. Its sole purpose is to present some concepts of one or more aspects in a simplified form as a prelude to the more detailed description that is presented later.
The invention mainly aims to solve the problems that the capacity expansion index is single and cannot meet the requirement of refining a user group or a specific area in the prior art, and provides a method and a system for discriminating an LTE high-load cell. According to the method and the system, a threshold does not need to be specified manually, the result is directly derived according to the wireless performance statistical data, and the capacity expansion judgment result is more accurate.
In order to solve the problems, the scheme of the invention is as follows:
an LTE high load cell discrimination method comprises the following steps:
step 1, extracting statistical data of performance indexes of a cell in an area to be evaluated, and eliminating invalid numbers;
step 2, selecting a first index and at least one second index from the statistical data of the performance indexes, and calculating a Pearson correlation coefficient of the second index and the first index;
step 3, fitting the first index and the second index by taking the first index as a horizontal coordinate and the second index as a vertical coordinate; obtaining a screening inflection point K based on a plurality of sample mean values before the second index and the corresponding Pearson correlation coefficients;
step 4, rejecting samples with vertical coordinates smaller than the vertical coordinates corresponding to the inflection point K and horizontal coordinates smaller than the horizontal coordinates corresponding to the inflection point K, and repeating the step 3-4 until the screening inflection point is at the leftmost end or the rightmost end of the fitting line;
and 5, screening out the high-load cell according to a preset condition based on the final sample.
In at least one embodiment of the present invention, in step 1, samples with an average PRC connected user number smaller than a predetermined value are rejected.
In at least one embodiment of the present invention, the first indicator is an average number of RRC connected users; the second index is
And the average occupancy rate of the downlink PRB and/or the downlink user plane traffic of the PDCP layer.
In at least one embodiment of the present invention, the mean of the first several samples of the second index is ranked with the square of the Pearson's correlation coefficient, r2And multiplying to obtain the ordinate of the inflection point, and taking the abscissa corresponding to the inflection point on the fitting curve as the abscissa of the inflection point.
In at least one embodiment of the invention, in said step 4, the conditions for screening the high load cell are:
there were at least 4 high load samples for 7 consecutive days;
or
There were at least 7 high load samples for 15 consecutive days;
or
There were at least 14 high load samples for 30 consecutive days.
An LTE high load cell screening system, comprising:
the network data extraction module is used for extracting the statistical data of the performance indexes of the cells of the area to be evaluated and eliminating invalid numbers;
the correlation coefficient determining module is used for selecting a first index and at least one second index from the performance index statistical data and calculating the Pearson correlation coefficient of the second index and the first index;
the screening inflection point determining module is used for fitting the first index and the second index by taking the first index as an abscissa and taking the second index as an ordinate; obtaining a screening inflection point K based on a plurality of sample mean values before the second index and the corresponding Pearson correlation coefficients;
the relevant data eliminating module is used for eliminating samples of which the vertical coordinate is smaller than the vertical coordinate corresponding to the inflection point K and the horizontal coordinate is smaller than the horizontal coordinate corresponding to the inflection point K, and the screening inflection point determining module and the relevant data eliminating module are repeatedly called until the screening inflection point is at the leftmost end or the rightmost end of the fitting line;
and the high-load cell screening module screens out the cell to be expanded according to preset conditions based on the final sample.
In at least one embodiment of the present invention, in the network data extraction module, samples whose average number of PRC connection users is smaller than a predetermined value are rejected.
In at least one embodiment of the present invention, the first indicator is an average number of RRC connected users; the second index is
And the average occupancy rate of the downlink PRB and/or the downlink user plane traffic of the PDCP layer.
In at least one embodiment of the invention, the secondThe mean value of a plurality of samples with indexes ranked in the front and the square r of the correlation coefficient of the Pearson2And multiplying to obtain the ordinate of the inflection point, and taking the abscissa corresponding to the inflection point on the fitting curve as the abscissa of the inflection point.
In at least one embodiment of the present invention, in the high load cell screening module, the conditions for screening the high load cell are:
there were at least 4 high load samples for 7 consecutive days;
or
There were at least 7 high load samples for 15 consecutive days;
or
There were at least 14 high load samples for 30 consecutive days.
As can be seen from the above description: compared with the traditional method, the error of the method related by the invention is at least one order of magnitude smaller; the method related by the invention is a non-parametric algorithm, does not need to manually appoint a threshold, directly derives a result according to the wireless performance statistical data, and is more objective.
Drawings
The accompanying drawings, which are incorporated herein and form a part of the specification, illustrate embodiments of the present invention and, together with the description, further serve to explain the principles of the invention and to enable a person skilled in the pertinent art to make and use the disclosure.
Fig. 1 is a scatter diagram of "average downlink PRB occupancy" and "average number of RRC connected users" (1472 high-load samples obtained by a threshold method).
Fig. 2 illustrates a scatter diagram of "PDCP layer downlink user plane traffic" and "average number of RRC connected users" (1472 high load samples obtained by the thresholding method);
fig. 3 illustrates a scatter diagram (total data) of "downlink PRB average occupancy" and "average number of RRC-connected users";
fig. 4 illustrates a scatter diagram (total data) of "PDCP layer downlink user plane traffic" and "average number of RRC-connected users";
fig. 5 illustrates a "scatter diagram of the average occupancy of downlink PRBs" and an "average number of RRC-connected users" (high-load sample list V1);
fig. 6 illustrates a scatter plot of "PDCP layer downlink user plane traffic" and "average number of RRC connected users" (high load sample list V1);
fig. 7 illustrates a "scatter diagram of the average occupancy rate of downlink PRBs" and an "average number of RRC-connected users" (high-load sample finalize list);
fig. 8 illustrates a scatter plot (high load sample final list) of "PDCP layer downlink user plane traffic" and "average number of RRC connected users";
embodiments of the present invention will be described with reference to the accompanying drawings.
Detailed Description
Examples
The purpose of the embodiment is achieved by the following principle: under the influence of a scheduling algorithm, the capacity and performance indexes of the LTE network are influenced and restricted mutually, and under the condition of low load, the load and throughput indexes are in quasi-linear and positive correlation with the number of effective users; in high load, once the resource is short, the relation that the load and the throughput index and the number of effective users are converted into negative correlation is the 'saturation' state. By utilizing the characteristics, the embodiment analyzes the wireless performance statistical data, identifies the inflection point of which the index is changed from positive correlation to negative correlation, and identifies the sample in a saturated state, so as to discriminate the high-load cell to be expanded.
The specific steps of the embodiment are as follows:
step 1, extracting 24-hour performance statistical data of N days of m cells in a certain area from a network manager.
The data should include: load indexes (uplink PRB average occupancy rate and downlink PRB average occupancy rate), cell busy hour throughput indexes (PDCP layer uplink user plane traffic and PDCP layer downlink user plane traffic) and average RRC connection user number.
Description of related indexes:
n: the number of days for extracting the statistical data is generally 7, 15 and 30.
Average utilization rate of uplink PRB: the average number of occupied uplink PRBs/number of PRBs in a cell is 100%, and the use condition of uplink physical resources is reflected.
Average utilization rate of downlink PRB: the average number of occupied downlink PRBs/number of PRBs in a cell is 100%, and the use condition of the downlink physical resources is reflected.
Average number of RRC connected users: and counting the average number of the RRC connections existing at the same time, sampling the measurement parameters by presetting a measurement time interval to obtain the number of the RRC connections existing at the same time in the given cell, and then averaging.
Cell busy hour throughput: the empty service flow is divided into the number of bytes of uplink and downlink services, and the index reflects the uplink and downlink flows of the empty.
And 2, carrying out pretreatment.
And eliminating samples with the average RRC connection user number of 0. The term "total data" in the following text refers to performance statistics data of m cells Nx24 hours in a certain area after eliminating samples with the average number of RRC connected users being 0.
And 3, calculating the Pearson correlation coefficient of all the data.
The pearson correlation coefficient is a linear correlation coefficient, and the correlation coefficient is represented by r, which describes the degree of the linear correlation between two variables. The value of r is between-1 and +1, if r >0, the two variables are positively correlated, namely the larger the value of one variable is, the larger the value of the other variable is; if r <0, it indicates that the two variables are negatively correlated, i.e., the larger the value of one variable, the smaller the value of the other variable.
r2Refers to the scale that can be explained in a fitted line (linear relationship). For example: if r of y and x2The variation of y of 0.7, i.e. 70%, can be explained by the best fit line of x and y, the remaining 30% being affected by other factors. A larger absolute value of r indicates a stronger correlation.
Step 4, load index analysis is carried out based on all data
And generating a scatter diagram of the average occupancy rate of the downlink PRB and the average user number of RRC connection from samples in all the data and a linear fitting line.
Calculating the mean value of the maximum 5 statistical values of the 'average occupancy rate of the downlink PRB' and the square r of the correlation coefficient of the Pearson obtained in the step 32The product is taken as the ordinate of the point K _ 0. By fittingAnd calculating the abscissa of the K _ 0. See fig. 3.
The significance of the point K _0 is to give an ideal case: samples at the lower left of the point K _0 (the average occupancy rate of the downlink PRB is less than the ordinate of the K _0, and the average user number of RRC connection is less than the abscissa of the K _0) all satisfy a fitting line (linear relation), namely all samples are low-load samples; samples above or to the right of point K _0 (the "average occupancy of downlink PRBs > K _0 ordinate" or "average number of RRC connected users > K _0 abscissa") all do not satisfy the fit line (linear relationship), but are affected by other factors, i.e. all high-load samples.
Step 5, carrying out throughput analysis based on all the obtained data
And (3) generating a scatter diagram of the downlink user plane flow of the PDCP layer and the average RRC connection user number and a fitting line (linear relation) by using samples in all data.
Calculating the average value of the maximum 5 statistical values of the downlink user plane flow of the PDCP layer and the square r of the Pearson correlation coefficient obtained in the step 32The product is taken as the ordinate of the point K _ 1. The abscissa of K _1 is calculated by a linear fit line. See fig. 4.
The significance of the point K _1 is to give an ideal case: samples at the lower left of the point K _1 (the vertical coordinate of the downlink user plane flow of the PDCP layer is less than K _1, and the horizontal coordinate of the average RRC connection user number is less than K _1) all meet a fitting line (linear relation), namely all are low-load samples; samples above or to the right of point K _1 (the "PDCP layer downlink user plane traffic > K _1 ordinate" or "average RRC connected user number > K _1 abscissa") all do not satisfy the fit line (linear relationship), but are affected by other factors, i.e. all high load samples.
Step 6, obtaining a high load sample list
And screening low-load samples of the average occupancy rate of the downlink PRBs (K _0 ordinate), the downlink user plane flow rate of the PDCP layer (K _1 ordinate) and the average RRC connection user number (Min) (K _0 abscissa, K _1 abscissa) from the statistical data of a certain region, and obtaining a high-load sample list.
Step 7, carrying out load analysis based on the high load sample list
And generating a scatter diagram (figure 5) of the average occupancy rate of the downlink PRB and the average user number of RRC connection by using the high-load sample list, and generating a LOESS fitting line (a is 0.5, and has a nonlinear relation).
The LOESS refers specifically to local weight polynomial fitting. Each point on the data set is fitted with a low order polynomial, the closer to the point to be fitted the higher the weight, and conversely the further away the weight is.
If the highest point (point K _0^ ') of the generated fit line is not at the left end or the right end of the interval, the fit line has a segment of' average occupancy of downlink PRB 'linearly related to' average number of RRC connected users ', because there is a part of low-load samples (located at the lower left of point K _0^' in FIG. 5, namely, 'average occupancy of downlink PRB < K _0^' ordinate 'and' average number of RRC connected users '< K _0^' abscissa) in the 'high-load sample list v 1'.
Step 8, carrying out throughput analysis based on high load sample list
Samples in the high-load sample list are used to generate a scattergram (fig. 6) of "PDCP layer downlink user plane traffic" and "average number of RRC connected users", and a loses fitting line (a is 0.5, i.e. a non-linear relationship).
Maximum Point if fitted to line (Point K'1) If the fitting line is not at the left end or the right end of the interval, the fitting line has a segment of PDCP layer downlink user plane traffic which is in positive linear correlation with the average RRC connection user number, which indicates that a part of low-load samples (located at the lower left of the point K _1^' in figure 6, namely the segment of PDCP layer downlink user plane traffic) exist<K _1^ ordinate ' and ' average number of RRC-connected users '<K _1^' abscissa).
Step 9, correcting to obtain a second high-load sample list
Screening low-load sample 'downlink PRB average occupancy rate' from all data<K _0^ ' ordinate ' and ' PDCP layer downlink user plane traffic<K _1^ ordinate ' and ' average number of RRC-connected users '<Min(K'0Abscissa, K'1Abscissa). And correcting to obtain a second high-load sample list.
And 10, repeating the steps 7-9, and correcting the high-load sample list until the ' average occupancy rate of the downlink PRB ' and the ' average user number of RRC connections ' are fitted, the ' flow rate of the downlink user plane of the PDCP layer and the ' average user number of RRC connections ' are fitted, and the highest points of the two fitted lines are all at the left end or the right end of the interval.
A third list of high load samples is obtained. The samples are used for generating a scatter diagram (shown in figure 7) of the average occupancy rate of the downlink PRB and the average number of RRC connection users, and a scatter diagram (shown in figure 8) of the downlink user plane flow rate of the PDCP layer and the average number of the RRC connection users, wherein the highest points of the fitted lines are all at the left end of the interval, namely the average occupancy rate of the downlink PRB and the downlink user plane flow rate of the PDCP layer are in a negative correlation relation with the average number of the RRC connection users, and the characteristic is the radio performance characteristic in a typical high-load state.
Step 11, discriminating 'high load cell to be expanded'
And (3) statistical conditions are as follows: when the number of samples in the third final list of high-load samples of a cell is at least 4 in 7 consecutive days (or at least 7 in 15 consecutive days, or at least 14 in 30 consecutive days), the cell is the high-load cell to be expanded.
In summary, the conventional algorithm for discriminating the high-load cell to be expanded by using the specified expansion threshold is a parameter algorithm. However, since the parameter of the capacity expansion threshold directly affects several billion dollars of investment per year only in china, operators generally uniformly account the important parameter nationwide at the group level and issue the important parameter to provinces and cities for execution. In actual projects, the user behavior difference of different places is large, the user models are different in regions, and on the other hand, various marketing measures such as 'no-flow package' and the like cause the user models to fluctuate dramatically. Therefore, the parameter of 'one cutting' is difficult to be refined according to the user group or the specific area, and the error is large.
The present embodiment will be described in detail with reference to specific application examples.
First, a conventional capacity expansion method will be described.
For example, when an operator expands a project at a university, 7x 24-hour wireless performance statistics (10447 samples) of each cell of the university are extracted, and 1472 samples are screened out to reach an expansion threshold of one or two by using the above algorithm. However, in the analysis of 1472 high-load samples obtained by the threshold method, 300 low-load samples are found:
in the "scattergram of the average occupancy rate of downlink PRBs" and the "average number of users connected to RRC" (see fig. 1), the highest point (point K _0) of the fitted line is not at the left end or the right end of the interval, and the fitted line has a segment of the average occupancy rate of downlink PRBs positively correlated (approximately linear) with the "average number of users connected to RRC" because there is a part of low-load samples (located at the lower left of the point K _0, that is, "average occupancy rate of downlink PRBs < K _0 ordinate" and "average number of users connected to RRC" < K _0 abscissa). In a scattergram of "downlink user plane traffic of PDCP layer" and "average number of RRC connected users" (see fig. 2), the highest point (point K _1) of the fit line is not located at the left end or the right end of the interval, and the fit line has a segment of "downlink user plane traffic of PDCP layer" positively correlated (approximately linear) with "average number of RRC connected users", because there is a part of low load samples (located at the left lower part of the point K _1, that is, "downlink user plane traffic of PDCP layer < K _1 ordinate" and "average number of RRC connected users" < K _1 abscissa).
The "final high-load sample list" has 1596 samples, and in the "scatter diagram of" average occupancy rate of downlink PRB and "average number of RRC connected users" (as shown in fig. 7) and the "scatter diagram of" downlink user plane traffic of PDCP layer and "average number of RRC connected users" (as shown in fig. 8), the highest points of the fitted line are all at the left end of the interval, and the "average occupancy rate of downlink PRB" and "downlink user plane traffic of PDCP layer" are in a negative correlation with the "average number of RRC connected users", which is a typical radio performance characteristic in a high-load state. Therefore, the threshold method misses 1596- (1472-.
Therefore, the error of the high-load sample obtained by the threshold method is (300+ 424)/1596-45.4%.
According to the statistical conditions: "the statistics reaches the first threshold or the second threshold when busy hour statistics reaches for at least 4 days continuously for 7 days, and outputs the expansion list", and the expansion list of the threshold method (based on 1472 high-load samples) has 30 high-load cells to be expanded. However, compared with the expansion list based on 1596 high-load samples, which is the final high-load sample list, the expansion list of the threshold method (based on 1472 high-load samples) has 3 misjudgment cells and 3 missed judgment cells to be expanded, and the error of the expansion scheme is 20%.
Next, a method employing this embodiment will be described:
1. and extracting performance statistical data of m cells Nx24 hours in a certain area from the network management system.
The data shall include; load indexes (uplink PRB average occupancy rate and downlink PRB average occupancy rate), cell busy hour throughput indexes (PDCP layer uplink user plane traffic and PDCP layer downlink user plane traffic) and average RRC connection user number.
2. And (4) carrying out pretreatment.
Samples with the average RRC connection user number of 0 are removed, and 9412 samples in the statistical data are obtained.
3. Pearson correlation coefficients for the "statistics" are calculated.
Figure BDA0003052322980000121
Figure BDA0003052322980000131
Significant correlation at the.01 level (double-sided).
4. Load index analysis of "statistical data
And generating a scatter diagram of the average occupancy rate of the downlink PRB and the average user number of RRC connection according to the statistical data and a fitting line (linear relation).
Calculating the mean value of the maximum 5 statistical values of the 'average occupancy rate of the downlink PRB' and the square r of the correlation coefficient of the Pearson obtained in the step 32Multiplication, product as point K0The ordinate of (c). Calculating to obtain K through a fitted line0The abscissa of (a). See fig. 3.
5. Throughput analysis of "statistics
And generating a scatter diagram of the downlink user plane flow of the PDCP layer and the average RRC connection user number and a fitting line (linear relation) by using the statistical data.
Calculating the average value of the maximum 5 statistical values of the downlink user plane flow of the PDCP layer and the square r of the Pearson correlation coefficient obtained in the step 32Multiplication, product as point K1The ordinate of (c). Calculating to obtain K through a fitted line1The abscissa of (a). See fig. 4.
6. Obtain "high load sample List V1"
Figure BDA0003052322980000141
Screening out downlink PRB average occupancy rate from certain area statistical data<K0Ordinate 'and' PDCP layer downlink user plane traffic<K1Ordinate "and" average number of RRC-connected users<Min(K0Abscissa, K1Abscissa) "of the low load samples, a" high load sample list V1 "is obtained, in which 1680 samples are present.
7. Load analysis of "high load sample List V1
The "high load sample list V1" is used to generate a "downlink PRB average occupancy" and "average number of RRC connected users" scatter diagram (fig. 5), and a loses fitting line (a is 0.5, a non-linear relationship).
Highest point of generated fitted line (point K'0) Not at the left end or the right end of the interval, the fitting line has a segment of ' average occupancy rate of downlink PRB ' linearly related to ' average number of RRC connected users ', because a part of low-load samples (located at point K ' in FIG. 5) exist in the ' high-load sample list v1 '0Lower left of (i.e. "average occupancy of downlink PRB<K'0Ordinate "and" average number of RRC-connected users "<K'0Abscissa).
8. Throughput analysis of "high load sample inventory V1
The "high load sample list V1" is used to generate a scattergram of "PDCP layer downlink user plane traffic" and "average number of RRC connected users" (fig. 6), and a loses fitting line (a ═ 0.5, i.e., a non-linear relationship).
Highest point of the fitted line (point K'1) If the fitting line is not at the left end or the right end of the interval, the segment of PDCP layer downlink user plane flow rate is linearly related to the average RRC connection user number, which indicates that a part of low-load samples (located at point K 'in FIG. 6) exist'1Lower left of (1), i.e. "PDCP layer downlink user plane traffic<K'1Ordinate "and" average number of RRC-connected users "<K'1Abscissa).
9. The correction obtained "high load sample List V2"
Figure BDA0003052322980000151
Figure BDA0003052322980000161
Screening low-load sample downlink PRB average occupancy rate from statistical data<K'0Ordinate 'and' PDCP layer downlink user plane traffic<K'1Ordinate "and" average number of RRC-connected users "<Min(K'0Abscissa, K'1Abscissa). The correction obtains a "high load sample list V2".
10. And 7-9, correcting the high-load sample list until a fitting line of the average occupancy rate of the downlink PRB and the average user number of RRC connections, and a fitting line of the downlink user surface flow rate of the PDCP layer and the average user number of RRC connections, wherein the highest point of the two fitting lines is at the left end or the right end of the interval.
A "high load sample final manifest" was obtained, of which 1596 samples. In a scatter diagram of "downlink PRB average occupancy" and "average number of RRC connected users" (as shown in fig. 7) and a scatter diagram of "PDCP layer downlink user plane traffic" and "average number of RRC connected users" (as shown in fig. 8), the highest point of the fitted line is at the left end of the interval, and the "downlink PRB average occupancy" and the "PDCP layer downlink user plane traffic" are in a negative correlation with the "average number of RRC connected users", which is a typical radio performance characteristic in a high load state.
The sample error e of the "final list of high load samples" can be quantitatively estimated as follows:
the "high load sample list v 1" sample error is first calculated.
In the high load sample list v1, the low load sample downlink PRB average occupancy rate<K'0Ordinate 'and' PDCP layer downlink user plane traffic<K'1Ordinate "and" average number of RRC-connected users "<Min(K'0Abscissa, K'1Abscissa), the number of false positives is 303; the high load samples are 1680-.
Sample error "high load sample list v 1":
Figure BDA0003052322980000162
sample error "high load sample Final List":
Figure BDA0003052322980000171
since the "high load sample list v 1" is calculated from the maximum 5 statistical values of "average occupancy of downlink PRBs" and "downlink user plane traffic of PDCP layer", the "total number of samples of the final list of high load samples" in the above formula is divided by 5.
11. Discrimination of 'high load cell to be expanded'
According to the statistical conditions: when the number of samples in the "high load sample final list" of a cell is at least 4 in 7 consecutive days (or at least 7 in 15 consecutive days, or at least 14 in 30 consecutive days), the cell is the high load cell to be expanded.
Based on the "final list of high load samples" 1596 expanded lists of high load samples, see table 1 for details.
The error of the expanded list is
Figure BDA0003052322980000172
In summary, the conventional algorithm for discriminating the high-load cell to be expanded by using the specified expansion threshold is a parameter algorithm. When the threshold method is adopted in the embodiment, the error of the high-load sample is 45.4%; according to the statistical conditions: "the statistics reaches the first threshold or the second threshold when busy hour counts for at least 4 consecutive 7 days", and the error of the output expansion list is 20%.
By adopting the nonparametric algorithm specifically related to the invention, the error of the high-load sample is 1.8 percent; according to the same statistical conditions: when the number of samples of a certain cell in the final list of high-load samples is at least 4 in 7 continuous days, the error of the output expansion list is 0.9 percent.
Compared with the traditional algorithm, the error of the method related by the invention is at least one order of magnitude smaller.
From the above, it can be seen that: the nonparametric algorithm can discriminate the high-load cell to be expanded only according to the wireless performance statistical data extracted by the LTE network management system. Compared with the prior art, the method has two positive effects:
compared with the traditional algorithm, the method related by the invention has the error at least one order of magnitude smaller
The method related by the invention is a non-parametric algorithm, does not need to manually appoint a threshold, directly derives a result according to the wireless performance statistical data, and is more objective.
In this embodiment, while, for purposes of simplicity of explanation, the methodologies are shown and described as a series of acts, it is to be understood and appreciated that the methodologies are not limited by the order of acts, as some acts may, in accordance with one or more embodiments, occur in different orders and/or concurrently with other acts from that shown and described herein or not shown and described herein, as may be understood by those of ordinary skill in the art.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary embodiments, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software as a computer program product, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a web site, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk (disk) and disc (disc), as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk and blu-ray disc where disks (disks) usually reproduce data magnetically, while discs (discs) reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
It is noted that references in the specification to "one embodiment," "an example embodiment," "some embodiments," etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
The previous description of the disclosure is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the spirit or scope of the disclosure. Thus, the disclosure is not intended to be limited to the examples and designs described herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. An LTE high load cell discrimination method is characterized by comprising the following steps:
step 1, extracting statistical data of cell performance indexes of an area to be evaluated, and rejecting samples with the average RRC connection user number of 0;
step 2, selecting a first index and at least one second index from the statistical data of the performance indexes, and calculating a Pearson correlation coefficient of the second index and the first index;
step 3, fitting the first index and the second index by taking the first index as a horizontal coordinate and the second index as a vertical coordinate; obtaining a screening inflection point K based on a plurality of sample mean values before the second index and the corresponding Pearson correlation coefficients;
step 4, rejecting samples with vertical coordinates smaller than the vertical coordinates corresponding to the inflection point K and horizontal coordinates smaller than the horizontal coordinates corresponding to the inflection point K, and repeating the step 3-4 until the screening inflection point is at the leftmost end or the rightmost end of the fitting line;
and 5, screening out the high-load cell according to a preset condition based on the final sample.
2. The LTE high-load cell discrimination method according to claim 1, wherein in step 1, samples with an average PRC connection user number smaller than a predetermined value are removed.
3. The method for discriminating the LTE high-load cell according to claim 1, wherein the first index is an average number of RRC connected users; the second index is the average occupancy rate of the downlink PRB and/or the downlink user plane traffic of the PDCP layer.
4. The LTE high load cell screening method according to claim 1, wherein the mean value of a plurality of samples with the second index ranked at the top and the square r of the Pearson correlation coefficient2And multiplying to obtain the ordinate of the inflection point, and taking the abscissa corresponding to the inflection point on the fitting curve as the abscissa of the inflection point.
5. The method for screening the high-load LTE cell according to claim 1, wherein in said step 4, the conditions for screening the high-load cell are:
there were at least 4 high load samples for 7 consecutive days;
or
There were at least 7 high load samples for 15 consecutive days;
or
There were at least 14 high load samples for 30 consecutive days.
6. An LTE high load cell screening system, comprising:
the network data extraction module is used for extracting the statistical data of the performance indexes of the cells of the area to be evaluated and eliminating invalid numbers;
the correlation coefficient determining module is used for selecting a first index and at least one second index from the performance index statistical data and calculating the Pearson correlation coefficient of the second index and the first index;
the screening inflection point determining module is used for fitting the first index and the second index by taking the first index as an abscissa and taking the second index as an ordinate; obtaining a screening inflection point K based on a plurality of sample mean values before the second index and the corresponding Pearson correlation coefficients;
the relevant data eliminating module is used for eliminating samples of which the vertical coordinate is smaller than the vertical coordinate corresponding to the inflection point K and the horizontal coordinate is smaller than the horizontal coordinate corresponding to the inflection point K, and the screening inflection point determining module and the relevant data eliminating module are repeatedly called until the screening inflection point is at the leftmost end or the rightmost end of the fitting line;
and the high-load cell screening module screens out the cell to be expanded according to preset conditions based on the final sample.
7. The LTE high-load cell screening system according to claim 6, wherein samples with an average PRC connection user number smaller than a predetermined value are removed from the network data extraction module.
8. The LTE high load cell screening system of claim 6, wherein the first indicator is an average number of RRC connected users; the second index is the average occupancy rate of the downlink PRB and/or the downlink user plane traffic of the PDCP layer.
9. The LTE high load cell screening system according to claim 6, wherein the mean value of a plurality of samples with the second index ranked at the top and the square r of the Pearson correlation coefficient2And multiplying to obtain the ordinate of the inflection point, and taking the abscissa corresponding to the inflection point on the fitting curve as the abscissa of the inflection point.
10. The LTE high-load cell screening method according to claim 6, wherein in the high-load cell screening module, the conditions for screening the high-load cell are as follows:
there were at least 4 high load samples for 7 consecutive days;
or
There were at least 7 high load samples for 15 consecutive days;
or
There were at least 14 high load samples for 30 consecutive days.
CN202110490568.6A 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system Active CN113133035B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110490568.6A CN113133035B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810573488.5A CN108777870B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system based on Pearson coefficient
CN202110490568.6A CN113133035B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201810573488.5A Division CN108777870B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system based on Pearson coefficient

Publications (2)

Publication Number Publication Date
CN113133035A true CN113133035A (en) 2021-07-16
CN113133035B CN113133035B (en) 2022-07-22

Family

ID=64025767

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202110490568.6A Active CN113133035B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system
CN201810573488.5A Active CN108777870B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system based on Pearson coefficient

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201810573488.5A Active CN108777870B (en) 2018-06-06 2018-06-06 LTE high-load cell discrimination method and system based on Pearson coefficient

Country Status (1)

Country Link
CN (2) CN113133035B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111132215B (en) * 2019-12-18 2023-06-20 广东海格怡创科技有限公司 Hot spot area positioning method, device, computer equipment and storage medium
CN111010665B (en) * 2019-12-18 2021-03-16 广东海格怡创科技有限公司 Hotspot range positioning method and device, computer equipment and storage medium
CN113259954B (en) * 2020-02-13 2023-02-28 ***通信集团广东有限公司 Method and device for determining quality difference processing strategy and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055433A (en) * 1996-09-20 2000-04-25 Northern Telecom Limited Data processing system and method for balancing a load in a communications network
CN103039118A (en) * 2010-06-28 2013-04-10 高通股份有限公司 Resource utilization measurements for heterogeneous networks
CN106714239A (en) * 2016-12-14 2017-05-24 北京拓明科技有限公司 LTE network load automatic balancing method and system
CN106851736A (en) * 2016-12-15 2017-06-13 北京拓明科技有限公司 LTE adjacent areas parameter automatic optimization method based on load
CN107277844A (en) * 2017-07-25 2017-10-20 山东浪潮商用***有限公司 A kind of communication network high load capacity cell method for early warning based on time series

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8682310B2 (en) * 2010-05-28 2014-03-25 Telefonaktiebolaget L M Ericsson (Publ) Network-based area positioning for capacity and coverage improvement
CN104469817B (en) * 2013-09-18 2018-10-19 上海无线通信研究中心 The method for controlling small base station dormancy based on load and interference level
CN106612516A (en) * 2015-10-26 2017-05-03 中兴通讯股份有限公司 Parameter optimization method and device
CN107371178B (en) * 2017-08-28 2019-10-18 浪潮软件集团有限公司 high-load cell optimization method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055433A (en) * 1996-09-20 2000-04-25 Northern Telecom Limited Data processing system and method for balancing a load in a communications network
CN103039118A (en) * 2010-06-28 2013-04-10 高通股份有限公司 Resource utilization measurements for heterogeneous networks
CN106714239A (en) * 2016-12-14 2017-05-24 北京拓明科技有限公司 LTE network load automatic balancing method and system
CN106851736A (en) * 2016-12-15 2017-06-13 北京拓明科技有限公司 LTE adjacent areas parameter automatic optimization method based on load
CN107277844A (en) * 2017-07-25 2017-10-20 山东浪潮商用***有限公司 A kind of communication network high load capacity cell method for early warning based on time series

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
郭奕吟: "基于多维度数据的LTE网络智能容量优化研究", 《数字通信世界》 *
龙青良等: "基于用户感知的LTE网络负荷评估体系及扩容研究", 《邮电设计技术》 *

Also Published As

Publication number Publication date
CN113133035B (en) 2022-07-22
CN108777870B (en) 2021-06-08
CN108777870A (en) 2018-11-09

Similar Documents

Publication Publication Date Title
CN108777870B (en) LTE high-load cell discrimination method and system based on Pearson coefficient
CN103491556B (en) A kind of method and device of network adjustment
CN109905882B (en) Network capacity expansion method and device
CN110380881B (en) Network capacity expansion prediction method and device
CN109391950B (en) Method, device, equipment and medium for predicting terminal distribution
CN107071788B (en) Spectrum sensing method and device in cognitive wireless network
CN104661244A (en) Method and device for evaluating PCI (physical cell identifier) mod3 interference
CN105429792B (en) User behavior flow acquisition methods and device, user behavior analysis method and system
CN106982441A (en) A kind of determination method and device of cell capacity-enlarging
WO2010015145A1 (en) Method and system for filtering and monitoring program behaviors
US20110023116A1 (en) Method and apparatus for spam short message detection
US8682293B1 (en) Telecom fraud detection using social pattern
CN105139227A (en) Data calculating method and apparatus
CN104883705A (en) Problem positioning method for data service complaints and device thereof
CN102523286B (en) Method and device for obtaining credit degree of service
CN103607731A (en) Method and device for processing measurement reports
CN103955192B (en) A kind of curve form data sampling method for sewage work
CN105095382A (en) Method and device for sample distributed clustering calculation
CN101980209B (en) Adaptive multi-field search engine calling method and system
CN107086923B (en) Communication network performance index analysis method and device
CN102487334B (en) Method and device for determining associated alarm information
CN103746753A (en) Data prediction method based on cognitive wireless network
CN103841601B (en) A kind of method of accurate assessment TD SCDMA inter-cell frequency scrambling interference menace level
CN114417184A (en) Seed node screening method, device, equipment and storage medium
CN112597323A (en) Remote sensing image storage and migration method and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant