CN103257896B - A kind of Max-D job scheduling method under cloud environment - Google Patents

A kind of Max-D job scheduling method under cloud environment Download PDF

Info

Publication number
CN103257896B
CN103257896B CN201310038329.2A CN201310038329A CN103257896B CN 103257896 B CN103257896 B CN 103257896B CN 201310038329 A CN201310038329 A CN 201310038329A CN 103257896 B CN103257896 B CN 103257896B
Authority
CN
China
Prior art keywords
resource
time
tcir
rtcir
cloud environment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310038329.2A
Other languages
Chinese (zh)
Other versions
CN103257896A (en
Inventor
李千目
陆路
侯君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing China Network Technology Co., Ltd.
Original Assignee
LIANYUNGANG RESEARCH INSTITUTE OF NANJING UNIVERSITY OF SCIENCE AND TECHNOLOGY
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LIANYUNGANG RESEARCH INSTITUTE OF NANJING UNIVERSITY OF SCIENCE AND TECHNOLOGY filed Critical LIANYUNGANG RESEARCH INSTITUTE OF NANJING UNIVERSITY OF SCIENCE AND TECHNOLOGY
Priority to CN201310038329.2A priority Critical patent/CN103257896B/en
Publication of CN103257896A publication Critical patent/CN103257896A/en
Application granted granted Critical
Publication of CN103257896B publication Critical patent/CN103257896B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses the Max D job scheduling method under a kind of cloud environment.This algorithm, by estimating each task operation time in resource, is that the most suitable resource of task choosing performs with Max D-algorithm, and when actual operating efficiency is higher than estimated efficiency, mates task with resource.This job scheduling method can the load balancing of resource under effective guarantee cloud environment, and operation average operating time can be made to reduce, increase the handling capacity of system.

Description

A kind of Max-D job scheduling method under cloud environment
Technical field
The present invention relates to cloud environment job scheduling method, the Max-D job scheduling method under a kind of cloud environment.
Technical background
In recent years, cloud computing mode quickly grows, and framework and the method for operation of IT industry are changed the most therewith.Computing power is wanted by cloud computing Asking reduction, the most supermatic management mode seldom needs manual intervention, substantial saving in enterprise procurement and artificial cost.This also makes High-performance computer, high end storage, high-end server markets are gradually tied up by the cluster of low side devices;Traditional data center is with low cost Cloud computing center replace;A large amount of software application are release in cloud platform in the way of service, and even many application and game can be transported in " cloud " OK.
Cloud computing workload to be dealt with and data volume are huge, and the system almost moment is all processing operation and the data of magnanimity the most how Cloud resource is reasonably distributed, operation is dispatched efficiently, enabling meet the use demand of user, allow operation that user submits to The process time is shorter, execution cost is less, and it is the emphasis in cloud computing and difficult point that the load of simultaneity factor maintains the state of a relative equilibrium.Just It is the demand owing to needing farthest to meet QoS of customer (Quality of Service, QOS) under cloud environment, so research is made under cloud environment Industry dispatching method significant.Inappropriate job scheduling strategy can cause the waste of cloud resource, increases execution time and the cost of user job, Even more so that system congestion cannot continue to provide service time serious;And suitable job scheduling can be on the premise of meeting the demand of user, by resource Waste reduces as far as possible, reaches the expection of user and cloud service provider.Therefore, study under a kind of cloud environment that can meet user and enterprise demand Job scheduling method is necessary.
A.Vouk proposes Min-Min job scheduling method in " Cloud Computing-Issues, Research and Implementations ". Min-Min dispatching method estimates that each unscheduled operation minimum completion time obtains minimum completion time set, then by complete for the minimum of All Jobs The one-tenth time compares, and chooses the minimum operation of deadline in set and is dispatched to suitably calculate node.Min-Min method enables to single work The deadline of industry is less, but can produce the unbalanced of load, and the flat near deadline of operation is longer.
Summary of the invention
It is an object of the invention to provide the job scheduling method under a kind of cloud environment so that when calculating node processing operation under cloud environment, can keep load all Weighing apparatus, and reduce the average completion time of operation.
The technical scheme realizing the object of the invention is:
A kind of Max-D job scheduling method under cloud environment, step is as follows:
The first step: determine all calculating resources and the set of idling-resource in cloud environment.
Second step: submit to priority to be ranked into queue by operation operation to be allocated, the new operation submitted to is added into this queue tail.
3rd step: be scheduling the operation after sequence, uses the suitable resource of Max-D method choice to perform.
For the Max-D method of the 3rd step, its step is as follows:
Step 3.1: to all operations to be allocated, calculates operation averaged power spectrum in all calculating resources and runs the time;
Step 3.2: the averaged power spectrum calculating each operation ran between time and its operation time minimum in the calculating resource of single free time Difference Di, and record this calculating resource;
Step 3.3: find the operation that difference Di is maximum in All Jobs, and this Di is designated as D;
Step 3.4: if D >=0, then assign operation and process to the resource of record, this resource is removed from idling-resource set simultaneously;If D < 0, then redefine resource and the idling-resource set of distribution, joins in idling-resource set, then by the resource completing its distribution operation Return step 3.1.
Step 3.5: repeat step 3.2 to step 3.4 until the resource for all application operations is assigned with operation.
Step 3.1 calculates resource averaged power spectrum deadline method as follows:
Assume that cloud environment is by n unallocated operation T={t1,t2,...tnAnd m resource R={r1,r2,...rmComposition, each resource simultaneously can only One operation is processed;Resource number idle in resource is k, is designated as R'={r1',r2',...rk', wherein k < m;Operation ti is in resource rj On estimation run the time be TCirj, then operation ti average operating time in all resources is
The operation ti deadline in resource rj, be the residual completion time of operation and operation ti just performed on rj on rj when completing Between sum.
Assuming in cloud environment, for same class operation, the data volume that the speed that resource processes processes to it is directly proportional.Operation i is in resource r Estimated Time Of Completion is just to run the residual completion time of operation in resource r and operation i performs time sum in resource r:
TCir j ( k + 1 ) = RTCir j ( k ) &times; 1 - p r o p r o + &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) p r o &rsqb; &times; M ( k + 1 ) , r j &Element; R - - - ( 1 )
Wherein, TCirj(k+1) deadline needed for representing resource rj process operation ti, TCirjK () represents that previous operation is in resource rj On the prediction deadline;M (k) is the ratio running this operation required time with run unit operation required time;RTCirjK () represents previous Individual operation actual run time on rj, pro (0 < pro≤1) represents the completed percentage of previous operation, if resource rj is idling-resource, i.e. Previous operation has performed, then pro=1, and above-mentioned formula can be reduced to
TCir j ( k + 1 ) = &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) &rsqb; &times; M ( k + 1 ) , r j &Element; R &prime; - - - ( 2 )
Time TCir is performed by the estimation of operation previous in this resourcej(k) and actual execution time RTCirjK (), uses formula (1) to carry out Estimation obtains the unscheduled operation execution time in certain resource.But, in the stage that system has just started, each resource was also not carried out operation, Then the execution time of resource cannot be estimated by the implementation status of previous operation.Therefore when system just starts, for all resources, order
TCirj(0)=RTCirj(0)=0 (3)
First the most pending operation can select the resource being not carried out operation to perform, and after resource has performed first operation, is just made Actual execution time RTCir of industryj(1), TCir is madej(1) equal to RTCirj(1), then the operation time of operation afterwards is estimated according to formula (1) Calculate.
The method calculating difference D in step 3.2 is as follows:
The operation ti minimum operation time on the node of all unallocated work is designated as mUTCi=min{TCir1',TCir2',...,TCirk', note TCir is met under recordj'=mUTCiUnallocated operation rj ', and remember BRi=rj', then according to formula Di=AvgTCi-mUTCi, Difference Di to operation i.
Compared with prior art, its remarkable advantage: 1, compared to conventional scheduling method, the job scheduling of the present invention only can be by operation for the present invention It is assigned in the resource of free time, it is ensured that the equilibrium of load under cloud environment, does not haves the situation that part resource transships and other resources are idle;
2, compared to conventional scheduling method, the present invention is that operation selects most suitable resource by Max-D method, decreases averagely completing of operation Time, improve the throughput of system.
Accompanying drawing explanation
Accompanying drawing is the flow chart of Max-D method of the present invention.
Detailed description of the invention
The present invention will be further described below in conjunction with the accompanying drawings.
Assume that cloud environment is by n unallocated operation T={t1,t2,...tnAnd m resource R={r1,r2,...rmComposition, each resource simultaneously can only One operation is processed;Resource number idle in resource is k, is designated as R'={r1',r2',...rk', wherein k < m;Operation ti is in resource rj On estimation run the time be TCirj, then operation ti average operating time in all resources isOperation ti is all The minimum operation time on the node of unallocated work is designated as mUTCi=min{TCir1',TCir2',...,TCirk', record satisfied TCirj'=mUTCiUnallocated operation rj ', and remember BRi=rj';
When needing the operation set non-NULL of scheduling, the following operation of execution:
Step 1: All Jobs in operation set T is calculated AvgTC respectivelyi
Step 2: each operation ti is found mUTCi, and calculate Di=AvgTCi-mUTCi
Step 3: find operation ti so that Di=Max{D1,D2,...,Dn, if there being multiple operation to meet condition, then arrive according to these operations Order select ti;
Step 4: if Di>=0, then assignment operation ti is to the process of resource BRi, resource BRi is removed from idling-resource set R ' simultaneously; If Di< 0, then reappraise resource and the idling-resource set of distribution, joins in idling-resource set, so by the resource completing its distribution operation Rear return step (1).
Step 5: repeat step 2 to step 4 until the resource for all application operations is assigned with operation.
The operation ti deadline in resource rj, be the residual completion time of operation and operation ti just performed on rj on rj when completing Between sum.
Assume that, for same class operation, the data volume that the speed that resource processes processes to it is directly proportional herein.Anticipated in resource r of operation i completes Time is just to run the residual completion time of operation in resource r and operation i performs time sum in resource r:
TCir j ( k + 1 ) = RTCir j ( k ) &times; 1 - p r o p r o + &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) p r o &rsqb; &times; M ( k + 1 ) , r j &Element; R
Wherein, TCirj(k+1) deadline distributed to by operation ti needed for resource rj processes, TCir are representedjK () represents that previous operation is in money The prediction deadline on the rj of source;M (k) is the ratio running this operation required time with run unit operation required time;RTCirjK () represents Previous operation actual run time on rj, pro (0 < pro≤1) represents the completed percentage of previous operation, if resource rj is idling-resource, The most previous operation has performed, then pro=1, and above-mentioned formula can be reduced to
TCir j ( k + 1 ) = &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) &rsqb; &times; M ( k + 1 ) , r j &Element; R &prime;
According to formula, the unscheduled operation time that performs in certain resource can perform the time by the estimation of operation previous in this resource TCirj(k) and actual execution time RTCirjK () is estimated.But, in the stage that system has just started, each resource was also not carried out operation, Then the execution time of resource cannot be estimated by the implementation status of previous operation.Therefore when system just starts, for all resources, order
TCirj(0)=RTCirj(0)=0
First the most pending operation can select to be not carried out the resource of operation, after resource has performed first operation, has just obtained the reality of operation Execution time RTCirj(1), TCir is madej(1) equal to RTCirj(1), then the operation time of operation afterwards can be estimated according to formula (1).

Claims (3)

1. the Max-D job scheduling method under a cloud environment, it is characterised in that step is as follows:
The first step: determine all calculating resources and the set of idling-resource in cloud environment;
Second step: submit to priority to be ranked into queue by operation operation to be allocated, the new operation submitted to is added into this queue tail;
3rd step: be scheduling the operation after sequence, uses the suitable resource of Max-D method choice to perform;Max-D method, its step Rapid as follows:
Step 3.1: to all operations to be allocated, calculates operation averaged power spectrum in all calculating resources and runs the time;
Step 3.2: the averaged power spectrum calculating each operation ran between time and its operation time minimum in the calculating resource of single free time Difference Di, and record this calculating resource;
Step 3.3: find the operation that difference Di is maximum in All Jobs, and this Di is designated as D;
Step 3.4: if D >=0, then assign operation and process to the resource of record, this resource is removed from idling-resource set simultaneously;If D < 0, then redefine resource and the idling-resource set of distribution, joins in idling-resource set, then by the resource completing its distribution operation Return step 3.1;
Step 3.5: repeat step 3.2 to step 3.4 until the resource for all application operations is assigned with operation.
Max-D job scheduling method under cloud environment the most according to claim 1, it is characterised in that calculate resource in described step 3.1 Averaged power spectrum deadline method is as follows:
Assume that cloud environment is by n unallocated operation T={t1,t2,...tnAnd m resource R={r1,r2,...rmComposition, each resource simultaneously can only One operation is processed;Resource number idle in resource is k, is designated as R'={r1',r2',...rk', wherein k < m;Operation ti is in resource rj On estimation run the time be TCirj, then operation ti average operating time in all resources is
The operation ti deadline in resource rj, be the residual completion time of operation and operation ti just performed on rj on rj when completing Between sum;
Assuming in cloud environment, for same class operation, the data volume that the speed that resource processes processes to it is directly proportional, and operation i is in resource r Estimated Time Of Completion is just to run the residual completion time of operation in resource r and operation i performs time sum in resource r:
TCir j ( k + 1 ) = RTCir j ( k ) &times; 1 - p r o p r o + &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) p r o &rsqb; &times; M ( k + 1 ) , r j &Element; R - - - ( 1 )
Wherein, TCirj(k+1) deadline needed for representing resource rj process operation ti, TCirjK () represents that previous operation is in resource rj On the prediction deadline;M (k) is the ratio running this operation required time with run unit operation required time;RTCirjK () represents previous Individual operation actual run time on rj, pro (0 < pro≤1) represents the completed percentage of previous operation, if resource rj is idling-resource, i.e. Previous operation has performed, then pro=1, and above-mentioned formula can be reduced to
TCir j ( k + 1 ) = &lsqb; ( 1 - &rho; ) TCir j ( k ) M ( k ) + &rho; RTCir j ( k ) M ( k ) &rsqb; &times; M ( k + 1 ) , r j &Element; R &prime; - - - ( 2 )
Time TCir is performed by the estimation of operation previous in this resourcej(k) and actual execution time RTCirjK (), uses formula (1) to carry out Estimation obtains the unscheduled operation execution time in certain resource;
When system just starts, for all resources, order
TCirj(0)=RTCirj(0)=0 (3)
First pending operation can select the resource being not carried out operation to perform, and after resource has performed first operation, has just obtained operation Actual execution time RTCirj(1), TCir is madej(1) equal to RTCirj(1), then the operation time of operation afterwards is estimated according to formula (1).
Max-D job scheduling method under cloud environment the most according to claim 1, it is characterised in that calculate difference D in step 3.2 Method is as follows:
The operation ti minimum operation time on the node of all unallocated work is designated as mUTCi=min{TCir1',TCir2',...,TCirk', note TCir is met under recordj'=mUTCiUnallocated operation rj ', and remember BRi=rj', then according to formula Di=AvgTCi-mUTCi, Difference Di to operation i.
CN201310038329.2A 2013-01-31 2013-01-31 A kind of Max-D job scheduling method under cloud environment Expired - Fee Related CN103257896B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310038329.2A CN103257896B (en) 2013-01-31 2013-01-31 A kind of Max-D job scheduling method under cloud environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310038329.2A CN103257896B (en) 2013-01-31 2013-01-31 A kind of Max-D job scheduling method under cloud environment

Publications (2)

Publication Number Publication Date
CN103257896A CN103257896A (en) 2013-08-21
CN103257896B true CN103257896B (en) 2016-09-28

Family

ID=48961831

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310038329.2A Expired - Fee Related CN103257896B (en) 2013-01-31 2013-01-31 A kind of Max-D job scheduling method under cloud environment

Country Status (1)

Country Link
CN (1) CN103257896B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530182A (en) * 2013-10-22 2014-01-22 海南大学 Working scheduling method and device
CN104951368B (en) * 2014-03-28 2019-02-22 中国电信股份有限公司 Resource dynamic allocation device and method
CN105446979B (en) 2014-06-27 2019-02-01 华为技术有限公司 Data digging method and node
CN108270833B (en) * 2016-12-31 2021-07-16 ***通信集团安徽有限公司 Automatic scheduling method, device and system for rendering cloud resources
CN106790636A (en) * 2017-01-09 2017-05-31 上海承蓝科技股份有限公司 A kind of equally loaded system and method for cloud computing server cluster
CN108509256B (en) * 2017-02-28 2021-01-15 华为技术有限公司 Method and device for scheduling running device and running device
CN110445939B (en) * 2019-08-08 2021-03-30 中国联合网络通信集团有限公司 Capacity resource prediction method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100369398C (en) * 2001-12-29 2008-02-13 中兴通讯股份有限公司 Method and device of continuous channet unit dispatching used in CDMA system
TWI426393B (en) * 2010-02-12 2014-02-11 Elitegroup Computer Sys Co Ltd Cloud-computating resource scheduling method and system applying the same

Also Published As

Publication number Publication date
CN103257896A (en) 2013-08-21

Similar Documents

Publication Publication Date Title
CN103257896B (en) A kind of Max-D job scheduling method under cloud environment
CN108009023B (en) Task scheduling method based on BP neural network time prediction in hybrid cloud
CN105718479B (en) Execution strategy generation method and device under cross-IDC big data processing architecture
CN109582448B (en) Criticality and timeliness oriented edge calculation task scheduling method
CN104765640B (en) A kind of intelligent Service dispatching method
CN105260005B (en) Cloud workflow schedule optimization method towards energy consumption
CN104023042B (en) Cloud platform resource scheduling method
CN102243598A (en) Task scheduling method and system in distributed data warehouse
CN114138486A (en) Containerized micro-service arranging method, system and medium for cloud edge heterogeneous environment
CN104142860A (en) Resource adjusting method and device of application service system
CN103401939A (en) Load balancing method adopting mixing scheduling strategy
CN103455375B (en) Load-monitoring-based hybrid scheduling method under Hadoop cloud platform
CN110308967A (en) A kind of workflow cost based on mixed cloud-delay optimization method for allocating tasks
CN108270805B (en) Resource allocation method and device for data processing
KR101770191B1 (en) Resource allocation and apparatus
CN106776025A (en) A kind of computer cluster job scheduling method and its device
CN107292419A (en) The Cost Optimization strategy that dynamic Multi-workflow scheduling is performed in a kind of mixing cloud environment
Delavar et al. A synthetic heuristic algorithm for independent task scheduling in cloud systems
CN104598311A (en) Method and device for real-time operation fair scheduling for Hadoop
CN104156505A (en) Hadoop cluster job scheduling method and device on basis of user behavior analysis
CN103617083A (en) Storage scheduling method and system, job scheduling method and system and management node
CN111782627B (en) Task and data cooperative scheduling method for wide-area high-performance computing environment
Mao et al. Optimal scheduling algorithm of MapReduce tasks based on QoS in the hybrid cloud
CN107797870A (en) A kind of cloud computing data resource dispatching method
CN112306696A (en) Energy-saving and efficient method and system for unloading edge computing tasks

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20170713

Address after: High tech Zone Nanjing city Jiangsu province 210000 Liufang Road No. 8 Building 7 layer

Patentee after: Nanjing China Network Technology Co., Ltd.

Address before: 222000 No. 2 Chenguang Road, Sinpo District, Jiangsu, Lianyungang

Patentee before: Lianyungang Research Institute of Nanjing University of Science and Technology

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160928

Termination date: 20200131

CF01 Termination of patent right due to non-payment of annual fee