CN108243030A - A kind of backup server selects management method - Google Patents

A kind of backup server selects management method Download PDF

Info

Publication number
CN108243030A
CN108243030A CN201611213267.4A CN201611213267A CN108243030A CN 108243030 A CN108243030 A CN 108243030A CN 201611213267 A CN201611213267 A CN 201611213267A CN 108243030 A CN108243030 A CN 108243030A
Authority
CN
China
Prior art keywords
server
abnormal
servers
type
backup
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611213267.4A
Other languages
Chinese (zh)
Inventor
曾飞传
安西民
任丽君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Space Star Technology (beijing) Co Ltd
Original Assignee
Space Star Technology (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Space Star Technology (beijing) Co Ltd filed Critical Space Star Technology (beijing) Co Ltd
Priority to CN201611213267.4A priority Critical patent/CN108243030A/en
Publication of CN108243030A publication Critical patent/CN108243030A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0668Management of faults, events, alarms or notifications using network fault recovery by dynamic selection of recovery network elements, e.g. replacement by the most appropriate element after failure
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/22Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Hardware Redundancy (AREA)

Abstract

The present invention provides a kind of backup server selection management method, selects suitable standby server according to the type for the server being abnormal, meets the best demand of system, improve the efficiency restored to server exception.

Description

A kind of backup server selects management method
【Technical field】
The invention belongs to operation condition of server monitoring field more particularly to backup server selection management methods.
【Background technology】
Under normal circumstances, to the abnormal monitoring of server, it usually needs artificially carry out malfunction elimination, this artificial investigation side Method waste of manpower resource, investigation accuracy are low, and cannot remotely be monitored and visualized management.
In the prior art, also there are some methods for carrying out remote monitoring to calculating operation condition of server in system, But when finding server exception, the selection to backup server is typically random, and backup server is in performance and physics The best demand of system is generally unable to reach on position.
Based on the above problem, there is an urgent need for a kind of new backup servers now to select management method, according to the clothes being abnormal The type of business device selects suitable standby server, meets the best demand of system, improves the efficiency restored to server exception.
【Invention content】
In order to solve the above problem of the prior art, the present invention proposes a kind of backup server selection management method.
The technical solution adopted by the present invention is as follows:
A kind of backup server selects management method, which is characterized in that the system comprises multiple calculation servers, states Management server and multiple standby servers, wherein state management server connect above-mentioned multiple calculation servers and above-mentioned backup Server, for managing the state of above-mentioned multiple calculation servers, this method comprises the following steps:
(1) state management server classifies to multiple calculation servers, according to multiple cloud computing servers calculate, The different performance of storage and communication aspects is divided into calculation type server, storage-type server and communication type service device;
(2) state management server monitors whether above-mentioned multiple calculation servers are abnormal in real time;
(3) it is abnormal when state management server monitors a calculation server, then from multiple calculation servers The server of a no exceptions is selected as backup server, wherein selected backup server and the cloud being abnormal The type of calculation server is identical.
Beneficial effects of the present invention include:Suitable standby server is selected according to the type for the server being abnormal, Meet the best demand of system, improve the efficiency restored to server exception.
【Description of the drawings】
Attached drawing described herein is to be used to provide further understanding of the present invention, and forms the part of the application, but It does not constitute improper limitations of the present invention, in the accompanying drawings:
Fig. 1 is the structure chart of present system;
Fig. 2 is the flow chart of backup server selection management method of the present invention.
【Specific embodiment】
Come that the present invention will be described in detail below in conjunction with attached drawing and specific embodiment, illustrative examples therein and say It is bright to be only used for explaining the present invention, but not as a limitation of the invention.
It is the system that the present invention is applied referring to attached drawing 1, the system comprises multiple calculation servers, condition managings to take Business device and multiple standby servers, wherein state management server connect above-mentioned multiple calculation servers and above-mentioned backup services Device, for managing the state of above-mentioned multiple calculation servers.
In one embodiment, the system is cloud system, and the cloud system includes multiple cloud computing servers, state The above-mentioned multiple cloud computing clothes of visualized management server and multiple standby servers, wherein status visualization management server connection Business device and above-mentioned backup server, for managing the state of above-mentioned multiple cloud computing servers.
Referring to attached drawing 2, embodiment 1:A kind of backup server selects management method, and this method comprises the following steps:
(1) state management server classifies to multiple calculation servers, according to multiple cloud computing servers calculate, The different performance of storage and communication aspects is divided into calculation type server, storage-type server and communication type service device;
(2) state management server monitors whether above-mentioned multiple calculation servers are abnormal in real time;
(3) it is abnormal when state management server monitors a calculation server, then from multiple calculation servers The server of a no exceptions is selected as backup server, wherein selected backup server and the cloud being abnormal The type of calculation server is identical.
Embodiment 2:This method comprises the following steps:
(1) status visualization management server classifies to multiple cloud computing servers.
In one embodiment, according to multiple cloud computing servers calculate, storage and communication aspects different performance It is divided into calculation type server, storage-type server and communication type service device, setup algorithm threshold value stores threshold value and communication threshold, Calculation type server subject to judging if the computing capability of calculation server is being calculated on threshold value, the computing capability of server It can be weighed with the quantity of every server processing request per second;If the storage capacity of calculation server storage threshold value it On then judge subject to storage-type server, the storage capacity of server can weigh with the hard disc storage capacity of every server Amount;Communication type service device subject to judging if the communication capacity of calculation server is on communication threshold, the communication of server Ability can send and receive quantity or the rate of information throughout of data packet with every server to weigh within the unit interval Amount.
If server be only judged subject to one in calculation type, storage-type or communication type service device, by above-mentioned clothes Business device is classified as corresponding calculation type, storage-type or communication type service device.
If server be judged subject in calculation type, storage-type or communication type service device at least two, i.e. server Storage capacity, communication capacity or computing capability both be at least more than respective threshold, then which aspect of performance to be more than phase according to The ratio highest of threshold value is answered, then is respective server by classification server.
Such as server storage ability, beyond storage threshold value 20%, computing capability is beyond calculating threshold value 30%, communication capacity Beyond communication threshold 40%, i.e., the server be first judged subject to calculation type, storage-type and communication type service device, and due to logical Letter ability exceeds 40% highest of ratio of communication threshold, and most the classification server is communication type service device at last.
(2) status visualization management server obtains the actual physical address of multiple cloud computing servers.
In one embodiment, the multiple cloud computing server determines it by the GPS module unit being arranged inside Residing actual physical address, and send above-mentioned actual physical address to above-mentioned status visualization management server.
(3) status visualization management server is according to the above-mentioned actual physical address of acquisition, by multiple cloud computing servers The computer cluster being divided into several regions, the physics between each computer in computer cluster in the region Distance is in predetermined threshold range.
(4) standby server is set in above-mentioned each region.
In one embodiment, the standby server may be provided at one in the computer cluster in each region Computer location vicinity.
(5) the display end generation server state management map of status visualization management server.
In one embodiment, label has multiple regions, each region on the server state management map Each computer server and redundant computer in interior computer cluster are marked according to actual physical address accordingly Figure position, the computer server icon representation of each calculation server same shape in the computer cluster, and use Different computer server marks are distinguished, the redundant computer icon representation of the redundant computer same shape, And it is distinguished with different redundant computer marks.
(6) status visualization management server monitors whether above-mentioned multiple cloud computing servers are abnormal in real time.
(7) status visualization management server is monitored if there is cloud computing server is abnormal, then in server shape Abnormal marking, and selection and determining backup server are carried out to the above-mentioned cloud computing server being abnormal on state management map.
In one embodiment, abnormal marking can represent that different types of exception is different with an abnormal mark Abnormal mark carries out difference expression, and the exception includes one in abnormal processing, communication abnormality and the abnormal three types of storage.
Further,
(7-1) is abnormal when status visualization management server monitors a cloud computing server, then:
(7-1-1) carries out abnormal mark on server state management map to the above-mentioned cloud computing server being abnormal Note;
(7-1-2) determines the type for the cloud computing server being abnormal, and the type is calculation type server, storage-type One in server and communication type service device;
(7-1-3) determines the cluster regions where the cloud computing server being abnormal;
(7-1-4) one no exceptions of selection from multiple cloud computing servers in determining above-mentioned cluster regions Server is as backup server, wherein selected backup server and the type phase for the cloud computing server being abnormal Together;
(7-2) is abnormal when remote status visualized management server monitoring to m cloud computing server, and wherein m >= 2, then:
(7-2-1) carries out above-mentioned m cloud computing server being abnormal on server state management map abnormal Label;
(7-2-2) determines the respective cluster regions where m cloud computing server being abnormal;
(7-2-3) is located at different cluster regions respectively in case of m abnormal cloud computing server, then from determining The server of a no exceptions is selected in multiple cloud computing servers in above-mentioned respective cluster regions as backup services Device, wherein the type of cloud computing server of the selected backup server with being abnormal is identical;
(7-2-4) is located at k cluster regions in case of m abnormal cloud computing server, wherein each cluster regions In there is n or more cloud computing server to be abnormal, wherein 2≤n≤m, k >=1, then from each cluster in k cluster regions N-1 cloud computing server of no exceptions is selected in multiple cloud computing servers in region respectively as backup server, Wherein selected n-1 backup server is identical with the type of n-1 cloud computing server being abnormal, while swashs respectively Standby server in k cluster regions living is as a backup server in cluster each in above-mentioned k cluster regions.
Assuming that there is 4 cloud computing servers to be abnormal, this 4 abnormal servers are located at 2 cluster regions, each cluster There are 2 abnormal servers in region, there is 6 calculation servers and a standby server in each cluster regions, determines to occur different The type of normal server selects the server of 1 no exceptions as a backup server in each cluster regions, And the type of server is identical with a type in 2 servers being abnormal in the cluster regions, as the type phase The backup server of the same server being abnormal, for another different types of server being abnormal, then selects Standby server in the cluster regions is as its backup server.The number of servers being abnormal due to each cluster regions It is higher (2/6) to account for cluster regions number of servers, therefore selects standby server as backup server rather than selection cluster In runtime server as backup server, therefore avoid bigger being caused to influence the operation of whole region other servers, carry High system overall operation efficiency.
(8) to selected backup server into line activating processing, to selected on server state management map The backup server selected carries out backup server activation tagging.
By the above method, the present invention selects suitable standby server according to the type for the server being abnormal, full The best demand of pedal system, improves the efficiency restored to server exception.
The above is only the better embodiment of the present invention, therefore all constructions according to described in present patent application range, The equivalent change or modification that feature and principle are done, is included in the range of present patent application.

Claims (2)

1. a kind of backup server selects management method, which is characterized in that the system comprises multiple calculation servers, state pipes Server and multiple standby servers are managed, wherein state management server connects above-mentioned multiple calculation servers and above-mentioned backup clothes Business device, for managing the state of above-mentioned multiple calculation servers, this method comprises the following steps:
(1) state management server classifies to multiple calculation servers, according to multiple calculation servers calculate, storage and The different performance of communication aspects is divided into calculation type server, storage-type server and communication type service device;
(2) state management server monitors whether above-mentioned multiple calculation servers are abnormal in real time;
(3) it is abnormal, is then selected from multiple calculation servers when state management server monitors a calculation server The server of one no exceptions is as backup server, wherein selected backup server and the calculating being abnormal take The type of business device is identical.
2. backup server according to claim 1 selects management method, which is characterized in that the exception includes handling different Often, one in communication abnormality and the abnormal three types of storage.
CN201611213267.4A 2016-12-23 2016-12-23 A kind of backup server selects management method Pending CN108243030A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611213267.4A CN108243030A (en) 2016-12-23 2016-12-23 A kind of backup server selects management method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611213267.4A CN108243030A (en) 2016-12-23 2016-12-23 A kind of backup server selects management method

Publications (1)

Publication Number Publication Date
CN108243030A true CN108243030A (en) 2018-07-03

Family

ID=62704671

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611213267.4A Pending CN108243030A (en) 2016-12-23 2016-12-23 A kind of backup server selects management method

Country Status (1)

Country Link
CN (1) CN108243030A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109117322A (en) * 2018-08-28 2019-01-01 郑州云海信息技术有限公司 A kind of control method, system, equipment and the storage medium of server master-slave redundancy

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070198529A1 (en) * 2006-02-14 2007-08-23 Yukio Ogawa Management computer and communication system
US7752013B1 (en) * 2006-04-25 2010-07-06 Sprint Communications Company L.P. Determining aberrant server variance
CN103475696A (en) * 2013-08-23 2013-12-25 汉柏科技有限公司 System and method for monitoring state of cloud computing cluster server
CN103959270A (en) * 2011-10-07 2014-07-30 英特尔公司 Mechanism for employing and facilitating dynamic and remote memory collaboration at computing devices
CN205490680U (en) * 2016-04-01 2016-08-17 北京轻元科技有限公司 High available cloud computing system based on general server and universal exchange
CN105959145A (en) * 2016-06-04 2016-09-21 广东中兴新支点技术有限公司 Method and system for parallel management server of high availability cluster
CN111737078A (en) * 2020-05-12 2020-10-02 华南理工大学 Load type-based adaptive cloud server energy consumption measuring and calculating method, system and equipment
CN113010576A (en) * 2021-03-19 2021-06-22 中国建设银行股份有限公司 Method, device, equipment and storage medium for capacity evaluation of cloud computing system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070198529A1 (en) * 2006-02-14 2007-08-23 Yukio Ogawa Management computer and communication system
US7752013B1 (en) * 2006-04-25 2010-07-06 Sprint Communications Company L.P. Determining aberrant server variance
CN103959270A (en) * 2011-10-07 2014-07-30 英特尔公司 Mechanism for employing and facilitating dynamic and remote memory collaboration at computing devices
CN103475696A (en) * 2013-08-23 2013-12-25 汉柏科技有限公司 System and method for monitoring state of cloud computing cluster server
CN205490680U (en) * 2016-04-01 2016-08-17 北京轻元科技有限公司 High available cloud computing system based on general server and universal exchange
CN105959145A (en) * 2016-06-04 2016-09-21 广东中兴新支点技术有限公司 Method and system for parallel management server of high availability cluster
CN111737078A (en) * 2020-05-12 2020-10-02 华南理工大学 Load type-based adaptive cloud server energy consumption measuring and calculating method, system and equipment
CN113010576A (en) * 2021-03-19 2021-06-22 中国建设银行股份有限公司 Method, device, equipment and storage medium for capacity evaluation of cloud computing system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109117322A (en) * 2018-08-28 2019-01-01 郑州云海信息技术有限公司 A kind of control method, system, equipment and the storage medium of server master-slave redundancy

Similar Documents

Publication Publication Date Title
CN106789257B (en) A kind of cloud system server state visual management method
CN110213068B (en) Message middleware monitoring method and related equipment
CN101111981B (en) Mapping power system components
CN108564181A (en) Electrical equipment fault detects and method for maintaining and terminal device
CN103067297B (en) A kind of dynamic load balancing method based on resource consumption prediction and device
CN106844108B (en) A kind of date storage method, server and storage system
CN107026881B (en) Method, device and system for processing service data
CN105808407A (en) Equipment management method, equipment and equipment management controller
CN110336742B (en) Information sending method and device, computer equipment and storage medium
CN105243004A (en) Failure resource detection method and apparatus
CN103763126A (en) System and method for monitoring database and database monitoring device
CN108197774A (en) A kind of abnormality diagnostic method and device of distributed photovoltaic power generation amount
CN111966289A (en) Partition optimization method and system based on Kafka cluster
CN110659180A (en) Data center infrastructure management system based on cluster technology
CN104142871A (en) Data backup method and device and distributed file system
CN111258305B (en) Wei Xingyun control center system, equipment management method, equipment and storage medium
CN108241567B (en) A kind of cloud system server state management map method
CN109101390A (en) Timed task abnormality monitoring method, electronic device and medium based on Gaussian Profile
CN109918354B (en) HDFS-based disk positioning method, device, equipment and medium
CN106951445A (en) A kind of distributed file system and its memory node loading method
CN105183627A (en) Server performance prediction method and system
CN117207778B (en) Nondestructive testing method and system for vehicle parts
CN108243030A (en) A kind of backup server selects management method
CN108243206B (en) A kind of server cluster area management method
CN116302580B (en) Method and device for scheduling calculation force resources of nano relay

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
CB02 Change of applicant information

Address after: 101399 No. 2 East Airport Road, Shunyi Airport Economic Core Area, Beijing (1st, 5th and 7th floors of Industrial Park 1A-4)

Applicant after: Zhongke Star Map Co.,Ltd.

Address before: 101399 Building 1A-4, National Geographic Information Technology Industrial Park, Guomen Business District, Shunyi District, Beijing

Applicant before: GEOVIS TECHNOLOGY (BEIJING) Co.,Ltd.

CB02 Change of applicant information
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180703

RJ01 Rejection of invention patent application after publication