CN108376107A - A kind of method, apparatus, equipment and the storage medium of server failure detection - Google Patents

A kind of method, apparatus, equipment and the storage medium of server failure detection Download PDF

Info

Publication number
CN108376107A
CN108376107A CN201810171335.8A CN201810171335A CN108376107A CN 108376107 A CN108376107 A CN 108376107A CN 201810171335 A CN201810171335 A CN 201810171335A CN 108376107 A CN108376107 A CN 108376107A
Authority
CN
China
Prior art keywords
ipmi
wathdog
equipment
information
warping apparatus
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810171335.8A
Other languages
Chinese (zh)
Inventor
袁传博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201810171335.8A priority Critical patent/CN108376107A/en
Publication of CN108376107A publication Critical patent/CN108376107A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/0757Error or fault detection not based on redundancy by exceeding limits by exceeding a time limit, i.e. time-out, e.g. watchdogs

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

This application discloses a kind of methods of server failure detection, are included in when triggering system starts and start IPMI Wathdog, and judge whether IPMI Wathdog are open state after preset duration;If so, obtaining the register data of equipment;Then register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.It can be seen that, judge currently to whether there is warping apparatus using IPMI Wathdog, if so, by the register data of analytical equipment to obtain the information of warping apparatus, the information that warping apparatus is found out by way of manually investigating is avoided, to improve the convenience of server failure detection.Disclosed herein as well is device, equipment and the computer readable storage mediums of a kind of detection of server failure, all have above-mentioned advantageous effect.

Description

A kind of method, apparatus, equipment and the storage medium of server failure detection
Technical field
The present invention relates to equipment detection field, more particularly to a kind of method, apparatus, equipment and the meter of server failure detection Calculation machine readable storage medium storing program for executing.
Background technology
With the rapid development and cloud computing of information technology and the landing of Internet of Things, user to the reliability of server and Information processing capability has higher requirement.Compared with traditional server, the server of new generation based on Purley platforms is being counted Calculate has prodigious advantage compared with traditional server on Performance And Reliability, therefore based on the server application of Purley platforms It is more and more extensive.As the equipment in server is more and more, therefore cause the factor of each device fails also more and more, It also becomes more and more important to the fault detect work of equipment in server.
The fault detection approach of server in the prior art based on Purley platforms be in cabinet using centralized management and The strategy of differentiated control uniformly carries out failure monitoring, fault management and failure by whole machine cabinet to all devices in server Record, after detecting the information of unit exception, sends out prompt message to prompt operating personnel to the equipment in server It is investigated, finds out abnormal equipment.But this mode be after being tested with unit exception by way of manually investigating The position for finding out failure generation, needs to expend a large amount of manpower and materials.
Therefore, how to improve the convenience of server failure detection is that those skilled in the art need the technology solved at present Problem.
Invention content
In view of this, the purpose of the present invention is to provide a kind of method of server failure detection, server can be improved The convenience of fault detect;It is a further object of the present invention to provide device, equipment and the computers of a kind of detection of server failure Readable storage medium storing program for executing all has above-mentioned advantageous effect.
In order to solve the above technical problems, the present invention provides a kind of method of server failure detection, including:
Start IPMI Wathdog when the system of triggering starts, and judges that the IPMI Wathdog are after preset duration No is open state;
If so, obtaining the register data of equipment;
The register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.
Preferably, further comprise:
The information of the warping apparatus is recorded in fault log.
Preferably, the IPMI Wathdog are judged after preset duration to further comprise after open state:
Suspend the timer time of the IPMI Wathdog.
Preferably, further comprise after the information by the warping apparatus is recorded in fault log:
Pass through the content of fault log described in web displaying.
Preferably, further comprise after the information by the warping apparatus is recorded in fault log:
It is alarmed using attention device.
Preferably, the attention device is specially buzzer and/or indicator light.
Preferably, the attention device includes a variety of alarm signals, and each alarm signal corresponds to different failures respectively Situation.
In order to solve the above technical problems, the present invention also provides a kind of devices of server failure detection, including:
Judgment module for starting IPMI Wathdog when the system of triggering starts, and judges the IPMI Wathdog Whether it is open state after preset duration;
Acquisition module is used for if so, obtaining the register data of equipment;
Analysis module, for being analyzed the register data to obtain warping apparatus according to preset analysis rule Information.
In order to solve the above technical problems, the present invention also provides a kind of equipment of server failure detection, including:
Memory, for storing computer program;
Processor realizes the step of the method for any of the above-described kind of server failure detection when for executing the computer program Suddenly.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, it is described computer-readable Computer program is stored on storage medium, the computer program realizes any of the above-described kind of server event when being executed by processor The step of hindering the method for detection.
The method of server failure detection provided by the invention, is included in when triggering system starts and starts IPMI Wathdog, and judge whether IPMI Wathdog are open state after preset duration;If so, obtaining the register of equipment Data;Then register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.
As it can be seen that when it is still open state to judge IPMI Wathdog after preset duration, illustrate in current server There are warping apparatus, therefore obtain the register data of equipment, and are divided register data according to preset analysis rule Analysis is to obtain the information of warping apparatus.That is, judge currently to whether there is warping apparatus using IPMI Wathdog, if It is then to be avoided by way of manually investigating by the register data of analytical equipment to obtain the information of warping apparatus The information for finding out warping apparatus, to improve the convenience of server failure detection.
In order to solve the above technical problems, the present invention also provides device, equipment and the calculating of a kind of detection of server failure Machine readable storage medium storing program for executing all has above-mentioned advantageous effect.
Description of the drawings
It in order to illustrate the embodiments of the present invention more clearly or the technical solution of the prior art, below will be to embodiment or existing Attached drawing is briefly described needed in technology description, it should be apparent that, the accompanying drawings in the following description is only this hair Some bright embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to the attached drawing of offer.
Fig. 1 is a kind of flow chart of the method for server failure detection provided in an embodiment of the present invention;
Fig. 2 is the flow chart of the method for another server failure detection provided in an embodiment of the present invention;
Fig. 3 is a kind of structure chart of the device of server failure detection provided in an embodiment of the present invention;
Fig. 4 is a kind of structure chart of the equipment of server failure detection provided in an embodiment of the present invention.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
The core of the embodiment of the present invention is to provide a kind of method of server failure detection, can improve server failure inspection The convenience of survey;Another core of the present invention is to provide the device of server failure detection a kind of, equipment and computer-readable deposits Storage media all has above-mentioned advantageous effect.
It is right with reference to the accompanying drawings and detailed description in order to make those skilled in the art more fully understand the present invention program The present invention is described in further detail.
Fig. 1 is a kind of flow chart of the method for server failure detection provided in an embodiment of the present invention, as shown, a kind of The method of server failure detection specifically includes:
S10:Start IPMI Wathdog when the system of triggering starts;
S20:Judge whether IPMI Wathdog are open state after preset duration.
It is understood that primary application program is because unknown reason is died without reason or program fleet in order to prevent, one As can by IPMI Wathdog programs ensure system can restart.IPMI Wathdog are server B MC (Baseboard Management Controller, baseboard management controller) one group of timer logic defining, three IPMI (intelligence are externally provided Energy platform management interface) command interface:SetWDT, GetWDT and ResetWDT.
In specific implementation, BIOS (Basic Input Output System, basic input output system), OS (Operating System, operating system/Computer management and control program) and other OEM (original equipment manufacturer) apply journey Sequence can use the timer time of the IPMI command interfaces setting WDT (Watchdog Timer, house dog) of BMC, time Action, WDT after expiring are turned on and off.
When the system of triggering starts, while starting IPMI Wathdog, then judges IPMI Wathdog in preset duration Whether it is afterwards open state.In general, after system normally starts, WDT is closed;If but occurring in current server abnormal Equipment, then will cause cannot by SetWDT close WDT, therefore by judge IPMI Wathdog after preset duration whether be Open state judges currently to whether there is warping apparatus.
That is, IPMI Wathdog are sentenced by the process of monitoring BIOS, OS and the POST of other OEM application programs It is disconnected currently to whether there is warping apparatus.For example, when triggering OS startups, using SetWDT interfaces setting timer time and expire Action afterwards, and reuse SetWDT closing IPMI Wathdog, purpose after starting IPMI Wathdog, OS start completions simultaneously It is the case where whether monitoring OS start-up courses will appear delay machine.If the equipment in current server has exception, OS is opened Dynamic process will will appear the case where delay machine, and OS has no chance to close IPMI Wathdog using SetWDT after delay machine, in IPMI When Wathdog timer times expire, will triggering WDT to server execute corresponding actions Time Expired (time time-out)/ Power Cycle (power cycle)/Reset (reset)/Down (going offline) etc..
More specifically, when system starts, by power-on self-test program (POST, Power-On-Self-Test) to service Equipment on device is detected, to judge whether equipment works normally.If there are warping apparatus, IPMI Wathdog will be caused It cannot close.
It should be noted that preset duration is generally less than the timer duration of IPMI Wathdog, that is to say, that triggering After system starts IPMI Wathdog startups simultaneously, and judge IPMI before IPMI Wathdog timer times expire Whether Wathdog is open state.It is understood that if preset time is more than the timer time of IPMI Wathdog, that System is likely to occur the case where delay machine, then cannot effectively judge the fault condition of current system.It should be noted that this reality It applies example not limit preset duration, generally be set according to the timer time of IPMI Wathdog.
S30:If so, obtaining the register data of equipment.
It is understood that if so, obtaining the register data of equipment;That is IPMI Wathdog are still after preset duration Open state, that is, IPMI Wathdog are not turned off after preset duration, at this time in server there are warping apparatus, because This obtains the register data of equipment.
It should be noted that in the present embodiment, server B MC reads corresponding register data according to PECI agreements. It is understood that the register data of some equipment is respectively present in multiple and different registers, it is therefore desirable to obtain phase Close all register datas of equipment.
It should be noted that can also black box daily record (Black further be stored in the register data got Box), in order to subsequent operation.It is understood that black box daily record is to store the journal file of binary message, generally make It is read with computer program.
S40:Register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.
Specifically, analyzed register data according to pre-set analysis rule, analysis rule includes but unlimited In decoding, parsing etc., analysis result is obtained, and the information of warping apparatus is obtained according to analysis result.More specifically, if in S30 Register data is stored in black box daily record, then corresponding S40 specifically includes black to being stored according to preset analysis rule Register data in box daily record is analyzed and obtains the information of warping apparatus.
The method of server failure detection provided in this embodiment, is included in when triggering system starts and starts IPMI Wathdog, and judge whether IPMI Wathdog are open state after preset duration;If so, obtaining the register of equipment Data;Then register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.As it can be seen that utilizing IPMI Wathdog judge currently to whether there is warping apparatus, if so, by the register data of analytical equipment to obtain The information of warping apparatus avoids the information that warping apparatus is found out by way of manually investigating, to improve server failure The convenience of detection.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, Judging IPMI Wathdog after preset duration to further comprise after open state:
Suspend the timer time of IPMI Wathdog.
That is, judging IPMI Wathdog after preset duration for after open state, by IPMI The timer time of Wathdog is set as halted state.Specifically, the type of the timer of IPMI Wathdog is varied, The present embodiment does not limit this.It should be noted that the present embodiment is to setting the timer time of IPMI Wathdog to The concrete operations mode of pause does not limit, as long as can achieve the purpose that the present embodiment.By suspending IPMI The timer time of Wathdog can make to avoid the time of the IPMI Wathdog arrival timer when carrying out accident analysis Server delay machine.
Fig. 2 is the flow chart of the method for another server failure detection provided in an embodiment of the present invention.As shown in Fig. 2, On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, further packet It includes:
S50:The information of warping apparatus is recorded in fault log.
It should be noted that fault log is journal file for storage device failure information, by by warping apparatus Information be recorded in fault log, the information of historical analysis result can be obtained by checking fault log, i.e. history is different Standing standby information;Or it can be by the way that the locally downloading memory space of fault log be carried out other operations.It needs Illustrate, the present embodiment is not limited specifically recording form.It should be noted that the particular content of analysis result can be with Type, failure rank, error code, failure-description and the treatment advice etc. of the time, faulty equipment that occur including failure.
As preferred embodiment, further comprise after the information of warping apparatus is recorded in fault log:
S60:Pass through the content of web displaying fault log.
It is understood that the case where in order to make operator more easily obtain warping apparatus in server, can pass through The content of web displaying fault log shows the information of warping apparatus in the form of a web page.Specifically, webpage can be clothes Business device WEB webpages, naturally it is also possible to be other webpages, the present embodiment does not limit the concrete type of webpage.
Can be that the content of fault log is subjected to whole displays in addition, by the content of web displaying fault log, It is shown again after being screened to the content in fault log, the present embodiment is to showing that the content of fault log does not limit It is fixed.As it can be seen that by the content of web displaying fault log, the mode for the information for obtaining warping apparatus is increased, in practical application In it is more practical.
On the basis of the above embodiments, the present embodiment has made further instruction and optimization to technical solution, specifically, Further comprise after the information of warping apparatus is recorded in fault log:
It is alarmed using attention device.
Attention device is specially buzzer and/or indicator light.
Attention device includes a variety of alarm signals, and each alarm signal corresponds to different fault conditions respectively.
It is understood that in practical applications, display is also one of the equipment belonged in server, when display goes out When now abnormal, operator will not pass through the information that display obtains warping apparatus.Therefore, it is recorded by the information of warping apparatus Further comprise alarming using attention device after in fault log, attention device is specially buzzer and/or indicator light.
Specifically, in practical applications, generally using buzzer or indicator light to alarming, by buzzer or referring to Show that lamp intuitively carries out alarm.
Attention device includes a variety of alarm signals, and each alarm signal corresponds to the fault condition of different warping apparatus respectively. More specifically, fault message can be shown by different sound sequences, that is to say, that the buzzer of buzzer can be with It is the combination of short buzzer and long buzzer, different combinations is corresponded to different defect contents.For example, can lead to Very brief buzzer is crossed to indicate that buzzer works normally;It can indicate that supply voltage is unstable by very brief buzzer It is fixed;By long lasting for buzzer indicate memory failure;Video card failure etc. is indicated by the buzzer of unexpected misfortune; Similar, it can be shown by different light sequences with fault message, that is to say, that the light of indicator light can be flicker Light and long bright light combination, different defect contents is corresponded to again by different combinations, it is no longer superfluous herein It states.The present embodiment does not limit the correspondence of alarm signal and fault message.
As it can be seen that being shown to fault message by buzzer and/or indicator light, event can be more directly and quickly obtained Hinder information, improves the convenience of fault detection method.
It is described in detail above for a kind of embodiment of the method for server failure detection provided by the invention, The present invention also provides a kind of device, equipment and the computer-readable storage mediums of server failure detection corresponding with this method Matter, since the embodiment of device, equipment and computer readable storage medium part and the embodiment of method part mutually correlate, because The embodiment of this device, equipment and computer readable storage medium part refers to the description of the embodiment of method part, here It wouldn't repeat.
Fig. 3 is a kind of structure chart of the device of server failure detection provided in an embodiment of the present invention.As shown, service The device of device fault detect includes:
Judgment module 31 for starting IPMI Wathdog when the system of triggering starts, and judges that IPMI Wathdog exist Whether it is open state after preset duration;
Acquisition module 32 is used for if so, obtaining the register data of equipment;
Analysis module 33, for being analyzed register data to obtain warping apparatus according to preset analysis rule Information.
The device of server failure detection provided in this embodiment, has the beneficial of the method for above-mentioned server failure detection Effect.
Fig. 4 is a kind of structure chart of the equipment of server failure detection provided in an embodiment of the present invention, including:
Memory 41, for storing computer program;
Processor 42, realizes following steps when for executing computer program:
Trigger system start when start IPMI Wathdog, and judge IPMI Wathdog after preset duration whether be Open state;
If so, obtaining the register data of equipment;
Register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.
The equipment of server failure detection provided in this embodiment, has the beneficial of the method for above-mentioned server failure detection Effect.
In order to solve the above technical problems, the present invention also provides a kind of computer readable storage medium, computer-readable storage It is stored with computer program on medium, lower step is realized when computer program is executed by processor:
Receive the triggering information sent by power-on self-test program when detecting warping apparatus;
According to the corresponding register data of triggering acquisition of information warping apparatus and register data is stored in black box daily record;
The register data in black box daily record is analyzed according to preset analysis rule and obtains analysis result.
Computer readable storage medium provided in this embodiment, the beneficial effect of the method with the detection of above-mentioned server failure Fruit.
Above to method, apparatus, equipment and the computer-readable storage medium of server failure provided by the present invention detection Matter is described in detail.Principle and implementation of the present invention are described for specific embodiment used herein, with The explanation of upper embodiment is merely used to help understand the method and its core concept of the present invention.It should be pointed out that being led for this technology For the those of ordinary skill in domain, without departing from the principle of the present invention, can also to the present invention carry out it is several improvement and Modification, these improvement and modification are also fallen within the protection scope of the claims of the present invention.
Each embodiment is described by the way of progressive in specification, the highlights of each of the examples are with other realities Apply the difference of example, just to refer each other for identical similar portion between each embodiment.For device disclosed in embodiment Speech, since it is corresponded to the methods disclosed in the examples, so description is fairly simple, related place is referring to method part illustration .
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is implemented in hardware or software actually, depends on the specific application and design constraint of technical solution.Profession Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered Think beyond the scope of this invention.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.

Claims (10)

1. a kind of method of server failure detection, which is characterized in that including:
Trigger system start when start IPMI Wathdog, and judge the IPMI Wathdog after preset duration whether be Open state;
If so, obtaining the register data of equipment;
The register data is analyzed according to preset analysis rule to obtain the information of warping apparatus.
2. according to the method described in claim 1, it is characterized in that, further comprising:
The information of the warping apparatus is recorded in fault log.
3. according to the method described in claim 1, it is characterized in that, judging that the IPMI Wathdog are after preset duration Further comprise after open state:
Suspend the timer time of the IPMI Wathdog.
4. according to the method described in claim 2, it is characterized in that, being recorded in failure in the information by the warping apparatus Further comprise after in daily record:
Pass through the content of fault log described in web displaying.
5. according to the method described in claim 2, it is characterized in that, being recorded in failure in the information by the warping apparatus Further comprise after in daily record:
It is alarmed using attention device.
6. according to the method described in claim 5, it is characterized in that, the attention device is specially buzzer and/or indicator light.
7. according to the method described in claim 5, it is characterized in that, the attention device includes a variety of alarm signals, and it is each described Alarm signal corresponds to different fault conditions respectively.
8. a kind of device of server failure detection, which is characterized in that including:
Judgment module for starting IPMI Wathdog when the system of triggering starts, and judges the IPMI Wathdog pre- If whether being open state after duration;
Acquisition module is used for if so, obtaining the register data of equipment;
Analysis module, for being analyzed the register data according to preset analysis rule to obtain the letter of warping apparatus Breath.
9. a kind of equipment of server failure detection, which is characterized in that including:
Memory, for storing computer program;
Processor realizes server failure inspection as described in any one of claim 1 to 7 when for executing the computer program The step of method of survey.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program, the computer program realize server failure detection as described in any one of claim 1 to 7 when being executed by processor Method the step of.
CN201810171335.8A 2018-03-01 2018-03-01 A kind of method, apparatus, equipment and the storage medium of server failure detection Pending CN108376107A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810171335.8A CN108376107A (en) 2018-03-01 2018-03-01 A kind of method, apparatus, equipment and the storage medium of server failure detection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810171335.8A CN108376107A (en) 2018-03-01 2018-03-01 A kind of method, apparatus, equipment and the storage medium of server failure detection

Publications (1)

Publication Number Publication Date
CN108376107A true CN108376107A (en) 2018-08-07

Family

ID=63018282

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810171335.8A Pending CN108376107A (en) 2018-03-01 2018-03-01 A kind of method, apparatus, equipment and the storage medium of server failure detection

Country Status (1)

Country Link
CN (1) CN108376107A (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109614280A (en) * 2018-12-10 2019-04-12 浪潮(北京)电子信息产业有限公司 A kind of state test method storing equipment
CN109766248A (en) * 2019-01-04 2019-05-17 浪潮商用机器有限公司 System failure signal acquiring method, device, server and readable storage medium storing program for executing
CN109947602A (en) * 2019-03-29 2019-06-28 浪潮商用机器有限公司 Partition recovery method, apparatus, equipment and medium based on powerVM
CN109947586A (en) * 2019-03-20 2019-06-28 浪潮商用机器有限公司 A kind of method, apparatus and medium of isolated fault equipment
CN109976959A (en) * 2019-03-27 2019-07-05 苏州浪潮智能科技有限公司 A kind of portable device and method for server failure detection
CN110008105A (en) * 2019-04-11 2019-07-12 苏州浪潮智能科技有限公司 A kind of BMC time reservation method, device and electronic equipment and storage medium
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
CN110187996A (en) * 2019-05-30 2019-08-30 苏州浪潮智能科技有限公司 BMC host process method for diagnosing faults, device, equipment and readable storage medium storing program for executing
CN111124774A (en) * 2019-11-24 2020-05-08 苏州浪潮智能科技有限公司 Method and related device for testing stability of server in starting process
CN111124725A (en) * 2019-11-29 2020-05-08 苏州浪潮智能科技有限公司 Fault positioning method, device, equipment and computer readable storage medium
CN111143173A (en) * 2020-01-02 2020-05-12 山东超越数控电子股份有限公司 Server fault monitoring method and system based on neural network
CN112084050A (en) * 2019-06-14 2020-12-15 北京北方华创微电子装备有限公司 Information recording method and system
CN112699705A (en) * 2019-10-22 2021-04-23 杭州海康威视数字技术股份有限公司 Information acquisition method, fault positioning method and device and electronic equipment
CN112732477A (en) * 2021-04-01 2021-04-30 四川华鲲振宇智能科技有限责任公司 Method for fault isolation by out-of-band self-checking
CN113176973A (en) * 2021-05-14 2021-07-27 山东英信计算机技术有限公司 PSU power supply black box log time stamp recording method, device, equipment and medium
CN113312214A (en) * 2021-06-10 2021-08-27 北京百度网讯科技有限公司 Method, apparatus, electronic device and storage medium for operating computer
WO2021169270A1 (en) * 2020-02-27 2021-09-02 平安科技(深圳)有限公司 Server fault pre-warning method, device, computer apparatus, and storage medium
CN114676019A (en) * 2022-03-25 2022-06-28 苏州浪潮智能科技有限公司 Method, device, equipment and storage medium for monitoring state of central processing unit
CN117370052A (en) * 2023-09-14 2024-01-09 广州宇中网络科技有限公司 Microservice fault analysis method, device, equipment and storage medium
CN117792863A (en) * 2024-02-27 2024-03-29 深圳供电局有限公司 Industrial switch field visual fault detection method, system and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260841A1 (en) * 2003-06-19 2004-12-23 Mathew Tisson K. Method, apparatus, and system for internet protocol communication over intelligent platform management bus
CN101277213A (en) * 2007-03-30 2008-10-01 上海未来宽带技术及应用工程研究中心有限公司 System and method for developing IPMC based on event-driven principle
CN103294585A (en) * 2012-03-02 2013-09-11 鸿富锦精密工业(深圳)有限公司 Server monitoring system
CN103500133A (en) * 2013-09-17 2014-01-08 华为技术有限公司 Fault locating method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260841A1 (en) * 2003-06-19 2004-12-23 Mathew Tisson K. Method, apparatus, and system for internet protocol communication over intelligent platform management bus
CN101277213A (en) * 2007-03-30 2008-10-01 上海未来宽带技术及应用工程研究中心有限公司 System and method for developing IPMC based on event-driven principle
CN103294585A (en) * 2012-03-02 2013-09-11 鸿富锦精密工业(深圳)有限公司 Server monitoring system
CN103500133A (en) * 2013-09-17 2014-01-08 华为技术有限公司 Fault locating method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
LONGSHAN_2009: "uboot第一阶段分析", 《GEEKSHARE》 *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109614280A (en) * 2018-12-10 2019-04-12 浪潮(北京)电子信息产业有限公司 A kind of state test method storing equipment
CN109766248A (en) * 2019-01-04 2019-05-17 浪潮商用机器有限公司 System failure signal acquiring method, device, server and readable storage medium storing program for executing
CN109947586A (en) * 2019-03-20 2019-06-28 浪潮商用机器有限公司 A kind of method, apparatus and medium of isolated fault equipment
CN109976959A (en) * 2019-03-27 2019-07-05 苏州浪潮智能科技有限公司 A kind of portable device and method for server failure detection
CN109947602A (en) * 2019-03-29 2019-06-28 浪潮商用机器有限公司 Partition recovery method, apparatus, equipment and medium based on powerVM
CN110008105A (en) * 2019-04-11 2019-07-12 苏州浪潮智能科技有限公司 A kind of BMC time reservation method, device and electronic equipment and storage medium
CN110058979A (en) * 2019-04-18 2019-07-26 苏州浪潮智能科技有限公司 A kind of temperature reads monitoring method, BMC and the storage medium of failure failure
CN110187996A (en) * 2019-05-30 2019-08-30 苏州浪潮智能科技有限公司 BMC host process method for diagnosing faults, device, equipment and readable storage medium storing program for executing
CN112084050A (en) * 2019-06-14 2020-12-15 北京北方华创微电子装备有限公司 Information recording method and system
CN112699705A (en) * 2019-10-22 2021-04-23 杭州海康威视数字技术股份有限公司 Information acquisition method, fault positioning method and device and electronic equipment
CN111124774A (en) * 2019-11-24 2020-05-08 苏州浪潮智能科技有限公司 Method and related device for testing stability of server in starting process
CN111124774B (en) * 2019-11-24 2022-08-05 苏州浪潮智能科技有限公司 Method and related device for testing stability of server in starting process
CN111124725A (en) * 2019-11-29 2020-05-08 苏州浪潮智能科技有限公司 Fault positioning method, device, equipment and computer readable storage medium
CN111143173A (en) * 2020-01-02 2020-05-12 山东超越数控电子股份有限公司 Server fault monitoring method and system based on neural network
WO2021169270A1 (en) * 2020-02-27 2021-09-02 平安科技(深圳)有限公司 Server fault pre-warning method, device, computer apparatus, and storage medium
CN112732477A (en) * 2021-04-01 2021-04-30 四川华鲲振宇智能科技有限责任公司 Method for fault isolation by out-of-band self-checking
CN112732477B (en) * 2021-04-01 2021-06-29 四川华鲲振宇智能科技有限责任公司 Method for fault isolation by out-of-band self-checking
CN113176973A (en) * 2021-05-14 2021-07-27 山东英信计算机技术有限公司 PSU power supply black box log time stamp recording method, device, equipment and medium
CN113312214A (en) * 2021-06-10 2021-08-27 北京百度网讯科技有限公司 Method, apparatus, electronic device and storage medium for operating computer
CN113312214B (en) * 2021-06-10 2024-05-31 北京百度网讯科技有限公司 Method, apparatus, electronic device and storage medium for operating computer
CN114676019A (en) * 2022-03-25 2022-06-28 苏州浪潮智能科技有限公司 Method, device, equipment and storage medium for monitoring state of central processing unit
CN114676019B (en) * 2022-03-25 2024-06-28 苏州浪潮智能科技有限公司 Method, device, equipment and storage medium for monitoring state of central processing unit
CN117370052A (en) * 2023-09-14 2024-01-09 广州宇中网络科技有限公司 Microservice fault analysis method, device, equipment and storage medium
CN117370052B (en) * 2023-09-14 2024-04-26 广州宇中网络科技有限公司 Microservice fault analysis method, device, equipment and storage medium
CN117792863A (en) * 2024-02-27 2024-03-29 深圳供电局有限公司 Industrial switch field visual fault detection method, system and storage medium

Similar Documents

Publication Publication Date Title
CN108376107A (en) A kind of method, apparatus, equipment and the storage medium of server failure detection
CN108287775A (en) A kind of method, apparatus, equipment and the storage medium of server failure detection
WO2022160756A1 (en) Server fault positioning method, apparatus and system, and computer-readable storage medium
Chen et al. Towards intelligent incident management: why we need it and how we make it
CN105518629B (en) Cloud deployment base structural confirmation engine
Xu et al. Early detection of configuration errors to reduce failure damage
CN108388489B (en) Server fault diagnosis method, system, equipment and storage medium
CN107660289B (en) Automatic network control
Panda et al. {IASO}: A {Fail-Slow} Detection and Mitigation Framework for Distributed Storage Services
CN104899119A (en) Method for automatically detecting hard disk abnormity
KR20060046276A (en) Method, system, and apparatus for providing custom product support for a software program based upon states of program execution instability
US8984335B2 (en) Core diagnostics and repair
US11573848B2 (en) Identification and/or prediction of failures in a microservice architecture for enabling automatically-repairing solutions
US10430202B2 (en) Dual purpose boot registers
CN110457907B (en) Firmware program detection method and device
US20190079854A1 (en) Systems and methods for executing tests
CN112732503B (en) BIOS problem positioning method and device and computer readable storage medium
CN108572895B (en) Stability test method for automatically checking software and hardware configuration under Linux
JP2017091077A (en) Pseudo-fault generation program, generation method, and generator
JP2007133870A (en) Method for measuring autonomic ability of computing system, system, and computer program
CN107704333A (en) Failure store method, device and the readable storage medium storing program for executing of SAN storage system
Flora et al. My services got old! Can Kubernetes handle the aging of microservices?
CN109614279B (en) Industrial personal computer self-checking system and control method thereof and related equipment
CN111188782A (en) Fan redundancy test method and device and computer readable storage medium
Ljubuncic Problem-solving in high performance computing: A situational awareness approach with Linux

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180807