CN100472471C - System and method for obtaining fault in-situ information for computer operating system - Google Patents

System and method for obtaining fault in-situ information for computer operating system Download PDF

Info

Publication number
CN100472471C
CN100472471C CNB2006100576026A CN200610057602A CN100472471C CN 100472471 C CN100472471 C CN 100472471C CN B2006100576026 A CNB2006100576026 A CN B2006100576026A CN 200610057602 A CN200610057602 A CN 200610057602A CN 100472471 C CN100472471 C CN 100472471C
Authority
CN
China
Prior art keywords
operating system
module
memory
information
running time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CNB2006100576026A
Other languages
Chinese (zh)
Other versions
CN101025709A (en
Inventor
周涛
周建
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CNB2006100576026A priority Critical patent/CN100472471C/en
Publication of CN101025709A publication Critical patent/CN101025709A/en
Application granted granted Critical
Publication of CN100472471C publication Critical patent/CN100472471C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention is a computer operating system failure field information obtaining system and method, comprising EFI BIOS and operating system running on EFI BIOS, as well as operating system monitoring module, counter and hardware key module, where the EFI BIOS comprises memory allocating module and running time service module; after the system is power-on, when EFI BIOS initializes the memory, the memory allocating module divides physical memory into operating system memory and failure analysis system memory; when the operating system runs, it starts and is resident at the operating system monitoring module, and the operating system monitoring module collects operating system field information; when the operating system breaks down, the operating system monitoring module informs the running time service module of the event of operating system breakdown; the running time service module establishes an failure analysis system environment for obtaining system failure field information to obtain the operating system failure field information; and it can assure making system analysis and diagnosis in the failure field of the operating system and obtain the failure field information.

Description

The system and method that a kind of computer operating system fault in-situ information is obtained
Technical field
The present invention relates to computer realm, particularly relate to the system and method that a kind of computer operating system fault in-situ information is obtained.
Background technology
The present computer operations system (Operation System, OS), multiple task operating system particularly, all comparatively complicated as Windows operating system, a complete operating system is in the process of operation, because new fault may appear in the operation of a plurality of application programs or new procedures.Usually, existing operating system all has certain operations system diagnostics and maintaining method, working condition that can monitor operating system, and when might going wrong in advance to User Alarms.But when some fatal errors take place in operating system (visit etc. of crossing the border as EMS memory error, application program), operating system can be collapsed, comprise endless loop (promptly crashing) or produce the mistake (blue screen) that can't discern, at this moment common way is that computer machine is restarted.Yet at this moment all computer failure field datas will all be lost, and can't further carry out fault analysis, also just can't find the basic reason of problem.And do not get rid of the problem of machine, and hidden danger still exists, and the stability of system can not get guaranteeing that performance can not get ensureing, may break down once more when restriction condition satisfies, and the user will descend to its degree of belief.Therefore, how in computer operating system,, become the problem that industry presses for solution if crash or during collapse, obtain fault in-situ information in operating system.
The method of existing processing operating system fatal error has following several:
1, after operating system (as the Windows operating system) collapse, carry out dump (dump) by the dump process of operating system, the pattern of existing 3 kinds of dumps is respectively little core dump (64K); The core core dump; Core dump is fully just analyzed dump file by the analysis tool program then.
But all there is following defective in these the 3 kinds methods of handling operating system fatal error:
These the 3 kinds transformations of handling the dump pattern of operating system fatal error need be provided with under operating system, need take a large amount of core memory spaces if carry out full core dump, if but (little core dump) can lose a lot of information again very little, certainly, the user can may be suitable for different dump patterns according to different application and collapse, but when computer operating system is collapsed, can only carry out dump, be other dump pattern and can not rechange according to the dump pattern that configures.On the other hand, the dump that this method still needs to depend under the operating system is carried out, if serious operating system failure situation occurring, under the situation of also collapsing as dump, perhaps under the situation that the internal memory of the local storage of operating system breaks down, can't carry out the preservation of field data.
2, fault in-situ information is restarted, obtained to computer operating system at the scene by system manager or operating system developer when fatal error takes place, and carries out operation elements such as the diagnosis of computer failure and maintenance.
But the shortcoming of this disposal route also is conspicuous, its major defect is to need system manager or operating system developer to operate to the scene, need take system manager or operating system developer's plenty of time and diagnosis and the maintenance that energy carries out operating system like this.And because the scene when not having computer crashes, therefore system manager or operating system developer accurate fault location just, pinpoint the problems and can only by virtue of experience move for a long time with a large amount of analysis tool programs, obtain field failure information, its efficient is very low, and it is also lower really to find out the probability of field data of computer operating system fault.Therefore, this method can not obtain general application in real world applications.
Summary of the invention
The system and method that a kind of computer failure field data that the objective of the invention is to overcome above-mentioned defective and provide obtains, it can guarantee that operating system analyzes system at the scene of breaking down and diagnose, obtain the information of fault in-situ, comprise memory information etc.
Be the system that realizes that a kind of computer operating system fault in-situ information that the object of the invention provides is obtained, operate based on EFI BIOS and the operating system that operates on the EFI BIOS;
Comprise the operating system monitor module, described EFI BIOS comprises memory allocating module and running time service module;
Described memory allocating module is used for after computer system powers on, and EFI BIOS is in the pre-vectoring phase, and EFI BIOS is divided into operating system memory and trouble analysis system internal memory with physical memory during to internal memory initialization;
Described operating system monitor module moves and resides in the described operating system, is used for collecting the operating system field data and preserve when operating system normally being moved; Simultaneously when operating system collapse, the running time service module of the event notice EFI BIOS that operating system is collapsed.
Described running time service module, run on the trouble analysis system memory headroom, be used for the trouble analysis system file that obtains operating system failure information is carried out initialization, the trouble analysis system environment of operating system failure field data is obtained in foundation, the analytic system support environment is provided, obtain the operating system failure field data, selection operation system failure field data and preservation position, and preserve the information content to this position.
System of the present invention can also comprise counter and Dongle module;
Described counter is used for regularly computer running time; The operating system monitor module is regularly rewritten counter, prevents that counter from overflowing; When counter overflows, then produce and interrupt, trigger EFI BIOS interruption supervisor, start the Dongle module;
The Dongle module is used for the pointer of system program is adjusted to the running time service module of trouble analysis system memory headroom, thereby the control of computer system is forwarded in the running time service module of EFI BIOS.
Described operating system is Windows operating system.
The method that the present invention also provides a kind of computer operating system fault in-situ information to obtain is characterized in that, comprises the following steps:
Steps A) after computing machine powered on, during EFI BIOS initialization internal memory, memory allocating module was divided into operating system memory and trouble analysis system internal memory with physical memory;
Step B) during operating system, start and park operation system monitoring module, the operating system monitor module is collected the operating system field data; When operating system collapse, with the running time service module of the event notice EFI BIOS of operating system collapse;
Step C) running time service module runs on the trouble analysis system memory headroom, sets up the trouble analysis system environment that obtains the operating system failure field data, obtains the operating system failure field data.
Described step B) also comprises the following steps:
Step B1) during operating system, start and park operation system monitoring module, the operating system monitor module is regularly write counter;
Step B2) when the operating system collapse, when entering EFI BIOS System Management Mode, the Dongle module starts running time service module with system program pointed running time service module.
Described step C) comprises the following steps:
Step C1) running time service module is uploaded the device drives based on EFI;
Step C2) analysis tool selection operation system failure scene memory content, the memory location, on-the-spot information also preserved in record.
Described step C1) also comprises the following steps:
Running time service module loads EFI web service, sets up connection Network Based, after loading the network interface card driving, sends system's warning by network to control end, notice control end system this operating system current state.
Described device drives comprises that network interface card drives, the IDE/SCSI device drives, and USB device drives, the PCI device drives.
Described field data comprises cpu busy percentage information, memory usage information, class in content of registers information and the progress information or multiclass information combination.
The invention has the beneficial effects as follows: the present invention utilizes EFI BIOS to finish the analysis that operating system is crashed, when computer operating system breaks down, utilization is independent of the memory headroom and the analysis operation environment of operating system to be analyzed the reason of computer operating system collapse, obtains fault in-situ information.It judges preservation computer operating system current state, and when operating system breaks down, enter memory headroom independently and do not destroy the region of memory and the internal storage state of operating system, and region of memory and the internal storage state to operating system analyzed also in conjunction with the bottom diagnosis to hardware system under analysis environments, obtain the field data of mechanical disorder, obtain the reason that computer operating system breaks down.
Description of drawings
Fig. 1 is the system architecture synoptic diagram that computer operating system fault in-situ information of the present invention is obtained;
Fig. 2 is a monitoring module course of work process flow diagram among Fig. 1;
Fig. 3 is a counter interrupt module process flow diagram of the present invention;
Fig. 4 is an EFI BIOS running time service module process flow diagram of the present invention.
Embodiment
Further describe the system and method that a kind of computer operating system fault in-situ information of the present invention is obtained below in conjunction with accompanying drawing 1~4.
The present invention overcomes existing Basic Input or Output System (BIOS) (Basic Input/Output System, BIOS) weakness, utilize improvement and the raising of EFI BIOS aspect this working time (runtime) function and pre-guiding (pre-boot) function two, solved the problem of when computer operating system is collapsed, obtaining the computer failure field data.
The present invention will be referred to Extensible Firmware Interface (EFI) technology, below we are introduced it earlier:
Extensible Firmware Interface (Extensible Firmware Interface, EFI) be occurred in 1999 continue to use the interface routine of new generation of Basic Input or Output System (BIOS) (BIOS) for many years in order to replacement, about the introduction of Extensible Firmware Interface, see UEFI forum for details and introduce http://www.UEFI.org about the EFI technology.EFI BIOS is between hardware device and operating system (such as Windows or Linux).Different with traditional BIOS, write in EFI BIOS use whole world higher level lanquage C language the most widely, it provides not only has the function of traditional B IOS but also the expanded function that is better than traditional B IOS is arranged, on design mechanism and framework, also be different from the realization of traditional B IOS, it is BIOS interface specification of future generation, this just means the development that has more slip-stick artist can participate in EFI BIOS, adds many more valuable functions.
The basic function that EFI BIOS possesses is:
The hardware platform initialization;
Support starts the operating system;
The platform management instrument of separating system.
The mode of operation of EFI BIOS can simply reduce: start-up system, standard firmware platform initialization, then carry out relative program from loading EFI driver storehouse and reaching, in EFI BIOS system start-up menu, choose the system that will enter and submit to and start guidance code to EFI BIOS, normally then enter system, otherwise will end to start service and return EFI BIOS system start-up menu.
In the fault in-situ information getting method to computer operating system of the present invention, the particularly description of carrying out with Windows operating system, but the present invention is suitable for the situation of the operating system except Windows operating system equally.
As shown in Figure 1, the system that computer operating system fault in-situ information of the present invention is obtained includes:
(1) memory allocating module 1, be used at the hardware structure of supporting EFI BIOS, power in computer system after (Power on), EFI BIOS is in pre-guiding (pre-boot) stage, EFI BIOS is during to internal memory initialization, a part of internal memory is kept, and at this moment, the memory size that EFI BIOS provides to Windows operating system is exactly the size that the system physical internal memory size deducts reserved memory.
Simultaneously, memory allocating module 1 will be used to obtain the trouble analysis system file of operating system failure information and put into reserved memory area, for entering when the operating system collapse takes place.
Power in system after (Power on), EFI BIOS carries out initialization in pre-guiding (pre-boot) stage with internal memory, starts memory allocating module 1, and memory allocating module 1 is divided into two parts with internal memory:
A part is an operating system memory, be used for Windows operating system and entering operating system installation (OS load) after the stage, operating system is controlled this partial memory, each process program use of distributing to operating system and moving on operating system.
Another part is for being left the trouble analysis system internal memory, it is initialized as the trouble analysis system environment and reserves memory headroom when EFI BIOS starts, the trouble analysis system file is only distributed in this reserved memory space, and this segment space can not be found and use to Windows operating system after startup.(runtime) service module 3 working time of operation EFI BIOS in this headspace, this module functions is to set up to obtain the necessary operating environment of failure message, as load the driving of EFI network interface card (NIC), load analysis tool (diagnostic tool) and preserve the content of Windows operating system failure information.
(2) the operating system monitor module 2, are used for collecting when operating system the operating system field data; Simultaneously in the operation requests of finding that operating system can not the response application program, when the operating system collapse occurring, with (runtime) service module 3 working time of the event notice EFI BIOS of operating system collapse.
(3) counter (Timer) 4 is used for regularly computer running time; Operating system monitor module 2 is regularly rewritten counter 4, prevents that counter 4 from overflowing; When counter 4 overflows, then produce and interrupt, trigger EFI BIOS interruption supervisor, start Dongle module 5.
When operating system was normally moved, operating system monitor module 2 was regularly rewritten the counter register of the middle counter 4 of south bridge ICH chip (I/OController HUB) in the computer hardware, produced thereby assurance counter 4 can not overflow and overflowed interruption; When operating system is collapsed, monitor module 2 then resident and that operate in the operating system also can't normally move, therefore can not regularly write the counter register of south bridge counter 4, thereby cause counter 4 regularly not reset, counter 4 overflows, produce and interrupt, triggered EFI BIOS interruption supervisor, start Dongle module 5.
(4) the Dongle module 5, be used for the pointer of system program is adjusted to (runtime) service module 3 working time of trouble analysis system memory headroom, thereby make the control of computer system forward the working time of EFI BIOS in (runtime) service module 3.
(5) working time of EFI BIOS (runtime) service module 3, be used for the parts of the analysis environments of obtaining operating system failure information are carried out initialization, the initialization that when entering running time service module 3, need carry out hardware system, make it set up living trouble analysis system environment, the analytic system support environment is provided, as load the driver of EFI NIC, load analysis tool (diagnostic tool), selection operation system failure field data and preservation position, and preserve the information content to this position.
When Windows operating system was collapsed, information such as the internal memory when analysis tool 6 is passed through the read operation system crash, content of registers were in conjunction with the bottom of system hardware being diagnosed the information of obtaining the system failure, the concrete reason that the diagnostic operation system breaks down.Selection operation system failure field data and preserve the position then, and preserve the information content to this position.
Behind the Windows os starting, under the situation of normal operation, operating system monitor module 2 operation and when residing in operating system, collect the field data of Windows operating system, comprise cpu busy percentage, memory usage, content of registers, progress information, and be written in the internal memory fixed memory space of operating system management and go.
Simultaneously, operating system monitor module 2 is regularly rewritten the counter register of south bridge counter 4, guarantees that counter 4 can not overflow and produces and overflow interruption.
If operating system collapse, monitor module 2 then resident and that operate in the operating system also can't normally move, therefore can not regularly rewrite the counter register of south bridge counter 4, thereby cause counter 4 regularly not reset, counter 4 overflows, produce and interrupt, triggered EFI BIOS interruption supervisor, enter into EFI BIOS System Management Mode (System Management, SM), start Dongle module 5, Dongle module 5 is (runtime) service module 3 working time in the system program pointed trouble analysis system memory headroom, thereby the system control of making forwards the working time of EFI BIOS in (runtime) service module 3.
The device drives that working time, (runtime) service module 3 was at first uploaded based on EFI, comprise network interface card (NIC), the IDE/SCSI device drives, USB device drives, the PCI device drives, by uploading above driving, make EFI BIOS obtain the control of relevant device, can carry out the operation of I/O equipment; Then, load EFI web service (service), set up connection Network Based, preferably, the network that is based on http connects, after loading network interface card (NIC) driving, send system's warning by network to a certain control end, notice remote side administration person system this operating system current state; At last, analysis tool (diagnostictool) 6 is according to the instruction of remote console, and as the memory content of selective dump (dump), store as USB the memory location, the network storage, and perhaps local hard drive storage, thus note on-the-spot information.
Further describe the method that a kind of computer failure field data of the present invention obtains below in conjunction with said system:
Steps A: after computing machine powered on, in the stage, during EFI BIOS initialization internal memory, memory allocating module 1 was divided into operating system memory and trouble analysis system internal memory with physical memory in pre-guiding (pro-boot).
After the electricity (Power on), EFI BIOS carries out initialization in pre-guiding (pre-boot) stage with internal memory on computers, starts memory allocating module 1, and memory allocating module 1 is divided into two parts with internal memory:
A part is an operating system memory, be used for Windows operating system and entering operating system installation (OS load) after the stage, operating system is controlled this partial memory, each process program use of distributing to operating system and moving on operating system.
Another part is for being left the trouble analysis system internal memory, it is initialized as trouble analysis system environment reservation memory headroom when EFI BIOS starts, the trouble analysis system environment is only distributed in this reserved memory space, and this segment space can not be found and use to Windows operating system after startup.And in this headspace (runtime) service module 3 working time of operation EFI BIOS, running time service module 3 is set up and is obtained the necessary operating environment of failure message, as load the driving of EFI NIC, load analysis tool (diagnostic tool) 6 and preserve the content of Windows operating system failure information.
Step B: during operating system, start and park operation system monitoring module 2, collect the operating system field data, and regularly rewrite counter 4; When operating system collapse, with the running time service module 3 of the event notice EFI BIOS of operating system collapse.
Step B1: during operating system, start and park operation system monitoring module 2, collect operating system field data and preservation, and regularly rewrite counter 4.
As shown in Figure 2, behind the Windows os starting, under the situation of normal operation, the field data of Windows operating system is collected in operating system monitor module 5 operation and when residing in operating system, comprises cpu busy percentage information, memory usage information, content of registers information, progress information, and be written in the internal memory fixed space of operating system management and go.
Simultaneously, operating system monitor module 2 is regularly rewritten the counter register of south bridge counter 4, guarantees that counter 4 can not overflow and produces and overflow interruption.
Step B2: when operating system collapse, when entering EFI BIOS System Management Mode, Dongle module 5 is system program pointed working time (runtime) service module 3, starts (runtime) service module 3 working time.
As shown in Figure 3, if operating system collapse, as system in case of system halt blue screen etc., monitor module 2 then resident and that operate in the operating system also can't normally move, therefore can not regularly write the counter register of south bridge counter 4, thereby cause counter 4 regularly not reset, counter 4 overflows, produce and interrupt, triggered EFI BIOS interruption supervisor, enter into EFI BIOS System Management Mode (SystemManagement, SM), and with (runtime) service module 3 working time in the system program pointed trouble analysis system memory headroom, thereby the system control of making forwards the working time of EFI BIOS in (runtime) service module 3.
Step C: running time service module 3 is set up the trouble analysis system environment that obtains the operating system failure field data, obtains the operating system failure field data.
After running time service module 3 started, its upload device drove, and set up network and connected, and started analysis tool, selected and the storage failure field data.
When operating system monitor module 2 collapses in operating system, can not regularly write the counter register of counter 4, timer 4 overflows, trigger the interruption supervisor of EFI BIOS, enter into the System Management Mode (SM) of EFI BIOS, start Dongle module 5, Dongle module 5 is with (runtime) service module 3 pointed working time of system program.
The device drives that working time, (runtime) service module 3 was at first uploaded based on EFI, comprise network interface card (NIC), the IDE/SCSI device drives, USB device drives, the PCI device drives, by uploading above driving, make EFI BIOS obtain the control of relevant device, can carry out the operation of I/O equipment;
Then, running time service module 3 loads EFI web service (service), set up connection Network Based, preferably, the network that is based on http connects, after loading network interface card (NIC) driving, send system's warning by network to a certain control end, notice remote side administration person system this operating system current state;
At last, analysis tool (diagnostic tool) 6 instructions according to remote console, as the memory content of selective dump (dump), the memory location, as the USB storage, the network storage, perhaps local hard drive storage, record is also preserved on-the-spot information.
The present invention utilizes EFI BIOS in working time (runtime) and pre-guiding (pre-boot) running environment, finish and do not rely on operating system when operating system collapsed and obtain field failure information, its operation and the monitor module 2 that resides in the operating system obtain the operating system field data, decision operation system current state, when taking place, the operating system collapse enters analysis environments, do not destroy the region of memory of windows, and under analysis environments, the region of memory of operating system is analyzed also in conjunction with the bottom diagnosis to hardware system, obtain the information of mechanical disorder, and can find out the reason that operating system breaks down in conjunction with the existing operating system diagnostic tool, the stability when guaranteeing operating system.
Present embodiment is in order to understand the detailed description that the present invention carries out better; it is not the qualification of scope that the present invention is protected; therefore, those of ordinary skills do not break away from purport of the present invention without creative work to this bright change of making in protection scope of the present invention.

Claims (10)

1. the system that computer operating system fault in-situ information is obtained operates with the operating system that operates on the EFI BIOS based on EFI BIOS, it is characterized in that:
Comprise operating system monitor module (2), described EFI BIOS comprises memory allocating module (1) and running time service module (3);
Described memory allocating module (1) is used for after computer system powers on, and EFI BIOS is in the pre-vectoring phase, and EFI BIOS is divided into operating system memory and trouble analysis system internal memory with physical memory during to internal memory initialization;
Described operating system monitor module (2) moves and resides in the described operating system, is used for collecting the operating system field data and preserve when operating system normally being moved; Simultaneously when operating system collapse, the running time service module (3) of the event notice EFI BIOS that operating system is collapsed;
Described running time service module (3), run on the trouble analysis system memory headroom, be used for the trouble analysis system file that obtains operating system failure information is carried out initialization, the trouble analysis system environment of operating system failure field data is obtained in foundation, the analytic system support environment is provided, obtain the operating system failure field data, selection operation system failure field data and preservation position, and preserve the information content to this position.
2. the system that computer operating system fault in-situ information according to claim 1 is obtained is characterized in that, also comprises counter (4) and Dongle module (5);
Described counter (4) is used for regularly computer running time; Operating system monitor module (2) is regularly rewritten counter (4), prevents that counter (4) from overflowing; When counter (4) when overflowing, then produce and interrupt, trigger EFI BIOS interruption supervisor, start Dongle module (5);
Dongle module (5) is used for the pointer of system program is adjusted to the running time service module (3) of trouble analysis system memory headroom, thereby the control of computer system is forwarded in the running time service module (3) of EFI BIOS.
3. the system that computer operating system fault in-situ information according to claim 1 and 2 is obtained is characterized in that described operating system is Windows operating system.
4. the system that computer operating system fault in-situ information according to claim 1 and 2 is obtained, it is characterized in that, described field data comprises cpu busy percentage information, memory usage information, class in content of registers information and the progress information or multiclass information combination.
5. the method that computer operating system fault in-situ information is obtained is characterized in that, comprises the following steps:
Steps A) after this computing machine powered on, during EFI BIOS initialization internal memory, memory allocating module (1) was divided into operating system memory and trouble analysis system internal memory with physical memory;
Step B) during operating system, start and park operation system monitoring module (2), operating system monitor module (2) is collected the operating system field data; When operating system collapse, with the running time service module (3) of the event notice EFI BIOS of operating system collapse;
Step C) running time service module (3) runs on the trouble analysis system memory headroom, sets up the trouble analysis system environment that obtains the operating system failure field data, obtains the operating system failure field data.
6. the method that computer operating system fault in-situ information according to claim 5 is obtained is characterized in that described step B) also comprise the following steps:
Step B1) during operating system, start and park operation system monitoring module (2), operating system monitor module (2) is regularly write counter (4);
Step B2) when the operating system collapse, when entering EFI BIOS System Management Mode, Dongle module (5) starts running time service module (3) with system program pointed running time service module (3).
7. the method for obtaining according to claim 5 or 6 described computer operating system fault in-situ information, it is characterized in that, described field data comprises cpu busy percentage information, memory usage information, class in content of registers information and the progress information or multiclass information combination.
8. the method for obtaining according to claim 5 or 6 described computer operating system fault in-situ information is characterized in that described step C) comprise the following steps:
Step C1) running time service module (3) is uploaded the device drives based on EFI;
Step C2) memory content, the memory location at analysis tool (6) selection operation system failure scene, record is also preserved on-the-spot information.
9. the method that computer operating system fault in-situ information according to claim 8 is obtained is characterized in that described step C1) also comprise the following steps:
Running time service module (3) loads EFI web service, sets up connection Network Based, after loading the network interface card driving, sends system's warning by network to control end, notice control end system this operating system current state.
10. the method that computer operating system fault in-situ information according to claim 8 is obtained is characterized in that, described device drives comprises that network interface card drives, the IDE/SCSI device drives, and USB device drives, the PCI device drives.
CNB2006100576026A 2006-02-22 2006-02-22 System and method for obtaining fault in-situ information for computer operating system Active CN100472471C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006100576026A CN100472471C (en) 2006-02-22 2006-02-22 System and method for obtaining fault in-situ information for computer operating system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006100576026A CN100472471C (en) 2006-02-22 2006-02-22 System and method for obtaining fault in-situ information for computer operating system

Publications (2)

Publication Number Publication Date
CN101025709A CN101025709A (en) 2007-08-29
CN100472471C true CN100472471C (en) 2009-03-25

Family

ID=38744028

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006100576026A Active CN100472471C (en) 2006-02-22 2006-02-22 System and method for obtaining fault in-situ information for computer operating system

Country Status (1)

Country Link
CN (1) CN100472471C (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567550A (en) * 2011-12-31 2012-07-11 曙光信息产业股份有限公司 Method and device for collecting data of emergency event in operating system (OS)
CN102622322B (en) * 2012-02-24 2015-09-09 华为技术有限公司 A kind of method, black box and server utilizing black box to obtain crash info
CN104699615B (en) * 2012-03-31 2017-09-22 北京奇虎科技有限公司 A kind for the treatment of method and apparatus of the system failure
CN102637144B (en) * 2012-03-31 2015-05-06 北京奇虎科技有限公司 System fault processing method and device
CN103631706B (en) * 2012-08-27 2018-09-04 腾讯科技(深圳)有限公司 A kind of browser testing method and apparatus
CN104035871B (en) * 2014-06-27 2016-04-13 腾讯科技(深圳)有限公司 Based on fault handling method and the device of the application program in geographic position
CN105204977A (en) * 2014-06-30 2015-12-30 中兴通讯股份有限公司 System exception capturing method, main system, shadow system and intelligent equipment
CN105512000B (en) * 2014-09-24 2020-04-24 中兴通讯股份有限公司 Operating system abnormal information collection method and device and computer
US20170196029A1 (en) * 2016-01-05 2017-07-06 Gentex Corporation Communication system for vehicle
CN106997315B (en) * 2016-01-25 2021-01-26 阿里巴巴集团控股有限公司 Method and device for memory dump of virtual machine
CN107025146B (en) * 2016-01-30 2019-10-18 华为技术有限公司 A kind of document generating method, device and system
CN106681771B (en) * 2016-12-30 2020-12-29 阿里巴巴(中国)有限公司 System reinstallation method and device
CN108319530A (en) * 2018-02-06 2018-07-24 合肥联宝信息技术有限公司 Diagnostic method, device, terminal and the medium of computer hardware
CN111158982B (en) * 2019-12-26 2022-06-28 联想(北京)有限公司 Electronic device, first operating system, data processing method, and storage medium
CN111341434B (en) * 2020-03-02 2024-05-28 北京医维星科技有限公司 Remote fault diagnosis and maintenance system for medical equipment and construction method thereof
CN113064747B (en) * 2021-03-26 2022-10-28 山东英信计算机技术有限公司 Fault positioning method, system and device in server starting process
CN114064132B (en) * 2021-09-30 2023-07-21 中科创达软件股份有限公司 Method, device, equipment and system for recovering system downtime
CN116302646B (en) * 2023-02-24 2024-03-29 荣耀终端有限公司 Fault positioning method, system, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN101025709A (en) 2007-08-29

Similar Documents

Publication Publication Date Title
CN100472471C (en) System and method for obtaining fault in-situ information for computer operating system
JP6530774B2 (en) Hardware failure recovery system
CN100356335C (en) Method and apparatus to preserve trace data
CN105938450B (en) The method and system that automatic debugging information is collected
US9158628B2 (en) Bios failover update with service processor having direct serial peripheral interface (SPI) access
Kim et al. Availability modeling and analysis of a virtualized system
US8135985B2 (en) High availability support for virtual machines
US8468389B2 (en) Firmware recovery system and method of baseboard management controller of computing device
CN100388231C (en) Method for analyzing data obtained from crash and data processing system
JP5120664B2 (en) Server system and crash dump collection method
CN105659215A (en) Fault processing method, related device and computer
US7523346B2 (en) Systems and methods for CPU repair
CN101377750A (en) System and method for cluster fault toleration
CN104254840A (en) Memory dump and analysis in a computer system
US20070174689A1 (en) Computer platform embedded operating system backup switching handling method and system
CN101025701A (en) Memory dump method, memory dump program and computer system
CN100472468C (en) Computer system, computer network and method
JP2002014939A (en) Multi-partition computer system
US7434042B2 (en) Apparatus, method and recording medium for starting up data processing system
CN102521105A (en) Output method of power on self test information, virtual machine manager and processor
CN109032901A (en) A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band
US20190384666A1 (en) Capturing video data and serial data during an information handling system failure
CN104216771A (en) Restarting method and device for software program
US7168002B2 (en) Preservation of error data on a diskless platform
JP2009104412A (en) Storage apparatus and method controlling the same

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant