WO2015131549A1 - Method and device for collecting operating system fault information, and computer - Google Patents

Method and device for collecting operating system fault information, and computer Download PDF

Info

Publication number
WO2015131549A1
WO2015131549A1 PCT/CN2014/091189 CN2014091189W WO2015131549A1 WO 2015131549 A1 WO2015131549 A1 WO 2015131549A1 CN 2014091189 W CN2014091189 W CN 2014091189W WO 2015131549 A1 WO2015131549 A1 WO 2015131549A1
Authority
WO
WIPO (PCT)
Prior art keywords
operating system
information
abnormal
original
original operating
Prior art date
Application number
PCT/CN2014/091189
Other languages
French (fr)
Chinese (zh)
Inventor
蒋彪
王亮
蔡衢
谈虎
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2015131549A1 publication Critical patent/WO2015131549A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0766Error or fault reporting or storing
    • G06F11/0787Storage of error reports, e.g. persistent data storage, storage using memory protection

Definitions

  • the invention relates to the technical field of computer operating systems, in particular to a method, a device and a computer for collecting operating system abnormal information.
  • the operating system can also respond to interrupts, but cannot handle any other transactions normally.
  • the cause of this type of crash may be:
  • a deadlock in the operating system prevents normal scheduling or processing of other transactions.
  • the fault environment at this time is like an island, and the accurate running state of the system cannot be known.
  • the existing means cannot collect valid information and is used for analysis and positioning of such faults.
  • the existing anomaly collection methods are as follows: the machine is connected to the keyboard and mouse login system to collect information, and the remote control tool is used to log in to the system to collect information and pass the operating system. Its own log function collects information, and in this case, no valid fault information can be collected.
  • the machine can be connected to the display to collect abnormal printing information of the system through the display, but the following problems exist:
  • Another method of information collection is to configure a specialized monitoring device, such as a serial device, to capture abnormal information of the monitored device in real time.
  • a specialized monitoring device such as a serial device
  • this method is not versatile due to the additional configuration of monitoring equipment; at the same time, the information collected by such equipment is limited, and only the system printing information before the failure occurs can be collected, and more detailed information cannot be collected, such as the system. Memory mirroring information at the time of failure is not conducive to the analysis and location of the problem.
  • the purpose of the embodiments of the present invention is to provide a method, a device, and a computer for collecting operating system abnormal information, which solve the problem that collecting abnormal information is inaccurate, is not universal, and cannot collect detailed information.
  • an operating system abnormality information collecting method includes:
  • the step of acquiring a triggering instruction that is interrupted by the original operating system process triggered by the power button of the computer when the original operating system is abnormal includes:
  • a trigger instruction that is triggered by the power button is reported to the system control interrupt SCI of the basic input/output system BIOS.
  • the step of acquiring the trigger command triggered by the power button to report to the system control interrupt SCI of the basic input/output system BIOS includes:
  • the step of acquiring a triggering instruction that is interrupted by the original operating system process triggered by the power button of the computer when the original operating system is abnormal includes:
  • the trigger command of the original operating system process interrupt triggered by the power button of the computer being directly pressed is directly acquired.
  • collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
  • synchronously interrupting a program running in the original operating system and acquiring an instruction to start the new operating system to collect abnormal information of the original operating system, and performing the step of collecting the abnormal information includes:
  • the control pointer points to the new operating system and starts the new operating system, collects an instruction of abnormal information of the original operating system, and collects the abnormal information.
  • collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
  • the writing manner of the preset configuration information is bare write or writing through a file system interface
  • collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
  • the abnormal information of the original operating system is collected according to the preset configuration information, and the abnormal information is stored in a storage medium that starts a preset remote host in advance.
  • the operating system abnormal information collecting method further includes: controlling shutdown or reset according to the preset configuration information.
  • the preset configuration information includes: information for starting/closing the abnormality information collection function of the control system and time limit information for collecting the information.
  • an operating system abnormal information collecting device which includes:
  • a first acquiring module configured to acquire preset configuration information of a new operating system preset in the original operating system, where the new operating system is a physics of a computer running the original operating system
  • An operating system kernel pre-stored in memory
  • a second obtaining module configured to acquire a triggering instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal;
  • the processing module is configured to collect abnormal information of the original operating system according to the preset configuration information according to the triggering instruction, and store the abnormal information into a specified storage medium.
  • the second obtaining module includes:
  • the first obtaining submodule is configured to acquire a triggering instruction that is triggered by the power button to be reported to the system control interrupt SCI of the basic input/output system BIOS.
  • the first obtaining submodule includes:
  • an obtaining unit configured to: when the original operating system is abnormal and the current power button supports the ACPI, acquire a trigger instruction that is triggered by the power button to report to the SCI of the BIOS.
  • the second obtaining module includes:
  • a second obtaining submodule configured to directly acquire a trigger instruction of the original operating system process interrupt triggered by the power button of the computer being pressed when the original system is abnormal and the current power button does not support ACPI.
  • steps of the processing module include:
  • a first processing submodule configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and perform the Collection of abnormal information.
  • the first processing submodule includes:
  • a processing unit configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, control a pointer to the new operating system, start the new operating system, and collect an instruction of abnormal information of the original operating system And collecting the abnormal information.
  • processing module includes:
  • a second obtaining submodule configured to obtain, according to the triggering instruction, a writing manner of the preset configuration information as a bare write or a file system interface
  • the second processing submodule is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
  • processing module includes:
  • a third acquiring submodule configured to obtain a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction
  • the third processing sub-module is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a storage medium that starts a preset remote host in advance.
  • the operating system abnormal information collecting device further includes: a control module, configured to control shutdown or reset according to the preset configuration information.
  • the preset configuration information includes: information for starting/closing the abnormality information collection function of the control system and time limit information for collecting the information.
  • an embodiment of the present invention further provides a computer, including an operating system kernel pre-stored in a physical memory of a running computer, the operating system kernel having an operating system abnormality information collecting device as described above.
  • the interrupt is triggered by the hardware power button, and the hardware trigger interrupt is reported to the new operating system, and then the new operating system acquires the trigger command of the hardware trigger interrupt, according to the preset configuration.
  • the information is collected and stored for abnormal information, thereby completing the collection of abnormal information.
  • the universal hardware power button is used to trigger another pre-configured new operating system to complete the effective collection of abnormal information for subsequent fault analysis and positioning, without additional hardware equipment, and without special hardware design support. Strong universal Sexuality, and can effectively enhance the maintainability of the system, effectively collecting abnormal information.
  • FIG. 1 is a schematic diagram of steps of implementing an operating system abnormal information collection method according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of steps of power button interrupt processing in an environment supporting ACPI according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of steps of power button interrupt processing in an environment that does not support ACPI according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of a workflow of system exception information collection according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of a workflow through local storage according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a workflow of storing through a network according to an embodiment of the present invention.
  • FIG. 7 is a flowchart of steps of an implementation process of an operating system abnormal information collecting method according to an embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of an operating system abnormality information collecting apparatus according to an embodiment of the present invention.
  • the usual use of the power button in the conventional hardware platform is to press the power button to implement the shutdown function, which is usually implemented by hardware, and the software cannot intercept the processing;
  • the usual use of the power button in the traditional hardware platform of the solution is to short-press the power button to generate an interrupt, and then the software interrupts the software to implement the software shutdown operation, usually stopping the various processes in the system and then synchronizing the file system. Finally, the shutdown operation is initiated.
  • the present invention is directed to the abnormal information that the system fails to collect an effective fault in the prior art, and the same
  • the traditional collection method is not universal, it is unable to collect detailed information, and provides an operating system abnormal information collection method, device and computer, through the preset configuration information of the new operating system, in the original operating system.
  • the new operating system is triggered by the general hardware, and the abnormal information is collected by the new operating system.
  • the additional information can be effectively collected without additional hardware equipment or special hardware design support. Versatility and can effectively enhance the maintainability of the system.
  • the operating system abnormality information collecting method of the embodiment of the present invention includes:
  • Step 101 Obtain preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operating system kernel pre-stored in a physical memory of a computer running the original operating system;
  • the pre-configuration information includes: the preset configuration information includes: information about controlling system abnormal information collection function activation/deactivation, information collection time limit information, configuration files, system or library function calls, and system startup parameters.
  • the preset configuration information includes: information about controlling system abnormal information collection function activation/deactivation, information collection time limit information, configuration files, system or library function calls, and system startup parameters.
  • the configuration includes but is not limited to: a) whether to enable the system abnormality information collection function based on the power button; b) the action after the information collection is completed; c) the time limit for information collection; d) the physical reserved for the new kernel for collecting information The starting address and size of the memory; e) the way the system exception information is stored.
  • the optional action is to shut down or restart.
  • the default operation of the power button is ultimately to initiate a shutdown, but when it is necessary to improve business continuity, it is possible It will be necessary to automatically resume the service when there is an abnormality in the system. Therefore, the option to shut down and restart is provided for the user to select.
  • Time limit for information collection To improve the continuity of the service and minimize the time of service interruption, this configuration option is provided for the user to control the service interruption time when the system is abnormal. If the information is not collected within the specified time, the shutdown or reset is directly initiated.
  • the starting address and size of the physical memory reserved by the new kernel used to collect information. Used to specify the physical memory area loaded by the new kernel. Since the physical address space distribution of different hardware environments may be different, and the memory size occupied by different operating system kernels may also be different, providing this configuration item can make the present invention more flexible and more versatile.
  • Local storage means to save information to the storage medium in the local environment.
  • the pressing of the power button refers to short pressing the power button, and the short refers to a length of time greater than one second and less than three seconds, which is usually set according to requirements, which is different from the prior art button length and can
  • the technical effects of the triggering of the embodiment of the present invention and the reporting of the new operating system are all within the protection scope of the embodiment of the present invention.
  • the physical memory pre-storing a region for loading new memory used for collecting information, after the original operating system is started, loading the new kernel into the reserved memory region, and then jumping after the original operating system exception Run the new kernel and complete the information collection in the new kernel.
  • Step 102 Acquire a trigger instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal.
  • the process of interrupting the original operating system process triggered by the power button being pressed is divided into two cases. If the current environment supports ACPI (abbreviation of Advanced Configuration and Power Management Interface, advanced configuration and power management interface), the power button Trigger The interrupt will be reported to the BIOS (abbreviation of Basic Input Output System, basic input and output system), and then by the BIOS through SCI (System Control Interrupt shorthand, system control interrupt.
  • An IRQ dedicated to ACPI power management (Interrupt Request The short-circuit, interrupt request) interrupt is reported to the operating system; if the current environment does not support ACPI, the interrupt is directly reported to the operating system.
  • the triggering instruction can be obtained by using a pre-registered interrupt service program to collect information.
  • Step 103 Collect, according to the triggering instruction, abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified storage medium.
  • step 101 After obtaining the related preset configuration information in step 101, when the original operating system is abnormal, then the triggering command for triggering the power button is acquired through step 102, the new operating system is triggered, and then the storage of the abnormal information is collected through step 103.
  • the exception information is sent to the specified storage medium.
  • the scheduling of the new operating system ensures that when the original system is abnormal, the required information can be effectively collected, and the power button of the hardware is used. Interrupting the original operating system triggers the new operating system to be versatile, effectively collecting critical information, and greatly assisting in analyzing and locating such faults.
  • step 102 includes:
  • Step 120 Acquire a trigger instruction that is triggered by the power button to be reported to the system control interrupt SCI of the BIOS of the basic input/output system.
  • the step 102 includes:
  • Step 121 When the original operating system is abnormal and the current power button supports the ACPI, The trigger triggered by the power button is reported to the trigger command in the SCI of the BIOS.
  • the triggering instruction accurately interrupts the original operating system, triggering the opening of the new operating system to collect system abnormality information.
  • an example of the implementation process when the specific power button of the present invention supports ACPI is as follows.
  • step 21 the user manually presses the power button to trigger an interrupt.
  • Step 22 According to the ACPI specification, the corresponding interrupt is reported to the BIOS.
  • step 23 after the BIOS performs corresponding processing, the SCI interrupt is reported to the new operating system.
  • step 24 the new operating system performs corresponding processing in the SCI interrupt process.
  • the step 102 includes: Step 122: When the original system is abnormal and the current power button does not support ACPI, directly acquiring the power button of the computer is pressed. The triggered trigger of the original operating system process interrupt triggered.
  • the trigger command is directly obtained, which saves the time for triggering the new operating system and improves the flexibility of collecting abnormal information of the system.
  • the triggering can be implemented in various ways.
  • an example of an implementation process in which a specific power button of the present invention does not support ACPI is as follows.
  • step 31 the interrupt vector and the interrupt service routine of the interrupt triggered by the power button are pre-registered in the new operating system.
  • step 32 the user manually presses the power button to trigger an interrupt.
  • step 33 the new operating system intercepts the corresponding interrupt and performs corresponding processing in the corresponding interrupt service program.
  • the step of step 103 includes:
  • Step 131 Synchronously interrupt the running of the program in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and collect the abnormal information.
  • the step 131 includes: Step 1311, synchronously interrupting the running of the program in the original operating system according to the triggering instruction, and the control pointer points to the new The operating system starts the new operating system, collects an instruction of the abnormal information of the original operating system, and collects the abnormal information.
  • Step 41 When the original operating system starts, a piece of physical memory is reserved for storing a new kernel for collecting information.
  • the new kernel includes all the hardware drivers and function modules related to the collection of abnormal information. For example, if you need to use the hard disk to save the abnormal information, the new kernel needs to include the corresponding hard disk drive. Similarly, if you need to use the network. If you save the exception information, you need to include the corresponding NIC driver in the new kernel.
  • the starting address and size of the reserved physical memory area can be configured.
  • Step 42 After the original operating system is started, the new kernel for collecting information is loaded into the previously reserved physical memory.
  • step 43 after the original operating system is abnormal, the user manually triggers the corresponding interrupt by short pressing the power button. The corresponding interrupt is then intercepted by the new operating system.
  • step 44 in the corresponding interrupt processing flow, synchronization processing is performed, for example, an inter-core interrupt is sent to all other CPUs to notify them to stop running.
  • step 45 the current instruction pointer is jumped to the start address of the kernel previously loaded into the reserved memory, thereby implementing loading of the new kernel.
  • Step 46 After the new kernel is started, collect system exception information, such as system memory image information, and store it on a specified storage medium, such as through a local disk or a network.
  • system exception information such as system memory image information
  • the step 103 includes: Step 311, according to the triggering instruction, obtaining a write mode of the preset configuration information as a bare write or writing through a file system interface;
  • Step 312 Collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
  • Local storage is convenient and fast, saving time and facilitating the collection of system abnormal information.
  • Step 51 Before configuring the control module, the configuration information storage mode is local storage, and the specific write mode is set at the same time: bare write or through the file system interface.
  • Step 52 The new kernel for collecting the original operating system exception information includes the corresponding hardware driver of the local storage, and the file system module supports, and loads into the specified memory area.
  • Step 53 After the original operating system is abnormal, in the interrupt processing flow of the power button, jump to the new kernel to run, load the corresponding driver after the new kernel is started, and create a corresponding device file. After the system exception information is collected, the corresponding interface (naked or file system) provided by the new operating system is called to write the information to the specified storage medium.
  • the step 103 includes:
  • Step 313 Acquire a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction.
  • the method of storing the network through the network by pre-starting the preset remote host requires another host as the server of the network storage, and the local machine acts as the client. Save the information over the network to a storage medium on the remote host.
  • network transmission such as: ftp (referred to as File Transfer Protocol, file transfer protocol), nfs (short for Network File System, network file system), httpd (global network server software, server, Apache super
  • HTTP text transfer protocol
  • related configuration includes network transmission mode, IP address of network storage server and related routing information.
  • Step 314 Perform abnormal information of the original operating system according to the preset configuration information. Collecting and storing the abnormality information into a storage medium that starts the preset remote host in advance.
  • step 62 the related service (ftp, nfs or httpd) of another host is pre-configured and started as a server of the network storage.
  • Step 63 The new kernel for collecting the original operating system exception information includes the network card hardware driver used by the network storage, and the corresponding client module support, and is loaded into the specified memory area.
  • Step 64 After the original operating system is abnormal, in the interrupt processing flow of the power button, jump to the new kernel to run, load the corresponding driver after the new kernel is started, and create a corresponding device file. After the system exception information is collected, the client interface of the corresponding network transmission mode is invoked, and the information is written into the storage medium of the preset remote host.
  • the default operation of the power button is to initiate a shutdown, but when it is necessary to improve business continuity, it may be necessary to automatically resume the service when the system is abnormal. Therefore, the option to shut down and restart is provided for the user. select.
  • an example of an implementation flow of a specific operating system abnormality information collecting method of the present invention is as follows.
  • Step 71 The configuration control module is configured to pre-set the relevant configuration to control the original operating system abnormal information collection process.
  • Step 72 After the original operating system is abnormal, the user manually triggers the power button by triggering Broken.
  • step 73 the interrupt is reported to the new operating system and processed by the interrupt service program pre-registered in the new operating system.
  • step 74 the original operating system exception information collection is performed in the interrupt service routine.
  • step 75 the collected information is stored in the designated storage medium through the information storage module.
  • step 76 a shutdown or reset is initiated according to the configuration in the configuration control module.
  • the embodiment of the present invention further provides an operating system abnormal information collecting apparatus, which includes:
  • the first obtaining module 801 is configured to acquire preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operation pre-stored in a physical memory of the computer running the original operating system.
  • System kernel
  • a second obtaining module 802 configured to acquire a trigger instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal;
  • the processing module 803 is configured to collect abnormal information of the original operating system according to the preset configuration information according to the triggering instruction, and store the abnormal information into a specified storage medium.
  • the exception information stored here can be completed by the information storage module.
  • the second obtaining module 802 triggers a trigger command of the power button to trigger a new operating system, and then passes through the processing module 803.
  • the abnormal information is collected and stored in the specified storage medium.
  • an operating system kernel pre-stored in the physical memory of the computer running by the original operating system passes through the new operating system. Scheduling ensures that when the original system is abnormal, it can effectively collect the required information and press the power of the hardware.
  • the key interrupts the original operating system to trigger the new operating system to be versatile, effectively collecting key information, and greatly improving the analysis and positioning of such faults.
  • the second obtaining module 802 includes:
  • the first obtaining submodule is configured to acquire a triggering instruction that is triggered by the power button to be reported to the system control interrupt SCI of the basic input/output system BIOS.
  • the first obtaining submodule includes:
  • an obtaining unit configured to: when the original operating system is abnormal and the current power button supports the ACPI, acquire a trigger instruction that is triggered by the power button to report to the SCI of the BIOS.
  • the second obtaining module 802 includes:
  • a second obtaining submodule configured to directly acquire a trigger instruction of the original operating system process interrupt triggered by the power button of the computer being pressed when the original system is abnormal and the current power button does not support ACPI.
  • the processing module 803 includes:
  • a first processing submodule configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and perform the Collection of abnormal information.
  • the first processing submodule includes:
  • a processing unit configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and control a pointer to the new operating system and start the new operating system, and collect the The instruction of the abnormal information of the original operating system is performed to collect the abnormal information.
  • the processing module 803 includes:
  • a second obtaining submodule configured to obtain, according to the triggering instruction, a writing manner of the preset configuration information as a bare write or a file system interface
  • the second processing submodule is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
  • the processing module 803 includes:
  • a third acquiring submodule configured to obtain a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction
  • the third processing sub-module is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a storage medium that starts a preset remote host in advance.
  • the operating system abnormality information collecting apparatus of the embodiment of the present invention further includes: a control module, configured to control shutdown or reset according to the preset configuration information.
  • the preset configuration information includes: information for controlling the startup/shutdown of the system abnormal information collecting function and time limit information for collecting information.
  • the device provided by the present invention is a device that applies the foregoing operating system abnormal information collecting method, and all embodiments of the operating system abnormal information collecting method are applicable to the device, and both can achieve the same or similar beneficial effects. .
  • an embodiment of the present invention further provides a computer, including an operating system kernel pre-stored in a physical memory of a running computer, the operating system kernel having an operating system abnormality information collecting device as described above.
  • the operating system abnormal information collecting device of the embodiment of the present invention is applied to a computer. Therefore, the computer provided by the embodiment of the present invention, wherein the implementation embodiments of the operating system abnormal information collecting device are applicable to the computer In the embodiment, the same technical effect can also be achieved.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner such as: multiple units or components may be combined, or Can be integrated into another system, or some features can be ignored or not executed.
  • the coupling, or direct coupling, or communication connection of the components shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may be electrical, mechanical or other forms. of.
  • the units described above as separate components may or may not be physically separated, and the components displayed as the unit may or may not be physical units, that is, may be located in one place or distributed to multiple network units; Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated into one unit;
  • the unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. Including the steps of the above method embodiments;
  • the foregoing storage medium includes: a removable storage device, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes.
  • the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions.
  • a computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a removable storage device, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes.
  • the invention discloses a method, a device and a computer for collecting operating system abnormal information, which can trigger an interrupt through a hardware power button, and the hardware trigger interrupt is reported to the new operating system, and then the new operating system acquires a trigger command of the hardware trigger interrupt, according to the pre-
  • the configuration information is used to collect and store the abnormal information, thereby completing the collection of the abnormal information.
  • the universal hardware power button is used to trigger another pre-configured new operating system to complete the effective collection of abnormal information for subsequent fault analysis and positioning, without additional hardware equipment, and without special hardware design support. Strong versatility, and can effectively enhance the maintainability of the system, effectively collecting abnormal information.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

Provided are a method and a device for collecting operating system fault information, and a computer. The invention solves the problem of fault information collection being imprecise, lacking universal compatibility, and not including the collection of detailed information. The method comprises: obtaining pre-set configuration information of a new operating system pre-established in an underlying operating system, the new operating system being an operating system kernel pre-stored in the physical memory of a computer on which the underlying operating system runs; obtaining, upon a fault of the underlying operating system, a trigger command for ending a process of the underlying operating system, said command being triggered when the computer's power button is pressed; and, on the basis of the trigger command, and in accordance with said pre-set configuration information, collecting fault information of the underlying operating system and storing same to a designated storage medium.

Description

一种操作***异常信息收集方法、装置及计算机Operating system abnormal information collecting method, device and computer 技术领域Technical field
本发明涉及计算机操作***技术领域,特别是一种操作***异常信息收集方法、装置及计算机。The invention relates to the technical field of computer operating systems, in particular to a method, a device and a computer for collecting operating system abnormal information.
背景技术Background technique
随着计算机应用程序的日渐复杂和数量剧增,而作为这些应用程序的运行环境,操作***的运行压力也随之与日俱增,加上支撑操作***运行的硬件环境也日趋复杂,在实际应用中,***死机故障经常发生,导致***死机的原因有很多,相应的外在表现也不尽相同,其中如下***死机故障是非常典型的一种类型,此类***死机故障发生时,具有如下表现:With the increasing complexity and the increasing number of computer applications, as the operating environment of these applications, the operating pressure of the operating system has also increased, and the hardware environment that supports the operating system is becoming more and more complex. In practical applications, System crashes often occur, causing system crashes for a number of reasons, and the corresponding external performance is not the same. The following system crashes are a very typical type. When such a system crash occurs, it has the following performance:
1)本机接键盘、鼠标无响应或者无法正常登陆。1) The machine is connected to the keyboard, the mouse does not respond or cannot log in normally.
2)无法通过远程控制工具登陆问题主机。2) The problem host cannot be logged in via the remote control tool.
3)***日志无法记录到有效的故障信息3) System log cannot record valid fault information
4)操作***还能响应中断,但无法正常处理其它任何事务。4) The operating system can also respond to interrupts, but cannot handle any other transactions normally.
导致此类的死机故障的原因可能为:The cause of this type of crash may be:
1)操作***发生死锁导致无法正常调度或处理其它事务。1) A deadlock in the operating system prevents normal scheduling or processing of other transactions.
2)操作***负荷过重导致***挂死(比如内存不足)。2) The operating system is overloaded and the system hangs (such as insufficient memory).
3)硬盘硬件或固件故障导致操作***挂死。此时由于硬盘故障导致***I/O(输入/输出)挂死,无法记录日志,同时也无法正常登陆***。3) The hard disk hardware or firmware failure causes the operating system to hang. At this time, the system I/O (input/output) hangs due to a hard disk failure, and the log cannot be recorded, and the system cannot be logged in normally.
此时的故障环境犹如一个孤岛,无法获知***的准确运行状态,现有手段无法收集到有效信息,用于此类故障的分析定位。The fault environment at this time is like an island, and the accurate running state of the system cannot be known. The existing means cannot collect valid information and is used for analysis and positioning of such faults.
针对此类***死机故障,现有的异常收集方法如:本机接键盘和鼠标登陆***收集信息、通过远程控制工具登陆***收集信息、通过操作*** 自身的日志功能收集信息,在这种情况下都无法收集到有效的故障信息。For such system crashes, the existing anomaly collection methods are as follows: the machine is connected to the keyboard and mouse login system to collect information, and the remote control tool is used to log in to the system to collect information and pass the operating system. Its own log function collects information, and in this case, no valid fault information can be collected.
此外,可以在本机接显示器,通过显示器来收集***的异常打印信息,但是存在如下问题:In addition, the machine can be connected to the display to collect abnormal printing information of the system through the display, but the following problems exist:
1)在此类故障发生时,多数情况下,由于显示器显示相关的处理流程太长,导致显示器也无法正常记录到异常信息。1) When such a fault occurs, in most cases, the display process cannot be normally recorded due to the display process being too long.
2)如果显示器休眠或者***当前控制台处于图形模式下,显示器也无法记录异常信息。2) If the monitor is asleep or the current console of the system is in graphics mode, the monitor cannot record abnormal information.
3)即使在文本模式下且显示器不休眠,显示器对应的控制台能记录到异常信息,但由于此时无法滚屏,记录的信息也严重不足,通常不足以定位此类死机故障。3) Even in the text mode and the display does not sleep, the corresponding console of the display can record the abnormal information, but since the scrolling cannot be performed at this time, the recorded information is also seriously insufficient, and it is usually insufficient to locate such a crash.
另外一种可用的信息收集方法为配置一个专业化的监测设备,比如串口设备,用于实时捕获被监测设备的异常信息。但是,该方法由于额外配置了监测设备,因此不具备通用性;同时,此类设备收集的信息也比较有限,只能收集到故障发生前的***打印信息,无法收集更详细的信息,比如***故障时的内存镜像信息等,不利于问题的分析定位。Another method of information collection is to configure a specialized monitoring device, such as a serial device, to capture abnormal information of the monitored device in real time. However, this method is not versatile due to the additional configuration of monitoring equipment; at the same time, the information collected by such equipment is limited, and only the system printing information before the failure occurs can be collected, and more detailed information cannot be collected, such as the system. Memory mirroring information at the time of failure is not conducive to the analysis and location of the problem.
发明内容Summary of the invention
本发明实施例的目的在于提供一种操作***异常信息收集方法、装置及计算机,解决收集异常信息不准确,不具备通用性且无法收集详细信息的问题。The purpose of the embodiments of the present invention is to provide a method, a device, and a computer for collecting operating system abnormal information, which solve the problem that collecting abnormal information is inaccurate, is not universal, and cannot collect detailed information.
为了解决上述技术问题,本发明实施例提供的一种操作***异常信息收集方法,其中,包括:In order to solve the above technical problem, an operating system abnormality information collecting method provided by an embodiment of the present invention includes:
获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核;Obtaining preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operating system kernel pre-stored in a physical memory of a computer running the original operating system;
获取在所述原始操作***异常时,所述计算机的电源按键被按下所触 发的所述原始操作***进程中断的触发指令;Obtaining that when the original operating system is abnormal, the power button of the computer is pressed and touched The triggering instruction of the original operating system process interrupted;
根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。And collecting, according to the triggering instruction, abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into a specified storage medium.
进一步的,获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令的步骤包括:Further, the step of acquiring a triggering instruction that is interrupted by the original operating system process triggered by the power button of the computer when the original operating system is abnormal includes:
获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。A trigger instruction that is triggered by the power button is reported to the system control interrupt SCI of the basic input/output system BIOS.
进一步的,获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令的步骤包括:Further, the step of acquiring the trigger command triggered by the power button to report to the system control interrupt SCI of the basic input/output system BIOS includes:
在原始操作***异常且当前电源按键支持所述ACPI时,获取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。When the original operating system is abnormal and the current power button supports the ACPI, an interrupt command triggered by the power button is reported to the trigger in the SCI of the BIOS.
进一步的,获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令的步骤包括:Further, the step of acquiring a triggering instruction that is interrupted by the original operating system process triggered by the power button of the computer when the original operating system is abnormal includes:
在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。When the original system is abnormal and the current power button does not support ACPI, the trigger command of the original operating system process interrupt triggered by the power button of the computer being directly pressed is directly acquired.
进一步的,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中的步骤包括:Further, according to the triggering instruction, collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。And synchronously interrupting the running of the program in the original operating system according to the triggering instruction, and acquiring an instruction to start the new operating system, collecting abnormal information of the original operating system, and collecting the abnormal information.
进一步的,根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***收集所述原始操作***的异常信息的指令,进行所述异常信息的收集的步骤包括: Further, according to the triggering instruction, synchronously interrupting a program running in the original operating system and acquiring an instruction to start the new operating system to collect abnormal information of the original operating system, and performing the step of collecting the abnormal information Includes:
根据所述触发指令,同步中断所述原始操作***中的程序运行,Synchronizing interrupting the running of the program in the original operating system according to the triggering instruction,
控制指针指向所述新操作***并启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。The control pointer points to the new operating system and starts the new operating system, collects an instruction of abnormal information of the original operating system, and collects the abnormal information.
进一步的,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中的步骤包括:Further, according to the triggering instruction, collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;Obtaining, according to the triggering instruction, the writing manner of the preset configuration information is bare write or writing through a file system interface;
按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。Collecting abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into a specified local storage medium.
进一步的,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中的步骤包括:Further, according to the triggering instruction, collecting the abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into the specified storage medium includes:
根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机;And pre-launching the preset remote host that obtains the preset configuration information according to the triggering instruction;
按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。The abnormal information of the original operating system is collected according to the preset configuration information, and the abnormal information is stored in a storage medium that starts a preset remote host in advance.
其中,所述的操作***异常信息收集方法还包括:根据所述预设配置信息,控制关机或者复位。The operating system abnormal information collecting method further includes: controlling shutdown or reset according to the preset configuration information.
进一步的,所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息及信息收集的时限信息。Further, the preset configuration information includes: information for starting/closing the abnormality information collection function of the control system and time limit information for collecting the information.
为了解决上述技术问题,本发明实施例还提供一种操作***异常信息收集装置,其中,包括:In order to solve the above technical problem, the embodiment of the present invention further provides an operating system abnormal information collecting device, which includes:
第一获取模块,用于获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理 内存中预先存储的一操作***内核;a first acquiring module, configured to acquire preset configuration information of a new operating system preset in the original operating system, where the new operating system is a physics of a computer running the original operating system An operating system kernel pre-stored in memory;
第二获取模块,用于获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令;a second obtaining module, configured to acquire a triggering instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal;
处理模块,用于根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。The processing module is configured to collect abnormal information of the original operating system according to the preset configuration information according to the triggering instruction, and store the abnormal information into a specified storage medium.
进一步的,所述第二获取模块包括:Further, the second obtaining module includes:
第一获取子模块,用于获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。The first obtaining submodule is configured to acquire a triggering instruction that is triggered by the power button to be reported to the system control interrupt SCI of the basic input/output system BIOS.
进一步的,所述第一获取子模块包括:Further, the first obtaining submodule includes:
获取单元,用于在原始操作***异常且当前电源按键支持所述ACPI时,获取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。And an obtaining unit, configured to: when the original operating system is abnormal and the current power button supports the ACPI, acquire a trigger instruction that is triggered by the power button to report to the SCI of the BIOS.
进一步的,所述第二获取模块包括:Further, the second obtaining module includes:
第二获取子模块,用于在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。And a second obtaining submodule, configured to directly acquire a trigger instruction of the original operating system process interrupt triggered by the power button of the computer being pressed when the original system is abnormal and the current power button does not support ACPI.
进一步的,所述处理模块的步骤包括:Further, the steps of the processing module include:
第一处理子模块,用于根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。a first processing submodule, configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and perform the Collection of abnormal information.
进一步的,所述第一处理子模块包括:Further, the first processing submodule includes:
处理单元,用于根据所述触发指令,同步中断所述原始操作***中的程序运行,控制指针指向所述新操作***并启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。 a processing unit, configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, control a pointer to the new operating system, start the new operating system, and collect an instruction of abnormal information of the original operating system And collecting the abnormal information.
进一步的,所述处理模块包括:Further, the processing module includes:
第二获取子模块,用于根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;a second obtaining submodule, configured to obtain, according to the triggering instruction, a writing manner of the preset configuration information as a bare write or a file system interface;
第二处理子模块,用于按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。The second processing submodule is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
进一步的,所述处理模块包括:Further, the processing module includes:
第三获取子模块,用于根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机;a third acquiring submodule, configured to obtain a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction;
第三处理子模块,用于按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。The third processing sub-module is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a storage medium that starts a preset remote host in advance.
其中,所述的操作***异常信息收集装置还包括:控制模块,用于根据所述预设配置信息,控制关机或者复位。The operating system abnormal information collecting device further includes: a control module, configured to control shutdown or reset according to the preset configuration information.
进一步的,所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息及信息收集的时限信息。Further, the preset configuration information includes: information for starting/closing the abnormality information collection function of the control system and time limit information for collecting the information.
为了解决上述技术问题,本发明实施例还提供一种计算机,包括在所运行的计算机的物理内存中预先存储的一操作***内核,所述操作***内核具有如上述的操作***异常信息收集装置。In order to solve the above technical problem, an embodiment of the present invention further provides a computer, including an operating system kernel pre-stored in a physical memory of a running computer, the operating system kernel having an operating system abnormality information collecting device as described above.
本发明实施例的上述技术方案的有益效果如下:The beneficial effects of the above technical solutions of the embodiments of the present invention are as follows:
本发明实施例的方案中,在原始操作***发生异常后,通过硬件电源按键触发中断,硬件触发中断上报到新操作***中,然后新操作***获取到硬件触发中断的触发指令,按照预设配置信息进行异常信息的收集及存储,从而完成异常信息的收集。这样通过通用硬件电源按键来触发另一预先配置好的新操作***完成异常信息的有效收集,用于后续的故障分析和定位,不需额外硬件设备,也不需特殊的硬件设计支持,具有很强的通用 性,并能有效增强***的可维护性,有效地收集异常信息。In the solution of the embodiment of the present invention, after the abnormality occurs in the original operating system, the interrupt is triggered by the hardware power button, and the hardware trigger interrupt is reported to the new operating system, and then the new operating system acquires the trigger command of the hardware trigger interrupt, according to the preset configuration. The information is collected and stored for abnormal information, thereby completing the collection of abnormal information. In this way, the universal hardware power button is used to trigger another pre-configured new operating system to complete the effective collection of abnormal information for subsequent fault analysis and positioning, without additional hardware equipment, and without special hardware design support. Strong universal Sexuality, and can effectively enhance the maintainability of the system, effectively collecting abnormal information.
附图说明DRAWINGS
图1为本发明实施例的操作***异常信息收集方法进行实施的步骤示意图;FIG. 1 is a schematic diagram of steps of implementing an operating system abnormal information collection method according to an embodiment of the present invention;
图2为本发明实施例在支持ACPI的环境中电源按键中断处理的步骤示意图;2 is a schematic diagram of steps of power button interrupt processing in an environment supporting ACPI according to an embodiment of the present invention;
图3为本发明实施例在不支持ACPI的环境中电源按键中断处理的步骤示意图;3 is a schematic diagram of steps of power button interrupt processing in an environment that does not support ACPI according to an embodiment of the present invention;
图4为本发明实施例的***异常信息收集的工作流程示意图;4 is a schematic diagram of a workflow of system exception information collection according to an embodiment of the present invention;
图5为本发明实施例的通过本地存储的工作流程示意图;FIG. 5 is a schematic diagram of a workflow through local storage according to an embodiment of the present invention; FIG.
图6为本发明实施例通过网络存储的工作流程示意图;6 is a schematic diagram of a workflow of storing through a network according to an embodiment of the present invention;
图7为本发明实施例操作***异常信息收集方法的实现流程的步骤流程图;FIG. 7 is a flowchart of steps of an implementation process of an operating system abnormal information collecting method according to an embodiment of the present invention;
图8为本发明实施例的操作***异常信息收集装置的结构示意图。FIG. 8 is a schematic structural diagram of an operating system abnormality information collecting apparatus according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。The technical problems, the technical solutions, and the advantages of the present invention will be more clearly described in the following description.
为了供下文更详细的描述容易参考,需要说明的是传统硬件平台中自带的电源按键通常的用途是长按电源按键可实现关机功能,该功能通常由硬件实现,软件无法截获处理;而本方案的传统硬件平台中自带的电源按键通常的用途是短按电源按键可以产生中断,然后由软件进行相关中断,实现软件关机操作,通常是先停止***中的各种进程、然后同步文件***,最后发起关机操作。For ease of reference in the following detailed description, it should be noted that the usual use of the power button in the conventional hardware platform is to press the power button to implement the shutdown function, which is usually implemented by hardware, and the software cannot intercept the processing; The usual use of the power button in the traditional hardware platform of the solution is to short-press the power button to generate an interrupt, and then the software interrupts the software to implement the software shutdown operation, usually stopping the various processes in the system and then synchronizing the file system. Finally, the shutdown operation is initiated.
本发明针对现有技术中***死机无法收集到有效故障的异常信息,同 时传统的收集方法不具备通用性,无法收集详细的信息的问题,提供一种操作***异常信息收集方法、装置及计算机,通过预先设定的新操作***的预设配置信息,在原始操作***发生异常时,通过通用硬件触发新操作***,由新操作***完成异常信息的收集,不需额外硬件设备,也不需特殊的硬件设计支持,即可实现异常信息的有效收集,具有很强的通用性,并能有效增强***的可维护性。The present invention is directed to the abnormal information that the system fails to collect an effective fault in the prior art, and the same When the traditional collection method is not universal, it is unable to collect detailed information, and provides an operating system abnormal information collection method, device and computer, through the preset configuration information of the new operating system, in the original operating system. When an abnormality occurs, the new operating system is triggered by the general hardware, and the abnormal information is collected by the new operating system. The additional information can be effectively collected without additional hardware equipment or special hardware design support. Versatility and can effectively enhance the maintainability of the system.
如图1至图8所示,本发明实施例的操作***异常信息收集方法,包括:As shown in FIG. 1 to FIG. 8 , the operating system abnormality information collecting method of the embodiment of the present invention includes:
步骤101,获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核;Step 101: Obtain preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operating system kernel pre-stored in a physical memory of a computer running the original operating system;
其中所述预先配置信息包括:所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息、信息收集的时限信息,配置文件、***或库函数调用和***启动参数。通过用户配置,控制***异常信息收集过程,同时能提升本发明提出方法的灵活性。向用户提供的接口可以采用多种方式,The pre-configuration information includes: the preset configuration information includes: information about controlling system abnormal information collection function activation/deactivation, information collection time limit information, configuration files, system or library function calls, and system startup parameters. Through the user configuration, the system abnormal information collection process is controlled, and at the same time, the flexibility of the proposed method of the present invention can be improved. The interface provided to the user can be in a variety of ways.
配置内容包括但不限于:a)是否开启基于电源按键的***异常信息收集功能;b)信息收集完成后的动作;c)信息收集的时限;d)为用于收集信息的新内核保留的物理内存的起始地址和大小;e)***异常信息存储方式等。The configuration includes but is not limited to: a) whether to enable the system abnormality information collection function based on the power button; b) the action after the information collection is completed; c) the time limit for information collection; d) the physical reserved for the new kernel for collecting information The starting address and size of the memory; e) the way the system exception information is stored.
具体的,a)是否开启基于电源按键的***异常信息收集功能。开启后启用相关功能,关闭时,仍按***原有电源按键流程处理,通过此开关能提高该功能的灵活性,可以根据实际情况,由用户按需决定是否开启。Specifically, a) whether to enable the system abnormal information collection function based on the power button. After the function is enabled, the related function is enabled. When it is turned off, it still processes according to the original power button process of the system. This switch can improve the flexibility of the function. According to the actual situation, the user can decide whether to turn it on or off according to the actual situation.
b)信息收集完成后的动作,可选动作为关机或重启。通常情况下,电源按键的默认操作最终都是发起关机,但在需要提高业务连续性时,可能 会需要在***出现异常时,能自动恢复业务,所以,这里提供关机和重启的选项,供用户选择。b) After the information collection is completed, the optional action is to shut down or restart. Normally, the default operation of the power button is ultimately to initiate a shutdown, but when it is necessary to improve business continuity, it is possible It will be necessary to automatically resume the service when there is an abnormality in the system. Therefore, the option to shut down and restart is provided for the user to select.
c)信息收集的时限。为提高业务的连续性,尽量减少业务中断的时间,提供此配置选项,供用户控制***异常时业务中断时间,如果指定时间内信息未收集完成,则直接发起关机或复位。c) Time limit for information collection. To improve the continuity of the service and minimize the time of service interruption, this configuration option is provided for the user to control the service interruption time when the system is abnormal. If the information is not collected within the specified time, the shutdown or reset is directly initiated.
d)用于收集信息的新内核保留的物理内存的起始地址和大小。用于指定新内核加载的物理内存区域。由于不同硬件环境的物理地址空间分布可能不同,同时,不同的操作***内核占用的内存大小也可能不同,提供此配置项,能使本发明更灵活、更具通用性。d) The starting address and size of the physical memory reserved by the new kernel used to collect information. Used to specify the physical memory area loaded by the new kernel. Since the physical address space distribution of different hardware environments may be different, and the memory size occupied by different operating system kernels may also be different, providing this configuration item can make the present invention more flexible and more versatile.
e)***异常信息存储方式。可选的方式如本地存储和网络存储方式。e) System exception information storage method. Optional methods such as local storage and network storage.
本地存储意为将信息保存到本机环境中的存储介质中,常见的如本地磁盘,同时,写磁盘的具体方式也可有多种选择,比如裸写或者通过文件***写的方式,这些均可通过配置决定。Local storage means to save information to the storage medium in the local environment. Commonly, such as local disk, at the same time, there are many options for writing the disk, such as bare writing or writing through the file system. Can be determined by configuration.
其中所述电源按键被按下是指短按电源按键,所述短是指大于一秒小于三秒的时间长度,通常是根据需求进行设定的,有别于现有技术的按键长度且能实现本发明实施例的触发中断,上报到新操作***的技术效果均属于本发明实施例的保护范围。The pressing of the power button refers to short pressing the power button, and the short refers to a length of time greater than one second and less than three seconds, which is usually set according to requirements, which is different from the prior art button length and can The technical effects of the triggering of the embodiment of the present invention and the reporting of the new operating system are all within the protection scope of the embodiment of the present invention.
其中所述物理内存中预先保留一段区域,用于加载用作收集信息的新内存,在原始操作***启动后,将新内核加载到保留的内存区域中,然后在原始操作***异常后,跳转到新内核运行,在新内核中完成信息收集。The physical memory pre-storing a region for loading new memory used for collecting information, after the original operating system is started, loading the new kernel into the reserved memory region, and then jumping after the original operating system exception Run the new kernel and complete the information collection in the new kernel.
步骤102,获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令;Step 102: Acquire a trigger instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal.
其中电源按键被按下所触发的所述原始操作***进程中断的流程分为两种情况,如果当前环境支持ACPI(Advanced Configuration and Power Management Interface的简写,高级配置和电源管理接口),则电源按键触发 的中断将先上报到BIOS(Basic Input Output System的简写,基本输入输出***),然后由BIOS通过SCI(System Control Interrupt的简写,***控制中断。专门用于ACPI电源管理的一个IRQ(Interrupt Request的简写,中断请求)中断的方式上报到操作***;如果当前环境不支持ACPI,则中断直接上报给操作***。The process of interrupting the original operating system process triggered by the power button being pressed is divided into two cases. If the current environment supports ACPI (abbreviation of Advanced Configuration and Power Management Interface, advanced configuration and power management interface), the power button Trigger The interrupt will be reported to the BIOS (abbreviation of Basic Input Output System, basic input and output system), and then by the BIOS through SCI (System Control Interrupt shorthand, system control interrupt. An IRQ dedicated to ACPI power management (Interrupt Request The short-circuit, interrupt request) interrupt is reported to the operating system; if the current environment does not support ACPI, the interrupt is directly reported to the operating system.
其中通过预先注册的中断服务程序可以获取所述的触发指令,进行信息的收集。The triggering instruction can be obtained by using a pre-registered interrupt service program to collect information.
步骤103,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。Step 103: Collect, according to the triggering instruction, abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified storage medium.
通过步骤101获取到相关的预设配置信息后,在原始操作***异常时,然后通过步骤102获取到触发电源按键的触发指令,触发新操作***,再然后经过步骤103收集异常信息的存储所述异常信息到指定的存储介质中。这样在原始***异常信息收集工作不能依赖于原有的***日志模块进行异常信息收集时,通过新操作***的调度保证在原始***异常时,能有效收集到需要的信息,并且通过硬件的电源按键中断原始操作***触发新操作***具有通用性,有效收集到关键信息,对此类故障的分析定位有极大的帮助。After obtaining the related preset configuration information in step 101, when the original operating system is abnormal, then the triggering command for triggering the power button is acquired through step 102, the new operating system is triggered, and then the storage of the abnormal information is collected through step 103. The exception information is sent to the specified storage medium. In this way, when the original system abnormal information collection work cannot rely on the original system log module for abnormal information collection, the scheduling of the new operating system ensures that when the original system is abnormal, the required information can be effectively collected, and the power button of the hardware is used. Interrupting the original operating system triggers the new operating system to be versatile, effectively collecting critical information, and greatly assisting in analyzing and locating such faults.
获取中断的触发信令的方式可以设置为多种,如通过其他介质进行获取,因此本发明实施例的操作***异常信息收集方法中,步骤102包括:The method for obtaining the interrupted triggering signaling may be set to multiple types, such as obtaining by using other media. Therefore, in the operating system abnormal information collecting method of the embodiment of the present invention, step 102 includes:
步骤120,获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。Step 120: Acquire a trigger instruction that is triggered by the power button to be reported to the system control interrupt SCI of the BIOS of the basic input/output system.
在电源按键具备特定的条件时才能通过其他介质,让新操作***获取触发信令,因此本发明实施例的操作***异常信息收集方法中,步骤102包括:When the power button has a specific condition, the new operating system can obtain the triggering signaling through the other media. Therefore, in the operating system abnormal information collecting method of the embodiment of the present invention, the step 102 includes:
步骤121,在原始操作***异常且当前电源按键支持所述ACPI时,获 取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。Step 121: When the original operating system is abnormal and the current power button supports the ACPI, The trigger triggered by the power button is reported to the trigger command in the SCI of the BIOS.
为了保证中断原始操作***的有效性,触发指令的准确中断原始操作***,触发新操作***的开启来收集***异常信息。In order to ensure the validity of the original operating system is interrupted, the triggering instruction accurately interrupts the original operating system, triggering the opening of the new operating system to collect system abnormality information.
如图2所示,本发明的具体的电源按键支持ACPI时实现流程的举例如下。As shown in FIG. 2, an example of the implementation process when the specific power button of the present invention supports ACPI is as follows.
步骤21,用户手工短按电源按键,触发中断。In step 21, the user manually presses the power button to trigger an interrupt.
步骤22,根据ACPI规范,相应中断上报给BIOS。Step 22: According to the ACPI specification, the corresponding interrupt is reported to the BIOS.
步骤23,BIOS进行相应处理后,向新操作***上报SCI中断。In step 23, after the BIOS performs corresponding processing, the SCI interrupt is reported to the new operating system.
步骤24,新操作***在SCI中断流程中进行相应处理。In step 24, the new operating system performs corresponding processing in the SCI interrupt process.
本发明的又一实施例的操作***异常信息收集方法中,步骤102包括:步骤122,在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。In the operating system abnormality information collecting method of the embodiment of the present invention, the step 102 includes: Step 122: When the original system is abnormal and the current power button does not support ACPI, directly acquiring the power button of the computer is pressed. The triggered trigger of the original operating system process interrupt triggered.
如果不支持ACPI时,直接获取触发指令,节约了触发新操作***的时间,同时提高了该***异常信息收集的灵活性,可以通过多种方式实现触发。If ACPI is not supported, the trigger command is directly obtained, which saves the time for triggering the new operating system and improves the flexibility of collecting abnormal information of the system. The triggering can be implemented in various ways.
如图3所示,本发明的具体的电源按键不支持ACPI的实现流程的举例如下。As shown in FIG. 3, an example of an implementation process in which a specific power button of the present invention does not support ACPI is as follows.
步骤31,在新操作***中预先注册电源按键触发的中断的中断向量和中断服务程序。In step 31, the interrupt vector and the interrupt service routine of the interrupt triggered by the power button are pre-registered in the new operating system.
步骤32,用户手工短按电源按键,触发中断。In step 32, the user manually presses the power button to trigger an interrupt.
步骤33,新操作***截获相应中断,并在相应的中断服务程序中进行相应处理。In step 33, the new operating system intercepts the corresponding interrupt and performs corresponding processing in the corresponding interrupt service program.
本发明的又一实施例的操作***异常信息收集方法中,步骤103的步骤包括: In the operating system abnormality information collecting method of the embodiment of the present invention, the step of step 103 includes:
步骤131,根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。Step 131: Synchronously interrupt the running of the program in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and collect the abnormal information.
具体的,本发明的又一实施例的操作***异常信息收集方法中,步骤131包括:步骤1311,根据所述触发指令,同步中断所述原始操作***中的程序运行,控制指针指向所述新操作***并启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。Specifically, in the operating system abnormality information collecting method of the embodiment of the present invention, the step 131 includes: Step 1311, synchronously interrupting the running of the program in the original operating system according to the triggering instruction, and the control pointer points to the new The operating system starts the new operating system, collects an instruction of the abnormal information of the original operating system, and collects the abnormal information.
如图4所示,本发明的具体的收集过程的实现流程的举例如下。As shown in FIG. 4, an example of the implementation flow of the specific collection process of the present invention is as follows.
步骤41,原始操作***启动时,保留一段物理内存,用于存放收集信息用的新内核。新内核中,包括所有跟异常信息收集相关的硬件驱动和功能模块,比如:如果最终需要使用硬盘来保存异常信息的话,则新内核中需要包括相应的硬盘驱动,同理,如果需要通过网络来保存异常信息的话,则新内核中需要包括相应的网卡驱动。此外,保留的物理内存区域的起始地址和大小可以配置。Step 41: When the original operating system starts, a piece of physical memory is reserved for storing a new kernel for collecting information. The new kernel includes all the hardware drivers and function modules related to the collection of abnormal information. For example, if you need to use the hard disk to save the abnormal information, the new kernel needs to include the corresponding hard disk drive. Similarly, if you need to use the network. If you save the exception information, you need to include the corresponding NIC driver in the new kernel. In addition, the starting address and size of the reserved physical memory area can be configured.
步骤42,原始操作***启动后,将用于收集信息的新内核加载到之前保留的物理内存中。Step 42: After the original operating system is started, the new kernel for collecting information is loaded into the previously reserved physical memory.
步骤43,在原始操作***异常后,由用户手工通过短按电源按键触发相应中断。然后由新操作***截获相应的中断。In step 43, after the original operating system is abnormal, the user manually triggers the corresponding interrupt by short pressing the power button. The corresponding interrupt is then intercepted by the new operating system.
步骤44,在相应的中断处理流程中,进行同步处理,比如向所有其它的CPU发送核间中断,通知它们停止运行。In step 44, in the corresponding interrupt processing flow, synchronization processing is performed, for example, an inter-core interrupt is sent to all other CPUs to notify them to stop running.
步骤45,使当前指令指针跳转到之前加载到保留内存中的内核的起始地址,从而实现加载新内核运行。In step 45, the current instruction pointer is jumped to the start address of the kernel previously loaded into the reserved memory, thereby implementing loading of the new kernel.
步骤46,新内核启动后,收集***异常信息,比如***的内存镜像信息,并将其存储到指定的存储介质上,比如通过本地磁盘或者网络。Step 46: After the new kernel is started, collect system exception information, such as system memory image information, and store it on a specified storage medium, such as through a local disk or a network.
本发明的又一实施例的操作***异常信息收集方法中,步骤103包括: 步骤311,根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;In the operating system abnormality information collecting method of the embodiment of the present invention, the step 103 includes: Step 311, according to the triggering instruction, obtaining a write mode of the preset configuration information as a bare write or writing through a file system interface;
步骤312,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。Step 312: Collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
本地存储方便快捷,节约时间,方便实现***异常信息的收集。Local storage is convenient and fast, saving time and facilitating the collection of system abnormal information.
如图5所示,本发明的具体存储到本地的实现流程的举例如下。As shown in FIG. 5, an example of a specific implementation process of the present invention stored locally is as follows.
步骤51,预先通过配置控制模块,配置信息存储方式为本地存储,同时设置具体的写入方式:裸写或通过文件***接口。Step 51: Before configuring the control module, the configuration information storage mode is local storage, and the specific write mode is set at the same time: bare write or through the file system interface.
步骤52,用于收集原始操作***异常信息的新内核中包含本地存储相应的硬件驱动,和文件***模块支持,并加载到指定的内存区域。Step 52: The new kernel for collecting the original operating system exception information includes the corresponding hardware driver of the local storage, and the file system module supports, and loads into the specified memory area.
步骤53,原始操作***异常后,在电源按键的中断处理流程中,跳转到新内核运行,新内核启动后加载相应的驱动,并创建相应的设备文件。在完成***异常信息收集后,调用新操作***提供的相应接口(裸写或文件***),将信息写入指定存储介质。Step 53: After the original operating system is abnormal, in the interrupt processing flow of the power button, jump to the new kernel to run, load the corresponding driver after the new kernel is started, and create a corresponding device file. After the system exception information is collected, the corresponding interface (naked or file system) provided by the new operating system is called to write the information to the specified storage medium.
本发明的又一实施例的操作***异常信息收集方法中,步骤103包括:In the operating system abnormality information collecting method of the embodiment of the present invention, the step 103 includes:
步骤313,根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机;Step 313: Acquire a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction.
其中通过预先启动预设远端主机实现通过网络存储的方式,需要另一台主机作为网络存储的服务端,本机作为客户端。将信息通过网络保存到远程主机上的存储介质中。网络传输方式上也可有多种选择,如:ftp(File Transfer Protocol的简称,文件传输协议)、nfs(Network File System的简称,网络文件***)、httpd(全球网服务器软件、服务器,Apache超文本传输协议(HTTP)服务器的主程序方式等,相关配置包括网络传输方式,网络存储服务端的IP地址及相关路由信息等。The method of storing the network through the network by pre-starting the preset remote host requires another host as the server of the network storage, and the local machine acts as the client. Save the information over the network to a storage medium on the remote host. There are also many options for network transmission, such as: ftp (referred to as File Transfer Protocol, file transfer protocol), nfs (short for Network File System, network file system), httpd (global network server software, server, Apache super The main program mode of the text transfer protocol (HTTP) server, etc., related configuration includes network transmission mode, IP address of network storage server and related routing information.
步骤314,按照所述预设配置信息对所述原始操作***的异常信息进行 收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。Step 314: Perform abnormal information of the original operating system according to the preset configuration information. Collecting and storing the abnormality information into a storage medium that starts the preset remote host in advance.
提供多种信息存储方式,提升操作***异常信息收集方法的可用性和灵活性。Provides a variety of information storage methods to improve the availability and flexibility of operating system exception information collection methods.
如图6所示,本发明的具体存储到本地的实现流程的举例如下。As shown in FIG. 6, an example of a specific implementation process of the present invention stored locally is as follows.
步骤61,预先通过配置控制模块,配置信息存储方式为网络存储,设置具体的网络传输方式:ftp、nfs或httpd,同时设置网络存储服务端的IP地址及相关路由信息等。Step 61: The configuration control module is configured in advance, and the configuration information storage mode is network storage, and the specific network transmission mode is set: ftp, nfs or httpd, and the IP address and related routing information of the network storage server are set at the same time.
步骤62,预先配置并启动另一台主机的相关服务(ftp、nfs或httpd),作为网络存储的服务端。In step 62, the related service (ftp, nfs or httpd) of another host is pre-configured and started as a server of the network storage.
步骤63,用于收集原始操作***异常信息的新内核中包含网络存储使用的网卡硬件驱动,和相应的客户端模块支持,并加载到指定的内存区域。Step 63: The new kernel for collecting the original operating system exception information includes the network card hardware driver used by the network storage, and the corresponding client module support, and is loaded into the specified memory area.
步骤64,原始操作***异常后,在电源按键的中断处理流程中,跳转到新内核运行,新内核启动后加载相应的驱动,并创建相应的设备文件。在完成***异常信息收集后,调用相应网络传输方式的客户端接口,将信息写入预设远端主机的存储介质中。Step 64: After the original operating system is abnormal, in the interrupt processing flow of the power button, jump to the new kernel to run, load the corresponding driver after the new kernel is started, and create a corresponding device file. After the system exception information is collected, the client interface of the corresponding network transmission mode is invoked, and the information is written into the storage medium of the preset remote host.
本发明的又一实施例的操作***异常信息收集方法中,还包括:The method for collecting operating system abnormal information according to still another embodiment of the present invention further includes:
根据所述预设配置信息,控制关机或者复位。Controlling shutdown or reset according to the preset configuration information.
通常情况下,电源按键的默认操作最终都是发起关机,但在需要提高业务连续性时,可能会需要在***出现异常时,能自动恢复业务,所以,这里提供关机和重启的选项,供用户选择。Normally, the default operation of the power button is to initiate a shutdown, but when it is necessary to improve business continuity, it may be necessary to automatically resume the service when the system is abnormal. Therefore, the option to shut down and restart is provided for the user. select.
如图7所示,本发明的具体的操作***异常信息收集方法的实现流程的举例如下。As shown in FIG. 7, an example of an implementation flow of a specific operating system abnormality information collecting method of the present invention is as follows.
步骤71,通过配置控制模块,预先设置好相关配置,控制原始操作***异常信息收集过程。Step 71: The configuration control module is configured to pre-set the relevant configuration to control the original operating system abnormal information collection process.
步骤72,原始操作***异常后,用户通过手工短按电源按键,触发中 断。Step 72: After the original operating system is abnormal, the user manually triggers the power button by triggering Broken.
步骤73,中断上报到新操作***,由新操作***中预先注册的中断服务程序处理。In step 73, the interrupt is reported to the new operating system and processed by the interrupt service program pre-registered in the new operating system.
步骤74,在中断服务程序中进行原始操作***异常信息收集。In step 74, the original operating system exception information collection is performed in the interrupt service routine.
步骤75,将收集到的信息通过信息存储模块,存储到指定的存储介质中。In step 75, the collected information is stored in the designated storage medium through the information storage module.
步骤76,根据配置控制模块中的配置,发起关机或复位。In step 76, a shutdown or reset is initiated according to the configuration in the configuration control module.
如图8所示,相应的,为了解决上述技术问题,本发明实施例还提供一种操作***异常信息收集装置,其中,包括:As shown in FIG. 8 , in order to solve the above technical problem, the embodiment of the present invention further provides an operating system abnormal information collecting apparatus, which includes:
第一获取模块801,用于获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核;The first obtaining module 801 is configured to acquire preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operation pre-stored in a physical memory of the computer running the original operating system. System kernel
第二获取模块802,用于获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令;a second obtaining module 802, configured to acquire a trigger instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal;
处理模块803,用于根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。The processing module 803 is configured to collect abnormal information of the original operating system according to the preset configuration information according to the triggering instruction, and store the abnormal information into a specified storage medium.
其中此处存储异常信息可以通过信息存储模块完成。The exception information stored here can be completed by the information storage module.
通过第一获取模块801获取到相关的预设配置信息后,在原始操作***异常时,第二获取模块802通过获取到触发电源按键的触发指令,触发新操作***,然后再经过处理模块803,收集异常信息的存储所述异常信息到指定的存储介质中。这样在原始***异常信息收集工作不能依赖于原有的***日志模块进行异常信息收集时,通过所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核,通过新操作***的调度保证在原始***异常时,能有效收集到需要的信息,并且通过硬件的电源按 键中断原始操作***触发新操作***具有通用性,有效收集到关键信息,对此类故障的分析定位有极大的帮助。After the first acquisition module 801 obtains the related preset configuration information, when the original operating system is abnormal, the second obtaining module 802 triggers a trigger command of the power button to trigger a new operating system, and then passes through the processing module 803. The abnormal information is collected and stored in the specified storage medium. In this way, when the original system abnormal information collection work cannot rely on the original system log module for abnormal information collection, an operating system kernel pre-stored in the physical memory of the computer running by the original operating system passes through the new operating system. Scheduling ensures that when the original system is abnormal, it can effectively collect the required information and press the power of the hardware. The key interrupts the original operating system to trigger the new operating system to be versatile, effectively collecting key information, and greatly improving the analysis and positioning of such faults.
本发明的又一实施例的操作***异常信息收集装置中,所述第二获取模块802包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the second obtaining module 802 includes:
第一获取子模块,用于获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。The first obtaining submodule is configured to acquire a triggering instruction that is triggered by the power button to be reported to the system control interrupt SCI of the basic input/output system BIOS.
本发明的又一实施例的操作***异常信息收集装置中,所述第一获取子模块包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the first obtaining submodule includes:
获取单元,用于在原始操作***异常且当前电源按键支持所述ACPI时,获取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。And an obtaining unit, configured to: when the original operating system is abnormal and the current power button supports the ACPI, acquire a trigger instruction that is triggered by the power button to report to the SCI of the BIOS.
本发明的又一实施例的操作***异常信息收集装置中,所述第二获取模块802包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the second obtaining module 802 includes:
第二获取子模块,用于在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。And a second obtaining submodule, configured to directly acquire a trigger instruction of the original operating system process interrupt triggered by the power button of the computer being pressed when the original system is abnormal and the current power button does not support ACPI.
本发明的又一实施例的操作***异常信息收集装置中,所述处理模块803的步骤包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the processing module 803 includes:
第一处理子模块,用于根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。a first processing submodule, configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and perform the Collection of abnormal information.
本发明的又一实施例的操作***异常信息收集装置中,所述第一处理子模块包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the first processing submodule includes:
处理单元,用于根据所述触发指令,同步中断所述原始操作***中的程序运行,控制指针指向所述新操作***并启动所述新操作***,收集所 述原始操作***的异常信息的指令,进行所述异常信息的收集。a processing unit, configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and control a pointer to the new operating system and start the new operating system, and collect the The instruction of the abnormal information of the original operating system is performed to collect the abnormal information.
本发明的又一实施例的操作***异常信息收集装置中,所述处理模块803包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the processing module 803 includes:
第二获取子模块,用于根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;a second obtaining submodule, configured to obtain, according to the triggering instruction, a writing manner of the preset configuration information as a bare write or a file system interface;
第二处理子模块,用于按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。The second processing submodule is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
本发明的又一实施例的操作***异常信息收集装置中,所述处理模块803包括:In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the processing module 803 includes:
第三获取子模块,用于根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机;a third acquiring submodule, configured to obtain a pre-launch preset remote host that obtains the preset configuration information according to the triggering instruction;
第三处理子模块,用于按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。The third processing sub-module is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a storage medium that starts a preset remote host in advance.
本发明的又一实施例的操作***异常信息收集装置中还包括:控制模块,用于根据所述预设配置信息,控制关机或者复位。The operating system abnormality information collecting apparatus of the embodiment of the present invention further includes: a control module, configured to control shutdown or reset according to the preset configuration information.
本发明的又一实施例的操作***异常信息收集装置中,所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息及信息收集的时限信息。In the operating system abnormality information collecting apparatus of the embodiment of the present invention, the preset configuration information includes: information for controlling the startup/shutdown of the system abnormal information collecting function and time limit information for collecting information.
需要说明的是,本发明提供的装置是应用上述操作***异常信息收集方法的装置,则上述操作***异常信息收集方法的所有实施例均适用于该装置,且均能达到相同或相似的有益效果。It should be noted that, the device provided by the present invention is a device that applies the foregoing operating system abnormal information collecting method, and all embodiments of the operating system abnormal information collecting method are applicable to the device, and both can achieve the same or similar beneficial effects. .
为了解决上述技术问题,本发明实施例还提供一种计算机,包括在所运行的计算机的物理内存中预先存储的一操作***内核,所述操作***内核具有如上述的操作***异常信息收集装置。 In order to solve the above technical problem, an embodiment of the present invention further provides a computer, including an operating system kernel pre-stored in a physical memory of a running computer, the operating system kernel having an operating system abnormality information collecting device as described above.
相应的由于本发明实施例的操作***异常信息收集装置,应用于计算机,因此,本发明实施例提供的计算机,其中,上述操作***异常信息收集装置的所述实现实施例均适用于该计算机的实施例中,也能达到相同的技术效果。The operating system abnormal information collecting device of the embodiment of the present invention is applied to a computer. Therefore, the computer provided by the embodiment of the present invention, wherein the implementation embodiments of the operating system abnormal information collecting device are applicable to the computer In the embodiment, the same technical effect can also be achieved.
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以作出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备和方法,可以通过其它的方式实现。以上所描述的设备实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,如:多个单元或组件可以结合,或可以集成到另一个***,或一些特征可以忽略,或不执行。另外,所显示或讨论的各组成部分相互之间的耦合、或直接耦合、或通信连接可以是通过一些接口,设备或单元的间接耦合或通信连接,可以是电性的、机械的或其它形式的。In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. The device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, such as: multiple units or components may be combined, or Can be integrated into another system, or some features can be ignored or not executed. In addition, the coupling, or direct coupling, or communication connection of the components shown or discussed may be indirect coupling or communication connection through some interfaces, devices or units, and may be electrical, mechanical or other forms. of.
上述作为分离部件说明的单元可以是、或也可以不是物理上分开的,作为单元显示的部件可以是、或也可以不是物理单元,即可以位于一个地方,也可以分布到多个网络单元上;可以根据实际的需要选择其中的部分或全部单元来实现本实施例方案的目的。The units described above as separate components may or may not be physically separated, and the components displayed as the unit may or may not be physical units, that is, may be located in one place or distributed to multiple network units; Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各实施例中的各功能单元可以全部集成在一个处理单元中,也可以是各单元分别单独作为一个单元,也可以两个或两个以上单元集成在一个单元中;上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated into one unit; The unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤; 而前述的存储介质包括:移动存储设备、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。A person skilled in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by using hardware related to the program instructions. The foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. Including the steps of the above method embodiments; The foregoing storage medium includes: a removable storage device, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes.
或者,本发明上述集成的单元如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分。而前述的存储介质包括:移动存储设备、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。Alternatively, the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product. Based on such understanding, the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions. A computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a removable storage device, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.
工业实用性Industrial applicability
本发明公开了一种操作***异常信息收集方法、装置及计算机,能够通过硬件电源按键触发中断,硬件触发中断上报到新操作***中,然后新操作***获取到硬件触发中断的触发指令,按照预设配置信息进行异常信息的收集及存储,从而完成异常信息的收集。这样通过通用硬件电源按键来触发另一预先配置好的新操作***完成异常信息的有效收集,用于后续的故障分析和定位,不需额外硬件设备,也不需特殊的硬件设计支持,具有很强的通用性,并能有效增强***的可维护性,有效地收集异常信息。 The invention discloses a method, a device and a computer for collecting operating system abnormal information, which can trigger an interrupt through a hardware power button, and the hardware trigger interrupt is reported to the new operating system, and then the new operating system acquires a trigger command of the hardware trigger interrupt, according to the pre- The configuration information is used to collect and store the abnormal information, thereby completing the collection of the abnormal information. In this way, the universal hardware power button is used to trigger another pre-configured new operating system to complete the effective collection of abnormal information for subsequent fault analysis and positioning, without additional hardware equipment, and without special hardware design support. Strong versatility, and can effectively enhance the maintainability of the system, effectively collecting abnormal information.

Claims (21)

  1. 一种操作***异常信息收集方法,包括:A method for collecting operating system abnormal information includes:
    获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核;Obtaining preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operating system kernel pre-stored in a physical memory of a computer running the original operating system;
    获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令;Obtaining a trigger instruction that is interrupted by the original operating system process triggered by a power button of the computer when the original operating system is abnormal;
    根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。And collecting, according to the triggering instruction, abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into a specified storage medium.
  2. 根据权利要求1所述的操作***异常信息收集方法,其中,获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令,包括:The operating system abnormality information collecting method according to claim 1, wherein a triggering instruction for interrupting the original operating system process triggered by the power button of the computer being pressed when the original operating system is abnormal is acquired, including :
    获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。A trigger instruction that is triggered by the power button is reported to the system control interrupt SCI of the basic input/output system BIOS.
  3. 根据权利要求2所述的操作***异常信息收集方法,其中,获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令,包括:The operating system abnormality information collecting method according to claim 2, wherein the triggering instruction triggered by the power button is reported to the system control interrupt SCI of the basic input/output system BIOS, and includes:
    在原始操作***异常且当前电源按键支持所述ACPI时,获取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。When the original operating system is abnormal and the current power button supports the ACPI, an interrupt command triggered by the power button is reported to the trigger in the SCI of the BIOS.
  4. 根据权利要求1所述的操作***异常信息收集方法,其中,获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令,包括:The operating system abnormality information collecting method according to claim 1, wherein a triggering instruction for interrupting the original operating system process triggered by the power button of the computer being pressed when the original operating system is abnormal is acquired, including :
    在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。 When the original system is abnormal and the current power button does not support ACPI, the trigger command of the original operating system process interrupt triggered by the power button of the computer being directly pressed is directly acquired.
  5. 根据权利要求1所述的操作***异常信息收集方法,其中,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中,包括:The operating system abnormality information collecting method according to claim 1, wherein the abnormal information of the original operating system is collected according to the preset configuration information according to the triggering instruction, and the abnormal information is stored to a specified Storage media, including:
    根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。And synchronously interrupting the running of the program in the original operating system according to the triggering instruction, and acquiring an instruction to start the new operating system, collecting abnormal information of the original operating system, and collecting the abnormal information.
  6. 根据权利要求5所述的操作***异常信息收集方法,其中,根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***收集所述原始操作***的异常信息的指令,进行所述异常信息的收集,包括:The operating system abnormality information collecting method according to claim 5, wherein, according to the triggering instruction, the program running in the original operating system is synchronously interrupted and an abnormality of starting the new operating system to collect the original operating system is acquired. The instruction of the information, the collection of the abnormal information, including:
    根据所述触发指令,同步中断所述原始操作***中的程序运行,控制指针指向所述新操作***并启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。And synchronously interrupting the running of the program in the original operating system according to the triggering instruction, the control pointer points to the new operating system, starts the new operating system, collects an instruction of the abnormal information of the original operating system, and performs the abnormality. Collection of information.
  7. 根据权利要求1所述的操作***异常信息收集方法,其中,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中,包括:The operating system abnormality information collecting method according to claim 1, wherein the abnormal information of the original operating system is collected according to the preset configuration information according to the triggering instruction, and the abnormal information is stored to a specified Storage media, including:
    根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;Obtaining, according to the triggering instruction, the writing manner of the preset configuration information is bare write or writing through a file system interface;
    按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。Collecting abnormal information of the original operating system according to the preset configuration information, and storing the abnormal information into a specified local storage medium.
  8. 根据权利要求1所述的操作***异常信息收集方法,其中,根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中,包括:The operating system abnormality information collecting method according to claim 1, wherein the abnormal information of the original operating system is collected according to the preset configuration information according to the triggering instruction, and the abnormal information is stored to a specified Storage media, including:
    根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机; And pre-launching the preset remote host that obtains the preset configuration information according to the triggering instruction;
    按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。The abnormal information of the original operating system is collected according to the preset configuration information, and the abnormal information is stored in a storage medium that starts a preset remote host in advance.
  9. 根据权利要求1所述的操作***异常信息收集方法,其中,所述方法还包括:根据所述预设配置信息,控制关机或者复位。The operating system abnormality information collecting method according to claim 1, wherein the method further comprises: controlling shutdown or reset according to the preset configuration information.
  10. 根据权利要求1所述的操作***异常信息收集方法,其中,所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息及信息收集的时限信息。The operating system abnormality information collecting method according to claim 1, wherein the preset configuration information comprises: information for controlling activation/deactivation of the system abnormal information collecting function and time limit information for collecting information.
  11. 一种操作***异常信息收集装置,包括:An operating system abnormal information collecting device includes:
    第一获取模块,配置为获取原始操作***中预先设定的新操作***的预设配置信息,所述新操作***是所述原始操作***所运行的计算机的物理内存中预先存储的一操作***内核;a first acquiring module configured to acquire preset configuration information of a new operating system preset in the original operating system, where the new operating system is an operating system pre-stored in a physical memory of the computer running the original operating system Kernel
    第二获取模块,配置为获取在所述原始操作***异常时,所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令;a second obtaining module, configured to acquire a triggering instruction that is interrupted by the original operating system process triggered by the power button of the computer being pressed when the original operating system is abnormal;
    处理模块,配置为根据所述触发指令,按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的存储介质中。The processing module is configured to collect abnormal information of the original operating system according to the preset configuration information according to the triggering instruction, and store the abnormal information into a specified storage medium.
  12. 根据权利要求11所述的操作***异常信息收集装置,其中,所述第二获取模块包括:The operating system abnormality information collecting apparatus according to claim 11, wherein the second obtaining module comprises:
    第一获取子模块,配置为获取通过所述电源按键触发的中断上报到基本输入输出***BIOS的***控制中断SCI中的触发指令。The first obtaining submodule is configured to acquire a triggering instruction in the system control interrupt SCI of the basic input/output system BIOS that is triggered by the power button.
  13. 根据权利要求12所述的操作***异常信息收集装置,其中,所述第一获取子模块包括:The operating system abnormality information collecting apparatus according to claim 12, wherein the first obtaining submodule comprises:
    获取单元,配置为在原始操作***异常且当前电源按键支持所述ACPI时,获取通过电源按键触发的中断上报到所述BIOS的所述SCI中的触发指令。 And an obtaining unit configured to acquire, when the original operating system is abnormal and the current power button supports the ACPI, acquire a trigger instruction reported by the power button to the SCI of the BIOS.
  14. 根据权利要求11所述的操作***异常信息收集装置,其中,所述第二获取模块包括:The operating system abnormality information collecting apparatus according to claim 11, wherein the second obtaining module comprises:
    第二获取子模块,配置为在所述原始***异常时且当前电源按键不支持ACPI时,直接获取所述计算机的电源按键被按下所触发的所述原始操作***进程中断的触发指令。The second obtaining submodule is configured to directly acquire a trigger instruction of the original operating system process interrupt triggered by the power button of the computer being pressed when the original system is abnormal and the current power button does not support ACPI.
  15. 根据权利要求11所述的操作***异常信息收集装置,其中,所述处理模块包括:The operating system abnormality information collecting apparatus according to claim 11, wherein the processing module comprises:
    第一处理子模块,配置为根据所述触发指令,同步中断所述原始操作***中的程序运行并获取一启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。a first processing submodule configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, and acquire an instruction to start the new operating system, collect abnormal information of the original operating system, and perform the Collection of abnormal information.
  16. 根据权利要求15所述的操作***异常信息收集装置,其中,所述第一处理子模块包括:The operating system abnormality information collecting apparatus according to claim 15, wherein the first processing submodule comprises:
    处理单元,配置为根据所述触发指令,同步中断所述原始操作***中的程序运行,控制指针指向所述新操作***并启动所述新操作***,收集所述原始操作***的异常信息的指令,进行所述异常信息的收集。a processing unit, configured to synchronously interrupt a program running in the original operating system according to the triggering instruction, control a pointer to the new operating system, start the new operating system, and collect an instruction of abnormal information of the original operating system And collecting the abnormal information.
  17. 根据权利要求11所述的操作***异常信息收集装置,其中,所述处理模块包括:The operating system abnormality information collecting apparatus according to claim 11, wherein the processing module comprises:
    第二获取子模块,配置为根据所述触发指令,获取到所述预设配置信息的写入方式为裸写或者通过文件***接口写入;The second obtaining sub-module is configured to obtain, according to the triggering instruction, that the writing manner of the preset configuration information is bare write or write through a file system interface;
    第二处理子模块,配置为按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到指定的本地的存储介质中。The second processing submodule is configured to collect abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a specified local storage medium.
  18. 根据权利要求11所述的操作***异常信息收集装置,其中,所述处理模块包括:The operating system abnormality information collecting apparatus according to claim 11, wherein the processing module comprises:
    第三获取子模块,配置为根据所述触发指令,获取到所述预设配置信息的预先启动预设远端主机; a third acquiring sub-module, configured to acquire a preset remote host that is preset to the preset configuration information according to the triggering instruction;
    第三处理子模块,配置为按照所述预设配置信息对所述原始操作***的异常信息进行收集,并存储所述异常信息到预先启动预设远端主机的存储介质中。The third processing sub-module is configured to collect the abnormal information of the original operating system according to the preset configuration information, and store the abnormal information into a storage medium that starts the preset remote host in advance.
  19. 根据权利要求11所述的操作***异常信息收集装置,其中,还包括:控制模块,配置为根据所述预设配置信息,控制关机或者复位。The operating system abnormality information collecting apparatus according to claim 11, further comprising: a control module configured to control shutdown or reset according to the preset configuration information.
  20. 根据权利要求11所述的操作***异常信息收集装置,其中,所述预设配置信息包括:控制***异常信息收集功能启动/关闭的信息及信息收集的时限信息。The operating system abnormality information collecting apparatus according to claim 11, wherein the preset configuration information comprises: information for controlling activation/deactivation of the system abnormality information collecting function and time limit information for collecting information.
  21. 一种计算机,包括在所运行的计算机的物理内存中预先存储的一操作***内核,所述操作***内核包括如权利要求11至20任一项的所述的操作***异常信息收集装置。 A computer comprising an operating system kernel pre-stored in a physical memory of a running computer, the operating system kernel comprising the operating system exception information collecting device according to any one of claims 11 to 20.
PCT/CN2014/091189 2014-09-24 2014-11-14 Method and device for collecting operating system fault information, and computer WO2015131549A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410495257.9 2014-09-24
CN201410495257.9A CN105512000B (en) 2014-09-24 2014-09-24 Operating system abnormal information collection method and device and computer

Publications (1)

Publication Number Publication Date
WO2015131549A1 true WO2015131549A1 (en) 2015-09-11

Family

ID=54054434

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/091189 WO2015131549A1 (en) 2014-09-24 2014-11-14 Method and device for collecting operating system fault information, and computer

Country Status (2)

Country Link
CN (1) CN105512000B (en)
WO (1) WO2015131549A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111367769A (en) * 2020-03-30 2020-07-03 浙江大华技术股份有限公司 Application fault processing method and electronic equipment
CN112015600A (en) * 2019-05-28 2020-12-01 阿里巴巴集团控股有限公司 Log information processing system, log information processing method and device and switch
CN114219231A (en) * 2021-11-19 2022-03-22 广东工业大学 Performance evaluation method of power system, electronic device, and storage medium

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107783853A (en) * 2016-08-26 2018-03-09 中兴通讯股份有限公司 A kind of method and device that abnormal information is collected in the os starting stage
CN107480052B (en) * 2017-07-21 2020-11-13 广东虹勤通讯技术有限公司 Method and device for positioning BIOS (basic input output System) codes during downtime
CN108021407B (en) * 2017-12-06 2021-01-15 锐捷网络股份有限公司 Service processing method and device based on network equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1942859A (en) * 2003-10-01 2007-04-04 扎鲁纳股份有限公司 Operating systems
KR20080039136A (en) * 2006-10-31 2008-05-07 에스케이 텔레콤주식회사 Trouble ticket operating system of communication system
CN103136046A (en) * 2011-11-24 2013-06-05 林振慧 Operation system
CN103902316A (en) * 2012-12-27 2014-07-02 联想(北京)有限公司 Switching method and electronic device
CN103986623A (en) * 2014-05-28 2014-08-13 山东超越数控电子有限公司 Automatic hardware equipment monitoring system based on domestic operating system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100472468C (en) * 2005-09-30 2009-03-25 联想(北京)有限公司 Computer system, computer network and method
CN100472471C (en) * 2006-02-22 2009-03-25 联想(北京)有限公司 System and method for obtaining fault in-situ information for computer operating system
CN100517246C (en) * 2006-03-14 2009-07-22 联想(北京)有限公司 Computer remote control method and system
CN102163167A (en) * 2011-03-02 2011-08-24 奇智软件(北京)有限公司 Method and device for rescuing system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1942859A (en) * 2003-10-01 2007-04-04 扎鲁纳股份有限公司 Operating systems
KR20080039136A (en) * 2006-10-31 2008-05-07 에스케이 텔레콤주식회사 Trouble ticket operating system of communication system
CN103136046A (en) * 2011-11-24 2013-06-05 林振慧 Operation system
CN103902316A (en) * 2012-12-27 2014-07-02 联想(北京)有限公司 Switching method and electronic device
CN103986623A (en) * 2014-05-28 2014-08-13 山东超越数控电子有限公司 Automatic hardware equipment monitoring system based on domestic operating system

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112015600A (en) * 2019-05-28 2020-12-01 阿里巴巴集团控股有限公司 Log information processing system, log information processing method and device and switch
CN111367769A (en) * 2020-03-30 2020-07-03 浙江大华技术股份有限公司 Application fault processing method and electronic equipment
CN111367769B (en) * 2020-03-30 2023-07-21 浙江大华技术股份有限公司 Application fault processing method and electronic equipment
CN114219231A (en) * 2021-11-19 2022-03-22 广东工业大学 Performance evaluation method of power system, electronic device, and storage medium
CN114219231B (en) * 2021-11-19 2022-09-23 广东工业大学 Performance evaluation method of power system, electronic device, and storage medium

Also Published As

Publication number Publication date
CN105512000A (en) 2016-04-20
CN105512000B (en) 2020-04-24

Similar Documents

Publication Publication Date Title
WO2015131549A1 (en) Method and device for collecting operating system fault information, and computer
JP6530774B2 (en) Hardware failure recovery system
US7574627B2 (en) Memory dump method, memory dump program and computer system
US9912535B2 (en) System and method of performing high availability configuration and validation of virtual desktop infrastructure (VDI)
WO2017063505A1 (en) Method for detecting hardware fault of server, apparatus thereof, and server
US10540498B2 (en) Technologies for hardware assisted native malware detection
JP2010086181A (en) Virtual machine system, method for managing thereof, program, and recording medium
WO2016000298A1 (en) System exception capturing method, main system, shadow system and intelligent device
US20140189103A1 (en) System for monitoring servers and method thereof
JP2010086364A (en) Information processing device, operation state monitoring device and method
WO2013112159A1 (en) Image-based application automation
US11249771B2 (en) Terminal input invocation
US20160259578A1 (en) Apparatus and method for detecting performance deterioration in a virtualization system
TW201640351A (en) Resource monitoring system and method thereof
GB2458005A (en) Shutting down unnecessary programs in a computer system
TW201500919A (en) System and method of remote debugging BMC
CN114184885A (en) Fault detection method, device and storage medium
JP6264879B2 (en) Information processing apparatus, monitoring program, and monitoring method
KR102137891B1 (en) Server managing Method, Server, and Recording medium using User Specialized Operating Mechanism on BMC environment
JP2011028438A (en) Application verification system and computer program
JP2009205208A (en) Operation management device, method and program
JP5429171B2 (en) Information processing apparatus and hangup cause investigation information acquisition method
US20050210329A1 (en) Facilitating system diagnostic functionality through selective quiescing of system component sensor devices
JP5668836B2 (en) Information processing apparatus, information acquisition method, and information acquisition program
WO2018036318A1 (en) Method and device for collecting abnormality information during startup of operating system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14885047

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14885047

Country of ref document: EP

Kind code of ref document: A1