CN105511972A - Method for reducing faults of independent redundant disk array - Google Patents

Method for reducing faults of independent redundant disk array Download PDF

Info

Publication number
CN105511972A
CN105511972A CN201410505314.7A CN201410505314A CN105511972A CN 105511972 A CN105511972 A CN 105511972A CN 201410505314 A CN201410505314 A CN 201410505314A CN 105511972 A CN105511972 A CN 105511972A
Authority
CN
China
Prior art keywords
array
raid
operating strategy
vibrations
impact
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410505314.7A
Other languages
Chinese (zh)
Other versions
CN105511972B (en
Inventor
辛伯勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201410505314.7A priority Critical patent/CN105511972B/en
Publication of CN105511972A publication Critical patent/CN105511972A/en
Application granted granted Critical
Publication of CN105511972B publication Critical patent/CN105511972B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides a method for reducing faults of an independent redundant disk array. The method comprises the steps that a shake or impact suffered by the independent redundant disk array are detected at a certain frequency; when each shake or impact suffered by the independent redundant disk array is detected, the independent redundant disk array can be managed by a first management strategy if the shake or impact does not exceed a first threshold value; if the shake or the impact exceeds the first threshold value, a second management strategy can be called to manage the independent redundant disk array; and after the second management strategy is called to manage the independent redundant disk array, a disk is made to drop off line from the independent redundant disk array if the disk in the independent redundant disk array makes no response under the second management strategy. The method for reducing the faults of the independent redundant disk array provided by the invention has the advantages that disk off-line mistakes of a storage system can be reduced.

Description

A kind of method reducing raid-array fault
Technical field
The present invention relates to a kind of method reducing fault, and relate more specifically to a kind of method reducing raid-array fault.
Background technology
Disc array system (RedundantArrayofIndependentDisks, RAID) be a kind of storing technology getting up to become a large-scale disk group by multiple disk combination, data scatter is stored in multiple disk, and its storge quality is higher than single disk.Disk array can be divided into number of different types, wherein more common to have zero level disk array (RAID0), first order disk array (RAID1), the 0th adds one-level disk array (RAID0+1) and level V disk array (RAID5) etc.RAID can reach by related hardware, software and user configured RAID rank, multiple hard disk the demand that user pursues high-performance high reliability; also can carry out monitor and managment to the hard disk in RAID array simultaneously; when hard disk occurs that long low performance is run or do not respond; for the impact better protecting RAID array data and reduce whole RAID array performance, this hard disk " can be kicked out of " from RAID array.
Current, vibrations, the key factor that to impact be still performance to mechanical type hard disk, reliability and life-span have an impact, when operating hard disk receives vibrations or impacts, the performance of hard disk can be subject to a significant impact, and even there will be initiatively and passive stopping response.There is statistics to show, be considered in the hard disk of bad hard disk and " kicking out of ", have the hard disk of higher proportion to be that system is subject to unexpected vibrations and causes hard disk performance decline and cause hard disk to be missed " kicking out of ".
Therefore, how can carry out some to the existing storage system with raid-array to optimize, make when operating hard disk is given a shock or impacts and cause hard disk performance to decline, reduce the mistake of storage system to hard disk " to kick out of " thus the function making to have the storage system of raid-array becomes more practical, improving user's experience, is current problem demanding prompt solution.
Summary of the invention
In order to solve above-mentioned technical matters of the prior art, according to an aspect of the present invention, provide a kind of method reducing raid-array fault, described method comprises: the vibrations that detection raid-array is subject to or impact; When detecting that raid-array is given a shock or impacts, judging described vibrations or impacting whether be greater than first threshold; If described vibrations or impact are not more than first threshold, then manage described raid-array with the first operating strategy; If described vibrations or impact are greater than first threshold, then call the second operating strategy and manage described raid-array; Calling after the second operating strategy manages described raid-array, judge in raid-array, whether have hard disk not make response under described second operating strategy; And if have specified hard disc not respond under described second operating strategy in raid-array, described specified hard disc is then made to go offline from described raid-array, wherein, in described second operating strategy to the performance expectation of described raid-array lower than the performance expectation to described raid-array in described first operating strategy.
In addition, according to one embodiment of present invention, wherein, the method of described reduction raid-array fault comprises further: if the hard disk of described raid-array all makes response under the second operating strategy, then continue to manage described raid-array according to described second operating strategy.
In addition, according to one embodiment of present invention, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
In addition, according to one embodiment of present invention, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
In addition, according to one embodiment of present invention, wherein, the vibrations that are subject to of described detection raid-array or impact comprise vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact.
According to a further aspect in the invention, provide a kind of device reducing raid-array fault, described device comprises: detecting unit, is configured to detect vibrations or impact that raid-array is subject to; First judging unit, when detecting that raid-array is given a shock or impacts, described first judging unit is configured judge described vibrations or impact whether be greater than first threshold; First administrative unit, if described vibrations or impact are not more than first threshold, then described first administrative unit is configured and manages described raid-array with the first operating strategy; Call unit, if described vibrations or impact are greater than first threshold, then described call unit is configured to call the second operating strategy and manages described raid-array; Second judging unit, calling after the second operating strategy manages described raid-array, described second judging unit is configured to judge whether have hard disk not make response in raid-array under the second operating strategy; And fault processing unit, if there is specified hard disc not respond under described second operating strategy in raid-array, then described fault processing unit is configured described specified hard disc is gone offline from described raid-array, wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.
In addition, according to one embodiment of present invention, wherein, the device of described reduction raid-array fault comprises further: the second administrative unit, if the hard disk of raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue to manage described raid-array according to the second operating strategy.
In addition, according to one embodiment of present invention, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
In addition, according to one embodiment of present invention, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
In addition, according to one embodiment of present invention, wherein, described detecting unit is configured vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact further.
According to another aspect of the invention, provide a kind of data storage system, described system comprises: raid-array, for storing data; Acceleration sensor, for sensing the vibrations or impact that raid-array receives; And reducing raid-array failed equipment, described reduction raid-array failed equipment comprises: detecting unit, is configured to detect vibrations or impact that raid-array is subject to; First judging unit, when detecting that raid-array is given a shock or impacts, described first judging unit is configured judge described vibrations or impact whether be greater than first threshold; First administrative unit, if described vibrations or impact are not more than first threshold, then described first administrative unit is configured and manages described raid-array with the first operating strategy; Call unit, if described vibrations or impact are greater than first threshold, then described call unit is configured to call the second operating strategy and manages described raid-array; Second judging unit, calling after the second operating strategy manages described raid-array, described second judging unit is configured to judge whether have hard disk not make response in raid-array under the second operating strategy; And fault processing unit, if there is specified hard disc not respond under described second operating strategy in raid-array, then described fault processing unit is configured described specified hard disc is gone offline from described raid-array, wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.
In addition, according to one embodiment of present invention, wherein, the device of described reduction raid-array fault comprises further: the second administrative unit, if the hard disk of raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue to manage described raid-array according to the second operating strategy.
In addition, according to one embodiment of present invention, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
In addition, according to one embodiment of present invention, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
In addition, according to one embodiment of present invention, wherein, described detecting unit is configured vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact further.
As can be seen here, the method and apparatus of reduction raid-array fault provided by the invention can carry out some to the existing storage system with raid-array and optimize, make when operating hard disk is given a shock or impacts and cause hard disk performance to decline, reduce the mistake of storage system to hard disk " to kick out of " thus the function making to have the storage system of raid-array becomes more practical, improve user's experience.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, be briefly described to the accompanying drawing used required in the description of embodiment below.Accompanying drawing in the following describes is only exemplary embodiment of the present invention:
Fig. 1 shows the process flow diagram being applied to the method 100 of the reduction raid-array fault of an information handling system according to the embodiment of the present invention;
Fig. 2 shows the exemplary block diagram of the device 200 of the reduction raid-array fault according to the embodiment of the present invention;
Fig. 3 shows the exemplary block diagram of the data storage system 300 according to the embodiment of the present invention.
Embodiment
Hereinafter, the preferred embodiments of the present invention are described in detail with reference to accompanying drawing.Note, in the present description and drawings, there is substantially the same step and represent with the identical Reference numeral of element, and will be omitted the repetition of explanation of these steps and element.
Mentioned in the whole text " embodiment " or " embodiment " of this instructions means and to be contained at least one described embodiment in conjunction with special characteristic, structure or the characteristic described by described embodiment.Therefore, the appearance of phrase " in one embodiment " or " in one embodiment " all may not only be with same embodiment in the description.In addition, described special characteristic, structure or characteristic can any applicable mode be combined in one or more embodiments.
The term " disk " that the embodiment of the present invention relates to and " hard disk " have basic identical meanings.Disk is the equipment disk being carried out read-write capability by magnetic, and it can be non-volatile memory medium, and the file preserved after power-off can not be lost.Hard disk is installed in hard metal box by the tablet storage of disk and can better be protected.
Below, the method for reduction raid-array fault according to an embodiment of the invention is described with reference to Fig. 1.Fig. 1 shows the process flow diagram being applied to the method 100 of the reduction raid-array fault of an information handling system according to the embodiment of the present invention, and wherein, described information handling system comprises disc array system in order to store data.As shown in Figure 1, the method 100 reducing raid-array fault at least can comprise the following steps.
In step s 110, the vibrations that are subject to of raid-array or impact is detected.Particularly, in one embodiment of the invention, can by the vibrations that use the acceleration sensing device of such as gravity sensor etc. to be subject to certain frequency detecting raid-array or impact.Such as, acceleration sensor can be set in the information handling system with raid-array, described acceleration sensor can be connected mutually communicatedly with raid-array, and can sense vibrations or impact that raid-array suffers from real time.
In the step s 120, when detecting that raid-array is given a shock or impacts, judging described vibrations or impacting whether be greater than first threshold.Particularly, in one embodiment of the invention, can the vibrations that sense of degree of will speed up sensor or compared with impacting and protecting threshold value with predetermined jerk.Usually, acceleration sensor by the acceleration of surperficial micro-electromechanical technology object sensing motion, then can pass through related physical law, calculates the size of vibrations suffered by object or impact further.In an embodiment of the present invention, the size that the raid-array calculated can be given a shock or impact and predetermined jerk protect threshold value to compare.
In step s 130, which, if described vibrations or impact are not more than first threshold, then described raid-array is managed with the first operating strategy.Particularly; in one embodiment of the invention; when the vibrations that acceleration sensor senses or impact are not more than predetermined jerk protection threshold value; vibrations suffered by described raid-array can be judged or impact in normal range; in the case, described independent redundancy disk battle array can continue with existing normal management tactical management raid-array.
In step S140, if described vibrations or impact are greater than first threshold, then call the second operating strategy and manage described raid-array.Wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.Particularly, in one embodiment of the invention, wait for that when using described second operating strategy to manage described raid-array time that described raid-array respond can be greater than and wait for when described first operating strategy of use manages described raid-array the time that described raid-array responds.In another embodiment of the present invention, can be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries described raid-array number of retries when using described second operating strategy to manage described raid-array.
Specifically, when the vibrations that acceleration sensor senses or impact are greater than predetermined jerk protection threshold value, vibrations suffered by described raid-array can be determined or impact beyond normal range, these vibrations or impact may cause indivedual hard disk performance to decline, likely the hard disk that can affect in raid-array normally works, but the hard disk of hydraulic performance decline may be only occur that the low performance of a period of time is run and may not be " bad " hard disk that cannot work, in the case, if the normal disk management strategy of continuation use manages the hard disk in raid-array, then the Hard disk error that performance reduces may be judged into " bad " hard disk that cannot work and kick out of, therefore, the operating strategy expected by calling other a set of reduction hard disk performance manages raid-array, the hard disk that can prevent performance from reducing is kicked out of by mistake.In one embodiment of the invention, in the operating strategy of other a set of reduction hard disk performance expectation.The stand-by period that described raid-array is responded can be extended.In another embodiment of the present invention, in the operating strategy reducing hard disk performance expectation.Can also by increasing the number of retries to the hard disk in managed raid-array, the hard disk preventing performance from reducing is kicked out of by mistake.
In step S150, calling after the second operating strategy manages described raid-array, judge in raid-array, whether have hard disk not make response under the second operating strategy.Specifically, when the vibrations that raid-array is subject to or when impacting larger, in the case, the operating strategy of other a set of reduction hard disk performance expectation can be called to manage the hard disk of raid-array, when using the operating strategy reducing hard disk performance expectation to manage raid-array, can judge in raid-array, whether have hard disk still not make response under the operating strategy reducing hard disk performance expectation.
In step S160, if there is specified hard disc not respond under described second operating strategy in raid-array, then described specified hard disc is made to go offline from described raid-array.Specifically, when using the operating strategy reducing hard disk performance expectation to manage raid-array, if there is hard disk still not make response in raid-array under the operating strategy reducing hard disk performance expectation, then this hard disk may be not only the hard disk that performance reduces, therefore, can judge that this hard disk may be " bad " hard disk that cannot work, in the case, this hard disk " can be kicked out of " from raid-array, that is, this hard disk is made to go offline from raid-array.
In addition, in one embodiment of the invention, if the hard disk of described raid-array all makes response under the second operating strategy, then continue to manage described raid-array according to described second operating strategy.Specifically, reduce operating strategy that hard disk performance expects to after managing raid-array calling, if raid-array can respond under the operating strategy reducing hard disk performance expectation, such as, after increase is to the hard disk access times in raid-array, the hard disk access of hard disk to follow-up increase of raid-array responds, or, such as, after extending the response wait time to the hard disk in raid-array, the hard disk of raid-array responds hard disk access in the stand-by period of following elongation, then show that the performance of hard disk just decreases but not bad hard disk, in the case, can continue to use the operating strategy reducing hard disk performance expectation to manage raid-array.
As can be seen here, the method 100 of reduction raid-array fault provided by the invention can be carried out some to the existing storage system with raid-array and be optimized, make when operating hard disk is given a shock or impacts and cause hard disk performance to decline, reduce the mistake of storage system to hard disk " to kick out of " thus the function making to have the storage system of raid-array becomes more practical, improve user's experience.
Below, with reference to Fig. 2, the device 200 according to reduction raid-array fault of the present invention is described.Fig. 2 shows the exemplary block diagram of the device 200 of the reduction raid-array fault according to the embodiment of the present invention, and the device 200 of described reduction raid-array fault can comprise: detecting unit 210, first judging unit 220, first administrative unit 230, call unit 240, second judging unit 250 and fault processing unit 260.
Particularly, detecting unit 210 can be configured to detect the vibrations or impact that raid-array is subject to.Particularly, in one embodiment of the invention, detecting unit 210 can by the vibrations that use the acceleration sensing device of such as gravity sensor etc. to be subject to certain frequency detecting raid-array or impact.Such as, acceleration sensor can be set in the information handling system with raid-array, described acceleration sensor can be connected mutually communicatedly with raid-array, and the vibrations that can sense in real time suffered by raid-array or impact.
First judging unit 220, can be configured when detecting that raid-array judges described vibrations or impacts whether be greater than first threshold when being given a shock or impacting.Particularly, in one embodiment of the invention, the first judging unit 220 can be configured vibrations that degree of will speed up sensor senses or compared with impacting and protecting threshold value with predetermined jerk.Usually, acceleration sensor by the acceleration of surperficial micro-electromechanical technology object sensing motion, then can pass through related physical law, calculates the size of vibrations suffered by object or impact further.In an embodiment of the present invention, the first judging unit 220 can be configured the size that to be given a shock by the raid-array calculated or to impact and predetermined jerk protects threshold value to compare.
First administrative unit 230, if described vibrations or impact are not more than first threshold, then the first administrative unit 230 can be configured and manage described raid-array with the first operating strategy.Particularly; in one embodiment of the invention; when the vibrations that acceleration sensor senses or impact are not more than predetermined jerk protection threshold value; vibrations suffered by described raid-array can be judged or impact in normal range; in the case, can be continued by the first administrative unit 230 with existing normal management tactical management raid-array.
Call unit 240, if described vibrations or impact are greater than first threshold, then described call unit 240 can be configured to call the second operating strategy and manage described raid-array.Wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.Particularly, in one embodiment of the invention, wait for that when using described second operating strategy to manage described raid-array time that described raid-array respond can be greater than and wait for when described first operating strategy of use manages described raid-array the time that described raid-array responds.In another embodiment of the present invention, can be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries described raid-array number of retries when using described second operating strategy to manage described raid-array.
Second judging unit 250, calling after the second operating strategy manages described raid-array, described second judging unit 250 can be configured to judge whether have hard disk not make response in raid-array under the second operating strategy.Specifically, when the vibrations that raid-array is subject to or when impacting larger, in the case, the operating strategy of other a set of reduction hard disk performance expectation can be called to manage the hard disk of raid-array, when using the operating strategy reducing hard disk performance expectation to manage raid-array, the second judging unit 250 can be configured to judge whether have hard disk still not make response in raid-array under the operating strategy reducing hard disk performance expectation.
Fault processing unit 260, if there is specified hard disc not respond under the second operating strategy in raid-array, then described fault processing unit 260 can be configured described specified hard disc is gone offline from described raid-array.Specifically, when using the operating strategy reducing hard disk performance expectation to manage raid-array, if there is hard disk in raid-array still do not make response under the operating strategy reducing hard disk performance expectation, then this hard disk may be not only the hard disk that performance reduces, therefore, can judge that this hard disk may be " bad " hard disk that cannot work, in the case, fault processing unit 260 can be configured and " be kicked out of " from raid-array by this hard disk, that is, this hard disk is made to go offline from raid-array.
In addition, in one embodiment of the invention, the device 200 reducing raid-array fault can further include: the second administrative unit, if the hard disk of described raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue the hard disk according to described second operating strategy management raid-array.Specifically, call at call unit and reduce operating strategy that hard disk performance expects to after managing raid-array, if the hard disk in raid-array can make response to hard disk access under the operating strategy reducing hard disk performance expectation, such as, after increase is to the hard disk access times in raid-array, the hard disk access of hard disk in raid-array to follow-up increase responds, or, such as, after extending the response wait time to the hard disk in raid-array, hard disk in raid-array responds hard disk access in the stand-by period of following elongation, then show that the performance of hard disk just decreases but not bad hard disk, in the case, second administrative unit can continue to use the operating strategy reducing hard disk performance expectation to manage raid-array.
As can be seen here, the device 200 of reduction raid-array fault provided by the invention can carry out some to the existing storage system with raid-array and optimize, make when operating hard disk is given a shock or impacts and cause hard disk performance to decline, reduce the mistake of storage system to hard disk " to kick out of " thus the function making to have the storage system of raid-array becomes more practical, improve user's experience.
In addition, the present invention additionally provides a kind of data storage system 300 on the other hand.Fig. 3 shows the exemplary block diagram of the data storage system 300 according to the embodiment of the present invention.As shown in the figure, data storage system 300 can comprise raid-array 310, acceleration sensor 320 and fault reduction device 330.
Particularly, raid-array 310 may be used for storing data.In one embodiment of the invention, raid-array 310 can be that polylith disk or hard disk are invented a jumbo disk or hard disk, global storage speed can be accelerated by concurrent reading and concurrent writing, thus provide than the single disk of equivalent capability or the higher memory property of hard disk and data back-up capability.
Acceleration sensor 320, may be used for sensing the vibrations that receive of raid-array or impact.Particularly, in one embodiment of the invention, vibrations suffered by raid-array or impact can be detected by using the acceleration sensing device of such as gravity sensor.Speed sensor can be connected mutually communicatedly with raid-array, and can sense vibrations or impact that raid-array suffers from real time.Usually, acceleration sensor 320 by the acceleration of surperficial micro-electromechanical technology object sensing motion, then can pass through related physical law, calculates the size of vibrations suffered by object or impact further.
Reduce raid-array failed equipment 330, can be configured the fault for raid-array, described reduction raid-array failed equipment 330 may further include: detecting unit 210, first judging unit 220, first administrative unit 230, call unit 240, second judging unit 250 and fault processing unit 260.
Particularly, detecting unit 210 can be configured to detect the vibrations or impact that raid-array is subject to.Particularly, detecting unit 210 can be connected communicatedly with acceleration sensor 320, and by the vibrations that use acceleration sensor 320 to be subject to certain frequency detecting raid-array or impact.
First judging unit 220, can be configured when detecting that raid-array judges described vibrations or impacts whether be greater than first threshold when being given a shock or impacting.Particularly, in one embodiment of the invention, the first judging unit 220 can be configured vibrations that degree of will speed up sensor senses or compared with impacting and protecting threshold value with predetermined jerk.
First administrative unit 230, if described vibrations or impact are not more than first threshold, then the first administrative unit 230 can be configured and manage described raid-array with the first operating strategy.Particularly; in one embodiment of the invention; when the vibrations that acceleration sensor senses or impact are not more than predetermined jerk protection threshold value; vibrations suffered by described raid-array can be judged or impact in normal range; in the case, can be continued by the first administrative unit 230 with existing normal management tactical management raid-array.
Call unit 240, if described vibrations or impact are greater than first threshold, then described call unit 240 can be configured to call the second operating strategy and manage described raid-array.Wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.Particularly, in one embodiment of the invention, wait for that when using described second operating strategy to manage described raid-array time that described raid-array respond can be greater than and wait for when described first operating strategy of use manages described raid-array the time that described raid-array responds.In another embodiment of the present invention, can be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries described raid-array number of retries when using described second operating strategy to manage described raid-array.
Second judging unit 250, calling after the second operating strategy manages described raid-array, described second judging unit 250 can be configured to judge whether have hard disk not make response in raid-array under the second operating strategy.Specifically, when the vibrations that raid-array is subject to or when impacting larger, in the case, the operating strategy of other a set of reduction hard disk performance expectation can be called to manage the hard disk of raid-array, when using the operating strategy reducing hard disk performance expectation to manage raid-array, the second judging unit 250 can be configured to judge whether have hard disk still not make response in raid-array under the operating strategy reducing hard disk performance expectation.
Fault processing unit 260, if there is specified hard disc not respond under the second operating strategy in raid-array, then described fault processing unit 260 can be configured described specified hard disc is gone offline from described raid-array.Specifically, when using the operating strategy reducing hard disk performance expectation to manage raid-array, if there is hard disk in raid-array still do not make response under the operating strategy reducing hard disk performance expectation, then this hard disk may be not only the hard disk that performance reduces, therefore, can judge that this hard disk may be " bad " hard disk that cannot work, in the case, fault processing unit 260 can be configured and " be kicked out of " from raid-array by this hard disk, that is, this hard disk is made to go offline from raid-array.
In addition, in one embodiment of the invention, the device 200 reducing raid-array fault can further include: the second administrative unit, if the hard disk of described raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue according to described second operating strategy management hard disk.Specifically, call at call unit and reduce operating strategy that hard disk performance expects to after managing raid-array, if there is hard disk can make response to hard disk access in raid-array under the operating strategy reducing hard disk performance expectation, then show that the performance of hard disk just decreases but not bad hard disk, in the case, the second administrative unit can continue to use the operating strategy reducing hard disk performance expectation to manage raid-array.
As can be seen here, data storage system 300 provided by the invention can be carried out some to the existing storage system with raid-array and be optimized, make when operating hard disk is given a shock or impacts and cause hard disk performance to decline, reduce the mistake of storage system to hard disk " to kick out of " thus the function making to have the storage system of raid-array becomes more practical, improve user's experience.
Those of ordinary skill in the art can recognize, in conjunction with unit and the algorithm steps of each example of embodiment disclosed herein description, can realize with electronic hardware, computer software or the combination of the two, in order to the interchangeability of hardware and software is clearly described, generally describe composition and the step of each example in the above description according to function.These functions perform with hardware or software mode actually, depend on application-specific and the design constraint of technical scheme.Those skilled in the art can use distinct methods to realize described function to each specifically should being used for, but this realization should not thought and exceeds scope of the present invention.
Each embodiment of the present invention is described in detail above.But, it should be appreciated by those skilled in the art that without departing from the principles and spirit of the present invention, various amendment can be carried out to these embodiments, combination or sub-portfolio, and such amendment should fall within the scope of the present invention.

Claims (15)

1. reduce a method for raid-array fault, described method comprises:
The vibrations that detection raid-array is subject to or impact;
When detecting that raid-array is given a shock or impacts, judging described vibrations or impacting whether be greater than first threshold;
If described vibrations or impact are not more than first threshold, then manage described raid-array with the first operating strategy;
If described vibrations or impact are greater than first threshold, then call the second operating strategy and manage described raid-array;
Calling after the second operating strategy manages described raid-array, judge in raid-array, whether have hard disk not make response under described second operating strategy; And
If there is specified hard disc not respond under described second operating strategy in raid-array, then described specified hard disc is made to go offline from described raid-array,
Wherein, in described second operating strategy to the performance expectation of described raid-array lower than the performance expectation to described raid-array in described first operating strategy.
2. the method for claim 1, comprises further:
If the hard disk of described raid-array all makes response under the second operating strategy, then continue to manage described raid-array according to described second operating strategy.
3. the method for claim 1, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
4. the method for claim 1, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
5. the method for claim 1, wherein the vibrations that are subject to of described detection raid-array or impact comprise vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact.
6. reduce a device for raid-array fault, described device comprises:
Detecting unit, is configured to detect vibrations or impact that raid-array is subject to;
First judging unit, when detecting that raid-array is given a shock or impacts, described first judging unit is configured judge described vibrations or impact whether be greater than first threshold;
First administrative unit, if described vibrations or impact are not more than first threshold, then described first administrative unit is configured and manages described raid-array with the first operating strategy;
Call unit, if described vibrations or impact are greater than first threshold, then described call unit is configured to call the second operating strategy and manages described raid-array;
Second judging unit, calling after the second operating strategy manages described raid-array, described second judging unit is configured to judge whether have hard disk not make response in raid-array under the second operating strategy; And
Fault processing unit, if there is specified hard disc not respond under described second operating strategy in raid-array, then described fault processing unit is configured described specified hard disc is gone offline from described raid-array,
Wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.
7. device as claimed in claim 6, comprises further:
Second administrative unit, if the hard disk of raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue to manage described raid-array according to the second operating strategy.
8. device as claimed in claim 6, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
9. device as claimed in claim 6, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
10. device as claimed in claim 6, wherein, described detecting unit is configured vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact further.
11. 1 kinds of data storage systems, comprising:
Raid-array, for storing data;
Acceleration sensor, for sensing the vibrations or impact that raid-array receives; And
Reduce raid-array failed equipment, comprising:
Detecting unit, is configured to detect vibrations or impact that raid-array is subject to;
First judging unit, when detecting that raid-array is given a shock or impacts, described first judging unit is configured judge described vibrations or impact whether be greater than first threshold;
First administrative unit, if described vibrations or impact are not more than first threshold, then described first administrative unit is configured and manages described raid-array with the first operating strategy;
Call unit, if described vibrations or impact are greater than first threshold, then described call unit is configured to call the second operating strategy and manages described raid-array;
Second judging unit, calling after the second operating strategy manages described raid-array, is configured to judge whether have hard disk not make response in raid-array under the second operating strategy; And
Fault processing unit, if there is specified hard disc not respond under described second operating strategy in raid-array, then described fault processing unit is configured described specified hard disc is gone offline from raid-array,
Wherein, in the second operating strategy to the performance expectation of raid-array lower than in the first operating strategy to the performance expectation of raid-array.
12. data storage systems as claimed in claim 11, wherein, described fault reduces device and comprises further:
Second administrative unit, if the hard disk of raid-array all makes response under the second operating strategy, then described second administrative unit is configured to continue to manage described raid-array according to the second operating strategy.
13. data storage systems as claimed in claim 11, wherein, waiting for when using described second operating strategy to manage described raid-array the time that described raid-array responds, being greater than and waiting for when using described first operating strategy to manage described raid-array the time that described raid-array responds.
14. data storage systems as claimed in claim 11, wherein, when using described second operating strategy to manage described raid-array to described raid-array number of retries, be greater than when using described first operating strategy to manage described raid-array described raid-array number of retries.
15. data storage systems as claimed in claim 11, wherein, described detecting unit is configured vibrations by using acceleration sensor to be subject to certain frequency detecting raid-array or impact further.
CN201410505314.7A 2014-09-26 2014-09-26 A method of reducing raid-array failure Active CN105511972B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410505314.7A CN105511972B (en) 2014-09-26 2014-09-26 A method of reducing raid-array failure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410505314.7A CN105511972B (en) 2014-09-26 2014-09-26 A method of reducing raid-array failure

Publications (2)

Publication Number Publication Date
CN105511972A true CN105511972A (en) 2016-04-20
CN105511972B CN105511972B (en) 2018-12-14

Family

ID=55719976

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410505314.7A Active CN105511972B (en) 2014-09-26 2014-09-26 A method of reducing raid-array failure

Country Status (1)

Country Link
CN (1) CN105511972B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260967A1 (en) * 2003-06-05 2004-12-23 Copan Systems, Inc. Method and apparatus for efficient fault-tolerant disk drive replacement in raid storage systems
CN1959647A (en) * 2005-11-04 2007-05-09 英业达股份有限公司 Method for establishing stable memory mechanism
CN101196845A (en) * 2007-12-17 2008-06-11 山东超越数控电子有限公司 Hard disk protection method
CN103455448A (en) * 2012-05-30 2013-12-18 鸿富锦精密工业(深圳)有限公司 Shake resisting method and system for server

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260967A1 (en) * 2003-06-05 2004-12-23 Copan Systems, Inc. Method and apparatus for efficient fault-tolerant disk drive replacement in raid storage systems
CN1959647A (en) * 2005-11-04 2007-05-09 英业达股份有限公司 Method for establishing stable memory mechanism
CN101196845A (en) * 2007-12-17 2008-06-11 山东超越数控电子有限公司 Hard disk protection method
CN103455448A (en) * 2012-05-30 2013-12-18 鸿富锦精密工业(深圳)有限公司 Shake resisting method and system for server

Also Published As

Publication number Publication date
CN105511972B (en) 2018-12-14

Similar Documents

Publication Publication Date Title
CN105843699B (en) Dynamic random access memory device and method for error monitoring and correction
US7360112B2 (en) Detection and recovery of dropped writes in storage devices
US8667372B2 (en) Memory controller and method of controlling memory
CN101236482B (en) Method for processing data under degrading state and independent redundancy magnetic disc array system
CN114579340A (en) Memory error processing method and device
CN101477480B (en) Memory control method, apparatus and memory read-write system
US9529674B2 (en) Storage device management of unrecoverable logical block addresses for RAID data regeneration
US10401935B2 (en) Storage device with a power source and persistent store that provides backup power to DRAM in a power loss event
CN102915260B (en) The method that solid state hard disc is fault-tolerant and solid state hard disc thereof
US20140365817A1 (en) Data preservation processing device, raid controller, data preservation processing system, data preservation processing method and recording medium therefor
CN104391755A (en) Abnormity handling method and device for embedded multimedia card (eMMC) chip
CN104932830B (en) Information processing method and electronic equipment
CN108170375B (en) Overrun protection method and device in distributed storage system
US9720756B2 (en) Computing system with debug assert mechanism and method of operation thereof
CN105528180B (en) A kind of date storage method, device and equipment
CN106990918A (en) Trigger the method and device that RAID array is rebuild
US8549219B1 (en) Preventing hard drive failure and data loss due to vibration
CN115061641B (en) Disk fault processing method, device, equipment and storage medium
US9658919B2 (en) Malfunction escalation
CN106599236A (en) Metadata storage method and apparatus for file system
CN101599032A (en) Storage node management method, control subsystem and storage system
CN105511972A (en) Method for reducing faults of independent redundant disk array
US7698500B2 (en) Disk array system, host interface unit, control method for disk array system, and computer program product for disk array system
US20090183026A1 (en) Thresholding Hardware Errors
CN104464802A (en) Storage device and data protection method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant