FR2602891B1 - Systeme de correction d'erreur d'un systeme a multiprocesseurs pour corriger une erreur dans un processeur en mettant le processeur en condition de controle apres achevement du redemarrage du microprogramme a partir d'un point de reprise - Google Patents
Systeme de correction d'erreur d'un systeme a multiprocesseurs pour corriger une erreur dans un processeur en mettant le processeur en condition de controle apres achevement du redemarrage du microprogramme a partir d'un point de repriseInfo
- Publication number
- FR2602891B1 FR2602891B1 FR878711676A FR8711676A FR2602891B1 FR 2602891 B1 FR2602891 B1 FR 2602891B1 FR 878711676 A FR878711676 A FR 878711676A FR 8711676 A FR8711676 A FR 8711676A FR 2602891 B1 FR2602891 B1 FR 2602891B1
- Authority
- FR
- France
- Prior art keywords
- processor
- error
- restart
- microprogram
- instruction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000011084 recovery Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2023—Failover techniques
- G06F11/2028—Failover techniques eliminating a faulty processor or activating a spare
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G06F11/1405—Saving, restoring, recovering or retrying at machine instruction level
- G06F11/1407—Checkpointing the instruction stream
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2038—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant with a single idle spare processing component
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2043—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant where the redundant components share a common memory address space
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/202—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
- G06F11/2023—Failover techniques
- G06F11/2025—Failover techniques using centralised failover control functionality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Retry When Errors Occur (AREA)
- Hardware Redundancy (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP19354986 | 1986-08-18 | ||
JP19355086 | 1986-08-18 |
Publications (2)
Publication Number | Publication Date |
---|---|
FR2602891A1 FR2602891A1 (fr) | 1988-02-19 |
FR2602891B1 true FR2602891B1 (fr) | 1990-12-07 |
Family
ID=26507941
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
FR878711676A Expired - Fee Related FR2602891B1 (fr) | 1986-08-18 | 1987-08-18 | Systeme de correction d'erreur d'un systeme a multiprocesseurs pour corriger une erreur dans un processeur en mettant le processeur en condition de controle apres achevement du redemarrage du microprogramme a partir d'un point de reprise |
Country Status (2)
Country | Link |
---|---|
US (1) | US4852092A (fr) |
FR (1) | FR2602891B1 (fr) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4926320A (en) * | 1987-04-07 | 1990-05-15 | Nec Corporation | Information processing system having microprogram-controlled type arithmetic processing unit |
US5065311A (en) * | 1987-04-20 | 1991-11-12 | Hitachi, Ltd. | Distributed data base system of composite subsystem type, and method fault recovery for the system |
US5043866A (en) * | 1988-04-08 | 1991-08-27 | International Business Machines Corporation | Soft checkpointing system using log sequence numbers derived from stored data pages and log records for database recovery |
US4912707A (en) * | 1988-08-23 | 1990-03-27 | International Business Machines Corporation | Checkpoint retry mechanism |
KR930001056B1 (ko) * | 1988-11-10 | 1993-02-13 | 미쯔비시덴끼 가부시끼가이샤 | 하드웨어 소터유니트 및 그의 에러수집방법 |
JPH07113898B2 (ja) * | 1989-05-09 | 1995-12-06 | 株式会社日立製作所 | 障害検出方式 |
US5153881A (en) * | 1989-08-01 | 1992-10-06 | Digital Equipment Corporation | Method of handling errors in software |
US5247447A (en) * | 1990-10-31 | 1993-09-21 | The Boeing Company | Exception processor system |
US5214652A (en) * | 1991-03-26 | 1993-05-25 | International Business Machines Corporation | Alternate processor continuation of task of failed processor |
EP0529303A3 (en) * | 1991-08-29 | 1993-09-22 | International Business Machines Corporation | Checkpoint synchronization with instruction overlap enabled |
US5321698A (en) * | 1991-12-27 | 1994-06-14 | Amdahl Corporation | Method and apparatus for providing retry coverage in multi-process computer environment |
GB2263988B (en) * | 1992-02-04 | 1996-05-22 | Digital Equipment Corp | Work flow management system and method |
JPH05314075A (ja) * | 1992-05-07 | 1993-11-26 | Nec Corp | オンラインコンピュータ装置 |
US5715386A (en) * | 1992-09-30 | 1998-02-03 | Lucent Technologies Inc. | Apparatus and methods for software rejuvenation |
CA2106280C (fr) * | 1992-09-30 | 2000-01-18 | Yennun Huang | Appareil et methodes de traitement insensible aux defaillances faisant appel a un processus de surveillance par demon et une bibliotheque insensible aux defaillances en vue d'offrir differents degres d'insensibilite aux defaillances |
US5504859A (en) * | 1993-11-09 | 1996-04-02 | International Business Machines Corporation | Data processor with enhanced error recovery |
JP2685712B2 (ja) * | 1994-03-30 | 1997-12-03 | 株式会社サンポウロック | ハンドルロック |
JP2774770B2 (ja) * | 1994-05-19 | 1998-07-09 | 株式会社東芝 | デバッグ方式 |
US5551043A (en) * | 1994-09-07 | 1996-08-27 | International Business Machines Corporation | Standby checkpoint to prevent data loss |
CA2152329C (fr) * | 1994-09-08 | 1999-02-09 | N Dudley Fulton Iii | Appareil et methodes de rajeunissement de logiciels |
US5630047A (en) * | 1995-09-12 | 1997-05-13 | Lucent Technologies Inc. | Method for software error recovery using consistent global checkpoints |
US5678003A (en) * | 1995-10-20 | 1997-10-14 | International Business Machines Corporation | Method and system for providing a restartable stop in a multiprocessor system |
US5884021A (en) * | 1996-01-31 | 1999-03-16 | Kabushiki Kaisha Toshiba | Computer system having a checkpoint and restart function |
TW379298B (en) * | 1996-09-30 | 2000-01-11 | Toshiba Corp | Memory updating history saving device and memory updating history saving method |
JP3648892B2 (ja) * | 1996-12-16 | 2005-05-18 | 富士通株式会社 | 計算機システム |
US6115829A (en) * | 1998-04-30 | 2000-09-05 | International Business Machines Corporation | Computer system with transparent processor sparing |
US6189112B1 (en) * | 1998-04-30 | 2001-02-13 | International Business Machines Corporation | Transparent processor sparing |
DE102004037713A1 (de) * | 2004-08-04 | 2006-03-16 | Robert Bosch Gmbh | Verfahren, Betriebssystem und Rechengerät zum Abarbeiten eines Computerprogramms |
US8090996B2 (en) * | 2006-03-31 | 2012-01-03 | Intel Corporation | Detecting soft errors via selective re-execution |
US20140223062A1 (en) * | 2013-02-01 | 2014-08-07 | International Business Machines Corporation | Non-authorized transaction processing in a multiprocessing environment |
US9858151B1 (en) | 2016-10-03 | 2018-01-02 | International Business Machines Corporation | Replaying processing of a restarted application |
CN113687986A (zh) * | 2021-08-31 | 2021-11-23 | 上海商汤阡誓科技有限公司 | 一种芯片和处理单元的恢复方法 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3736566A (en) * | 1971-08-18 | 1973-05-29 | Ibm | Central processing unit with hardware controlled checkpoint and retry facilities |
US4128203A (en) * | 1977-09-01 | 1978-12-05 | Eaton Corporation | Four-port thermally responsive valve |
JPS594054B2 (ja) * | 1979-04-17 | 1984-01-27 | 株式会社日立製作所 | マルチプロセツサ障害検出方式 |
AU539418B2 (en) * | 1980-05-05 | 1984-09-27 | Westinghouse Electric Corporation | Automatic fault detection of hardware |
DE3207633A1 (de) * | 1982-02-26 | 1983-09-15 | Siemens AG, 1000 Berlin und 8000 München | Schaltungsanordnung zur ueberwachung eines mikroprozessors |
JPS5958547A (ja) * | 1982-09-28 | 1984-04-04 | Fujitsu Ltd | マイクロプログラム制御装置のエラ−処理方式 |
US4627054A (en) * | 1984-08-27 | 1986-12-02 | International Business Machines Corporation | Multiprocessor array error detection and recovery apparatus |
US4641305A (en) * | 1984-10-19 | 1987-02-03 | Honeywell Information Systems Inc. | Control store memory read error resiliency method and apparatus |
-
1987
- 1987-08-18 US US07/086,638 patent/US4852092A/en not_active Expired - Fee Related
- 1987-08-18 FR FR878711676A patent/FR2602891B1/fr not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
FR2602891A1 (fr) | 1988-02-19 |
US4852092A (en) | 1989-07-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
FR2602891B1 (fr) | Systeme de correction d'erreur d'un systeme a multiprocesseurs pour corriger une erreur dans un processeur en mettant le processeur en condition de controle apres achevement du redemarrage du microprogramme a partir d'un point de reprise | |
US5237700A (en) | Exception handling processor for handling first and second level exceptions with reduced exception latency | |
AU5394590A (en) | Method and apparatus for detecting and correcting errors in a pipelined computer system | |
JPS5462749A (en) | Multiple information processing system | |
JPS5764849A (en) | Error recovering system of logic device | |
CA2062771A1 (fr) | Appareil et methode d'emulation de systemes de traitement d'informations | |
EP0287295A3 (fr) | Diffusion virtuelle d'instructions E/S programmées sur bus E/S multiple | |
KR900003743A (ko) | 명령파이프라인방식의 마이크로프로세서 | |
DE3380643D1 (en) | Method and apparatus for restoring data in a computing system | |
DE69739608D1 (de) | Gastrechner-mikroprozessor mit vorrichtung zum zeitweisen anhalten des prozessorzustandes eines zielrechners | |
JPS57164343A (en) | Check point save system | |
EP0325226A3 (fr) | Appareil et méthode de traitement et d'information | |
JPS57178553A (en) | Multiprocessor system | |
JPS5717058A (en) | Control system of microprogram | |
JPH04288625A (ja) | マイクロコンピュータ | |
JPS57137949A (en) | Error recovery system of logical device | |
ES464591A1 (es) | Un equipo de tratamiento de interrupcion en un sistema de tratamiento de datos de programas multiples. | |
JP2814587B2 (ja) | ウォッチドッグタイマ | |
JPS625449A (ja) | 情報処理装置のデバツグ方式 | |
JPH03168844A (ja) | プログラムデバッグ方式 | |
JPS5455338A (en) | Microprogram controller | |
JPS5674758A (en) | Retry control system of microprogram control processor | |
JPS62113238A (ja) | 電子計算機システムにおけるプログラム修正装置 | |
BORGHESI et al. | The debugging of real time multiprocessor systems: An integrated solution proposal | |
JPS59128649A (ja) | マイクロプロセツサ |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ST | Notification of lapse |