
US Patent No: 5,864,657
Number of patents in Portfolio can not be more than 2000
Main memory system and checkpointing protocol for fault-tolerant computer system
Stats
-
Jan 26, 1999
Issued date -
Nov 29, 1995
filing date -
08/564,021
serial no -
In Force
status
Importance
Abstract
A mechanism for maintaining a consistent, periodically updated state in main memory without constraining normal computer operation is provided, thereby enabling a computer system to recover from faults without loss of data or processing continuity. In a typical computer system, a processor and input/output elements are connected to a main memory subsystem. A checkpoint memory element, which may include one or more buffer memories and a shadow memory, is also appended to this main memory subsystem. During normal processing, an image of data written to primary memory is captured by the checkpoint memory element. When a new checkpoint is desired, thereby establishing a consistent state in main memory to which all executing applications can safely return following a fault, the data previously captured is used to establish that checkpoint. This structure and protocol can guarantee a consistent state in main memory, thus enabling fault-tolerant operation.
First Claim
Related Publications
International Classification(s)
- [Classification Symbol]
- [Patents Count]
Cited Art
| Patent Info | (Count) | # Cites | Year |
|---|---|---|---|
|
|
|||
| 4,435,762 Buffered peripheral subsystems | 100 | 1981 | |
| 4,823,261 Multiprocessor system for updating status information through flip-flopping read version and write version of checkpoint data | 56 | 1986 | |
| 4,958,273 Multiprocessor system architecture with high availability | 97 | 1987 | |
| 4,965,719 Method for lock management, page coherency, and asynchronous writing of changed pages to shared external store in a distributed computing system | 110 | 1988 | |
| 4,924,466 Direct hardware error identification method and apparatus for error recovery in pipelined processing areas of a computer system | 35 | 1988 | |
| 5,325,517 Fault tolerant data processing system | 66 | 1989 | |
| 5,327,532 Coordinated sync point management of protected resources | 24 | 1990 | |
| 5,418,916 Central processing unit checkpoint retry for store-in and store-through cache systems | 52 | 1990 | |
| 5,235,700 Checkpointing mechanism for fault-tolerant systems | 74 | 1991 | |
| 5,214,652 Alternate processor continuation of task of failed processor | 37 | 1991 | |
| 5,276,848 Shared two level cache including apparatus for maintaining storage consistency | 122 | 1991 | |
| 5,269,017 Type 1, 2 and 3 retry and checkpointing | 74 | 1991 | |
| 5,293,613 Recovery control register | 28 | 1991 | |
| 5,394,542 Clearing data objects used to maintain state information for shared data at a local complex when at least one message path to the local complex cannot be recovered | 21 | 1992 | |
| 5,398,331 Shared storage controller for dual copy shared data | 87 | 1992 | |
| 5,485,585 Personal computer with alternate system controller and register for identifying active system controller | 9 | 1992 | |
| 5,418,940 Method and means for detecting partial page writes and avoiding initializing new pages on DASD in a transaction management system environment | 31 | 1993 | |
| 5,568,380 Shadow register file for instruction rollback | 74 | 1993 | |
| 5,504,861 Remote data duplexing | 134 | 1994 | |
| 5,566,297 Non-disruptive recovery from file server failure in a highly available file system for clustered computing environments | 134 | 1994 | |
| 5,495,587 Method for processing checkpoint instructions to allow concurrent execution of overlapping instructions | 13 | 1994 | |
| 5,463,733 Failure recovery apparatus and method for distributed processing shared resource control | 26 | 1994 | |
| 5,495,590 Checkpoint synchronization with instruction overlap enabled | 25 | 1995 | |
|
|
|||
| 4,373,179 Dynamic address translation system | 45 | 1978 | |
| 5,123,099 Hot standby memory copy system | 56 | 1988 | |
| 5,530,801 Data storing apparatus and method for a data processing system | 20 | 1994 | |
| 5,644,742 Processor structure and method for a time-out checkpoint | 52 | 1995 | |
| 5,649,136 Processor structure and method for maintaining and restoring precise state at any instruction boundary | 62 | 1995 | |
|
|
|||
| 5,239,637 Digital data management system for maintaining consistency of data in a shadow set | 49 | 1989 | |
| 5,263,144 Method and apparatus for sharing data between processors in a computer system | 20 | 1990 | |
| 5,247,618 Transferring data in a digital data processing system | 25 | 1992 | |
| 5,488,716 Fault tolerant computer system with shadow virtual processor | 80 | 1994 | |
| 5,408,636 System for flushing first and second caches upon detection of a write operation to write protected areas | 21 | 1994 | |
|
|
|||
| 4,484,273 Modular computer system | 130 | 1982 | |
| 4,654,819 Memory back-up system | 181 | 1985 | |
| 4,819,154 Memory back up system with one cache memory and two physically separated main memories | 122 | 1986 | |
|
|
|||
| 4,819,232 Fault-tolerant multiprocessor arrangement | 35 | 1986 | |
| 4,905,196 Method and storage device for saving the computer status during interrupt | 38 | 1987 | |
|
|
|||
| 4,703,481 Method and apparatus for fault recovery within a computing system | 87 | 1985 | |
| 4,740,969 Method and apparatus for recovering from hardware faults | 68 | 1986 | |
|
|
|||
| 5,301,309 Distributed processing system with checkpoint restart facilities wherein checkpoint data is updated only if all processors were able to collect new checkpoint data | 44 | 1990 | |
| 5,420,996 Data processing system having selective data save and address translation mechanism utilizing CPU idle period | 9 | 1991 | |
|
|
|||
| 4,459,658 Technique for enabling operation of a computer system with a consistent state of a linked list data structure after a main memory failure | 63 | 1982 | |
| 4,751,639 Virtual command rollback in a fault tolerant data processing system | 68 | 1985 | |
|
|
|||
| 5,271,013 Fault tolerant computer system | 82 | 1990 | |
| 5,363,503 Fault tolerant computer system with provision for handling external events | 28 | 1992 | |
|
|
|||
| 4,959,774 Shadow memory system for storing variable backup blocks in consecutive time periods | 164 | 1989 | |
|
|
|||
| 5,583,987 Method and apparatus for initializing a multiprocessor system while resetting defective CPU's detected during operation thereof | 56 | 1994 | |
|
|
|||
| 4,941,087 System for bumpless changeover between active units and backup units by establishing rollback points and logging write and read operations | 51 | 1987 | |
|
|
|||
| 5,530,946 Processor failure detection and recovery circuit in a dual processor computer system and method of operation thereof | 99 | 1994 | |
|
|
|||
| 5,649,152 Method and system for providing a static snapshot of data stored on a mass storage system | 195 | 1994 | |
|
|
|||
| 4,393,500 Method of modifying data stored in non-volatile memory and testing for power failure occurring during modification | 34 | 1980 | |
|
|
|||
| 4,426,682 Fast cache flush mechanism | 50 | 1981 | |
|
|
|||
| 5,381,544 Copyback memory system and cache memory controller which permits access while error recovery operations are performed | 11 | 1992 | |
|
|
|||
| 4,996,687 Fault recovery mechanism, transparent to digital system function | 37 | 1988 | |
|
|
|||
| 4,734,855 Apparatus and method for fast and stable data storage | 14 | 1985 | |
|
|
|||
| 4,503,534 Apparatus for redundant operation of modules in a multiprocessing system | 39 | 1982 | |
|
|
|||
| 5,313,647 Digital data processor with improved checkpointing and forking | 64 | 1991 | |
|
|
|||
| 4,912,707 Checkpoint retry mechanism | 93 | 1988 | |
|
|
|||
| 5,630,047 Method for software error recovery using consistent global checkpoints | 80 | 1995 | |
|
|
|||
| 4,964,126 Fault tolerant signal processing machine and method | 54 | 1988 | |
|
|
|||
| 5,557,735 Communication system for a network and method for configuring a controller in a communication network | 8 | 1994 | |
|
|
|||
| 5,157,663 Fault tolerant computer system | 284 | 1990 | |
|
|
|||
| 5,369,757 Recovery logging in the presence of snapshot files by ordering of buffer pool flushing | 156 | 1991 | |
|
|
|||
| 4,590,554 Backup fault tolerant computer system | 112 | 1982 | |
|
|
|||
| 4,566,106 Electronic postage meter having redundant memory | 24 | 1985 | |
|
|
|||
| 5,408,649 Distributed data access system including a plurality of database access processors with one-for-N redundancy | 103 | 1993 | |
|
|
|||
| 4,817,091 Fault-tolerant multiprocessor system | 132 | 1987 | |
|
|
|||
| 4,228,496 Multiprocessor system | 296 | 1976 | |
|
|
|||
| 4,403,284 Microprocessor which detects leading 1 bit of instruction to obtain microcode entry point address | 34 | 1980 | |
|
|
|||
| 5,325,519 Fault tolerant computer with archival rollback capabilities | 84 | 1991 | |
|
|
|||
| 4,413,327 Radiation circumvention technique | 36 | 1970 | |
|
|
|||
| 5,574,874 Method for implementing a checkpoint between pairs of memory locations using two indicators to indicate the status of each associated pair of memory locations | 24 | 1995 | |
|
|
|||
| 5,488,719 System for categorizing character strings using acceptability and category information contained in ending substrings | 36 | 1991 | |