
US Patent No: 5,787,243
Number of patents in Portfolio can not be more than 2000
Main memory system and checkpointing protocol for fault-tolerant computer system
Stats
-
Jul 28, 1998
Issued date -
Jul 2, 1996
filing date -
08/674,660
serial no -
In Force
status
Importance
Abstract
A mechanism for maintaining a consistent state in main memory without constraining normal computer operation is provided, thereby enabling a computer system to recover from faults without loss of data or processing continuity. In a typical computer system, a processor and input/output elements are connected to a main memory via a memory bus. A shadow memory element, which includes a buffer memory and a main storage element, is also attached to this memory bus. During normal processing, data written to primary memory is also captured by the buffer memory of the shadow memory element. When a checkpoint is desired (thereby establishing a consistent state in main memory to which all executing applications can safely return following a fault), the data previously captured in the buffer memory is then copied to the main storage element of the shadow memory element. This structure and protocol can guarantee a consistent state in main memory, thus enabling fault-tolerant operation.
First Claim
Related Publications
International Classification(s)
- [Classification Symbol]
- [Patents Count]
Cited Art
| Patent Info | (Count) | # Cites | Year |
|---|---|---|---|
|
|
|||
| 4,435,762 Buffered peripheral subsystems | 100 | 1981 | |
| 4,823,261 Multiprocessor system for updating status information through flip-flopping read version and write version of checkpoint data | 56 | 1986 | |
| 4,958,273 Multiprocessor system architecture with high availability | 97 | 1987 | |
| 4,965,719 Method for lock management, page coherency, and asynchronous writing of changed pages to shared external store in a distributed computing system | 110 | 1988 | |
| 4,924,466 Direct hardware error identification method and apparatus for error recovery in pipelined processing areas of a computer system | 35 | 1988 | |
| 5,325,517 Fault tolerant data processing system | 66 | 1989 | |
| 5,327,532 Coordinated sync point management of protected resources | 24 | 1990 | |
| 5,418,916 Central processing unit checkpoint retry for store-in and store-through cache systems | 52 | 1990 | |
| 5,235,700 Checkpointing mechanism for fault-tolerant systems | 74 | 1991 | |
| 5,214,652 Alternate processor continuation of task of failed processor | 37 | 1991 | |
| 5,276,848 Shared two level cache including apparatus for maintaining storage consistency | 122 | 1991 | |
| 5,269,017 Type 1, 2 and 3 retry and checkpointing | 74 | 1991 | |
| 5,293,613 Recovery control register | 28 | 1991 | |
| 5,394,542 Clearing data objects used to maintain state information for shared data at a local complex when at least one message path to the local complex cannot be recovered | 21 | 1992 | |
| 5,398,331 Shared storage controller for dual copy shared data | 87 | 1992 | |
| 5,485,585 Personal computer with alternate system controller and register for identifying active system controller | 9 | 1992 | |
| 5,418,940 Method and means for detecting partial page writes and avoiding initializing new pages on DASD in a transaction management system environment | 31 | 1993 | |
| 5,568,380 Shadow register file for instruction rollback | 74 | 1993 | |
| 5,504,861 Remote data duplexing | 134 | 1994 | |
| 5,566,297 Non-disruptive recovery from file server failure in a highly available file system for clustered computing environments | 134 | 1994 | |
| 5,495,587 Method for processing checkpoint instructions to allow concurrent execution of overlapping instructions | 13 | 1994 | |
| 5,463,733 Failure recovery apparatus and method for distributed processing shared resource control | 26 | 1994 | |
| 5,495,590 Checkpoint synchronization with instruction overlap enabled | 25 | 1995 | |
|
|
|||
| 5,239,637 Digital data management system for maintaining consistency of data in a shadow set | 49 | 1989 | |
| 5,263,144 Method and apparatus for sharing data between processors in a computer system | 20 | 1990 | |
| 5,247,618 Transferring data in a digital data processing system | 25 | 1992 | |
| 5,448,719 Method and apparatus for maintaining and retrieving live data in a posted write cache in case of power failure | 106 | 1992 | |
| 5,488,716 Fault tolerant computer system with shadow virtual processor | 80 | 1994 | |
| 5,408,636 System for flushing first and second caches upon detection of a write operation to write protected areas | 21 | 1994 | |
|
|
|||
| 4,373,179 Dynamic address translation system | 45 | 1978 | |
| 5,123,099 Hot standby memory copy system | 56 | 1988 | |
| 5,530,801 Data storing apparatus and method for a data processing system | 20 | 1994 | |
| 5,644,742 Processor structure and method for a time-out checkpoint | 52 | 1995 | |
| 5,649,136 Processor structure and method for maintaining and restoring precise state at any instruction boundary | 62 | 1995 | |
|
|
|||
| 4,484,273 Modular computer system | 130 | 1982 | |
| 4,654,819 Memory back-up system | 181 | 1985 | |
| 4,819,154 Memory back up system with one cache memory and two physically separated main memories | 122 | 1986 | |
|
|
|||
| 4,819,232 Fault-tolerant multiprocessor arrangement | 35 | 1986 | |
| 4,905,196 Method and storage device for saving the computer status during interrupt | 38 | 1987 | |
|
|
|||
| 4,703,481 Method and apparatus for fault recovery within a computing system | 87 | 1985 | |
| 4,740,969 Method and apparatus for recovering from hardware faults | 68 | 1986 | |
|
|
|||
| 5,301,309 Distributed processing system with checkpoint restart facilities wherein checkpoint data is updated only if all processors were able to collect new checkpoint data | 44 | 1990 | |
| 5,420,996 Data processing system having selective data save and address translation mechanism utilizing CPU idle period | 9 | 1991 | |
|
|
|||
| 4,459,658 Technique for enabling operation of a computer system with a consistent state of a linked list data structure after a main memory failure | 63 | 1982 | |
| 4,751,639 Virtual command rollback in a fault tolerant data processing system | 68 | 1985 | |
|
|
|||
| 5,271,013 Fault tolerant computer system | 82 | 1990 | |
| 5,363,503 Fault tolerant computer system with provision for handling external events | 28 | 1992 | |
|
|
|||
| 4,959,774 Shadow memory system for storing variable backup blocks in consecutive time periods | 164 | 1989 | |
|
|
|||
| 5,583,987 Method and apparatus for initializing a multiprocessor system while resetting defective CPU's detected during operation thereof | 56 | 1994 | |
|
|
|||
| 4,941,087 System for bumpless changeover between active units and backup units by establishing rollback points and logging write and read operations | 51 | 1987 | |
|
|
|||
| 5,530,946 Processor failure detection and recovery circuit in a dual processor computer system and method of operation thereof | 99 | 1994 | |
|
|
|||
| 5,649,152 Method and system for providing a static snapshot of data stored on a mass storage system | 195 | 1994 | |
|
|
|||
| 4,393,500 Method of modifying data stored in non-volatile memory and testing for power failure occurring during modification | 34 | 1980 | |
|
|
|||
| 4,426,682 Fast cache flush mechanism | 50 | 1981 | |
|
|
|||
| 5,381,544 Copyback memory system and cache memory controller which permits access while error recovery operations are performed | 11 | 1992 | |
|
|
|||
| 4,996,687 Fault recovery mechanism, transparent to digital system function | 37 | 1988 | |
|
|
|||
| 4,734,855 Apparatus and method for fast and stable data storage | 14 | 1985 | |
|
|
|||
| 4,503,534 Apparatus for redundant operation of modules in a multiprocessing system | 39 | 1982 | |
|
|
|||
| 5,313,647 Digital data processor with improved checkpointing and forking | 64 | 1991 | |
|
|
|||
| 4,912,707 Checkpoint retry mechanism | 93 | 1988 | |
|
|
|||
| 5,630,047 Method for software error recovery using consistent global checkpoints | 80 | 1995 | |
|
|
|||
| 5,557,735 Communication system for a network and method for configuring a controller in a communication network | 8 | 1994 | |
|
|
|||
| 5,157,663 Fault tolerant computer system | 284 | 1990 | |
|
|
|||
| 5,369,757 Recovery logging in the presence of snapshot files by ordering of buffer pool flushing | 156 | 1991 | |
|
|
|||
| 4,566,106 Electronic postage meter having redundant memory | 24 | 1985 | |
|
|
|||
| 5,408,649 Distributed data access system including a plurality of database access processors with one-for-N redundancy | 103 | 1993 | |
|
|
|||
| 4,817,091 Fault-tolerant multiprocessor system | 132 | 1987 | |
|
|
|||
| 4,228,496 Multiprocessor system | 296 | 1976 | |
|
|
|||
| 4,403,284 Microprocessor which detects leading 1 bit of instruction to obtain microcode entry point address | 34 | 1980 | |
|
|
|||
| 5,325,519 Fault tolerant computer with archival rollback capabilities | 84 | 1991 | |
|
|
|||
| 4,413,327 Radiation circumvention technique | 36 | 1970 | |
|
|
|||
| 5,574,874 Method for implementing a checkpoint between pairs of memory locations using two indicators to indicate the status of each associated pair of memory locations | 24 | 1995 | |
|
|
|||
| 4,509,554 High and low pressure, quick-disconnect coupling | 27 | 1983 | |