Over the past few years we have been developing a program to assist the physician in reasoning about the diagnosis and management of patients with cardiovascular disease characterized by manifestations of heart failure[16]. Since heart failure just means that the disease process makes cardiac output inadequate for the demands of the body, there are many possible causes. This domain is particularly rich in opportunities to reason from causal models because the manifestations are primarily the result of the compensatory mechanisms of the cardiovascular system. When cardiac output becomes inadequate, the system alters the capacitance and volume of fluid compartments to increase the heart input pressure (preload) in an attempt to increase cardiac output. This increased preload, propagated back to the lungs and venous system may lead to the pulmonary congestion or peripheral edema clinically recognized as heart failure. Since a number of disease states can produce the same general picture, the determination of the source of the problem in a particular patient fits very naturally into a paradigm of causal reasoning - in this case linking causes to observed effects to produce a causal explanation. Similarly, reasoning about the effects of an intervention involves causal reasoning because the compensatory mechanisms that produced the manifestations are also affected by the interventions and the overall result depends on how these mechanisms change the physiologic state of the patient.
Reasoning about the causes or effects of the patient state involves reasoning about the mechanisms of the cardiovascular system and the pathophysiology of the diseases. The mechanisms are sufficiently important that we have organized the reasoning of the Heart Failure Program around an integrated physiologic model of the cardiovascular system. The model has probabilistic causal relations and a case base for reasoning diagnostically and quantitative relations for predicting the effects of therapy. This model acts as the organizing structure for the various reasoning operators. The operators use the appropriate part of the model as the knowledge base of medical knowledge to carry out their function and as a template for building the patient specific model (PSM) that records the accumulating conclusions about the patient. The organization of the knowledge base and the operators is sketched in figure 1. The operators are identified by boxes and the data they use and produce are the ovals. The three parts of the model are the ovals with heavy borders. This diagram identifies the main interactions but is not intended to be complete. For example, the diagnostic causal model is actually used to some extent by all of the operators.
Figure 1 goes here.
When the program is used to reason about a patient, the diagnostic causal model is the repository for the knowledge to understand the findings. As input data is gathered into the initial PSM by the evidence generator, the model is used to organize and evaluate that data as evidence for the physiologic state of the patient. Some of the input data can be used directly to assert aspects of the physiologic state of the patient. The state determining operator uses logical implication to find such instances and assert them as part of the PSM. However, most aspects of the patient state remain unknown, with multiple possible explanations for the findings.
The next step is reasoning about a differential diagnosis. There are two operators that can do this. The first, the probabilistic diagnostic reasoner, uses the findings and known states in the PSM to identify the likely overall causes and then builds suitable causal explanations for the findings from those causes. These causal hypotheses can be compared by computing their probabilities. The set of ranked causal hypotheses is the differential diagnosis for the case.
The second method of diagnosis is the case-based diagnosis program, CASEY, developed by Phyllis Koton[10][9]. This makes use of the data base of cases to find a close match to the findings and then uses the physiologic model to resolve the differences, if they are minor. This produces a diagnosis, but depends on there being a close match to the case in the case base.
The differential diagnosis presents alternate explanations for the same findings. Even a single diagnostic hypothesis leaves many physiologic parameter states unknown because there is no need to hypothesize a state to explain the findings. However, it is often important to know the state of these parameters to manage the patient. The diagnosis refinement part of the information suggestion operator uses individual hypotheses and the differential to determine what further input data would be useful for refining the diagnosis.
Once the gaps in the causal disease description have been filled, the next step is to select therapies to manage the patient. This is a two stage process. First, the information suggestion operator looks through the PSM and diagnosis and determines what therapies might be appropriate. Since many of the diseases that cause heart failure are not themselves curable (short of a heart transplant), the appropriate approach is to look for therapies that have the potential to break the causal chains that are producing the undesirable effects.
Since the desired effects of therapies may not be the only effects and even the desired effects may trigger multiple compensatory mechanisms, the second stage of therapy selection is to predict the overall effect of the therapies. This is done using quantitative constraint equations that relate the values of the parameters and using signal flow analysis to predict the effects of changes in the parameter values. Thus, the therapy prediction operator is able to predict the likely changes in the major parameters in the model given a change in one or more of the possible therapies.
The final operator produces the explanations. This provides graphical explanations for both disease hypotheses and therapy effects. The method for explaining a disease hypothesis is simply to graph the causal relations among the physiologic state nodes and the abnormal input findings in the hypothesis. This displays all of the causal mechanisms involved in producing the input findings and is an effective way of reviewing the hypothesis for logical consistency. Explaining the therapy predictions is done by identifying the pathways through the model that have the greatest influence on the predicted change of a parameter of interest. In the process, the explanation identifies parameters that are important in determining the change.
Thus, these operators work together with the various aspects of the knowledge base to provide mechanisms to assist the physician in the diagnosis of the patient, the refinement of that diagnosis, the selection of therapy, predicting what the therapy might do, and understanding the justifications for the reasoning.
The Heart Failure Program is still in an active state of development, with some parts more thoroughly developed than others. The diagnosis algorithm has been developed over a couple of years and has been applied to a hundred or more cases with considerable success. The therapy prediction algorithm has also undergone considerable development, although the testing has been less extensive, primarily because it is difficult to collect suitable cases to really test it. The case based reasoner has been tested with a case base of forty cases and collection efforts have been completed to test it on a case set of 240 cases. The information and therapy selection operators are simpler and have not been tested in any formal way. The program is implemented on a Symbolics 3650 in Lisp.
This paper will discuss the structure of the knowledge base and then discuss the different operators, how they make use of the information in the knowledge base, and how they interface with one another.