5P41_RR00785 13_SUMEX_Annual_Report Year_11_Jun84 5P41 RR00785 13 SUMEX Annual Report Year 11 Jun84

5P41_RR00785-13_SUMEX_Annual_Report-Year_11_Jun84 5P41_RR00785-13_SUMEX_Annual_Report-Year_11_Jun84

User Manual: 5P41_RR00785-13_SUMEX_Annual_Report-Year_11_Jun84

Open the PDF directly: View PDF PDF.
Page Count: 233

Download5P41_RR00785-13_SUMEX_Annual_Report-Year_11_Jun84 5P41 RR00785-13 SUMEX Annual Report-Year 11 Jun84
Open PDF In BrowserView PDF
SUMEX
SUMEX
STANFORD UNIVERSITY
UNIVERSITY
STANFORD
MEDICAL EXPERIMENTAL
EXPERIMENTAL COMPUTER
COMPUTER RESOURCE
RESOURCE
MEDICAL
RR - 00785
00785

ANNUAL REPORT
ANNUAL
REPORT - YEAR 11
Submitted
Submitted to
BIOTECHNOLOGY
BIOTECHNOLOGY RESOURCES
RESOURCES PROGRAM
PROGRAM
NATIONAL
NATIONAL INSTITUTES
INSTITUTES OF
OF HEALTH
HEALTH

June
June 1,1984
1,1984

STANFORD
STANFORD UNIVERSITY
UNIVERSITY SCHOOL
SCHOOL OF MEDICINE
MEDICINE
Edward
Edward A. Feigenbaum,
Feigenbaum, Principal Investigator
Investigator
Edward
Edward H. Shortliffe, Co-Principal
Co-Principal Investigator
Investigator

DEPARTMENT
OF
DEPARTMENT
OF HEALTH
HEALTH AND
AND HUMAN
HUMAN SERVICES
SERVICES
PUBLIC
HEALTH
SERVICE
PUBLIC HEALTH SERVICE
NATIONAL
INSTITUTES
OF
NATIONAL
INSTITUTES
OF HEALTH
HEALTH
OF
DIVISION
DMSION
OF RESEARCH
RESEARCH RESOURCES
RESOURCES
BIOTECHNOLOGY
RESOURCES
BIOTECHNOLOGY
RESOURCES PROGRAM
PROGRAM
ANNUAL
ANNUAL PROGRESS
PROGRESS REPORT
REPORT
PART
PART I.,
I., TITLE
TITLE PAGE
PAGE

1.
1. PHS GRANT
GRANT NUMBER:
NUMBER:

6P41RR00786-11
SP41RR00785-11

2. TITLE
TITLE OF GRANT:
GRANT:

SUMEX
SUMEX
Stanford
University
Medical
Stanford
University
Medical
Experimental
Computer
Resource
Experimental
Computer
Resource

3.
3.

NAME
NAME OF
OF RECIPIENT
RECIPIENT INSTITUTION:
INSTITUTION:

4.
4. HEALTH
HEALTH PROFESSIONAL
PROFESSIONAL SCHOOL:
SCHOOL:

Stanford
University
Stanford
University

School
of Medicine
Medicine
School of

S.
5. REPORTING
REPORTING PERIOD:
PERIOD:
Sa.
5a.
Sb.
5b.

FROM:
FROM:
TO:
TO:

08-01-83
08-01-83
07-31-84
07-31-84

6. PRINCIPAL
PRINCIPAL INVESTIGATOR:
INVESTIGATOR:

6a.
6b.

NAME:
NAME:
TITLE:
TITLE:

6c.

SIGNATURE:
SIG NATURE:

7. DATE
DATE SIGNED:
SIGNED:

8. TELEPHONE:
TELEPHONE:

Edward A.
A. Feigenbaum,
Feigenbaum, Ph.D.
Ph.D.
Edward
Professor of
of Computer
Computer Science
Science
Professor

?%.1
AIM Projects
Projects
II.A.4.1.
Project
11.,4.4.1. PATHFINDER
PATHFINDER
Project
II.A.4.2.
II.A.4.2. RXDX
RXDX Project
Project
II.B.
1I.B. Books, Papers, and Abstracts
Abstracts
H.C.
J1.C. Resollrce
Resource Summary
Summary Table
Table

Appendix
Committee
Membership
Appendix A.
A. AIM
AIM Management
Management
Committee
Membership
Appendix
Growth
Appendix B.
B. Community
Community
Growth and
and Project
Project Abstracts
Abstracts

E. A. Feigenbaum

ii

106
106
121
129
130
130
136
136
142
149
149
159
159
166
166
167
175
179
179
183
183
185
190
191
191
197
202
202
202
202

203
203
209
209

Appendix
Appendix

5P·n
SF’41 RR0078S-11
RR0078S-11

List
List of
of Figures
Figures
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure
Figure

1:
2:
3:
3:
4:
4:
5:
5:
6:
6:
7:
8:
9:
10:
10:
11:
11:
12:
12:
13:
13:
14:
14:
15:
15:

Current
Current SUMEX-AIM
SUMEX-AIM Decsystem
Decsystem 2060 Computer
Computer Configuration
Configuration
Current
Current SUMEX-AIM
SUMEX-AIM 2020 Computer
Computer Configuration
Configuration
Current
AX Computer
Current Shared V
VAX
Computer Configuration
Configuration
SUMEX-AIM
SUMEX-AIM Ethernet
Ethernet Configuration
Configuration
SUIV1EX-AIM
SUMEX-AIM File
File Server
Server {SAFE}
{SAFE}
SUt\.tEX-AIM
Development
A....X}
SUMEX-AIM Development Vax
Vax {ARDV
{ARDVAX}
Total
by Month
Total CPU
CPU Time
Time Consumed by
Month
Peak Number
Number of Jobs by
by Month
Month
Peak Load Average
by Month
Average by
Month
Monthly
by Community
Monthly CPU
CPU Usage
Usage by
Community
Monthly
Terminal
Connect
by Community
Monthly Terminal Connect Time
Time by
Community
Cumulative
by Project
Cumulative CPU
CPU Usage
Usage Histogram
Histogram by
Project and Community
Community
TYMNET
TYMNET Terminal
Terminal Connect
Connect Time
Time
ARP
ANET Terminal
ARPANET
Terminal Connect
Connect Time
Time
SUMEX-AIM
SUMEX-AIM Growth
Growth by Community
Community

...
in
Ill

10
10
11
12
13
13
14
15
31
32
32
34
35
37
45
45
209

E. A. Feigenbaum

5P41 RR00785-U
RR00785-11

Annual
Annual Report
Report

I.
Description
I. Narrative
Narrative
Description
This
This third
third year
year of
of the 5-year
5-year renewal
renewal of
of the
the SUMEX
SUMEX resource
resource grant
grant has been an
active year
year not
not only
only for
for the
the SUMEX
SUMEX staff,
staff, but
but for
for the SUMEX-AIM
SUMEX-AIM community
community involved
involved
active
in developing
Successes
developing expert
expert systems.
systems.
Successes in developing
developing such systems,
systems, many
many of
of them
them
stemming
stemming from
from projects
projects in the
the SUMEX-AIM
SUMEX-AIM community,
community, continue
continue to
to stimulate
stimulate strong
strong and
growing
growing interest
interest in AI
AI research
research on many
many educational,
educational, governmental,
governmental, and industrial
industrial fronts.
fronts.

This is an annual
annual report
report for
for the Stanford
Stanford University
University Medical
Medical EXperimental
Experimental
This
computer
computer resource
resource for
for applications
applications of
of Artificial
Artificial Intelligence
Intelligence in Medicine
Medicine (SU1vfEX-AIM)
(SUMEX-AIM).
It
It covers the period
period between
between May
May 1, 1983 and April
April 30, 1984.
This
This third
third year
year of
of the 5-year
5-year renewal
renewal of
of the
the SUMEX
SUMEX resource grant
grant has been an
active
active year
year not
not only
only for
for the
the SUMEX
SUMEX start,
staff, but
but for
for the SUMEX-AIM
SUMEX-AIM community
community involved
involved
Successes
in developing
developing expert
expert systems.
systems.
Successes in developing
developing such systems, many
many of
of them
them
stemming
stemming from
from projects
projects in the
the SUMEX-AIM
SUMEX-AIM community,
community, continue
continue to
to stimulate
stimulate strong
strong and
growing
growing interest
interest in AI
AI research
research on many
many educational,
educational, governmental,
governmental, and industrial
industrial fronts.
fronts.
In addition,
addition, this
this past
past year
year has seen concurrent
concurrent development
development of
of SUMEX-AIM
SUMEX-AIM as a
distributed
distributed scientific
scientific resource.
resource. Our
Our approved
approved project
project goals focus principally
principally on the
merging of
of state-of-the-art
state-of-the-art community
community research
research in biomedical
biomedical AI
AI applications
applications with
with new
merging
computing
computing tools
tools and on the challenges they
they will
will bring
bring to
to the
the SUMEX-AIM
SUMEX-AIM community
community and
resource. The
The SUMEX
SUMEX start
staff continues
continues to
to exploit
exploit these advances in professional
professional
resource.
workstations and communication
communication technology,
technology, while
while at
at the
the same time
time maintaining
maintaining our
our high
high
worksta.tions
standards for
for a computing
computing resource.
resource.
sta.ndards
This
This third
third year
year also saw the initiation
initiation of
of a number
number of
of SUMEX-AIM
SUMEX-AIM pilot
pilot projects.
projects.
pilot projects
projects provide
provide new activities
activities and research
research directions
directions for
for the community
community to
to
These pilot
replace
replace existing
existing projects
projects which
which have matured
matured and moved
moved off
off the
the SUMEX-AIM
SUMEX-AIM resource.
resource.
The
The earlier
earlier phases of
of the SUMEX-AIM
SUMEX-AIM resource were characterized
characterized by
by the building
building
of a3 national
national community
community of
of biomedical
biomedical AI
AI collaborators
collaborators around
around a central
central resource located
located
of
St,nnford University.
University.
Beginning with
with 5 projects
projects in 1973, the AIM
AIM community
community grew
grew to
at Stanford
Beginning
11
11 major
major projects
projects at our
our renewal
renewal in 1978. This
This past
past year
year saw the
the completion
completion of
of two
t.wo long
term
DENDRAL
and PUF'F
jVM. There
term and successful projects
projects on SUMEX-AIM;
SUMEX-AIM;
DENDRAL
PUFF/VM.
There
currently
currently are 13 fully-authorized
fully-authorized projects
projects plus seven pilot
pilot efforts.
efforts.
t-.lany
Many of
of the computer
computer programs
programs under
under development
development by
by these groups
groups are maturing
maturing
into
useful
intro tools
tools increasingly
increaingiy
useful to the respective
respective research or
or clinical
clinical communities.
communities. \Ve
We
continue to
to seek out
out new AI
AI applications
applications in our
our community
community of
of biomedical
biomedical and computer
comput.er
continue
scientists
The
community
scientists who
who interact
interact through
through electronic
electronic media.
media.
The SUMEX-AIM
SUMEX-AIM
community is
beginning to evolve as a highly
highly distributed
distributed resource,
resource, with
with the
the SUMEX
SUMEX staff
staff and computer
computer
beginning
facilit.y
facility serving
serving a.<;
as the
t,he backbone
backbone to electronic
electronic communication
communication and systems
systems support.
support. The
The
community is becoming
becoming more and more
more involved
involved in personal
personal computers
computers and professional
professional
commnnity
workststiors,
more heavily
heavily dependent
dependent on network
network communication
communication facilities
facilities for
for
workstations,
and more
in
teractions, collaborations,
interactions,
collaborations, and sharing.
sharing.

The following
following sections
sections cover
cover the activities
activities of
of the SUMEX-AIM
SUMEX-AIM resource
resource this
this past
The
year, including
including brief
brief summaries
summaries of
of the
the our
our objectives,
objectives, a characterization
characterization of
of biomedical
biomedical AI
AI
year,
rese:1fch,
resesrch, resource organization
organization and operating
operating procedures,
procedures, recent
recent core progress
progress in system
system
development
development and basic AI
AI research, and progress
progress in the collaborative
collaborative projects.
projects.

1

Feigenbaum
E. A. Feigenbaum

5P41
5P41 RR00785-11
RR00785-11

Summary
Summary of
of Research
Research Progress
Progress

I.A.
of
Progress
I.A. Summary
Summary
of Research
Research
Progress
LA.1.
of
and
I.A.1. Overview
Overview
of Objectives
Objectives
and Rationale
Rationale
SUMEX-AIM
SUMEX-AIM ("SUMEX")
(“SUMEX”) isis aa national
national computer
computer resource
resource with
with aa dual mission:
mission: 1)
1)
promoting
promoting applications
applications of
of computer
computer science
science research
research in
in artificial
artificial intelligence
intelligence (AI)
(AI) to
biological
biological and
and medical
medical problems,
problems, and
and 2)
2) demonstrating
demonstrating computer
computer resource sharing
sharing within
within aa
national
community
of
health
research
projects.
The
central
SUMEX-AIM
facility
The central SUMEX-AIM facility is
is
nationa. community
health
projects.
located
physically
in
the
Stanford
University
Medical
located physically
Stanford University Medical School and serves as
as aa nucleus for aa
community
community of medical
medical AI
AI projects
projects at universities
universities around
around the country.
country. SUMEX
SUMEX provides
provides
computing
computing facilities
facilities tuned
tuned to the needs
needs of AI
AI research and communication
communication tools to
facilitate
facilitate remote
remote access,
access, interinter- and intra-group
intra-group contacts,
contacts, and the demonstration
demonstration of
developing
developing computer
computer programs
programs to biomedical
biomedical research collaborators.
collaborators.

I.A.1.1.
is
Intelligence
LA. 1.1. What
What
is Artificial
Artificial
Intelligence
The
The subfield
subfield of computer
computer science known
known as
as Artificial
Artificial Intelligence,
Intelligence, or
or AI,
AI, deals with
with
symbolic
reasoning
using
large
amounts
of
heuristic
knowledge.
Many
of
world’s
symbolic re-aoning using
amounts
heuristic knowledge. Many of the world's
difficult
problems
electronic
electronic or
or mechanical
mechanical
difficult
problems are symbolic,
symbolic, such as troubleshooting
troubleshooting
equipment,
medical
diagnosis
and
therapy
planning,
and
configuring
elemental
equipment, medical diagnosis
therapy planning,
configuring elemental parts
parts into
into a
whole system.
system. For
For these kinds
kinds of
of problems,
problems, AI
AI offers
offers new opportunities
opportunities for
for developing
developing
computerbased solutions.
computer-based
solutions.
AI also uses heuristic
heuristic
In addition
addition to
t.o using
using symbolic
symbolic representations
representations of
of knowledge,
knowledge, AI
methods
thumb, judgmental
judgmental rules
mles that
that
met,hods for
for processing
processing information.
information. Heuristics
Heuristics are rules
rules of
of thumb,
aid in finding
plausible
other areas of
of computing
computing in its
its
bk solutions.
solutions. AI
AI is distinguished
distinguished from
from other
finding plauso’
att,ention
heuristic (non-algorithmic)
(non-algorithmic)
att.ent,ion to
to both
both symbolic
symbolic (non-numeric)
(non-numeric) information
information and heuristic
methods
methods for
for solving
solving problems.
problems.

Placing AI
AI in
in Computer
Science
Placing
Computer
Science
The major
major focus of
of AI
AI is understanding
understanding intelligence
intelligence through
through construction
construction (or
(or
The
programming) of
of machines
machines that
that behave intelligently.
intelligently. That
That is a grand
grand goal. In
In the
the shortshortprogramming)
term, AI
AI research
research focuses on non-numerical
non-numerical problem
problem solving
solving in
in order
order to
to build
build experience
experience
term,
with problem
problem solving
solving methods,
methods, techniques
techniques for
for representing
representing various
various kinds
kinds of
of knowledge,
knowledge,
with
interfaces with
with users.
users, and numerous
numerous other
other issues.
issues.
interfaces
of t,he
the distinguishing
distinguishing features
features of
of problems
problems for
for which
which AI
AI methods
methods have
have been
One of
t,hat the problems
problems are not
not well-structured.
well-structured. That
That is,
is. one does not
not already
already
developed is t*hat
developed
know in advance
advance (from
(from the
the problem
problem description
description alone)
alone) what
what the
the best
best method
method is for
for solving
solving
know
the problem.
problem, In
In short,
short, there
there are no algorithms.
algorithms. Broadly
Broadly speaking,
speaking, AI
AI substitutes
substitutes
the
exploratory search
search for
for precise,
precise, algorithmic
algorithmic solution
solution methods.
methods.
explorat.ory

Expert Systems
Systems and
and Applications
Applications
Expert
The national
national SUMEX-AIM
SUMEX-AIM resource
resource is an outgrowth
outgrowth of
of a long,
long, interdisciplinary
interdisciplinary line
line
The
of artificial
artificial intelligence
int.elligence research
research at
at Stanford
Stanford and
and elsewhere
elsewhere concerned
concerned with
with the
the
of
development
of
concepts
and
techniques
for
building
"expert
systems"
[1].
An
"expert
development of concepts and techniques for building “expert systems” [l]. An “expert
system"” is an inteliigent
intelligent computer
computer program
program that
that uses knowledge
knowledge and
and inference
inference procedures
procedures
sy,item
to solve
solve problems
problems that
that, are difficult
difficult enough
enough to
to require
require significant
significant human
human expertise
expertise for
for their
their
to
solution. For
For some fields
fields of
of work,
work, the
the knowledge
knowledge necessary
necessary to
to perform
perform at
at such
such aa level,
level,
solution.

E. A.
A. Feigenbaum
Feigenbaum
E.

2

Overview
Overview of Objectives
Objectives and Rationale
Rationale

5P41
SP41 RR00785-11
RR00785-11

plus the inference
inference procedures
procedures used,
used, can be thought
thought of as
as aa model of the expertise
expertise of the
expert
expert practitioners
practitioners of that
that field.
Two
features
Two important
important
features that
that distinguish
distinguish expert
expert systems from
from conventional
conventional
programs
Expert
flexible in the sense
Expert systems are flexible
sense
programs are flexibility
flexibility and understandability.
understandability.
that
in the sense
that they
they can be
be changed and extended
extended easily, and they
they are understandable
understandable
sense
that
that they
they can explain
explain the contents
contents of
of their
their own
own knowledge
knowledge bases
bases and their
their own
own lines of
reasoning
features are especially
especially important
important in medicine,
medicine, where
where knowledge
knowledge is
is
reasoning [10].
[lo]. These features
changing
changing rapidly
rapidly and where practitioners
practitioners have to understand
understand the reasons for
for a program's
program’s
decisions because
because they
they have to accept responsibility
responsibility for following
following (or
(or not
not. following)
following) those
decisions.
The
The application
application areas range from
from medicine
medicine to electronics,
electronics, from
from machinery
machinery to
software.
software. The problems
problems ra.nge
range from
from diagnosis and troubleshooting
troubleshooting (analysis)
(analysis) problems
problems to
planning
pla,nning and configuration
configura.tion (synthesis)
(synthesis) problems.
problems. Knowledge
Knowledge bases
bases for
for expert
expert systems are
built
built iteratively
iteratively -- usually
usually through
through long interactions
interactions over
over many
many months
months between
between a human
human
specialist.
specialist who
who underst.ands
understands the details
details of
of the domain
domain and a knowledge
knowledge engineer
engineer who
who
understands
understands the programming
programming details
details of
of the system.
system.
The
The knowl-edge
knowledge of
of an expert
expert system
system consists of
of facts and heuristics.
heuristics. The
The "facts"
“facts”
constitute
constit,ute a body
body of
of irrformation
&formation that
that is widely
widely shared, publicly
publicly available,
available, and generally
generally
agreed upon by experts
experts in a field.
field. The
The "heuristics"
“heuristics” are the mostly-private,
mostly-private, little-discussed
little-discussed
judgment (rules
rult's
rules of good judgment
(rules of
of plausible
plausible reasoning,
reasoning, rules
rules of
of good guessing) that
that
charact.erize
charact,erize expert-level
expert,-level decision making
making in the field.
field. The
The performance
performance level of
of an expert
expert
system
system is primarily
primarily a function
funct.ion of
of the size and quality
quality of
of the knowledge
knowledge base that
that it
it
possesses.
is the
possesses. One of
of the key
key ideas in maintaining
maintaining flexibility
flexibility and understandability
understandability
clean separation
program that
separation of
of elements
elements of
of the knowledge
knowledge base from
from elements
elements of
of the program
that
interpret
base.
interpret the knowledge
knowledge base.
The
building expert
The major
major issues
issues in building
expert systems,
systems, at the moment,
moment, are:
•l selecting
problem (in
importance,
selecting an appropriate
appropriate problem
(in terms
terms of
of size, difficulty,
difficulty,
importance,
decomposability,
decomposability, risk)
risk)
•l selecting
selecting a representation
representation and control
control structure
structure (or
(or framework
framework system
system that
that
sllpplies
both),
supplies both),
•l set.t.ling
problem,
set,t.ling on an
a.11appropriate
appropriate vocabulary
vocabulary and conceptualization
conceptualization for
for the
the problem,
•l finding
finding an available
available expert,
expert,
transferring the expert’
expert's
knowledge into
into the
the program
program (knowledge
(knowledge engineering),
engineering),
l• transferring
s knowledge
refining t,he
the knowledge
knowledge base with
with feedback
feedback from
from test
test cases,
cases,
l• refining
packaging t,he
the system
system in a form
form that
that is acceptable
acceptable to
to end-users,
l• packaging
l• validating
validating the quality
quality of
of the
the program’
program's
s advice.
advice.
of the best
best known
known expert
expert systems
systems is MYCIN
MYCIN [3], a program
program in which
which the
the
One of
separation of
of knowledge
knowledge (of
(of medicine)
medicine) from
from the rest
rest of
of the
the program
program was carefully
carefully
sepa.rstion
engineered. (The
(The abstracted
abstracted ca.se
case of
of an arbitrary
arbitrary knowledge
knowledge base and a framework
framework
engineered.
interpreter, plus
plus auxiliary
a.uxiliary programs,
programs, was achieved
achieved in the EMYCIN
EMYCIN system
system [16],
to which
which
interpreter,
[lS], to
other domains
domains can be added to
to build
build a diagnostic
diagnostic system
system in those domains.)
domains.)
knowledge of
of other
knowledge
Cllrrent.ly authorized
authorized projects
projects in
in the
the SUMEX
SUMEX community
community are concerned
concerned in
in some
Currently
way with
with the
the application
application of
of AI
AI to
to biomedical
biomedical research*.
research·. The
The tangible
tangible objective
objective of
of this
this
way
approach is the
t.he development
development of
of computer
computer programs
programs that
that will
will be more
more general
general and effective
effective
approach

•

Bri€'f abstracts
abstracts of
of the
the various
various projects
projects can
can be
be found
found in
in Appendix
Appendix BB on
on page
page 209
200 and
and more
more detailed
detailed progress
progress
Brief
summaries in
in Section
Section IIII on
on page
page IHI.
60.
summaries

3

A. Feigenbaum
Feigenbaum
E. A.

RR00785-111
5P41 RR007851

Overview of
of Objectives
Objectives and
and Rationale
Rationale
Overview

consultative tools
tools for
for the
the clinician
clinician and
and medical
medical scientist.
scientist. There
There already
already have
have been
consultative
promising result*s
results in
in areas such as chemical
chemical structure
structure elucidation
elucidation and
and synthesis,
synthesis, diagnostic
diagnostic
promising
consultation, molecular
molecular biology,
biology, and
and modeling
modeling of
of psychological
psychological processes.
consult,ation,
much is yet
yet to
to be learned
learned in the
t.he process of
of fashioning
fashioning a coherent
coherent
Needless t.o say, much
scientific discipline
discipline out
out of
of the
the assemblage of
of personal
personal intuitions,
intuitions, mathematical
mathematical procedures,
procedures,
scientific
emerging theoretical
theoretical structure
st.ructure comprising
comprising artificial
artificial intelligence
intelligence research.
research. State-of-theState-of-theand emerging
art programs
programs are far
far more
more narrowly-specialized
narrowly-specialized and
and inflexible
inflexible than
than the
the corresponding
corresponding
art
of human
human intelligence
intelligence they
they emulate;
emulate; however,
however, in
in special
special domains
domains they
they may
may be of
of
aspects of
comparable or
or greater
greater power,
power, e.g., in
in the
the solution
solution of
of formal
formal problems
problems in
in organic
organic chemistry.
chemistry.
comparable

I.A.1.2. Impact
Impact of
of AI
AI in
in Biomedicine
Biomedicine
I.A.1.2.
There is a certain
certain inevitability
inevitability to
to the
the field
field of
of Artificial
Artificial Intelligence
Intelligence and
and its
its
There
applications, in particular,
particular, to
to medicine
medicine and biosciences. The
The cost
cost of
of computers
computers will
will
applications,
continue to
to fall
fall drastically
drastically during
during the
the coming
coming two
two decades. As itit does, many
many more
more of
of t,he
the
continue
practitioners of
ot the
t.he world’
world's
professions will
will be persuaded
persuaded to
to turn
turn to
to economical
economical automatic
automatic
practit,ioners
s professions
information processing
processing for
for assistance in managing
managing the
the increasing
increasing complexity
complexity of
of their
their daily
daily
information
They will
will find,
find, from
from most
most of
of computer
computer science, help only
only for
for those problems
problems that
that
tasks. They
have a mathematical
mat.hematical or
or statistical
statistical core, or
or are of
of a routine
routine data-processing
data-processing nature.
nature. But
But
ha.ve
such problems
problems will
will be relatively
relatively rare,
rare, except
except in
in engineering
engineering and
and physical
physical science. In
In
management, indeed
indeed in most
most of
of the
the world’
world's
work, the
the daily
daily tasks
tasks are
medicine, biology,
biology, management,
medicine,
s work,
requiring symbolic
symbolic reasoning
reasoning with
with detailed
detailed professional
professional knowledge.
knowledge. The
The computers
computers
those requiring
that will
wiII act
act as intelligent
intelligent alrsivtants
assistants for
for these professionals
profeSSionals must
must be endowed
endowed with
with
that
symbolic
symbolic reasoning
reasoning capabilities
capabilities and knowledge.
knowledge.
The
The growth
growth in medical
medical knowledge
knowledge has far
far surpassed the
the ability
ability of
of a single
practitioner to master
practitioner
master it
it all,
all, and the computer's
computer’s superior
superior information
information processing
processing capacity
capacity
there
by offers
thereby
offers a natural
natural appeal. Furthermore,
Furthermore, the reasoning
reasoning processes
processes of
of medical
medical experts
experts
are poorly
poorly understood;
understood; attempts
attempts to
to model
model expert
expert decision-making
decision-making necessarily
necessarily require
require a
degree of
of introspection
introspection and a structured
structured experimentation
experimentation that
that may,
may, in turn,
turn, improve
improve the
the
quality
quaiity of
of the
the physician's
physician’s own
own clinical
clinical decisions, making
making them
them more
more reproducible
reproducible and
defensible.
New insights
defensible. New
insights that
that result
result may
may also allow
allow us more
more adequately
adequately to
to teach m~dical
medical
students
students and house staff
staff the techniques
techniques for
for reaching
reaching good decisions, rather
rst,her than
than merely
merely to
to
offpr
offer ;).n. collection
collect ion of
of facts
facts which
which they
they must
must independently
independently learn
learn to
to utilize
utilize coherently.
coherently.
The
The kno'wledge
knowledge that
that must
must be used is a combination
combination of
of factual
factual knowledge
knowledge and
heuristic
The latter
1atLer is especially
especially hard
hard to
to obtain
obtain and represent
represent since the
heuristic knowledge.
knowledge. The
experts
expertIs providing
providing it
it are mostly
mostly unaware
unaware of
of the heuristic
heuristic knowledge
knowledge they
they are using.
~ledical
hledical and scientific
scientific communities
communities currently
currently face many
many widely-recognized
widely-recognized problems
problems
relating
relating to the rapid
rapid accumulation
accumulation of
of knowledge,
knowledge, for example:
example:
•l codifying
codifying theoretical
theoretical and heuristic
heuristic knowledge
knowledge
•l effectively
effectively using the wealth
wealth of
of information
information implicitly
implicitly available
available from
from textbooks,
textbooks,
journal
articles
and
other
practitioners
journal articles
other practitioners
•l disseminating
disseminating that
that knowledge
knowledge beyond
beyond the intellectual
intellectual centers where
where it
it is
is
collected
collected
•l customizing
customizing the presentation
presentation of that
that knowledge
knowledge to individual
individual practitioners
practitioners as
as
well
well as
‘3s customizing
customizing the
t,he application
application of the information
information to individual
individual cases
cases
\Ve
We believe that
t.hat computers
computers are an inevitable
inevitable technology
technology for
for helping
helping to overcome
overcome

E. A. Feigenbaum
Feigenbaum

4

Overview of
of Objectives
Objectives and Rationale
Rationale
Overview

5P41 RR00785-11
RR00785-11

While recogOlzmg
recognizing the
the value
value of
01 mathematical
mathematical modeling,
modeling, statistical
statistical
these problems.
While
problems.
classification, decision
decision theory
theory and other
other techniques,
techniques, we believe
believe that
that effective
effective use
use of
of such
classification,
methods depends on using
using them
them in conjunction
conjunction with
with less
less formal
formal knowledge,
knowledge, including
including
methods
contextual
contextual and strategic
strategic knowledge.
knowledge.
Artificial inklligence
intelligence offers
offers advantages
advantages for
for representing
representing and using
using information
information that
that
Artificial
will allow
allow physicians
physicians and scientists
scientists to
to use
use computers
computers as
as intelligent
intelligent assistants.
assistants. In
In this
this way
way
will
envision a significant
significant extension
extension to
to the
the decision-making
decision-making powers
powers of
of specific
specific practitioners
practit.ioners
we envision
without
without reducing
reducing the importance
importance of
of those individuals
individuals in that
that process.
Knowledge is power,
power, in the profession
profession and in the intelligent
intelligent agent.
agent. As we proceed
Knowledge
to model
model expertise
expertise in medicine
medicine and its
its related
related sciences,
sciences, we find
find that
that the power
power of
of our
our
to
programs derives
derives mainly
mainly from
from the
the knowledge
knowledge that
that we are able to
to obtain
obtain from
from our
our
programs
collaborating practitioners,
practitioners, not
not from
from the sophistication
sophist,ication of
of the inference
inference processes
processes we
collaborating
observe them
them using. Crucially,
Crucially, the knowledge
knowledge that
that gives power
power is not
not merely
merely the
the
knowledge of
of the textbook,
textbook, the lecture
lecture and the
the journal,
but the knowledge
knowledge of
of good
knowledge
journal, but
good
practice--the
experiential knowledge
knowledge of
of good
knowledge
practice--tlle experiential
good judgment
judgment and good
good guessing,
gue88ing, the knowledge
of the practitioner's
practitioner’s art
art that
that is orten
often used in lieu
lieu of
of facts
facts and rigor.
rigor.
of
This
This heuristic
heuristic
knowledge
knowledge is mostly
mostly private,
private, even in the
the very
very public
public practice
practice of
of science. It
It is almost
almost never
never
taught
taught explicitly,
explicitly, is almost
almost never
never discussed and critiqued
critiqued among
among peers, and most
most orten
often is
not even in the
t.he moment-by-moment
moment-by-moment awareness of
01 the
the practitioner.
practitioner.
not
Perhaps the the most
most expansive
expansive view
view of
of the significance
significance of
of the work
work of
of the
the SUMEXSUMEXPerhaps
AIhl community
communit.y is that
that a methodology
methodology is emerging
emerging for
for the systematic
systematic explication,
explication, testing,
testing,
AIM
dissemination, and teaching
teaching of
of the heuristic
heuristic knowledge
knowledge of
of medical
medical practice
practice and scientific
scientific
dissemination,
performance.
performa,nce. It
It may
may be less
less important
important that
that computer
computer programs
programs can be organized
organized to
to use
use
this lcnowledge than
Lhan that
that the knowledge
knowledge itself
itself can be organized
organized for
for the use of
of the
the human
human
this
practitioners
practitioners of
of today
today and tomorrow.
tomorrow.
Evidence of
of the impact
impact of
of SUMEX-AIM
SUMEX-AIM in promoting
promoting ideas such as
as these, and
Evidence
developing the pertinent
pertinent specific
specific techniques,
techniques, has been the explosion
explosion of
of interest
interest in medical
medical
developing
artificial intelligence
intelligence and the specific
specific research efforts
efforts of
of the SUMEX
SUMEX community.
community.
artificial
As
SU?rlEX has
ha.s entered
entered its second decade, we have found
found that
that the small
small community
community of
of
SU!\fEX
researchers
970'ss has now
researchers that
t.hat characterized
characterized the AIM
AIM field
field in the
the early
early 11970’
now grown
grown to a large,
accomplished, and respected
respected research community.
community. The
The American
American Association
Association for
for Artificial
Artificia.1
accomplished,
Intelligence
Intelligence (AAAI),
(A,4.41), the principal
principal scientific
scientific membership
membership organization
organization for
for the AI
AI field,
field, has
members, over
over 1000 of
of whom
whom are members
members of
of the medical
medical special interest
interest group
group
4000 members,
known as
as t.he
the AAAI-M.
AAAI-M.
This subgroup
subgroup was founded
founded by
by members
members of
of the SUMEX-AIM
SUMEX-AIM
known
This
community who
who were active
active in AAAI
AAAI and is the only
only active
active subgroup
subgroup in the Association.
Association.
community
The organization
organization distributes
distributes semiannual
semiannual newsletters
newsletters on medical
medical AI
AI and provides
provides a focus
The
for co-sponsoring
levan t medical
co-sponsoring re
relevant
medical computing
computing meetings
meetings with
with other
other societies
societies (such as
as the
American Association
Association for
for Medical
Medical Systems
Systems and Informatics
Informatics -- J\.-\MSI).
k4MSI).
Medical AI
AI papers
American
Medical
prominently featured
featured at
at’ both
both medical
medical computing
computing and artificial
artificial intelligence
intelligence meetings,
meetings,
are prominently
artificial int.elligence
intelligence is now
now routinely
routinely featured
featured as
as a specific
specific subtopic
subtopic for
for specialized
specialized
and artificial
sessions at
a,t medical
mcdicsl computing
computing and other
other medical
medical professional
professional meetings.
meetings. For
sessions
For example,
example,
members of
of the AIM
.41bf community
communit,y have represented
represented the field
field to
to physicians
physicians at the American
American
members
of Pathology
Pathology and American
American College
College of
of Physicians
Physicians meetings
meetings for
for the last
last several
several
College of
A mere decade ago, the words
words "artificial
“artificial intelligence"
intei!igence” were never
never uttered
uttered at such
years. A
The growing
growing interest
interest and recognition
recognition are largely
largely due to
to the activities
activities of
of the
the
conferences. The
SU?.lES-.4IM community.
community.
SU!'.fEX-AIM
Another indication
indication of
of the growing
growing impact
impact of
of the SUMEX-AIM
SUMEX-AIM community
community is its
its
Another
effect, on medical
medical education.
education.
effect
For
For reasons such as
as those outlined
outlined above, there
there is an
increasing recognition
recognition of
of the
t,he need for
for a revolution
revolution in the way
way medicine
medicine is taught
taught and
increasing

5

E. A.
A. Feigenbaum
Feigenbaum

Overview
Overview of Objectives
Objectives and Rationale
Rationale

5P41 RR00785-11
RR00785- I I

medical
Computing
medical students
students organize
organize and access
access information.
information.
Computing technology
technology isis routinely
routinely
cited as
as part
part of this
this revolution,
revolution, and artificial
artificial intelligence
intelligence (and SUMEX-AIM
SUMEX-AIM research)
research)
generally
generally figures
figures prominently
prominently in such discussions. Such diverse organizations
organizations as
as the
Na.tional
Na.tional Library
Library of Medicine,
Medicine, the American
American College of Physicians,
Physicians, the Association
Association of
American
American Medical
Medical Colleges, and the Medical
Medical Library
Library Association
Association have all called for
for
sweeping changes in medical
medical education,
education, increased educational
educational use
use of computing
computing
technology,
technology, enhanced research in medical
medical computer
computer science, and career development
development for
people working
working at the interface
interface between
between medicine
medicine and computing;
computing; reports
reports of all four
four
organizations
orga,nizations have
ha.ve specifically
specifically cited
cited the role of
of artificial
artificial intelligence
intelligence techniques
techniques in future
future
medical
programs
medical practice
practice and have used SUMEX-AIM
SUMEX-AIM
programs as
as examples of where
where the
technology
technology is
is gradually
gradually heading.
heading.
In summary,
summary, the logic which
which mandates
mandates that
that artificial
artificial intelligence
intelligence playa
play a key
key role in
enhancing
enhancing knowledge
knowledge management
management and access
access for biomedicine
biomedicine -- a logic in which
which we have
long believed
gradually become evident
evident to much
much of the biomedical
biomedical community.
community. We
believed -- has gradually
are encouraged
encouraged by
by this
this increased recognition,
recognition, but
but realistic
realistic about
about the significant
significant research
cha.llenges
cha.llenges that
that remain.
remain. Our
Our goals are accordingly
accordingly both
both scientific
scientific and educational.
educational. We
We
continue
continue t.o
to pursue
pursue the research objectives
objectives that
that have always
always guided
guided SUMEX-AIM,
SUMEX-AIM, but
but
must
must also undertake
undertake educational
educational efforts
efforts designed to inform
inform the biomedical
biomedical community
community of
of
our
our results
results while
while cautioning
cautioning it
it about
about the challenges remaining.
remaining.

A. Feigenbaum
Feigenbaum
E. A.

6

Details of
of Technical
Technical Progress
Progress
Details

5p·H
5P4 1 HH00785-11
R.R00785- 11

I.A.2.
Details
of
Progress
I.A.2.
Details
of Technical
Technical
Progress
I.A.2.1.
Facility
Management
and
I.A.2.1.
Facility
Management
and Operation
Operation
The
The following
following materia.l
materia.1 covers the SUMEX-AIM
SUMEX-AIM resource activities
activities over
over the past
past
year
year in greater
greater deta.il.
detail. Individual
Individual sections
sections cover
cover progress
progress in ;
•l Facility
Facility Management
Management and Operation
Operation

Timesharing Systems
Systems
•l Timesharing
•l Professional
Professional \Vorkstations
Workstations
•l Networking
Networking and Communications
Communications
These sections
sect.ions outline
outline accomplishments
accomplishments in the context
context of
of the resource staff
staff and resource
management.
management. Details
Details of
of the progress and plans
plans for
for our
our external
external collaborative
collaborative projects
projects are
presented
presented in Section
Section II
II beginning
beginning on page 69.

I.A.2.2.
Facility
Management
and
I.A.2.2.
Facility
Management
and Operation
Operation
SUMEX-AIM
continues
SUMEX-AIM
continues to
to manage and
and operate
operate it's
it’s computing
computing resources in a
effect.ive
effect,ive and efficient
efficient manner
manner conducive
conducive to
to providing
providing a reliable
reliable and robust
robust computing
computing
environment.
environment.
While
While the
the previous
previous year
year (Year
(Year 10) involved
involved a major
major move from
from the KIlO
KIlO Tenex
Tenex
syst.em
system to a new DECsystem
DECsystem 2060, this
this year
year saw more emphasis
emphasis to
to our
our gradual
gradual move to
to
distributed
distributed processing,
processing, while
while continuing
continuing to
to improve
improve our
our excellent
excellent timesharing
timesharing environment
environment
on the 2060. This
This development
development is covered
covered in full
full in section
section I.A.2.2
I.A.2.2 starting
starting on page 15.
Ollr
Our continued
continued movement
movement to
to professional
professional workstations
workstations has taken
taken on several
several forms.
forms.
We
We have continued
continued to
to acquire
acquire Lisp
Lisp machines
machines for
for use
use by
by the SUMEX
SUMEX community
community while
while at
the same time
time investigating
investigating t.he
the use of
01 remote
remote virtual
virtual graphics
graphics and new lower
lower cost
workstat,ions
workstations such as
as the Apple
Apple Macintosh,
Macintosh, Sun workstations,
workstations, and others
others that
that are
appearing
The
The development
development of
of professional
professional workstations
workstations is covered
covered in
apprnring on the market.
market.
more
more detail
detail in section
section I.A.2,3
I.A.2.3 starting
starting on page 21.
SUMEX
SUMEX cont,inues
continues to
to expend
expend a great
great deal of
of effort
effort in the
the support
support and development
development
of
Key
communications facilities.
facilities.
Key to
to our
our ability
ability to provide
provide the
the
of our
our networking
networking and communications
maximum
maximum computing
comput,ing power
power available
available to
to the
t,he greatest
greatest number
number of
of users is a mechanism
mechanism for
for
making
making it
it, irrelevant
irrelevant where
where that
that user is physically
physically located.
located. By
By having
having a robust
robust networking
networking
and communications
communications environment,
environment, we are able to extend
extend our
our facility
facility to
to any
any user or
or group
group
of
of users, thereby
thereby making
making available
available to
to them
them the power
power and convenience
convenience of
of SUMEX.
SUMEX.
Further
Further information
information on the
the progress made in networking
networking and communications
communications can be
found
found in section
section I.A.2.4
I.A.2.4 starting
starting on page 23.
In
In the area of
of fa.cility
fa.ci!ity management
management and operation,
operation, several
severa.! noteworthy
noteworthy events
events
occurred
occurred over
over the past
past year
year which
which will
will be explained
explained in more detail
detail here.

7

E. A. Feigenbaum
Feigenbaum

Details
Details of Technical
Technical Progress

5P41 RR00785-11
RR00785-1 I

SUA/EX/IIPP
Facility
SUhdEX/HPP Welch
Welch Road
Road Computing
Computing
Facility
A major
major development
development this
this past year
year at SUMEX
SUMEX was the move of the Heuristic
Heuristic
Programming
Project
to
their
new
location
at
701
Programming Project
their
location
701 Welch
Welch Road, adjacent
adjacent to the Stanford
Stanford
Medical
Medical Center.
Center. Since this
this group
group is a major
major user of the SUMEX-AIM
SUMEX-AIM resource and the
focus for most
most of the core AI
‘41 research, a good deal of effort
effort was expended to provide
provide a
robust
robust computing
computing environment
environment at their
their new location.
location. This
This development
development involved
involved several
stages and levels of technical
technical deveL:;pment,
devei;;pment, ranging
ranging from
from construction
construction of the machine
machine
room,
room, new cable and wiring
wiring installation,
installation, procurement
procurement and setup of networking
networking hardware,
hardware,
to major
major new developments
developments in the networking
networking software
software and a twisted
twisted pair
pair Ethernet
Ethernet
communication
communicat.ion link
link between
between this
this site and the main
main SUMEX
SUMEX Computer
Computer room.
room. All
A!! of the
hardware
hardware and facilities
facilities purchases were funded
funded from
from sources other
other than
than SUMEX.
SUMEX.
\Ve
We setup the general
genera! communications
communications capabilities
capabilities for
for the two
two buildings
buildings occupied
occupied by
by
the HPP.
HPP. This
This involved
involved wiring
wiring up local terminals,
terminals, installing
installing local Ethernets
Ethernets (both
(both 3 and 10
10
mt'gabit
megabit capability),
capability), and acquiring
acquiring and installing
installing networking
networking hardware
hardware such as
as terminal
terminal
interface
interface processors (TIP)
(TIP) and gateways,
gateways, as
as well
we!! as
as extending
extending the current
current SUMEX
SUMEX TIP
TIP
and GATE\\,
A
Y
software
to
handle
both
3
and
10
megabit
network
traffic.
GATEWAY software
handle both
megabit network traffic.
But
But the most
most important
important and most
most interesting
interesting development
development in this
this process was the
twisted pair'
“twist,ed
pair’ ethernet
ethernet developed
developed by
by the SUMEX
SUMEX engineering
engineering staff
stall to allow
allow high
high speed
reliable
reliable communications
communications between
between this
this Welch
Welch Road facility
facility and the SUMEX
SUMEX machine
machine room.
room.
Further
Further information
information on this
this new ethernet
ethernet can be found
found in Section
Section I.A.2.4
I.A.2.4 on page 23.
II

HPP
HPP researchers are routinely
routinely using this
this link
link to
to communicate
communicate with
with SUMEX
SUMEX and the
the
central
central university
university network.
network. In addition,
addition, various
various Lisp
Lisp machines
machines and printers
printers located
located in the
HPP
HPP facility
facility and connected
connected to
to a local network
network are able to communicate
communicate with
with the
the
university
university network.
network.
The
The end result
result is that
that we have successfully
successfully been able to
to extend
extend the
the SUMEX
SUMEX
compnting
providing a high
computing environment
environment to a remote
remote site,
site, providing
high speed link
link to
to the facilities
facilities of
of
SUMEX
processing. We
SUMEX While
while also allowing
allowing for
for local
local distributed
distributed processing.
We see
see this
this experience
experience has
being most
most valuable
valuable in the future
future as
as we move further
further into
into a distributed
distributed environment,
environment,
while
provided by
by large
while still
still needing
needing the sharing
sharing of
of resources and communication
communication links
links provided
t.imesharing
timesharing systems
systems and local area networks.
networks.

Dig£tal Equipment
Equipment Corporation
stops development
0/86-6it
product line
Digitnl
Corporation
stops
development
of S&bit product
line
Digital Equipment
Equipment Corporation,
Corporation, a long
long time
time supplier
supplier of
of high
high speed 36-bit
36-bit
Digital
timesharing computers
computers to
to the
the Artificial
Artificial Intelligence
Intelligence community,
community, announced
announced that
that itit was
timesharing
stopping all
all development
development of
of future
future 36bit
36-bit products,
products, and instead
instead sta.rting
starting a program
program to
to
stopping
provide a migration
migration path
path to
to its
its line
line of
of VAX
V A..,{ minicomputers.
minicomputers.
provide
Many DEC
DEC 20 customers
customers had
had been anticipating
anticipating a new yet
yet unannounced
unannounced machine
machine
Many
from DEC
DEC code named
named the
the'‘Jupiter’
Jupiter'.
which had been reported
reported to
to be a order
order of
of magnitude
magnitude
from
, which
faster than
than the
t.he current
current KLlO
KLlO processor
processor used in DEC20’
DEC20's
DEClO's.
However, DEC’
DEC's
fa.ster
s and DEClO’
s. However,
s
announcement means this
this effort
effort has stopped,
stopped, and we can expect
expect no more
more 36bit
36-bit products
products
announcement
from Digits!
Digital Equipment
Equipment Corporation.
Corporation.
from
The effect
effect of
of t’
this
announcement to
to the
the AI
AI community
community is disappoint,ing,
disappointing, alt,hough
although
The
his announcement
not totally
totally unexpected.
unexpected. The
The DECsystem20
DECsystem20 has been the
the predominant
predominant timesharing
timesharing machine
machine
not
to support
support Artificial
Artificial Intelligence
Intelligence based research,
research, but
but yet
yet researchers
researchers have been in
in
used to
of more
more processing
processing power
power and
and larger
larger address spaces for
for quite
quite a few years.
years. DEC
DEC has
need of
clearly decided
decided to
to devot,e
devote their
their resources
resources to
to VAX
VAX development.
development. For
For those
tho~e in
in need of
of
clearly
greater 36bit
3u-bit processing
processing power
power or
or address space, you
you must
must now
now look
look to
to newer
newer less
less
greater

E. A.
A. Feigenbaum
Feigenbaum

88

RR00785-11
5P41 RR0078S-11

Details of
of Technical
Technical Progress
Progress
Details

experienced
experienced companies
companies such as
as Foonley
Foonley and Systems
Systems Concepts
Concepts for
for follow
follow on 36-bit
36bit
products
products which
which those firms
firms are preparing.
preparing.
The
The impact
impact of
of this
this decision
decision on the
the SUMEX-AIM
SUMEX-AIM community
community must
must be examined
examined in
conjunclion
conjunclion with
with our
our development
development of
of AI
AI systems
systems on personal
personal Lisp
Lisp machines.
machines. We
We have
outlined
outlined very
very clearly
clearly our
our plans
plans to
to move
move to
to distributed
distributed Lisp-based
Lisp-based workstations
workstations for
for AI
AI
t,his is clearly
clearly where
where we see
see the
the AI
AI computing
computing market
market heading.
heading. These
Research, and this
ma.chines
machines offer
offer much
much better
better cost./performance
cost/performance ratios
ratios than
than timesharing
timesharing machines,
machines, high
high
resolution bit-mapped
bit-mapped screens, and powerful
powerful Lisp
Lisp programming
programming environments
environments for
for the
resolution
development
development, and eventual
eventual dissemination
dissemination of
of AI
AI based systems. However,
However, this
this is not
not too
too say
we still
still do not
not see
see a role for
for the large timesharing
timesharing machine
machine in our
our environment.
environment. We
We still
still
of a large central
central mainframe
mainframe computer
computer as the
the anchor
anchor for
for a large
believe in the use of
con1munit.y of
of users. The
The mainframe
mainframe also functions
functions as
as a central
central facility
facility for
for communication
communication
community
collaboration, and provides
provides fast
fast Lisp
Lisp cycles for
for program
program development
development where
where the
and collaboration,
application
application is not
not in need of
of a specialized
specialized workstation.
workstation.
Other SUA/EX
SUMEX
C’omputing
Facilities
Other
Computing
Facilities

SUMEX continues
continues to
to support
support other
other mainframe
mainframe computers,
computers, file servers, professional
professional
SUMEX
workstations,
workstations, and assorted
assorted printers
printers and terminals
terminals for
for use
use by
by the
the SUMEX-AIM
SUMEX-AIM community.
community.
1. The
The SUMEX-AIM
SUMEX-AIM File
File server,
server, based on a VAX
VAX 11/750
11/750 computer,
computer, continues
continues to
to

of the workstation
workstation users within
within SUMEX-AIM.
SUMEX-AIM. The
The use
use of
of
serve the needs of
SAFE by
by users of
of our
our 2060 is minimal.
minimal. We
We plan
plan to extend
extend the
the use
use of
of SAFE
SAFE in
SAFE
future by
by providing
pro\-iding more
more convenient
convenient access
access by
by 2060 users than
than is currently
current,ly
the future
available.
available.
The VAX
VAX 11
11,/780
computer system,
system, originally
originally purchased
purchased with
with DARPA
DARPA funds
funds
2. The
/780 computer
and previously
previous!y located
located in Margaret
Margaret Jacks Hall
Hall on campus,
campus, has become a
SUMEX-AIM resource this
this past
past year.
year. The
The system
system was moved
moved to a new location
location
SUMEX-AIM
Stanford campus
campus which
which provides
provides a better
better environment
environment for
for a computer
computer
on the Stanford
of this
this size. This
This VAX
VAX is now
now shared
shared between
between the
the Computer
Computer Science
of
Department
Department and the SUMEX-AIM
SUMEX-AIM community.
community.
3. SUMEX
SUMEX continues
continues to support
support a wide
wide range of
of professional
professional workstations
workstations from
from
vendors as
as Xerox,
Xerox, Symbolics,
Symbolics, and Hewlett
Hewlett Packard
Packard for
for the development
development
such vendors
testing of
of AI
AI applications.
applications. Additional
and testing
Additional work
work has been started
started to explore
explore
use of
of the Apple
Apple Macintosh
Macintosh and Apple
Apple Lisa
Lisa within
within SUMEX.
SUMEX. More
More
the use
information on these developments
developments can be found
found in section
section I.A.2.3.
I.A.2.3.
information

9

E. A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RIl00785-11

Details of
of Technical
Technical Progress
Progress
Details

7
G3nmil
Central Proceaor
Processor
DECKLlo-E
DEC
KLlO-E
2M words
warns of
of memoly,
memory, c¶cho
Cache
2M

RH20
RH2o
L

I

II

RH20
RtKo

MassBus

" . .sus

DisK colltr0ller
Controller
D&k
and Dnvc
Onve
and
DECRP07
DEC
UP07

I

Disk Controlkr
ContrOller
Disk
and Dnve
Drive
MCI
DECRP07
DEC
R PO7

I

RH20 I
I
RH20

IJNIEUS
UNIBUS

MusSus
M&s&m

L

4
OIB20
DIE20

DEC AN20
ARPAnet
Interlace

~

and OnY8

DECRPOe

I

MEIS
MassBus
Ettlemet
Interlace

~

~

~

H

K1.INIKUne

~glnQTTY

--

DEC LP·26
LP-26
DEC
urm Pnnta
UnePnntar
t-

6 Une Scannens

DEC OH·',
96 Unes total

III

6L.ines

TYMNET
Interface

Figure
1:
Figure
1:

f>. .A. Feigenballm

~

IZ

Current SUMEX-AIM
SUME?(-AIM Decsystem
Decsystem 2050
2060 Computer
Computer Configuration
Configuration
Current

10

7I

DEC
DEC LPlO
LP10
Line
Une Printer
Printer

Tape Drives
and Controller

I

~

~

DEC BAlD

L

Console TTY

20ECTU·78
2 DEC TU-78

J

l/O
bus
1I0bua

Disk ContrOller
~

t

fo-'

11/40FROUTEND
111.-0 FRONT END

,

.

RR00785-11
5P41 RR00785-11

Details
Details of
of Technical
Technical Progress
Progress

Central
Central Processor
Processor
DEC
KS·10
DECKS-10
512K
512K words
words Memory
Memory
LA-36
LA-36 Console
Console

I

I

I

Unibus
Unibus Adapter
Adapter

Unibus
Unibus Adapter
Adapter

I

II

1

I

UNIBUS
UNIBUS

I

I

Massbus
Massbus Adapter
Adapter

I

Massbus Adapter

Ethemet
UNIBUS
Interface

I
Line Scanner
DEC OZ·"
f-16 Lines

MassBus

I
Disk
Disk Controller
Controller
and
and Drive
Drive
DEC
DEC RP·06
RP-06

Figure
2:
Figure
2:

UNli3US
UNIBUS

MassBus

r-l

Tape
Tape Controller
Controller
and
and Drive
Drive
DECTU....s
DEC TlJ-45

Current
Current SUMEX-AIM
SUMEX-AIM 2020 Computer
Computer Configuration
Configuration

11
11

E. A. Feigenbaum
Feigenbaum

RR00785- 11
5P41 RR00785-11

Details
Details of
of Technical
Technical Progress
Progress

Central
Central Processor
Processor
DEC
DEC VAX
VAX 11nOO
11 f70O
4 Mbytes
Mbytes Memory
Memory
Floating
Floating Pt
Pt Unit
Unit
LA·36
LA-36 Console
Console

MassBus
MSSBUS

Disk Controller
and Drive
DEC RP·06

Line Scanner
DEC DZ·"
16 Lines

Disk
Disk Controller
Controller
and
and Drive
Drive
DEC
DEC RP·07
RP-07

Ethernet
UNIBUS
Interface

Tape Controller
and Drive
DEC TU·n

Figure
3:
Figure
3:

E. .r\.
A. P('ig('nbaum
Feigenbsum

UNIBUS
UNIBUS

I-

Chaosnet
Chaosnet
UNIBUS
UNIBUS
Interface
Interface

,

Current
AX Comput,er
Current Shared
Shared V
VAX
Comput,er Configuration
Configuration

12

DeWIs
Details of
or Technical
Technical Progress
Progress

SP41
RR0078S-11
SF’
41 RR0078.5-11

Margaret Jacks
Jacks
Margaret
Heuristic Prog
Prog Prof
PrOI
Heunstic

Electrical
Electrical
Engineering
Engmeenng

Pine Hall
Hall
Pine
CSLI

10 Mb.t ethernet
I

SCORE 2060
2060
SCORE
Ether TIP
TIP
Ether
Dover Pnnter
Printer
Dover
Other CSD
CSD Eoulp
Equip
Other

r

I

I

.
I

Sumex VAX
CSLl2060
Oandeltons
Dandelions
NS
NS file
file server
server

G

Oncocin
OnCOCin Offices
Offices

piiiJq

SUMEX Offices
Offices
SUMEX

Medical
Center
Medical Center
SUMEX Mach
SUMEX
Mach Rm
Rm

~

El

Xerox Alto
Alto
Xerox
Dolph.n
Dolphm
Dandelions
Dandelions
SUN.
Devel
SUN· Devel
Ether TIP
Ether
TIP
Imprmt-10
Printer
Imprint·10
Printer

2060
2060
2020
2020
VAX·
VAX. File
File Server
Server
VAX·
VAX. Devel
Devel
Ether
Ether TIP
TIP
PDP·11
POP-1 1 . Devel
Oevel

Repeater
Repeater

~
M Gateway
Gateway

Ether TIP
Dandelions
HP 9836's

10
ethernet
10 Mbit
Mbit ethernet

Whelan Building
(Welch Road)
Whelan
Buflding (Welch
Road)
Heuristic
Heuristic Prog
Prog Proj
Proi
1.5
1.5 Mbit
Mbit
phone
phone line
line

10
10 Mbit
Mbit ethernet
ethernet
/I

I

'-====l~~ I

11
I

Dandelions
Dandelfons
LM·3600's
LM-3600’S
Ether
Ether TIP
TIP
Dolphins
Dorado
Dorado
!=laven Printer

3 Mb.t
Mbtt ethernet
ethernet
3

Figure
4:
Figure
4:

SUMEX-AIM
SUhlES-AIhl Ethernet
Ethernet Configuration
Configuration

13
13

E. .\. Feigcnhnrim

Det3
Dehilsils of
of Technical
Technical Progress
Progress

5P41
5P41 RR00785-11
RR00785-11

Central
Central Processor
Procesor
DEC
DEC VAX
VAX llnSO
I t n!x
22 Mbytes
Mbytes Memory
Memory

MassBus
MassBus 1

I

I

TUSS

H

UNIBUS
UNIBUS

Kenne<1y
600/1600 bpI

CDC 256 Mbyte
Removable·
MedIa
Disk Drive

Fujitsu Eagle
414 Mbyte
Disk Drive

I1

It
System
System Industries
Industries
9900
8900 Disk
Disk Controller
Controller

f-

~

I

I

Fujitsu Eagle
Eagle
Fuji&u
414 Mbyte
Mbyte
414
Disk Drove
Drive
Disk

Fujitsu Eagle
Eagle
Fujitsu
414 Mbyre
Mbyte
414
Disk Drive
Drive
Disk

f-

f-

TIlDe Drove
& Emulex
Controller

Disk Controller
and Drive
DEC RK07

Ethernet
UNIBUS
Interface

OZ·11
I--

Figure 5:5:
Figure

E. ;\.

Ft'i~l'nbd'lrn

Line Scanner
BLines

SUMEX-AIM File
File Server
SerVt'r {SAFE}
{SAFE}
suhim-AI?.1

14
14

Console TTY

I

Details
Details of Technical
Technical Progress

5P41
5P41 RROOi85-11
RR00785-11

Central Processor
DEC VAX 111750
2 Mbytes Memory

"-

I

Console TTY

TU 58

UNIBUS
UNIBUS

Disk Controller
and Drive
DEC RK07

Ethernet
UNIBUS
Interface

Disk Controller
and Drive
DEC RK07

DEC OZ-11
Line Scanner
8 Lines

Figure 6:
6:
Figure

SUMEX-AIM Development
Development Vax
Vax {ARDVAX}
SUMEX-AIM
{ARDVAX}

15

A. Feigenbsum
Feigenbaum
E. A.

Progress
Progress -- Timesharing
Timesharing Systems
Systems

5P41
5P41 RR00785-11
RR00785- 11

I.A.2.3.
Timesharing
Systems
I.A.2.3.
Timesharing
Systems
Continued
Continued support
support and
and development
development ot
of our
our timesharing
timesharing systems
systems this
this past
past year
year has
has
concentrated
concentrated on
on several
several areas,
areas, including
including improvement
improvement of
of user
user services
services such
such as
as printing
printing
spoolers
spoolers and
and archiving
archiving support,
support, implementation
implementation of
of features
features from
from our
our KilO
KIlO Tenex
Tenex system,
system,
enhancing
interface
network
interface service,
service, correcting
correcting encountered
encountered system
system bugs, and
and
enhancing network
implementing
In
implementing new features
features for
for better
better user community
community support.
support.
In addition,
addition, we have
invested
system
invested further
further effort
effort in
in supporting
supporting the VAX/UNIX
VAX/UNIX
system in
in conjunction
conjunction with
with the
SUMEX-AIM
SUMEX-AIM file
file server
server installation.
installation.

DECsystem
2060/I'OPS-20
System
DECsystem
206O/TOPS-20
System
Support
Support of our
our main
main timesharing
timesharing machine,
machine, the DECsystem
DECsystem 2060,
2060, has continued
continued
during
during grant
grant year
year II.
11.

•l Hardware
development
Hardware
development
I.
1. The
The DECsystem
DECsystem 2060 system
system now in operation
operation at SUMEX
SUMEX differs
differs greatly
greatly
from
from our
our previous
previous KilO
KIlO Tenex
Tenex system.
system. Whereas
Whereas before, the KilO
KIlO system
system
and TENEX
TENEX software
software required
required much
much in-house development
development and support,
support,
life is easier with
with the 2060. Being
Being at Stanford
Stanford University,
University, where
where there
there
are at least 7 other
DECsystem
20's
with
similar
hardware
and
software
other DECsystem 20’s with similar hardware
software
is a great
great advantage.
advantage. We
We are able to
to share our
our experiences with
with other
other
sites, and have become an integral
integral part
part of
of the Stanford
Stanford DEC
DEC community.
community.
In addition,
addition, the DEC2060
DEC2060 hardware
hardware has been more
more reliable
reliable and easier to
maintain
maintain than
than the
the KilO
KIlO system.
system.
2. Additional
provide support
Additional modems
modems were added to
to the
the 2060 to
to provide
support for
for BELL
BELL
212A
baud protocols.
protocols. This
baud
212A 1200 baud
This adds an alternative
alternative to Vadic
Vadic 1200 baud
service.
use the Bell
service. Modems
Modems which
which use
Bell 212A
212A standard
standard are more
more widely
widely
available
available for
for much
much less
less cost than
than Vadic
Vadic modems
modems..

•a TOPS-20
TOPS-20 Monitor
AIonitor Software
Software Enhancements
Enhancements
1. A
A significant
significant enhancement
enhancement to
to our
our TOPS-20
TOPS-20 monitor
monitor occurred
occurred this
this year
year

when we implemented
implemented the
the software
software from
from our
our Tenex
Tenex system
system which
which allowed
allowed
when
extended support
support for
for the
the ‘'?'
feature of
of TOPS-20
TOPS-20 when
when parsing
parsing filenames.
filenames.
extended
?’ feature
This feature
feature allows
allows a user at
at any
any time
time to
to get
get a list
list of
of possible choices
This
when needing
needing to
to input
input a file
file name by
by just
just typing
typing ‘'?'.
This returns
returns an
?‘. This
when
actual list
list of
of file
file names, whereas
whereas in the
the standard
standard TOPS-20
TOPS-20 monitor,
monitor, just
just
actual
the st.ring
string ‘'input
filespec' was returned.
returned. This
This is a very
very significant
significant and
and
the
input filespec’
useful change to
to our
our TOPS-20
TOPS-20 system.
system.
useful
We continued
continued to
to keep up-to-date
up-to-date on
on the
the various
various bug
bug fixes
fixes and
and monitor
monitor
2. We
improvements that
that we received
received from
from DEC
DEC and
and other
other TOPS-20
TOPS-20 sites.
improvements
These included
included several
several fixes
fixes and
and rewrites
rewrites to
to the
the Internet
Internet IP/TCP
IP /TCP code
These
which went
went under
under a major
major revision
revision this
this past
past year.
year.
which
\Ve installed
installed the
the capability
capability for
for users to
to access
access their
their subdirectories
subdirectories as ifif
3. \Ye
they were
were the
the owners
owners of
of such.
such. While
While this
this may
may seem to
to be the
the logical
logical way
way
they
to implement
implement subdirectories
subdirectories to
to begin
begin with,
with, DEC’
DEC's
models of
of
to
s models
subdirectories was
was a bit
bit different.
different. Our
Our changes have
have since been installed
installed
subdirectories
on other
other DEC20’
DEC20's
at Stanford
Stanford and
and elsewhere.
elsewhere.
on
s at
4. We
'We inst,slled
installed the
the capability
capability to
to vary
vary the
the allocation
allocation of
of windfall
windfall cycles
cycles in
in
-1.

E. ,4.
A. Feigenbaum
Feigenbaum
E.

16

Progress - Timesharing
Timesharing Systems
Systems
Progress

5P41 RR00785-11
RR00785-11

accordance
accordance with
with user classes.
classes. This
This will
will allow
allow us more
more flexibility
flexibility in
assigning
jobs enough cycles to
assigning jobs
to run
run comfortably
comfortably while
while limiting
limiting their
their
usage
usage to a strict
strict percentage
percentage of
of the
the machine.
machine.
5. vVe
We installed
installed several
several new features
features in our
our TOPS-20
TOPS-20 EXEC
EXEC to
to facilitate
facilitate the
use
use of
of the system
system by
by the users. Among
Among these features
features was the ability
ability to
to
edit
edit any
any previous
previous command
commai.nd you
you had entered,
entered, and then
then re-execute
re-execute that
that
command.
command. This
This saves extra
extra keystrokes
keystrokes and has proven
proven to
to be very
very useful.
useful.
The
The code to
to do this
this came from
from the University
University of
of Texas
Texas at Austin.
Austin.
6. We
We switched
switched our
our system
system this
this year
year to
to using
using encrypted
encrypted passwords.
passwords. This
This
means that
that passwords
passwords are not
not stored
stored in any
any readable
readable form
form on t.he
the
computer
computer system,
system, and if
if an illegal
illegal user should
should gain access
access to
to the system,
system,
he/she
he/she would
would not
not be able to
to find
find out
out the passwords
passwords of
of any
any other
other users.
We
We feel this
this feature
feature is quite
quite important
important as the frequency
frequency of
of computer
computer
break-ins/attempts
break-ins/attempts increases.
7. Software
Software was added to
to our
our monitor
monitor in order
order to
to record
record the last
last reader
reader of
of a
file. Previously,
date of
of the last
last read was recorded,
recorded, while
while both
both
Previously, only
only the date
the writer
writer and date
date were recorded
recorded for
for creating
creating and writing
writing a file. This
This
gives users the ability
ability to
to determine
determine which
which other
other persons may
may have been
read
ing files
reading
files..

•a Printer
Printer Support
Support
The
The support
support on the 2060 for
for various
various printers
printers in the SUMEX
SUMEX community
community has
been greatly
greatly enhanced this
this past
past year.
year.
1. Support
Support was added for
for the Xerox
Xerox Raven
Raven printer
printer at the
the Welch
Welch Road
Road

facility
facility to allow
allow spooling
spooling and direct
direct output
output to
to the printer.
printer. In
In addition,
addition,
code was added to
to the spooler
spooler to print
print out
out a header
header page identifying
identifying the
user, filename,
job.
filename, and date
date for
for each job.
2. Similar
Similar spooler
spooler support
support was added for
for the
the Xerox
Xerox Dover
Dover printer
printer in
Margaret
hlargsret Jacks Hall.
Hall.
3. SUMEX
SUMEX installed
installed a Printronix
Printronix line printer
printer at
at Welch
Welch Road to allow
allow users
print out
out files remotely
remotely from
from the 2060. The
The Printronix
Printronix is connect.ed
connected to
to print
SUMEX
SUh?EX via
via a twisted
twisted pair
pair serial
serial line.

We transformed
transformed TENEX
TENEX software
software to the normal
normal TOPS-20
TOPS-20 line printer
printer
4. \Ve
spooler
spooler program
program to look
look out
out for
for users who
who had accidentally
accidentally printed
printed an
'unprintable
‘unprintable file',
file’, meaning
meaning a binary
binary file
file of
of some sort
sort which
which does not
not
cont:1in
contain legible
legible characters.
characters. We
We do this
this both
both by
by counting
counting the number
number of
of
binary
binary characters
characters in the first
first page of
of the file, and by
by not
not printing
printsing the file
if t.he
the count
count exceeds
exceeds a certain
certain threshold.
threshold. A
A similiar
similiar scheme is also used
taking
taking int.o
into account
account the vertical
vertical motion
motion of
of the first
first page.
5. Additional
Additional modifications
modifications were made to the LPlO
LPlO line printer
printer driver
driver
software in the TOPS-20
TOPS-20 monitor
monitor to
to improve
improve the reliability
reliability of
of using
using this
this
software
line printer,
printer, which
which came from
from our
our KilO
KIlO system.
system.
6. \Ve
We greatly
greatly enhanced our
our support
support of
of the IMAGEN
IMAGEN Imprint-lO
Imprint-10 laser
printa
/TCP Ethernet
printer this
this past
past year.
year. A
A new IP
IP/TCP
Ethernet interface
interface was installed
installed on
the prin
ter (discussed further
printer
further in Section
Section LA.2.4
I.A.2.4 replacing
replacing the existing
existing
serial
serial interface.
int,erface. This
This new interface
interface allows
allows for
for more
more efficient
efficient printer
prinber

17

Feigenbaum
E. A. Feigenbaum

Timesharing Systems
Progress - Timesharing

5P41 RR00785-11

operation, and greater
greater flexibility
flexibility in choosing output
output modes, number
number of
operation,
pages, and other
other features.
features. We have also implemented
implemented aa
copies, header pages,
TOPS-20 spooler for the Imagen
Imagen as
as well.
well.
TOPS-20
•l User
User System
System Software
Software

1. We have continued
continued to assemble and maintain
maintain a broad
broad range of utilities
utilities
1.
support software
software on the 2060. These include
and user support
include operational
operational aids,
statistical packages, DEC-supplied
DEC-supplied programs,
programs, text
text editors,
editors, text
text search
statistical
programs, file space
space management
management programs,
programs, graphics
graphics support,
support, texttextprograms,
formatting and justification
justification assistance, magnetic
magnetic tape conversion
conversion aids,
formatting
many more. We also are importing
importing software
soft,ware tools and packages
and many
wherever necessary to avoid
avoid reinventing
reinventing the wheel and wasting
wasting our
our own
own
wherever
efforts. Packages have been imported
imported from
from Texas Instruments,
Instruments, Columbia
Columbia
efforts.
University, the University
University of Texas at Austin,
Austin, Yale University,
University, and other
other
University,
Stanford sites.
Stanford
SUMEX has continued
continued to provide
provide to
to its users the latest
latest releases
releases of
2. SUMEX
various Lisp
Lisp dialects
dialects that
that run
run under
under TOPS-20.
TOPS-20. This
This past
past year
year we agreed
various
provide disk
disk space to store
store the 'official'
‘official’ version
version of
of Interlisp-20
Interlispfrom
to provide
from
XEROX due to the fact
fact that
that the machine
machine at Xerox
Xerox used to support
support
XEROX
Interlisp before
before was being removed.
removed. Interlisp-lO
Interlisp
is now officially
Interlispofficially
maintained at SUMEX
SUMEX by our
our staff
staff and XEROX
XEROX personnel.
personnel. In addition,
addition,
maintained
continue to support
support the full
full variet.y
variety of
of LISPUSERS
LISPUSERS packages.
we continue
Portable Standard
Standard Lisp
Lisp (PSL)
(PSL) developed at the University
University of Utah
Utah has
Portable
a.lso been installed
installed on SUMEX.
SUMEX.
also
We continue
continue to use
use MM,
MM, a very
very powerful
flexible mail
mail system,
system, on the
3. We
powerful and flexible
2060.
Electronic bulletin
bulletin boards
boards are also extensively-supported
extensively-supported
at
2050.
Electronic
at
SUMEX. These provide
rather informal
informal mechanism
mechanism for
for community
community
SUMEX.
provide a rather
Other bulletin
bulletin boards,
boards, read and contributed
contributed to
to
discussions and debates. Other
throughout the INTERNET
INTERNET
community, are available
throughout
community,
available for
for perusal
perusal at
SUMEX. These bulletin
bulletin boards
boards cover
cover such topics
topics as AI
AI Discussions,
Discussions, Micro
Micro
SUMEX.
Computers, Terminals,
Terminals, and \Vorkstations.
Workstations.
Computers,
SIJhIEX participates
part,icipates with
with other
other Stanford
Stanford sites in a general
general license for
for
4. SUMEX
access to
to the SCRIBE
SCRIBE text-formatting
text-formatting system
system from
from UNILOGIC,
UNILOGIC, including
including
access
versions
too run
run under
under TENEX,
TENEX, TOPS-20,
TOPS-20, and UNIX.
UNIX. SCRIBE
SCRIBE is the
the
versions t’
preferred
preferred tool
tool for
for text
text preparation
preparation at
at SUMEX.
SUMEX.
5.
S. Versions
Versions of
of various
various user utilities
utilities and system
system utilities
utilities were updated
updated
throughout
throughout the
the year.
year. These programs
programs included
included network
network server
server processes,
processes,
statistical
stat istical packages,
packages. system
system daemon
daemon programs,
programs, and several
several programs
programs for
for
processing
processing electronic
electronic mail.
mail.
6.
5. Various
Various system
system network
network tables
tables and networking
networking softwltre
softwa.re were updated
updated to
to
sccommoda,te
accommodate the ARPANET
ARPANET split
split that
that occurred
occurred this
this year.
yea.r. The
The network
network
change effectively
effectively split
split the
the ARPANET
ARPANET into
into two
two networks,
networks, the
the MILNET,
MILNET,
which
which is a secure private
private part,
part, and the
the rest
rest of
of the
the ARPANET,
ARPANET, which
which
operates
operates as the
the original
original ARPANET
ARPANET did
did before.
before .
• Docermentc?tion
Documentation and
and Education
Education

l

\Ve
\Vt> have expended
expended considerable
considerable effort
effort to
to develop,
develop, maintain,
maintain, a.nd
and facilitate
facilitate
access
access to
to our
our documentation
documentation so to
to accurately
accurately reflect
reflect available
available software.
software. The
The

Feigenbaum
E. A. Feigenbaum

18

Progress
Progress - Timesharing
Timesharing Systems
Systems

5P41 RR00785-11
RR00785-11

HELP
IHELP and Bulletin
Bulletin Board
Board subsystems
subsystems have
hazTebeen important
important in this
this effort.
effort. As
As
subsystems
publish a bulletin
bulletin or
subsystems are updated,
updated, we generally
generally publish
or small
small document
document
describing
more and more
more changes occur,
occur, it
it becomes more
more
describing the changes. As more
difficult
pointers. Within
difficult, for
for users to
to track
track down
down all of
of the
the change pointers.
Within manpower
manpower
limits,
limits, we are in a continual
continual process of
of reviewing
reviewing the existing
existing documentation
documentation
system
programs now
system for
for compatibility
compabibility with
with the programs
now on line and to integrate
integrate
cha.nges
This also will
will be done with
with a view
view toward
toward
changes into
into the main
main documents.
documents. This
developing
better tools
developing better
tools for
for maintaining
maintaining up-ta-date
up-to-date documentation
documentation, .

•l Software
Sharing
Software
Sharing
stated
previously, we firmly
firmly believe in importing
importing rather
rather than
than
stated previously,
reinventing
reinventing software
software where
where possible. As
As noted
noted above, a number
number of
of the
the
brought up are from
packages we have brought
from outside
outside groups.
groups. Many
Many a.venues
avenues
between the system
projects, other
exist
exist for
for sharing
sharing between
system staff,
staff, various
various user projects,
other
facilities,
and vendors.
The
facilities,
vendors.
The advent
advent of
of fast and
a.nd convenient
convenient
communication
communica.tion facilities
facilities coupling
coupling communities
communities of
of computer
computer facilities
facilities ha.s
has
ma.de
cooperation
made possible effective
effective intergroup
intergroup
cooperation and decentralized
decent,ralized
maintenance
maintenance of
of software
software packages.

1. As

2. The
The TENEX,
TENEX, TOPS-20,
TOPS-20, and UNIX
UNIX sites on the ARPANET
ARPANET have been a
good model
model for
for this
this kind
kind of
of exchange based on a functional
functional division
division of
of
labor
by-product
labor and expertise.
expertise. The
The other
other major
major advantage
advantage is that
that as a by-product
of
particular software,
personal
of the constant
cons.tant communication
communicat.ion about
about particular
soft.ware, personal
relat.ionships
between staff
relationships between
staff members
members of
of the various
various sites develop.
develop. These
collegial
collegial interactions
interactions serve to
to pass general
general information
information about
about software
software
tools
tools and to encourage
encourage the
the exchange of
of ideas among
among the sites. Certain
Certain
common
problems are now
common problems
now regularly
regularly discussed on a mUlti-site
multi-sit,e level.
3. vVe
We continue
continue to
to draw
draw significant
significant amounts
amounts of
of system
system software
software from
from other
other
ARP
ANET sites, reciprocating
ARPANET
developments.
reciprocating with
with our
our own
own local developments.
Interactions
backup support,
Interactions have included
included mutual
mutual backup
support, experience
experience with
with
various
various hardware
hardware configurations,
configurations, experience
experience with
with new types
types of
of computers
computers
and operating
operating systems,
systems, designs for
for local networks,
networks, operating
operating system
system
enhancements,
utility
or
language
software,
and
user project
project
enhancements,
software,
utility
or
collaborations.
We
collaborations.
We have been able to
to import
import many
many new pieces of
of
software
soft.wsre and improvements
improvements to
to existing
existing ones in this
this way.
way. Examples
Examples of
of
imported
program MM,
imported software
software include
include the message
message manipulation
ma.nipulat,ion program
Mhf, SAIL,
SAIL,
PASCAL,
A.X Ethernet
PASCAL, SOS, INTERLISP,
INTERLISP, the C compiler,
compiler, V
VAX
Ethernet code, the
PHOTO
program, ARPANET
host
PHOTO program,
ARPANET
host tables,
tables, various
various user utilities,
utilities, and
many
many others.
others.
4. Fin:.dly,
Finally, we also have assisted groups
groups that
that have interacted
interacted with
with SUMEX
SUMEX
user projects
projects in acquiring
acquiring access
access to
to software
software available
available in our
our community.
community.
\Ve
providing tape
preparation and copy
\Ve are repeatedly
repeatedly providing
tape preparation
copy service to
to many
many
SU~[EX-AIM
projects to
SUME.X-AIM projects
to aid in sharing
sharing their
their software
software with
with outside
outside
requestors.
requestors.

19

E. A.
A. Feigenbaum
Feigenbaum

5P41
5P 41 RR00785RR00785- 11

Progress - Timesharing
Timesharing Systems
Systems
Progress

DECsystem ZOL?O/TOPS-20
2020/I'OPS-20 System
System
DECsystem
Monitor Upgrade
Upgrade --- Our
Our 2020 system
system has continued
continued to
to run
run very
very reliably
reliably this
this
1. Monit.or
past year.
year. We
We have
have updated
updated the
the 2020 monitor
monitor with
with bug
bug fixes
fixes and
and performance
performance
past
improvements regularly.
regularly. There
There will
will likely
likely be few further
further monitor
monitor releases for
for
improvements
the 2020 since itit does not
not support
support extended
extended addressing
addressing and
and there
there are no plans
plans
the
to add
add this
this feature.
feature.
to
Demo Cont,rols
Controls --- We
\Ve continue
continue to
to use the
the 2020 system
system for
for demos of
of AI
AI systems
systems
2. Demo
developed at
at SUMEX.
SUMEX. This
This demo
demo system
system takes
takes advantage
advantage of
of the
the “pie-slice”
"pie-slice"
developed
scheduler in the
the TOPS-20
TOPS-20 release 4 monitor.
monitor. We
vVe now
now guarantee
guarantee dedicated
dedicated
scheduler
large fraction
fraction of
of the
the machine
machine but
but also allow
allow others
others to
to do useful
useful work
work
users a large
when the
the demo
demo demand
demand is low.
low. This
This system
system has nicely
nicely met
met the
the needs of
of both
both
when
groups.
groups.

v:4-YjUNIX Systems
We
11/780 system
\Ve continued
continued to
to provide
provide systems
systems support
support for
for the
the VAX/UNIX
VAX/UNIX 11/780
system
(named ‘'AIMVAX')
shared by
by the
the SUMEX-AIM
SUMEX-AIM community
community and Stanford
Computer
(named
AIMVAX’)
shared
Stanford Computer
Department. Various
Various efforts
efforts included
included supporting
supporting the
the UNIX
UNIX monitor,
monitor, installing
installing
Science Department.
network software,
software, and in bringing
bringing up various
various user subsystems.
subsystems,
new network
Further
Further development
development has continued
continued in support
support of
of the
the SUMEX-AIM
SUMEX-AIM File
File Server
Server
(SAFE) based on a VAX
V A..'C 11/750.
vVe successfully
successfully converted
converted SAFE
SAFE to
to Berkeley
Berkeley Unix
Unix 4.2
(SAFE)
llj7.50. We
server.
server, and with
with the help of
of the
the Computer
Computer Science Department,
Department, converted
converted the
the Ethernet
Ethernet
Pup software
software to
to run
run under
under UNIX
UNIX 4.2
Pup

E. A. Feigenbaum
Feigenbaum

20

Progress
Progress - Professional
Professional Workstations
Workstations

SP41
FjP41 RR0078S-11
RR00785-1 I

I.A.2.4.
Professional
Workstations
I.A.2.4.
Professional
Workstations
Our
Our ongoing
ongoing movement
movement to professional
professional workstations
workstations is taking
taking on several forms.
forms. We
We
continue
continue to carry
carry out
out our
our acquisition
acquisition plans
plans for
for acquiring
acquiring Lisp
Lisp machines
machines for
for use
use by
by the
SUMEX
SUhlEX community,
community, as well
well as investigating
investigating the
the use
use of
of remote
remote virtual
virtual graphics
graphics and new
This
lower
This work
work is prototypical
prototypical of
of what
what other
other
lower cost workstations
workstations in our
our environment.
environment.
groups
groups will
will face and we hope will
will serve to
to find
find effective
effective solutions
solutions to
to common
common problems.
problems.

Lisp-based Scientific
Workstations
Lisp-based
Scientific
Workstations
SUMEX
SUMEX carefully
carefully developed
developed and implemented
implemented our
our equipment
equipment acquisition
acquisition plan
plan for
for
year
year eleven by
by buying
buying seven Xerox
Xerox 1108 Lisp
Lisp machines
machines for
for use by
by SUMEX-AIM
SUMEX-AIM projects.
projects.
Two
purchased with
provide floating
Two of
of these machines
ma.chines were purchased
with special upgrade
upgrade packages to provide
floating
point. capability,
capability, expanded
expanded microcode,
microcode, and expanded
expanded memory.
memory. Our
Our experiences with
with these
machims
machines will
will be reported
reported in next
next years
years report.
report.
The
The XEnOX
XEROX Dolphin
Dolphin on loan to
to Rutgers
Rut*gers University
University was returned
returned to
to SUMEX
SUMEX this
this
year.
year. This
This Dolphin
Dolphin had effectively
effectively served the Rutgers-AIM
Rutgers-AIM community
community in setting
setting up their
their
Ethernet
provided initial
Now
Ethernet network
network and provided
initial exposure to
to the Lisp
Lisp machine
machine technology.
technology. Now
that
that that
that experiment
experiment is successfully
successfully completed,
completed, the Dolphins
Dolphins will
will be used for
for AI
AI system
system
development
development at Stanford.
Stanford.
SU~fEX
SUMEX installed
installed two
two SYMBOLICS
SYMBOLICS 3600 Lisp
Lisp machines,
machines, purchased
purchased with
with DARPA
DARPA
funding,
funding, for
for use
use within
within the Heuristic
Heuristic Programming
Programming Project
Project (HPP)
(HPP) at
at their
their new location
location at
Welch
Welch Road.
Road. \Ve
We are currently
currently awaiting
awaiting a new release of
of the Symbolics
Symbolics Operating
Operating System
System
software
before
we
can
provide
Ethernet
access
to
our
file
server
from
software before
provide Ethernet access to our file server from these machines.
machines.
The
The 3600'5
3600’s are used regularly
regularly by
by members
members of
of the HPP.
HPP.

We
We st.ill
st,ill are using 4 preproduction
preproduction models of
of the Dolphin
Dolphin workstations.
workstations. One
preproduct.ion
preproductzion model
model has been exchanged for
for a production
production system,
system, and we are on schedule
with
with XEROX
XEROX to exchange the remaining
remaining 4 machines
machines for
for production
production models at no extra
extra
cost.
cost,. This
This process is hampered
hampered by
by the rate
rate at which
which XEROX
XEROX themselves
themselves can get
prod
llction machines.
product,ion
machines.
\Ve studied
studied the benefits
benefits of
of buying
buying the extended
extended memory
memory and microstore
microstore upgrades
upgrades
to the Xerox
as
Xerox 1108 Dandelion
Dandelion announced
announced at AAAI-83
MI-83
as being
being "under
“under development."
development.” \Ve
We
concluded
concluded that
that some users would
would benefit
benefit greatly
greatly from
from these enhancements
enhancements and others
others not
not
a.t
a.t all. The
The most
most marked
marked improvements
improvements came from
from system
system which
which were extremely
extremely memory
memory
limit.ed.
NEOM)"CIN. SUMEX
limited, such as
as NEOMYCIN.
SUMEX will
will be acquiring
acquiring two
two 1108's
1108’s with
with the upgrades
upgrades for
for
full
full t.ime
time use
use and testing.
testing.
A close relationship
relationship between
between SUMEX
SUMEX and the newly-formed
newly-formed Center
Center for
for the Study
Study of
of
Langnage
Ln.ngusge and Information
Information (CSLI)
(CSLI) at Stanford
Stanford was established.
established. This
This has already
already benefit.('d
benefitsed
SU~1EX
SLJhlEX (and
(and the ONCOCIN
ONCOCIN project
project in particular)
particular) in the loan, by
by CSLI
CSLI to
to SUMEX,
SUMEX, of
of
two
two Xerox
Xerox 1108's
1108’s which
which have been in constant
constant use
use by
by researchers
researchers since January
January 12th.
12th. The
The
SUI,1EX
SUhlEX staff
staff assisted CSLI
CSLI in bringing
bringing their
their DecSystem20
DecSystem20 and network
network environment
environment online.
line. CSLI
CSLI has informally
informally expressed an interest
interest in working
working on the problem
problem of
of distributed
distributed
AI
AI computation
computation with
with SUMEX
SUMEX researchers.
researchers. CSLI
CSLI will
will have 110 1108's
1108’s on the Ethernet
Ethernet
wit.hin
within the ye:1r.
year. This
This resource suggests some exciting
exciting solutions
solutions to former
former compute-bound
compute-bound
problems.
problems. The
The ONCOCIN
ONCOCIN group
group has already
already implemented
implemented a preliminary
preliminary network-based
network-based
Interactor
to run
Int,eractor which
which permits
permits elements
elements of
of ONCOCIN
ONCOCIN
run concurrently
concurrently on different
different
machines.
of this
this writing,
writing, the Reasoner and its
its Debugger
Debugger have been made to
to run
run
ma.chines. As of
transparently
transparently in this
this mode, and to make good use of
of both
both processors.

21
21

E. A.
A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785- 11

Progress - Professional
Professional Workstations
Workstations

Virtual
Virtual Graphics
Graphic3
SU!'vlEX
SUMEX continued
continued the development
development of aa Virtual
Virtual Graphics
Graphics system
system written
written in
Interlisp-IO
and running
Interlisprunning on our 2060.
2060. Any
Any user running
running the V system
system on aa workstation
workstation
can then
then use
use the
t,he package on the 2060 to drive
drive the graphics
graphics display
display on the workstation.
workstation. A
A
current
application
is
to
take
nuclear
magnetic
resonance
data
on
the
2060
and display
current a.pplication is
nuclear magnetic
data
display
t,he
the atoms
atoms and
a.nd their
their bonds on a SUN
SUN workstation
workstation by using splines. This
This development
development is
is in
its infancy,
infancy, but
but, is
is opened ended and has great
great potential
potential with
with the price
price of workstations
workstations
capable
capa.ble of decent graphics
graphics reaching
reaching the two
two to three
three thousand
thousand dollar
dollar range. It
It allows
allows those
users who
who cannot
cannot afford
afford expensive lisp machines
machines to have full
full graphics
graphics power
power available
available to
them
them by doing
doing the actual
actual graphics
graphics applications
applications on a large time
time shared system,
system, and then
then
doing
doing t.he
the graphics
graphics itself
itself remotely
remotely on a less
less expensive workstation.
workstation. This
This development
development can
help users take
ta.ke advantage
advantage of the computing
computing power
power of the DECsystem
DECsystem 20, while
while providing
providing
many
many of the
t,he high
high speed
speed graphics
graphics advantages
advantages of the Lisp Machines.
Machines.

Apl)/e
Development
.4pple Workstation
Workstation
Development
SUMEX-AIM
SUMEX-AIM has initiated
initiated a development
development project
project to pursue
pursue the effectiveness
effectiveness and
possible use
of
low
cost
personal
workstations
within
our
environment.
use
personal workstations within our environment. After
After examing
examing a
number
number of new personal
personal computers
computers and workstations
workstations on the market,
market, such as
as the Hewlett
Hewlett
Packard
Packard 150.
150, IBM
IBM PC, Sun workstations,
workstations, and others,
others, we chose
chose the Apple
Apple Macintosh
Macintosh and
Apple
Apple Lisa on which
which to begin our
our work
work in this
this area. These machines
machines were chosen
technically
t*echnically due to their
their built
built in graphics,
graphics, networking,
networking, mouse, windows,
windows, and menu support.
support.
vVe
We also considered
considered the very
very beneficial
beneficial relationship
relationship formed
formed between
between Apple
Apple and Stanford
Stanford
University
which provides
provides us direct
direct access
access to Macintosh
Macintosh hardware
hardware and software
software
University which
documentation
plan to
documentation which
which is a necessity
necessity for
for the type
type of
of work
work we plan
to do.
Our
Our Macintosh
Macintosh development
development encompasses several areas ;
1. INFO-MAC
INFO-AIAC Discussion
Di.'3cu8sion List
List

An
An electronic
electronic discussion
discussion list
list was originated
originated at SUMEX,
SUMEX, and is currently
currently
maintained
maintained here, to
to foster
foster sharing
sharing and communication
communication among
among research groups
groups
and
pursuing the serious use
a.nd universities
universities that
that are interested
interested is pursuing
use of
of the Macintosh
Macintosh
within
within their
their respective
respective environments.
environments. This
This list
list has been highly
highly successful in
collaborating
collaborating on Macintosh
Macintosh development
development and the sharing
sharing of
of ideas. The
The
discussion
over 50 sites, and well
well over
over 1000 participants.
participants.
discussion list
list currently
currently contains
contains over

C Development
Development Environment
Environment
2. C
A vital
vital link
link in our
our development
development of
of Macintosh
Macintosh software
software is creating
creating a C based
A
development system
system on our
our VAX
VAX computers
computers for
for the
the coding
coding and downloading
downloading of
of
development
software. Utilizing
Utilizing existing
existing MC68000
MC68000 C cross compilers
compilers on our
our VAX,
VA.,X, we are
software.
developing the
the necessary linkages
linkages in order
order to
to make
make the
the appropriate
appropriate system
system
developing
to rout,ines
rOll tines in the
the Macint.osh
Macintosh ROM’
ROM's
for sophisticated
sophisticated graphics
graphics and system
system
calls to
s for
rela,t,ed functions.
functions.
related
3. Macintosh
Afacintosh print
print servers
servers

In order
order to
to effectively
effectively use the
the Macintosh
Macintosh as a stand-alone
stand-alone workstation,
workstation, we will
will
In
provide
the
ability
to
print
out
Macintosh
developed
files
on
our
IMAGEN
provide the ability to print out Macintosh developed files on our IMAGEN
printers.
laser printers.

Applebu8 to
to Ethernet
Ethernet Interface
Inter face
4. Applebus
This development
development involves
involves the
the hardware
hardware and
and software
software necessary to
to be able to
to
This

A. Feigenbaum
Feigenbaum
E. A.

22

Progress - Professional
Professional Workstations
Workstations

5P41
5P41 RR00785-11
RROO785-11

access
access various
various file and print
print servers on our
our Ethernet
Ethernet from
from aa Macintosh.
Macintosh.
Macintosh
blscintosh will
will be
be connected to the Apple
Apple network
network called Applebus.
Applebus.
hardware
hardware provides
provides an interrace
interface between
between Applebus
Applebus and 10MB
1OMB Ethernet.
Ethernet.
software
software necessary for this
this project
project involves
involves formulating
formulating Macintosh
Macintosh file
and block
block level I/O
I/O requests into
into properly
properly formatted
formatted Internet
Internet packets.
packets.

The
Our
Our
The
level

5. Virtual
Virtual Graphics
Graphics on
on a Lisa
Lisa

The
The Virtual
Virtual Graphics
Graphics System, as
as previously
previously reported,
reported, is in great
great need
need of a low
cost workstation
workstation on which
which it
it can run.
run. We
We have started
started a project
project to port
port the
Virtual
Virtual Graphics
Graphics system
syst,em to a Apple
Apple Lisa in hopes of providing
providing to our
our users
high speed
will report
report further
further on this
this project
project
speed graphics
graphics at remote
remote locations.
locations. We will
in next
next year's
year’s report.
report.
Anticipating
Ant,icipating the popularity
popularity of our
our Macintosh
Macintosh developments,
developments, we are fully
fully prepared
prepared
to make our
our efforts
efforts available
available to other
other research sites,
sibes, Universities,
Universities, and non-profit
non-profit
institlltions
institutions on a.a royalty-free
royalty-free basis in hopes of fostering
fostering continued
continued development
development and
communal
communal sharing.
sharing.
In addit.ion
addition to Lisp-based
Lisp-based scientific
scientific workstations,
workstations, we believe the use
use of
of low
low cost
workst.at,ions,
workstnt,ions, which
which offer
offer suitable
suitable local processing
processing power,
power, high
high resolution
resolution screens with
with easy
to
t.o use
use user interfaces,
interfaces, and networking
networking and communications
communications abilities,
abilities, are vital
vital to the
future
future of
of our
our resource. Our
Our Macintosh
Macintosh and Lisa
Lisa development
development efforts
efforts will
will allow
allow us to use
use
and experiment
experiment with
w&h these workstations
workstations in our
our environment.
environment.

Hewlett
Hewlett Packard
Packard Development
Development
SUl\fEX
SUMEX assists the ONCOCIN
ONCOCIN Project
Project is developing
developing a computing
computing environment
environment for
for
developing
part
developing AI
AI applications
applications based on HP
HP 9836 workstations.
workstations. These workstations
workstations were part
of
peripherals for
of aa. gift
gift from
from Hewlett
Hewlett Packard
Packard to the Oncocin
Oncocin Project.
Project. Additional
Additional support
support peripherals
for
t.he
the 9835's
9836’s included
included large capacity
capacity disk
disk drives,
drives, color
color monitors,
monitors, graphic
graphic tablets,
tablets, and a laser
printer. \Vork
proceeding to
print.er.
Work is proceeding
to network
network these machines
machines onto
onto the SUMEX
SUMEX Ethernet
Ethernet as
as soon
as
as suitablke
suit.ablke networking
networking hardware
hardware is available
available from
from HP.
HP. These machines
machines will
will be used for
for
projects within
new and
a.nd existing
existing projects
within Oncocin.
Oncocin.

23

A. Feigenbaum
Feigenbaum
E. A.

RR00785-11
5P41 RR00785-11

Networking and Communications
Net,working
Communications

I.A.2.5.
Networking
and
I.A.2.5.
Networking
and Communications
Communications
A
aspect of
A highly-important
highly-important
of SUMEX-AIM
SUMEX-AIM is effective
effective communication
communication with
with remote
remote
users and between
between the growing
growing number
number of
of machines
machines available
available within
within the SUMEX
SUMEX resource.
In addition
addition t.o
t.o the
the economic
economic arguments
a.rguments for
for terminal
terminal access,
access, networking
networking offers
offers other
other
advantages
including improved
improved inter-user
inter-user communications
communications and
a.dvantages for
for shared
shared computing,
computing, including
more
more effective
effective software
software sharing.
sharing.
Users accessing a remote
remote computer
computer will
will use a hardline
hardline connection
connection to
to the computer
computer
as
as aa. standard
standard of
of comparison.
comparison. Local
Local networks
networks stand
stand up well
well in this
this comparison
comparison but
but remote
remote
network
network facilities
facilities do not.
not. Data
Data loss
loss is not
not a problem
problem in most
most network
network communications;
communications; in
fact,
fact, wit·h
with the
t,he more
more extensive
extensive error
error checking
checking schemes, data
data integrity
integrity is higher
higher than
than for
for a
long distance
distance phone link.
link. On
On the
the other
other hand,
hand, remote
remote networking
networking relies upon
upon shared
shared use
use of
of
communication
communication lines for
for widespread
widespread geographical
geographical coverage at
at substantially
substantially reduced
reduced cost.
However,
However, unless enough
enough total
total line capacity
capacity is provided
provided to meet
meet peak loads, substantial
substantial
queueing
jams result
queueing and traffic
traffic jams
result in the
the loss
loss of
of terminal
terminal responsiveness. We
We continually
continually
monitor
monit,or the load statistics
statistics for
for our
our direct,
direct, dialup,
dialup, and TYMNET
TYMNET lines to
to avert
avert logjam
logjam
situations.
situations.

n-AfNET
TYMNET
TYMNET provides
provides broad
broad geographic
geographic covera.ge
covera,ge for
for terminal
terminal access
access to
to SUMEX
SUMEX from
from
throughout
throughout the country
country and increasingly
increasingly from
from foreign
foreign countries.
countries. With
With the installation
installation of
of
our
our new DEC2050
DEC2060 computer
computer system
system in January
January of
of 1983, we installed
installed new TYMNET
TYMNET
equipment.
After
After the initia.l
initia,l debugging
debugging of
of the new equipment
equipment (called
(called TYMCOM)
TYMCOM) the
equipment.
equipment
However, some months
months after
after the installation
installation it
it was
equipment has been quite
quite reliable.
reliable. However,
discovered
that
the
XON/XOFF
protocol
between
the
Tymcom
and
the
2050
had
discovered t,hat
XON/XOFF
protocol between
Tymcom
2060 had not
not
been properly
The
The number
number of
of user
properly specified
specified in the Tymcom
Tymcom and was corrected.
correct.ed.
complaints
complaints about
about connection
connection problems
problems have been greatly
greatly reduced.
reduced. This
This is thought
thought to
to be
the result
result of
of improved
improved "backbone"
“backbone” lines within
within Tymnet
Tymnet and the installation
installation of
of triple-duty
triple-duty
modems
modems which
which simplify
simplify things
things for
for the users
users..

•-tRPA.NET
\Ve
of
We retain
retain our
our advantageous
advantageous connection
connection to
to the Department
Department
of Defense's
Defense’s
ARPANET.
now
Agency
This
ARPANET.
now ma.naged
ma.naged by
by the Defense Communications
Communications
Agency (DCA).
(DCA).
This
connection
connection has facilitated
facilitated close collaboration
collaboration with
with the Rutgers-AIM
Rutgers-AIM facility
facility and many
many
other
ot.her computer
computer science groups
groups that
that are also on the net. \Ve
We have maintained
maintained good
working
working relationships
relationships with
with other
other sites on the ARPANET
ARPANET for
for system
system backup
backup and software
software
interchange.
day-bo-day working
working interactions
interactions with
with remot.e
remote facilities
facilities would
would not
not be
interchange. Such day-t.o-day
possible without
wit,hout the integrated
integrated file transfer,
transfer, communication,
communication, and terminal-handling
terminal-handling
capabilities
The
capa.hilities nnique
lrnique to the ARPANET.
ARPANET.
The ARPANET
ARPANET is also key
key to
to maintaining
maintaining ongoing
tt' lIectual contact.s
going in
intellectual
c0ntact.s between
between SUMEX
SUMEX projects
projects such as
as the Stanford
Stanford Heuristic
Heuristic
Programming
Progra.m.ming Project
Project authorized
authorized to
to use
use the net
net and other
other active
active AI
AI research groups
groups in the
ARPANET
ARPANET community.
community.
This
This past
past year,
year, SUMEX-AIM
SUMEX-AIM participated
participated in the split
split of
of the ARPANET
ARPANET into
into two
two
networks;
networks: the MILNET,
MILNET, which
which is a highly
highly secure strictly
strictly DOD-related
DOD-related part
part of
of the network,
network,
and the ARPANET,
which
ARPANET,
which is the remainder
remainder of
of the ARPANET
ARPANET sites. This
This latter
latter net
net
functions
NET can only
functions as
as we knew
knew t,he
the ARPANET
ARPANET before.
before. The
The MIL
MILNET
only be accessed
accessed via
via mail
mail
gat.eways.
ga.t.eways. No TELNET
TELI”:ET or
or FTP
FTP to
to MILNET
MILNET sites is allowed.
allowed. In
In addition,
addition, access
access to
to the
ARPANET
AC's
ARP,4NET T
TAC’
s (Terminal
(Terminal Access Controllers)
Controllers) was restricted
restricted this
this past
past year
year to
to only
only
t.hose
t,hose users who
who were granted
granted TAC
TAC access
access cards, which
which meant
meant their
their username
username was
rt'gistered
Network Information
registered with
with the Network
Information Center,
Center, and they
they were given
given a password
password with
with

E. A. Feigenballm
Feigenbslim

24

5P4l
5P41 RR00785-l1
RRO0785-11

Networking
Networking and Communications
Communications

which
which they
they could dial
dial into
into the ARPANET.
ARPANET. SUMEX
SUMEX arranged
arranged for
lor guest cards for
for those
users
users who needed such access.
access.
\Ve
We continue
continue to be
be called upon to interact
interact with
with outside
outside organizations
organizations which
which are (or
(or
wish to be) connected
connected to our
our IMP.
IMP. The
The line to Advanced
Advanced Information
Information and Decision
Decision
Syst.ems
Systems occasionally
occasionally causes
causes trouble
trouble requiring
requiring diagnosis. The
The intended
intended connection
connection to
Perceptronics
Perceptronics Inc. has evidently
evidently been canceled.

EI'HERNEI'
ETHERNET
A sllbstantial
substantial portion
portion of our
our system
system effort
effort this
this past
past year
year went
went into
into continued
continued
development
development of local Ethernet
Ethernet facilities
facilities which
which link
link the SUMEX
SUMEX resource hardware
hardware with
with
other
other parts
parts of the campus,
campus, namely
namely to 701
701 Welch
Welch Road, which
which is
is the new location
location of the
Heuristic
Heuristic Programming
Progra.mming Project,
Project, and to the Computer
Computer Science
Science Department
Department building
building on
campus. \Ve
We have also invested
invested a great
great amount
amount of
of effort
effort this
this year
year to begin our
our transfer
transfer to
a 10 megabit
megabit Ethernet,
Ethernet, while
while continuing
continuing support
support of
of our
our current
current 3 MB
MB ethernet.
ethernet.
Specific
Specific areas of
of Ethernet
Ethernet development
development include:
include:

-- We
We continued
continued support
support of
of the Sequin reliable
reliable packet
packet protocol
protocol and
Leaf
Leaf byte-level
byte-level file transfer
transfer protocol
protocol to enable our
our Xerox
Xerox D machines
machines to access
access
files on our
our DEC20
DEC20 systems. The
The Leaf
Leaf server
server had to be modified
modified on the DEC20
DECPO
t,his
this past
past year
year when
when we switched
switched to using
using encrypted
encrypted passwords. The
The LEAF
LEAF
server
server implementation
implementation for
for the 4.2 BSD release of Unix
Unix was also debugged and
installed
installed at SUMEX.
SUMEX. This
This allows
allows us to
to access
access files stored
stored on our
our VAX
VAX file
servers
servers from
from either
either our
our 10MB
1OMB or
or 3MB
3MB networks.
networks.

1. Leaf
Leaf server
server

The
protocol is built
built into
The Leaf
Leaf protocol
into the lowest
lowest levels of
of the Dolphin
Dolphin I/O
I/O system,
system, and
allows
allows any
any file on a remote
remote file
file server
server to
to be accessed
accessed as
as easily
easily as
as a disk
disk file in
both paged or
both
or random
random access
access mode. The
The latest
latest updates
updates to
to the Sequin
transport
t,ransport level have made marked
marked improvements
improvements in efficiency.
efficiency. The
The 2020 now
now
performs Leaf
performs
Leaf file
file transfers
transfers with
with a speed approaching
approaching that
that of
of XEROX's
XEROX’s
dedicated
dedicated file server.
server.
2. TOPS-20
Ethernet Server
We continued
continued to
to maintain
maintain and improve
improve the
TOPS-20
Ethernet
Seroer -- We

Et.hernet
under the
Ethernet service under
to t.he
the TELNET
TELNET and FTP
FTP
mentioned Leaf
server, and
mentioned
Leaf server,

TOPS-20
updates
TOPS-20 operating
operating system.
system. This
This included
included updates
programs, as well
previously
programs,
well as
as mail
mail software,
software, the previously
network table
table maintanence
maintanence programs.
programs.
network

Ethernet Ciate.way
Gateway ---- Our
Our Ethernet
Ethernet gateway
gateway software
software has continued
continued to
to run
run
3. Eth,ernet
reliably and effectively.
effectively. The
The previous
previous problems
problems with
with lost
lost packets
packets and delayed
delayed
reliably
terminal response has been fixed,
fixed, the
the cause of
of which
which was a bad memory
memory board
board
t.erminsl
software bug
bug in the
the TOPS-20
TOPS-20 operating
operating system.
system. Serious
Serious problems
problems that
that
and a software
affected our
our net
net connectivity
connectivity to
to other
other parts
parts of
of campus
campus were also discovered
discovered
affected
repaired this
this past
past year,
year, thereby
thereby providing
providing us with
with over
over 99%
99% net
net
and repaired
connect.ivity to
to the rest
rest of
of campus.
campus. The
The changes involved
involved board
board repair
repair and
connect.ivit,y
modifications to
to the topology
topology of
of the
the campus
campus Ethernet.
Ethernet.
modifications
The gateway
gateway itself
itself was generalized
generalized
The
networks where
where previously
previously itit had
had
networks
currently have two
two gateways,
gateways, each
currently
networks. two
two of
of which
which are 3MB
3MB and
and
networks,

to handle
handle 3 or
or more
more directly
directly connected
connected
to
only dealt
dealt with
with 2 such networks.
networks. We
We
only
handling the
the traffic
traffic between
between three
three local
local
handling
10MB network.
network.
one a 1OMB

4. Ethernet
Ethernet TIP
TIP (EtherTIP)
(EtherTlp) --- The
The EtherTIP
EtherTIP provides
provides multiple
multiple terminal
terminal access
access to
to
the Ethernet.
Ethernet. A
A PUP
PUP ethernet
ethernet operating
operating syst,em
system was written
written for
for MC68000MC68000t.he
and a MC6800O-based
MC68000-based EtherTIP
EtherTIP was built
built base.d
based on
on this.
this.
based processors, and

25

E. A.
A. Feigenbaum
Feigenbaum
E.

Networking
Networking and Communications
Communications

5P41 RR00785-11
RR00785-11

The
The EtherTIP
EtherTIP software
software has undergone
undergone further
further enhancements
enhancements in the
Portions
of
this
work
was
done
in
conjunction
Portions
this work
conjunction with
with the Stanford
Stanford
Science Department.
Department. Among
Among those enhancements
enhancements are the following:
following:

past year.
year.
Computer
Computer

a. It
It now
now accepts
accepts incoming
incoming connections
connections to
‘to line printer
printer ports,
ports, and for
for remote
remote
system
system diagnosis.
diagnosis.
b. It
It can simulate
simulate the
the "old"
“old” Stanford
Stanford EtherTIP
EtherTIP for
for users who
who have not
not yet
yet
made the transition
transition to the new environment.
environment.
c. The
The user interface
interface is more
more flexible
flexible to suit
suit the needs of an increasingly
increasingly
diverse
diverse user community.
community.
The
The EtherTIP
EtherTIP software
software has developed
developed into
into a very
very stable
stable system,
system, and one
enjoying
enjoying good use
use within
within the SUMEX
SUMEX community.
community.
5. 10 AlB/SEC
- SUMEX
SUMEX made a major
major move this
this past
past
hfB/SEC Ethernet
Ethernet Development
Development --

year
year to begin our
our transfer
transfer to
to a 10 megabit/sec
megabit/set network.
network. While
While the current
current 3
megabit/sec
megabit/set network
network continues
continues to
to serve us well,
well, many
many new workstations
workstations and
printers
printers are coming
coming on the market
market with
with only
only 10 MB/SEC
MB/SEC interfaces,
interfaces, and in
addition,
a.ddition, since 3 MB/SEC
MB/SEC networks
networks were only
only used a very
very few selected
settings.
settings. it
it is becoming
becoming increasingly
increasingly difficult
difficult to find
find replacement
replacement parts
parts when
when
failures
do
occur.
failures
occur.
Therefore,
Therefore, this
this past
past year
year saw several efforts
efforts involved
involved in installing
installing and
supporting
supporting the SUMEX
SUMEX 10MB/SEC
lOMB/SEC Ethernet;
Ethernet ;
a. Reworking
Reworking the
the entire
entire Ethernet
Ethernet system
system software
software to
to handle
handle both
both 3 and
and 10
megabit
megabit link
link level
level standards,
standards, i.e., addressing
addressing and encapsulation
encapsulation are
transparent
transparent to
to the
the user
user levels. We
We similarly
similarly made the
the network
network link
link level
protocols transparent
protocols
transparent to
to the
the the
the user level software.
software. In
In this
this way
way one can
communicated
protocols on a 10MB/SEC
communicated using
using PUP
PUP protocols
lOMB/SEC ethernet
ethernet and the
the
user software
software does not
not have to
to change.
protocols for
b. Adding
Adding address resolution
resolution protocols
for PUP
PUP and
and IP
IP so that
that the
the
3~tB/SEC byte
byte addresses can be translated
3MB/SEC
translated to
to 10 MB/SEC
MB/SEC hardware
hardware
addresses for
for the
the link
link level. This
This enables one to
to communicate
communicate using
using PUP
PUP
or IP
IP between
between 3 and
and 10 megabit
megabit hosts.
or
Integrating XNS
XNS and
and IP
IP into
into the
the PUP
PUP routing
routing mechanism.
mechanism.
c. Integrating
Solving some rather
rather subtle
subtle software/hardware
software/hardware integration
integration problems
problems in
d. Solving
order to
to simulate
simulate “ethernet”
"ethernet" on
on the
the HPP/Welch
HPP/\Velch Road
Road “twisted
"twisted pair”
pair"
order
ethernet.
ethernet.
e. Bringing
Bringing up the
the 3 MB/SEC
MB/SEC EtherTIP
EtherTIP on the
the 10 MB/SEC
MB/SEC network
network was a
e.
proof tha,t
that the
the above worked.
worked. ItIt was done without
without any
any changes to
to the
the
proof
TIP software
software itself
itself by
by simply
simply relinking
relinking itit with
with the
the 10 MB/SEC
MB/SEC system
system
TIP
software. This
This required
required only
only one additional
additional piece of
of logic.
logic. When
When a 10
software.
MB/SEC host
host wants
wants to
to communicate
communicate using
using PUP
PUP which
which is a 3 MB/SEC
MB/SEC
MB/SEC
protocol. then
then itit must
must find
find its
its PUP
PUP address
address from
from some host
host on the
the 10
protocol,
MB/SEC network.
network. The
The gateway
gateway maintains
maintains a translation
translation table,
table, and
and listens
listens
MB/SEC
requests, thus
thus translating
translating the
the 10 MB/SEC
MB/SEC hardware
hardware address into
into
for such requests,
for
"soft PUP
PUP address,”
address," and
and replying
replying to
to the
the requesting
requesting host.
host.
a “soft

HPP-SUAIEX Communication
Communication link
link -- The
The Heuristic
Heuristic Programing
Programing Project
Project
6. HPP-SUrZ1E.Y

A. Feigenbaum
Feigenbaum
E. A.

26

Networking
Communications
Networking and Communications

5F41 RR00785-11
RR00785- 11
5P41

(HPP)
701 Welch
Welch Road, adjacent
adjacent to
to the
(HPP) relocated
relocated from
from its campus location
location to
to 701
Stanford
Stanford campus. Since this
this group
group is a primary
primary user of
of the SUMEX
SUMEX computer
computer
facility
facility and the principal
principal focus for
for core AI
AI research, a communication
communication link
link
location and SUMEX
SUMEX machine room
room was imperative.
imperative. Several
between the new location
communication
communication schemes
schemes for
for establishing
establishing a reliable
reliable and relatively
relatively fast link
link were
m icrowave, laser, infrared,
infrared, direct
direct ethernet
ethernet (by
(by trenching
trenching
considered, namely;
namely ; microwave,
and placing
placing a direct
direct ethernet
ethernet cable), ATT's
ATT’s Tl
Tl service and others.
others.
All
All of the above schemes
schemes would
would have necessitated
necessitat,ed large budgetary
budgetary outlays
outlays and
some
some would
would have imposed lengthy
lengthy time
time delays (getting
(getting permits
permits and the like)
like)
The idea of using bare copper telephone pair
pair
due to jurisdictional
jurisdictional boundaries. The
already in place looked very
very attractive
attractive especially if
if reasonable
reasonable speed
speed and
already
reliability
reliability could be achieved. The
The wire
wire distance between the above mentioned
mentioned
locations
locations is approximately
approximately 2000
2000 ft.
ft. A
A design goal was established to
to try
try to
to
communication link
link with
with Ethernet
Ethernet type
type speed
speed ( 3MB/SEC)
3MB/SEC ) between
develop a communication
these
these two
t.wo locations.
locations.
Utilizing
Ut.ilizing high
high driving
driving capacity
capacity drivers
drivers (differential)
(differential) and ultra
ultra high speed,
speed, high
high
sensitivity
sensitivity receivers a transceiver
transceiver was designed
designed and tested for
for maximum
maximum
transmission
transmission speed
speed with
with maximum
maximum reliability.
reliability. The
The final
final configuration
configuration resulted
resulted
in aa, half
half duplex
duplex transmission
transmission over
over a bare copper twisted
twisted pair
pair in each direction
direction
utilizing Manchester
Manchester coding at
at a reliable
reliable transmission
transmission speed
speed of
of 1.25MBs/sec.
1.25MBs/sec.
utilizing
each direction
direction for
for an aggregate
a.ggregate speed
speed of
of 2.5MBs/sec.
2.5MBs/sec. This
This communication
communication link
link
each
has been
been in operation
operation for
for about
about six months
months now without
without any appreciable
appreciable down
time
time or
or noticeable error
error rate
rate or
or data
data delays. Many
Many HPP
HPP researchers are
utilizing
utilizing t,his
t,his link
link to communicate
communicate with
with SUMEX
SUMEX and the University
University Ethernet
Ethernet
network.
and printers
network. In addition,
addition, various
various Lisp
Lisp machines
machines
printers located in the HPP
HPP
facility
facilit*y a.nd
and connected to
to a local network
network there are also able to communicate
communicate
with the University
University network.
network.
with

Ii'VTERNET SOF'1.1VARE
One major
major issue
issue we face
face at SUMEX-AIM
SUMEX-AIM in support
support of our
our network
network environment
environment. is
the lack of standardization
standardization in network
network protocols
protocols among various
various vendors. Currently,
Currently, many
many
vendors
/TCP) protocols.
vendors are adding
adding support
support to their
their products
products for the Internet
Internet (IP
(IP/TCP)
protocols.
SUMEX
/TCP protocols
SUMEX continues
cont.inues to support
support the
the IP
IP/TCP
protocols on the DEC2060, and we are
current,ly alpha-testing
alpha.-testing a release
release of
of Interlisp-D
Inter-lisp-D which
which also supports
supports IP
IP/TCP
protocols. In
currently
/TCP protocols.
addition,
/TCP software
nddit,ion, we sucessfully adapted the IP
IPjTCP
software to our
our VAX
VAX systems running
running UNIX
UNIX
4.2BSD.
L~BSD.
This
This Vax
Vax TCP
TCP adaptation
adaptation involved
involved provisions
provisions for subnet
subnet routing,
routing, 3 MB/SEC
MB/SEC
byte swap problems, encapsulation
encapsulation problems
problems and 10
10 MB/SEC
MB/SEC debugging with
with our
our
byte
gateways.
gntewa.y.5.

27
27

E. A. Feigenbaum

Progress - Progress in Core Research

5P41 RR00785-11

I.A.2.6.
Progress
in
I.A.2.6.
Progress
in Core
Core Research
Research
Over
Over the past year
year we have continued
continued to support
support several core research activities
activities
aimed at developing
developing information
information resources, basic AI
AI research, and tools
tools of general interest
interest
to the SUMEX-AIM
SUMEX
SUMEX is providing
providing only
only partial
partial support
support for
for these
SUMEX-AIM community.
community.
projects,
project,s, with
with complementary
complementary funding
funding coming
coming from
from ARPA,
ARPA, ONR,
ONR, NLM
NLM and NSF
contracts
contracts and grants
grants to the Stanford
Stanford Heuristic
Heuristic Programming
Programming Project.
Project.

Core Research
Research
Core Research at SUMEX-AIM
SUMEX-AIM focuses
focuses on understanding
understanding the roles of knowledge
knowledge in
symbolic
symbolic problem
problem solving
solving systems, its representation
represent.ation in software
software and hardware,
hardware, its use
use for
for
inference.
inference, and its acquisition.
acquisition. We are continuing
continuing to develop new tools
tools for system
system builders
builders
and to improve
improve old ones.
ones. The
The research crosses
crosses aa number
number of application
application domains,
domains, as
as
reflected
reflected in the subprojects
subprojects discussed earler,
earler, but
but the main
main issues
issues that
that we are addressing
addressing in
this
this rese3rch
research are
a.re those fundamental
funda.mental to all aspects of AI.
AI. We
We believe this
this core research is
broadening
broadening and deepening
deepening the groundwork
groundwork for
for the design and construction
construction of
of even more
more
capable and effective
effective biomedical
biomedical systems.
systems.
As mentioned
mentioned above, although
although our
our style
style of research is largely
largely empirical,
empirical, the
questions
questions we are addressing
addressing are fundamental.
fundamental. The
The three
three major
ma.jor research issues
issues in AI
AI have,
since its beginning,
beginning, been knowledge
knowledge representation,
representation, control
control of
of inference
inference (search), and
learning.
learning. \Vithin
Within these topics,
topics, we wiII
will be asking
asking the following
following kinds
kinds of
of questions. As our
our
work
work progresses, we hope to leave behind
behind several
several prototype
prototype systems
systems that
that can be developed
by others
others in the
t.he medical
medical community.
community.
1.
1. Knowledge
Knowledge

Representation
Representation -- How
How can we represent
represent causal models and
structural
What
benefits of
st.ructura,l information?
information?
What are the relative
relative benefits
of logic-based,
logic-based, rulerulehased,
based, and frame-hased
frame-based systems?
systems ? How
How can we represent
represent temporal
temporal relations
relations and
events
events so that
t,hat rea.'Soning
reasoning over
over time
time is efficient?
efficient?

2. Knowledge
Knowledge Acquisition
Acquisition -- How
How can an expert
expert system
system acquire
acquire new knowledge
knowledge
wit,hout consuming
without
consuming substantial
substantial time
time from
from experts? Can
Can we improve
improve the
knowledge
paradigm enough
knowledge engineering
engineering paradigm
enough to
to make
make a difference?
difference? Can
Can automatic
automatic
learning
programs
be
designed
that
will
work
across
many
disciplines?
lea,rning programs
that will work
many disciplines? Will
Will
cooperative
cooperat.ive man-machine
man-machine systems
systems be able to open the communication
communication channel
channel
het,ween
expert and expert
expert system?
bet,wern expert
Knowledge Utilization
Utilization -- By
By what
what inference
inference methods
methods can a variety
variety of
of sources of
of
3. Knowledge
knowledge of
of diverse
diverse types
types be made to
to contribute
contribute jointly
jointly and efficiently
efficiently
knowledge
towa.rd solut,ions?
sollltions? What
\Vhat is the
the nature
nature of
of strategy
strategy and control
control information?
information?
toward

Plans for
for the Coming
Coming Year
Plunv
Several syst,ems
systems have been developed
developed in recent
recent years
years to
to serve as vehicles
vehicles for
for
Severa,
knowledge engineering
engineering and research
research on knowledge
knowledge representation
representation and its
its use. Knowledge
Knowledge
knowledge
acquisition (including
(including machine
machine learning)
learning) and
and advanced
advanced architectures
architectures for
for AI
AI will
will be the
the two
two
acquisition
areas of
of most
most new activity
activity in the coming
coming year.
year. Research on these topics
topics obviously
obviously must
must
draw on on-going
on-going work
work in representation
representation and
and control.
control.
draw
In part,iculsr,
particular, we will
will focus on
In
• Induct,ive
Inductive learning
learning of
of MYCIN-like
MYCIN-Iike rules
rules from
from case
case data
data in the
the domain
domain of
of
diagnosing disorders
disorders where
where the
the chief
chief complaint
complaint is jaundice;
jaundice;
diagnosing

l

• Learning
Learning from
from experience
experience in domains
domains where
where the
the means for
for interpreting
interpreting new
new

l

A. Feigenbaum
Feigenbaum
E. A.

28

Progress
Progress - Progress
Progress in Core
Core Research

SP'l1
5P4 1 RR0078S-11
RROO785-11

data
data are largely
largely contained
contained in the
the emerging
emerging (and
(and thus
thus incomplete
incomplete and not
not
wholly
wholly correct,)
correct) theory;
theory;
•l Learning
by watching
Learning by
watching
NEOMYCIN;
NEOMYCIN;

a

medical
medical

expert
expert

diagnose

cases
presented
cases presented

by
by

•l Investigating
Invest,igating complex
complex signal
signal understanding
understanding systems
systems for
for ways
ways to exploit
exploit and
represent
represent concurrency
concurrency with
with a view
view toward
toward hardware
hardware and software
software architectures
architectures
that
that may
may be capable of
of several
several orders
orders of
of magnitude
magnitude improvement
improvement in
performance.
performance.
Further
and the Heuristic
Further information
information on the core research at
at SUMEX-AIM
SUMEX-AIM
Heurist,ic
Programming
Project
can
be
found
in
the
Projects
section
starting
on
page
89.
Programming Project
found
Projects section starting

29

E. A.
A. Feigenbaum
Feigenbaum

5Phl RR00785-11
RR00785- 11
5P~1

Progress
Progress - Resource Operations
Operations Statistics
Statistics

I.A.2.
Operations
Statistics
I.A.2.7. 7. Resource
Resource
Operations
Statistics
The
The following
following data
data give an overview
overview of
of various
various aspects of
of SUMEX-AIM
SUMEX-AIM resource
usage.
usage. There
There are 5 subsections
subsections containing
containing data
data respectively
respectively for:
for:
1.
1. Overall
Overall resource loading
loading data
data (page 31).

2. Relative
Rela,tive system
system loading
loading by
by community
community (page 33).
3. Individual
Individual project
project and community
community usage (page 36).
4. Network
Network usa.ge
usage data
data (page 44).
5. System
System reliability
reliability data
data (page 44).
For
For the most
most part,
part, the
the data
data used for
for these plots
plots covers the
the entire
entire span of
of the
SUMEX-AIM
SUMEX-AIM project.
project. This
This includes
includes data
data from
from both
both the TENEX
TENEX KIlO
KIlO system
system and the
curren
current t DECsystem
DECsystem 2060. At
At the point
point where
where the SUMEX-AIM
SUMEX-AIM community
community switched
switched over
over
to the 2060 (February,
(February, 1983), you
you will
will notice
notice severe changes in most
most of
of the graphs.
graphs. This
This is
due to many
rnany reasons
rea.sons which
which I will
will mentioned
mentioned briefly
briefly here;
here ;
1.
1. Even
Even t.ho1]gh
though the Tenex
Tenex operating
operating system
system used on the KilO
KIlO was a forerunner
forerunner of
of

t.he
the current
current Tops20
Tops20 operating
operating system,
system, the Tops20
Tops20 system
system is still
still different
different from
from
Tenex is many
many ways.
ways. Tops20
Tops20 uses
uses a radically
radically different
different job
scheduling
Tellex
job scheduling
mechanism,
mechanism, different
different methods
methods for
for computing
computing monitor
monitor statistics,
statistics, different
different I/O
I/O
routines,
routines, etc. In general.
general, it
it can not
not be assumed that
that statistics
statistics measured
measured on the
Tenex
Trnex syst·em
syst,em correlate
correlate one to one with
with similar
similar statistics
statistics under
under Tops20.
Tops20.
2.
3. The
The KLlO
KLlO processor
processor on the 2060 is a faster
faster processor
processor than
than the KilO
KIlO processor
processor
used previously.
job running
previously. Hence, a job
running on the KLlO
KLlO will
will use
use less
less CPU
CPU time
time than
than
the sa.me
job running
sa,me job
running on the KilO.
KIlO. This
This aspect is further
further complicated
complicated by
by the
fact
fact that
that the SUMEX
SUMEX KilO
KIlO system
system was a dual
dual processor
processor system.
system.
3. The
The SUMEX-AIM
SUMEX-AIM Community
Community was changing
changing during
during the time
time of
of the transfer
transfer to
the
just been
t,he 2060.
SOGO.The
The usage of
of the GENET
GENET community
community on SUMEX
SUMEX had just
pha:sed
phased out.
out. This
This part
part of
of the community
community accounted
accounted for
for much
much of
of the
the CPU
CPU time
time
used by
by the AIM
AIM community.
community. Since the
the purchase
purchase of
of the 2060 was partially
partially
funded
funded by
by the Heuristic
Heuristic Programming
Programming Project
Project (HPP),
(HPP), an additional
additional number
number of
of
HPP
HPP Core
Core Research Projects
Projects started
started using the 2060, increasing
increasing the Stanford
Stanford
communities usage
usage of
of the machine.
machine. And
And finally,
finally, the move to
to the 2050
2060 occurred
occurred
communities
during
during a pivotal
pivotal time
time in the community
community when
when more
more and more
more projects
projects were
either
eit,her moving
moving to their
their own
own local timesharing
timesharing machines,
machines, or
or onto
onto specialized
specialized Lisp
Lisp
workstations.
workstatJions. It
It also was the time
time for
for the closure of
of many
many long time
time SUMEXSUMEXAIM projects,
projects, like
like Dendral
Dendrsl and Puff/VM.
Puff/VM.
AIM
Any
,4ny conclusions
conclusions reached by comparing
comparing the
the data
data before
before and after
after February,
February, 1983
should be done with
with caution.
caution. The
The data
data is included
included in this
this years
years annual
annual report
report mostly
mostly for
for
should
casual comparison.
comparison. Starting
Starting next
next year,
year, only
only data
data from
from the 2060 will
will be recorded
recorded on the
annual
annual report.
report. Readers will
will be referred
referred to
to previous
previous annual
annual reports
reports (such as
as this
this one) for
for
data
da.ta from
from the KIlO
KIlO Tenex
Tenex system.
system.

Feigenbaum
E. A. Feigenba.um

30

Progress
Progress - Resource Operations
Operations Statistics
Statistics

5P41
5P4 1 RR00185-11
RR00785fI 1
,

Ot'erail
Overall Res~urce
Reshree Loading
Loading Data
Data
The
over the life
life
The following
following plots
plots display
display several different
different aspect.s
aspects of system
system loading
loading over
of t.he
the current
current
t,he project.
project. This
This data
data includes
includes usage of the
the Tenex
Tenex KilO
KIlO system
system and the
DECsystem
DECsystem ~050.
2060.
These plots
of jobs
jobs
plots include
include total
total CPU
CPU time
time delivered
delivered per
per month,
month, the peak number
number of
logged in, and the peak load average. The
variable is
The monthly
monthly "peak"
“peak” value
value of
of a given
given variable
the average of
Thus, these
of the daily
daily peak values
values for
for that
that variable
variable during
during the month.
month. Thus,
"peak"
not reflect
reflect
“peak” values are representative
representative of
of average monthly
monthly loading
loading maxima
maxima a.nd
a.nd do not
the
days, which
which are much
much higher.
higher.
t,he largest
largest excursions
excursions seen
seen on individual
individual days,

800
800r
*‘Or

Total
Total CPU
CPU Usage
Usage
Hours/Month
Hours/Month

600
600 -

400
400 -

200
200 -

1983
$74
1975
1976
1977
1978
1979
1980
1981
1982
1984
1985
4
o~--~----~----~----~--~~----~----~--~~--~----~----~
1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985

1974

Figure
7:
Figure
7:

Total
Total CPU
CPU Time
Time Consumed
Consumed by
by Month
Month

31
31

E. A. Feigenbaum
Feigenbaum

5P41
5P41 RR00785-11
RROOTSS-11

Progress -- Resource
Resource Operations
Operations Statistics
Statistics
Progress

60
50
”

Peak Daily
Daily Jobs
Jobs
Peak.

4040
3030

‘.

2020
1010
O~--~----~----~--~----~----~--~-----L----~--~~--~I

I%74
1974

1975
1975

1976
1976

1977
1977

1978
1978

Figure
8:
Figure
8:

*r
8

7
7

1979
1979

1980
1980

1981
1981

1982
1982

1983
1983

1984
1984

1985
1985

Peak Number
of Jobs by
by Month
Month
Peak
Number of

Peak
Peak. Daily
Daily
Load
Load Average
Average

66
55
44
33
22
11
tI
OL-____
L -_ _ _ _L -_ _ _ _L -_ _

1974
p974

1975
1975

1976
1976

1977
1977

~~

Figure
Figure 9:9:

E.
E. A.
A. Fcigenb3.um
Feigenbaum

__

1978
1978

~~

__

1979
1979

~~

__

1980
1980

~~

____

1981
1981

Peak
Peak Load
Load Average
Average by
by Month
Month

32
32

~

__

1982
1982

~~

__

1983
1983

~~

__

1984
1984

I
1985
1985

~

Progress - Resource Operations
Operations Statistics
Statistics
Progress

RR00785-11
5P41 RR00785-11
Relative System
Loading by Community
Relative
System Loading
Community

The SUMEX
SUMEX resource is divided,
divided, for
for administrative
administrative purposes, into
into three
three major
major
The
projects based at the Stanford
Stanford Medical
Medical School (Stanford
(Stanford Projects),
communities:
Projects),
communit#ies: user projects
project’s based outside
outside of
of Stanford
Stanford (National
(Nutiomzl Alhf
common system
system
user projects
AlA! Projects),
Projects), and common
development
As defined
development efforts
efforts (System
(System Staff).
stajfi.
defined in the resource management
management plan
plan
approved
approved by the BRP
BRP at
at the
the start
start of
of the project,
project, the available
available system
system CPU
CPU capacity
capacity and
divided between
between these communities
communities as follows:
follows:
file space resources are divided
Stanford
Stanford
AIM
AIhI
Staff
Staff

40S;
40%
40S;
40%
20S;
20%

The
The "available"
“available” resources to be divided
divided up in this
this way
way are those remaining
remaining after
after
various
various monitor
monitor and community-wide
community-wide functions
functions are accounted
accounted for.
for. These include
include such
things as
as job
scheduling, overhead,
overhead, network
network service, file space for
for subsystems,
subsystems,
things
job scheduling,
documentation,
documenlation. etc.

The monthly
monthly usage
usage of
of CPU
CPU resources and terminal
terminal connect
connect time
time for
for each of
of these
The
three communities
communities relative
relative to
to their
their respective
respective aliquots
aliquots is shown
shown in the plots
plots in Figure
Figure
three
10 and Figure
Figure 11. As mentioned
mentioned on page 30, these plots
plots include
include both
both KilO
KIlO and 2060 usage
data.
data.

33

E. A. Feigenbaum
Feigenbaum

5P41
5P41 RRO0785-11
RR00785-11

Progress -- Resource
Resource Operations
Operations Statistics
Statistics
Progress

40
40-

National Projects
Projects
National

%CPUTime
%
CPU Time

30
30.

20
2010
10I%74
I

.A,
1975

1976

1977

1978

1979

P974

1975

1976

1977

1978

1979

40
JOT

% CPU
CPU Used
%
Used

1980
1980

1981

1982

1983

1984

1985
4

1981

1982

1983

1984

1985

Stanford Projects
Projects
Stanford

30
30-

20
2010
IOI

I
o~----~----~----~----~----~----~----~----~----~----~----~

1974
I%74

40
40

1975
1975

1977
1977

1976
1976

1978
1978

1979
1979

%
36 CPU
CPU Used
Used

1980
1980

1981
1981

1982
1982

1983
1983

19S4
1984

1985
1985

System
System Staff
Staff

30

20
20
10·
70 -

I

1

_I
b--L+

o.~----~--~~

p974 1975
1975
1974

__ ____ ____ ____ ____ _____

1976
197G

~

1977
1977

Figure
10:
Figure
10:

EE.. .-\.
‘-1. Feigenbaum
Feigcnb:Lum

~

~

~

~

1978
1978

1979
1979

1980
19SO

1981
1981

~

____

1932
1932

tvIonthly
Monthly CPU
CPU Usage
Usage by Community
Community

34
34

~

____

1£83
1553

.~

__

1984
19c&

~

1935

Progress
Progress -- Resource
Resource Operations
Operations Statistics
Statistics

5P41
5P4 1 RR00785-11
RR00785- 11

6000
5000

National
National Projects
Projects

Connect
Connect Time
Time
Hours/Month

4000
4000 3000
3000 2000
2000 1000
7000 -

---f
1975
,‘
974
I
1976
1977
1978
1979
1980
1981
1983
1985
4
1982
O~--~----~----~----~----~----~----~--~~--~----~----~
1975 1976 1977 1978 1979 1980 1981 1982 1983 .1984
1984 1985

1974

8000
Fizz
7000
6000
6000
5000
4000
3000
2000
1000

Connect
Connect Time
Time
Hours/Month
Hours/Month

Stanford
Stanford Projects
Projects

1975
lo974
I
1976
1977
1978
1979
1980
I
O~----~----~----~----~----~----~----~----~----~----~~--~
1975 1976 1977 1978 1979 1980 1981
1982
1983
1984
1985
1985
7983
1984
1991
7982

1974

6000 6000

Connect Time
Time
Connect
Hours/Month
5000
5000 - Hours/Month

System Staff
Staff
System

4000 4000
3000 3000
2000 2000
1000 1000
O~----~----~--~~--~~--~----~----~----~----~----~----~I

1974
IS74

1975
1975

1976
1976

1977
1977

Figure 11:
11:
Figure

1978
1978

1979
1979

1980
7980

1981
1981

1982
1982

1983
1983

1984
1984

Monthly Terminal
Terminal Connect
Connect Time
Time by
by Community
Community
Monthly

35
35

E. A.
A. Feigenbaum
Feigenbaum
E.

1985
1985

Progress
Progress -- Resource
Resource Operations
Operations Statistics
Statistics

5P41
5P41 RR00785-11
RR00785-11

Individual
Individual Project
Project and
and Community
Community U8age
Usage
The
The following
following histogram
histogram and
and table
table show
show cumulative
cumulative resource
resource usage
usage by
by collaborative
collaborative
project
and
community
during
the
past
grant
year.
The
histogram
displays
the project
project
project and community during the past grant year. The histogram displays the
distribution
of
the
total
CPU
time
consumed
between
May
1,
1983
and
April
3D,
distribution of the total CPU time consumed between May 1, 1983
April 30, 1984,
1984, on
the
t,he SUMEX-AIM
SUMEX-AIhl DECsystem2050
DECsyst.em2060 system.
system.
In
entries include
include aa text
text summary
summary of the funding
funding sources
In the
the table
table following,
following, entries
(outside
(outside of
of SUMEX-supplied
SUMEX-supplied computing
computing resources)
resources) for
for currently
currently active
active projects,
projects, total
total CPU
CPU
consumption
consumption by
by project
project (Hours),
(Hours), total
total terminal
terminal connect
connect time
time by project
project (Hours),
(Hours), and
average file space
space in use
use by project
project (Pages, 11 page
page = 512
512 computer
comput,er words).
words). These data
data
were accumulated
accumulated for each
each project
project for the months
months between May,
May, 1983 and May,
May, 1984.
1984.
Several of the projects
projects admitted
admitted to the National
National AIM
AIM community
community use
use the RutgersRutgersAIM
AIM resource as
as their
their home base.
base. We do not
not explicitly
explicitly list
list these projects
projects in this
this annual
annual
report
report covering
covering the St.anford
Stanford SUMEX-AIM
SUMEX-AIM resource. We do record
record information
information about
about the
Rutgers
Rutgers resource itself,
itself, however,
however, and note its separate resource status
status with
with the flag
"[Rutgers-AIM]"
.
” [Rut’gers-AIM] ‘I.

E. A.
A. Feigenbsurn
Feigenbaum
E.

35
36

Progress
Progress -- Resource
Resource Operations
OperatGons Statistics
Statistics

5P41
5P41 RR00785-11
RROOi85- 11

National
National AIM
AIM (24.4%
(24.4% Total)
Total)
AIM
AIM Administration
Administration
AIM
AIM Pilots
Pilots
AIM
AIM Users
Users
ACT
ACT

I
I

Caduceus
Caduceus
SEeS
SECS

1
I

Models
Models of
of Human
Human Cog
Cog
Solver
Solver
Puff-VM
Puff-VM
Rutgers
Rutgers

1
P

Stanford
Stanford (59.6%
(59.6% Total)
Total)
AGE
AGE
AI
Handbook
Al Handbook
DENDRAL
DENDRAL
EXPEX
EXPEX
Guidon
Guidon
Hpp
Research
Hpp Research
HPP
Assoc
HPP Assoc
Med
Med Info
Info Sci
Sci
MOLGEN
MOLGEN
Oncocin
Oncocin
Protein
Structure
Protein Structure
RX
Stanford Pilots
Pilots
Stanford
Stanford Assoc
Assoc
Stanford

~

.....

~-

P

Staff (16.0%
(16.0% Total)
Total)
Staff

Staff
Staff
Staff Assoc
Assoc
Staff

0o

Figure 12:
12:
Figure

p

P
I

I

55

10
10

15
20
25
25
15
20
Percent of
of Total
Total CPU
CPU Used
Used
Percent

Cumulative CPU
CPU Usage Histogram
Histogram by
by Project
Project and
and Community
Community
Cumulative

37
37

E. A.
A. Feigenbaum
Feigenbaum
E.

5P41
5P41 RR00785-11
RR00785-11

Progress -- Resource
Resource Operations
Operations Statistics
Statistics
Progress

Resource Use
Use by
by Individual
Individual Project
Project -- 5/83
5/83 through
through 4/84
4/84
Resource

National
A1A[ Community
Community
i\‘
ational .4IM

Connect
Connect
(Hours)
(Hours)

0.37
0.37

33.88
33.88

2866
2866

58.15
58.15

895.52

6852
6852

1.38
1.38

209.34

750
750

ACT Project
Project
1) ACT
"Acquisition of
of
“Acquisition
Cognitive Procedures”
Procedures"
Cognitive
John R. Anderson,
Anderson, Ph.D.
Ph.D.
John
Carnegie-Mellon Univ.
Univ.
Carnegie-Mellon
NSF IST-80IST-80-15357
NSF
15357
2/81-2/84
2/81-Z/84

File
File Space (**)
(**)
(Pages)
(Pages)

CPU
CPU
(Hours)
(Hours)

$186,000
$186,000

CADUCEUS
2) CADUCEUS
"Clinical Decision
Decision Systems
Systems
“Clinical
Resea.rch
Resource"
Resesrch Resource”
Jack D. Myers,
Myers, M.D.
M.D.
Jack
Harry E. Pople,
Pople, Jr.,
Jr., Ph.D.
Ph.D.
Harry
University of
of Pittsburgh
Pit,tsburgh
Universit,y
NIH RR-01
r·m-Ol101-07
NIH
101-07
$1,507,717
7/80-5/8S $1.607,717
7/80-6/&5
$369,484
7/83-6/84 $369,484
7,i83-f/84
NLM Lh1037
Lr-.W371O-04
NLM
IO-04
$817,884
7/80-5/8S
7/80-S/85
$817,884
$196,710
7/83-5/84
7/83-6/84
$196,710
NLt\l New
New Invest
NLhI
Invest L~103889-02
Lh¶03889-02
Gordon
Gordon E. Banks.
Banks. M.D.
M.D.
$107,675
4/82-3/8.5
4/‘82-3jS.S
$107,675
$35,975
4/83-3/84
4/83-3,/84
$35,975
$35,975
4/84-3/85
4/84-3185
$35.975
3) CLIPR
CLIPR Project
Project
"Hierarchical
“Hierarchical Models
Models
of Human
Hllman Cognition"
Cognition”
\Valter
\Valter Kint.sch.
Iiint.sch. Ph.D.
Ph.D.
Peter
Pet.er G. Polson,
Poison, Ph.D.
Ph.D.
University
Llniversity of Colorado
Colorado
NI!\'1H
NIXtH MH-15872-14-15
PIIH-1.5872-14-16 (Kintsch)
(Kintsch)
7/81-5/8'1
7/,81-6,/&l $281,085
77;83-6/8-1
/ 8~~-o/8·t $59,878
$60,878
NSF
NSF (Kintsch)
(E;int.sch)
$200,000
8/8a-7/85
8,‘83-7/‘86
$200,000
mr-.l
(Polson)
IBhi (Poison)
Da.vid
David Kieras
Kieras
lTniversity
Universit.y of
of Arizona
Arizona
$364,000
1/82-12/84
l/82-12/8.4
$364.000
$145,000
1/84-12/84
l/84-12/84
$145,000

E.
E. A.
A. Feigenbaum
Feigenbaum

38
38

Progress - Resource Operations
Operations Statistics
Stat.istics
Progress

SP41
5P41 RR0078S-11
RROO785-11

0.65

61.20

303

264.61

9877.34

10500

S.76
356.23
5.76
“Problem Solving
Solving
"Problem
Expertise”
Expertise"
Paul E. Johnson, Ph.D.
Ph.D.
P::J.1l1
v.,'illiam
William B. Thompson,
Thompson, Ph.D.
Ph.D.
Control
Control Data
Dat.a Corp.
Corp. (Johnson)
(Johnson)
$90,000
1983-85
$90,000
Microelect.
Microelect. and
a.nd Info.
Info. Ctr.
Ctr. Univ.
Univ. of
of rvlN
MN (Plus
(Plus Two
Two Colleagues)
Colleagues)
1984-1987 $800,000
$800,000
H18'i-1987

492

4) PUFF-VM
PUFF-VM Project
Project
“Biomedical Knowledge
Knowledge
"Biomedical
Engineering
Engineering in
Clinical
Clinica. Medicine"
Medicine”
John J. Osborn,
Osborn, M.D.
M.D.
John
Med. Research Inst.,
Inst.,
Med.
Francisco
San Francisco
Edward
Edward H. ShortIiffe,
Shortliffe, M.D.,Ph.D.
M.D.,Ph.D.
Stanford University
University
Stanford
Johnson &
k Johnson
Johnson
Johnson
1 year
$SO,OOO
$50,000 (*)
(*)
year
5) SECS Project
Project
"SiI1llllation
“Simulation &
Sr Evaluation
Evaluation
of Chemical
Chemical Synt.hesis"
Synthesis”
of
W.
W. Todd
Todd Wipke,
Wipke, Ph.D.
Ph.D.
U.
1J. California,
California., Santa
Santa Cruz
Cruz
NIHEHS
ES02845-02
NIHEHS ESO~845-02
4/82-3/85
4/82-3/8,5
$257.801
4j84-3i85
$89,140
4/84-3/85
$89,140
Evsns &
& SlItherland
Slltherland Corp.
Corp.
Evans
Equipment
Equipment gift
gift
Value
$9S,000
Value
$95,000
Stauffer
Sta,uffer Chemical
Chemical Co.
$5,000
$6,000
5)
6) SOLVER
SOLVER Project
Project

39

E. A.
A. Feigenbaum
Feigenbaum

Operations Statistics
Statistics
Progress - Resource Operations

5P41
5P41 RR00785-11
RR00785-11

*** [Rutgers-AIM]
[Rutgers-AIM] ***
***
7) ***
Rutgers Research Resource
Rutgers
“Computers in Biomedicine"
Biomedicine”
"Computers
Saul Amarel,
Amarel, D.Sc.
Ca.simir Kulikowski,
Kulikowski, Ph.D.
Ph.D.
Casimir
Shalom vVeiss,
Weiss, Ph.D
Ph.D
Shalom
Rutgers U
U.... New Brunswick
Brunswick
Rutgers
NIH RR-00543-12
RR-00643-12
(Amarel, Kulikowski)
Kulikowski)
NIH
(Amarel,
12/82-11/83
$405,304
12/82-11/83
$405,304
(Kulikowski, Weiss)
Weiss)
NIH RR-02230-01
RR-02230-01
NIH
(Kulikowski,

0.52
0.52

38.59
38.59

1117
1117

65.85
55.85

2227.48
2227.48

2461
2451

.93
.93

118.75
118.75

686
585

57.36
57.36

3836.19
3836.19

9649
9649

-----------------

-------------------

----m-w

455.56
455.56

17654.52
17554.52

35676
35576

12/83-11/87
$3,198,075
12/83-11/87
$3,198,075
12/83-l
l/84 $989,275
$989,276
12/83-11/84

AIM Pilot
Pilot Projects
Projects
8) AIM
9) AIM
AIM Administration
Administration
10) AIM
AIM Users
10)

Community Totals
Totals
Community

E. ‘A.
2. Feigenbaum
Feigenbaum

40
40

Progress - Resource Operations
Operations Statistics
Statistics

SP41
5P41 RR0078S-11
RR00785-11

CPU
CPU
(Hours)
(Hours)

Connect
Connect
(Hours)
(Hours)

File
File Space
Space
(Pages)
(Paws)

1) AGE
AGE Project.
Project (Core)
(Core)
"Attempt
“Attempt to
Generalize"
Generalize”
Edward
Ph.D.
Edward A. Feigenbaum,
Feigenbaum, Ph.D.
Dept.
Dept. Computer
Computer Science
Science
ARPA
(U*)
ARPA MDA903-8Q-C-0107
MDA903-80-C-0107
(***)
(partial
(partial support)
support)

11.80
11.80

845.30
845.30

4076
4076

2) AI
AI Handbook
Ha.ndbook Project
Project (Core)
(Core)
Edward
A.
Feigenbaum,
Ph.D.
Edward
Ph.D.
Dept.
Dept. Computer
Computer Science
Science
ARPA
(**)
hID,4903-80-C-0107
(**)
ARPA4 MDA903-8Q-C-0107
(partial
(partial support)
support)

1l.03
11.03

980.94
980.94

4425
4425

3.72
3.72

183.81
183.81

2980
2980

4) EXP
EX Proj
ect
EXPEX
Project
"Expert
“Expert Explanation"
Explanation”
Edward
Edward H. Shortliffe,
Shortliffe, M.D.,Ph.D.
hl.D.,Ph.D.
Dept.
Dept. Medicine
Medicine
ONR
NR 049-479
ONR NR
049-479
$-156,622
1/81-12/8:3
l/81-12,,‘8:3
$456,622
ONR
NR049-479
ONR NR049-479
Michael
hlichael Gt'nesereth
Genesereth
1/84-12/85
$312.070
l/84-12/‘86
NSF IST83-12H8
IST83-12148
Bruce G. Buchanan
Buchanan
$330,000
3/84-2/87
3/‘84-3,‘87
$330.000 (*)
(*)
$99,410
3/84-2/8.5
3 /8+2;‘8Fi
$99,410 (*)
(“)

53.75
53.75

239l.40
2391.40

4920
4920

5) GUIDON-NEOMYCIN
GUIDON-NEO~lYCIN Project
Project
"Exploration
“Exploration of Tutoring
Tutoring
S:
6 Problem-solving
Problem-solving
Strategies"
Strategies”
Bruce G. Buchanan,
Buchanan, Ph.D.
Ph.D.
William
William J. Clancey,
Clancey, Ph.D.
Ph.D.
Dept.
Dept. Computer
Computer Science
Science
ONR/
NOOOI4-79-C-0302
ONR/ARIARI N00014-79-C-0302
$583,892
3/79-3/85
3,‘79-3/85
$683,892

45.44
45.44

4418.68
4418.68

5967
5967

Stanford
Stan ford Community
Community

3) DENDRAL
DENDRAL Project
Project
"Resource
“Resource Related
Related Research:
Research:
Computers
Computers in Chemistry"
Chemistry”
Carl
Carl Djerassi,
Djerassi, Ph.D.
Ph.D.
Dennis H. Smith,
Smith, Ph.D.
Ph.D.
Dept.
Dept. Chemistry
Chemistry
NIH RR-00612-13
NIH
RR-00612-13
$170,710
5/82-4/83
5/82-d/83
$170,710

41
41

E. A. Feigenbaum

Progress
Progress - Resource Operations
Operations Statistics
Statistics

5P41
5P41 RR00785-11
RR00785- 11

6)
Project
6) fv[OLGEN
kIOLGEN
Project
"Applications
“Applications of
of Artificial
Artificial Intelligence
Intelligence

106.92
106.92

7734.34
7734.34

10448
10448

239.97

14404.62

14389

4.79

635.43
635.43

1296
1296

t.o
t,o Molecular
Molecular Biology"
Biology”
Edward
Edward A. Feigenbaum,
Feigenbaum, Ph.D.
Ph.D.
Peter
Peter Friedland,
Friedland, Ph.D.
Ph.D.
Charles
Charles Yanofsky,
Yanofsky, Ph.D.
Ph.D.
Depts.
Depts. Computer
Computer Science/
Science/
Biology
Biology
NSF MCS-8310236
NSF
MCS-8310236 (Feigenbaum,
(Feigenbaum, Yanofsky)
Ya.nofsky)
11/83-10/84
$139,215
(*)
11/83-lo/84
$139,315
(*)
7) ONCOCIN
ONCOCIN Project
Project
"Knowledge
“Knowledge Engineering
Engineering
for
for Med.
Med. Consult.ation
Consultation” II
EdWard
Edwa.rd H. Shortliffe,
Shortliffe, M.D.,Ph.D.
M.D.,Ph.D.
Dept.
~[edicine
Dept. hfedicine
NLM LM-03395
(Shortliffe/ONCOCIN)
NLhI
LM-03395
(Shortliffe/ONCOCIN)
Edward
A.
Feigenbaum,
Edward A. Feigenbaum, Ph.D.
Ph.D.
7/79-5/84
7/79-6184 $497,420
$497,420
7/83-5/84
$95,424
7,/83-6/84
$95,424
NLM LM-00048
NLhf
LbI-00048
$196,425
7/79-5/84
7/79-6/84
$196,425
7/83-5/84
$39,502
7,‘83-6184
$39,502
ONR
NR 049-479
ONR NR
$456,522
(*)
1/81-12/83
l/81-12/83
$456,622
(*)
NIH RR-01513
NIH
RR-01613
$524,455
7/83-6/85
7!83-6186
$624,455
$220,371
7/83-6/84
7/83-6/84
$220,371
NLM L~1-04136
NLM
Lhl-04136
$211,851
8/83-7/85
8/83-7/86
$211,851
$50.517
8/83-7/84
8/83-7/84
$60,517
H.J.
H.J. Kaiser
Kaiser Family
Family Fdn.
Fdn.
$150,000
7/83-6/86
7!‘83-6/86
$150,000
$50,000
7/83-6/84
7;83-6,/84
$50,000
ONR
NOOO14-81-K-0004
ONR N00014-81-K-0004
Michael
Michael R
R. Genesereth
Genesereth (Shortliffe)
(Shortiiffe)
1/84-12/86
$512.070
l/84-12/86
$512,070 (*)
(*)
NSF IST83-12148
NSF
IST83-12148
Bruce
Bruce G. Buchanan
Buchanan (Shortliffe)
(Shortliffe)
3/84-2/87
$330.000
(*)
3 /84-2/‘87
$330.000
(*)
$99,410
3/84-2/85
3/83-2/85
$99,410 (*)
(*)
8) PROTEIN
PROTEIN Project
Project
"Heuristic
“Heuristic Compo
Comp. Applied
Applied
to
to Prot.
Prot. Crystallog.
Crystallog.” II
Edward
Edward A.
A. Feigenbaum.
Feigenbaum, Ph.D.
Ph.D.
Dept.
Dept. Computer
Computer Science
NSF MCS-81-17330
NSF
MCS-81-17330
$28,976
1/82-1/83
l/82-1/83
$28,976

E. A. Feigenbaum
Feigenbaum

42
42

Progress - Resource Operations
Operations Statistics
Statistics

5P411 RR00785RR00785-11
5P4
11

79.44
RADIX Project
Project
9) RADIX
"Deriving Medical
Medical Knowledge
Knowledge from
from TimeTime“Deriving
Oriented Clinical
Clinical Databases”
Databases"
Oriented
Robert L. Blum.
Blum. M
M.D.
Robert
.D.
Gio C.hl.
C.I,I. VViederhold,
\Viederhold, Ph.D.
Ph.D.
Gio
Depts. Computer
Computer Science/
Science/
Depts.
Electrical Engrg.
Engrg.
Electrical
IST-8317858 (Blum)
(Blum)
NSF IST-8317858
(*)
3/84-3186
3/84-3/86 $89597 (*)
NLM (Wiederhold)
(Wiederhold)
NLM
$291,192
S/84-llj86
S/84-11/86 $291,192

3140.27
3140.27

8777

10) Stanford
Stanford Pilot
Pilot Projects
Projects

61.55
61.55

4115.02
4115.02

6097
6097

11)
HPP Core
Core AI
AI Research
Research
11) HPP

383.07
383.07

29073.96
29073.96

42202
42202

12)
HPP Associates
12) HPP

57.37
57.37

1600.31
1600.31

2997
2997

13) Stanford
Stanford Associates

27.01
27.01

1016.59
1016.59

1681
1681

5.62
5.62

1315.64
1315.64

587
587

---------

----------

_------

1091.46
1091.46

71856.29
71856.29

110842
110842

CPU
CPU
(Hours)
(Hours)

Connect
Connect
(Hours)
(Hours)

Space
File Space
(Pages)

288.21
288.21
16.65
16.65

17591.82
17591.82
1983.43
1983.43

23292
23292
7847
7847

---------

-------------------

_------

304.87
304.87

19575.25
19575.25

31139
31139

CPU
CPU
(Hours)
(Hours)

Connect
Connect
(Hours)
(Hours)

File Space
Space
File
(Pages)
(Pages)

530.54
530.54

67375.43
67375.43

167863
167863

----=====

----=====

----=====

2382.43
2382.43

176461.50
176461.50

345520
345520

14)
Medical Information
Information Sciences
Sciences
1-i) Medical

Community
Community Totals
Totals

SUMEY Stf'!!
St a f f
SUMEX
1) Staff
Staff
2)
2) System Associates
Associates

Community
Community Totals
Totals

System
System Operations
Operations
1)
1) Operations
Operations

Resource
Resource Totals
Totals
(*)
(*)

Award
Award includes indirect
indirect costs.
costs.

(**)
awarded
to
(**) Supported
Supported by aa larger
larger ARPA
ARPA contract
contract MDA-903-80-C-0107
MDA-903-80-C-0107
awarded
to
the Stanford
Stanford Computer
Computer Science
Science Department:
Department:

43
43

E. A. Feigenbaum

Progress
Progress - Resource Operations
Operations Statistics
Statistics

5P41
5P41 RR00785-11
RR00785- 11

System
Reliability
System Reliability
System
better than
System reliability
reliabilit,y for
for the DECsystem
DECsystem 2060 has been much
much better
than with
with our
our
previous KilO
periods of
particular hardware
previous
KIlO system.
system. "Ve
We have had very
very few periods
of particular
hardware or
or software
software
problems. The
below covers the
period in which
problems.
The data
data below
the entire
entire period
which the SUMEX-AIM
SUMEX-AIM community
community
has used the 2050.
2060. The
The actual
actual downtime
downtime was rounded
rounded to
to the nearest
nearest hour.
hour.

7

18

1
1

Feb

Mar

Apr

Table
per Month
Table 1 : System
System Downtime
Downtime Hours
Hours per
Month - February
February 83 through
through Apr
Apr 83

11

11

1

2

6

o0

11
11

15
15

26
26

13
13

16
16

28
28

May
hlay

Jun

Jul
Jul

Aug
Au

Sep
Sep

Oct
Ott

Nov
Nov

Dec
Dee

Jan

Feb

Mar
Mar

Apr
-Or

Table
per Month
Ta.ble 2 : System
System Downtime
Downtime Hours
Hours per
Month - May
May 83 through
through Apr
Apr 84

Reporting
period
Reporting period
Total
UpTime
Total Up Time
Pl\I
Phi Downtime
Downtitne
Act.ual
Act,ual Downtime
Downtime
Tot,al
Tor.al Downtime
Downtime
Mtbf
Mtbf
Uptime
Upt,ime Percentage
Percentage

462 days, 23 hours,
hours, 41 minutes,
minutes, and 42 seconds
454 days, 5 hours,
hours, 16 minutes,
minutes, and 57 seconds
1 days, 14 hours,
hours, 2 minutes,
minutes, and 55 seconds
7 days, 4 hours,
hours, 21
21 minutes,
minutes, and 50 seconds

8 days, 18 hours,
hours, 24 minutes,
minutes, and 45 seconds
2 days, 16 hours,
hours, 30 minutes,
minutes, and 16 seconds
98.45

Network Usage Statistics
Network
Statistics
The
plots in Figure
The plots
Figure 13 and Figure
Figure 14 show the monthly
monthly network
network terminal
terminal connect
connect
tim('
broader term
time for
for the TYMNET
TYMNET and the INTERNET
INTERNET usage. The
The INTERNET
INTERNET is a broader
term for
for
what
previously referred
usage. Since many
what was previously
referred to as
as Arpanet
Arpanet usage.
many vendors
vendors now
now support
support the
INTERNET
protocols (IP
JTCP) in addition
JTCP
INTERNET protocols
(IP/TCP)
addition to
to the Arpanet,
Arpanet, which
which converted
converted to
to IP
IP/TCP
in Janllary
198~3, it
between Arpanet
Janua.ry of
of 1083,
it is no longer
longer possible to
to distinguish
distinguish between
Arpanet usage and
Internet
Internet usage
usage on our
our 2060 system.
system.

E. A.
A. Feigenbaum
Feigenbaum

44
44

Progress
Progress- -Resource
Resource Operations
OperationsStatistics
Statistics

SP'll
RR00785-11
5P4
1 RR0078511

1400

TYMNETConnect
ConnectTime
Time
TYMNET
Hours/Month
Hours/Month

‘400

1200
7200
10001000
800 800
600 600
400 400
200
zoo-

jr
4

OL---~----~----~----~----~----~----~--~~--~----~----~
I?374
I
1975
1974
1975 1976
1976 1977
1977 1978
1978 1979
1979 1980
1980 1981
1981 1982
1982 7983
1983 1984
1984 7985
1985

Figure 13:
13:
Figure

TYMNET
TYMNET Terminal
Terminal Connect
Connect Time
Time

1200

ARPAnet Connect
Connect Time
Time
r ARPAnet
Hours/Month
Hours/Month
1000
1000 ‘200

1

800
800 600
600 400
400
t

200
200
t

,

O~----~----~----~----~----~----~----~----~----~----~~--~
1975
1976
1977
7978
1979
1980
1981
! 982
1983
1984
1985

It374
I
1974

1975

1976

1977

Figure
Figure 14:
14:

1978

1979

1980

1981

1982

1983

1984

ARP
ANET Termina.l
ARPANET
Terminal Connect
Connect Time
Time

45
45

E. A.
A. Feigenbaum
Feigenbaum
E.

1985

Progress
Progress -- SUMEX
SUMEX Staff
Staff Publications
Publications

5P41
5P41 RR00785-11
RR00785- 11

I.A.2.8.
SUMEX
Staff
I.A.2.8.
SUMEX
Staff Publications
Publications
The
The following
following are
are publications
publications for
for the
the SUMEX
SUMEX staff
staff and
and include
include papers
papers describing
describing
t,he
the SUMEX-AIM
SURIEX-AIM resource and
and on-going
on-going research as
as well as
as documentation
documentation of
of system
system and
and
program
EX-AIM community
program developments.
developments. Many
Many of
of the publications
publications documenting
documenting SUM
SUMEX-AIM
community
research are
are from
from the individual
individual collaborative
collaborative projects
projects and
and are
are detailed
detailed in
in their
their respective
respective
reports
(see
Section
II
on
page
69).
Publications
for
the
AGE
and
AI
Handbook
reports (see Section II on page 69). Publications for
AGE and AI Handbook core
core
research
resesrch projects
projects are
are given
given there.
1.
1. Carhart,
Carhsrt, R.E.,
R.E., Johnson, S.M.,
S.M., Smith,
Smith, D.H.,
D.H., Buchanan,
Buchanan, B.G.,
B.G., Dromey,
Dromey, R.G.,
R.G.,

and Lederberg,
A
Lederberg, J., Networking
Networking and
and aa Collaborative
Collaborative Research
Research Community:
Community:
A
Programs.
IN
Cllse
Case Study
Study Using
Using the DENDRAL
DENDRAL
Programs.
IN P. Lykos
Lykos (Ed.),
(Ed.),
CO~1PUTER
AND
COMPUTER NETWORKING
NETWORKING
AND CHEMISTRY,
CHEMISTRY, ACS
ACS Symposium
Symposium Series,
Series,
No. 19,
1975.
19, 1975.
2.
2. Levinthal,
Levinthal, E.C.,
E.C., Carhart,
Carhart, R.E.,
R.E., Johnson, S.M.,
S.M., and Lederberg,
Lederberg, J.:
Computer,'!
Computers Talk
Talk to Computers.
Computers. Industrial
Industrial Research, November,
November, 1975.
1975.

When

3. Wilcox,
- A Machine-Independent
Programming
Wilcox, C.R.,
C.R., MAINSAIL
MAINSAIL
Machine-Independent
Programming System.
Syatem.

Proc.
Proc. DECUS
DECUS Symposium
Symposium 2(4),
Z(4), Spring,
Spring, 1976.
1976.
4. Wilcox,
Project:
for Software
C.R.: The AL4INSAIL
MAINSAIL
Project: Developing
Wilcox, C.R.:
Developing Tools
Tools for
Software
Portability.
Portability. Proc.
Proc. SCAMC,
SCAMC, October,
October, 1977, pp. 76-83.
5. Lederberg,
and
Lederberg, J.L.:
J.L.: Digital
Digital Communications
Communication8
and the Conduct
Conduct of
of Science:
Science: The
New
New Literacy.
Literacy. Proc.
Proc. IEEE
IEEE 66(11), November,
November, 1978.
1978.
6. Wilcox,
MAINSAIL - Language
Language
Wilcox, C.R.,
C.R., Jirak,
Jirak, G.A.,
G.A., and Dageforde,
Dageforde, M.L.:
M.L.: MAINSAIL
Afanual. Stanford
Manual.
Stanford University
University Computer
Computer Science Report
Report STAN-CS-80-791,
STAN-CS-80-791,
1980.
1380.
C.R.,
Jirak,
G.A.,
and
Dageforde,
M.L.:
AIAINSAIL
C.R.,
Dageforde,
M.L.:
Jirak,
G.A.,
MAINSAIL
- Implementation
Implementation Overview.
Overview. Stanford
Stanford University
University Computer
Computer Science Report
Report
STAN-CS-80-792, 1980.
STAN-CS-80-793,

7. \Vilcox,
\Vilcox,

addition, a substantial
substantial continuing
continuing effort
effort has gone into
into developing,
developing, upgrading,
upgrading,
In addition,
extending documentation
documentation about
about the
the SUMEX-AIM
SUMEX-AIM resource.
resource. These efforts
efforts include
include user
and extending
help files, and introductory
introductory notes, an ARPANET
ARPANET Resource Handbook
Handbook entry,
entry, and
and
guides, help
policy guidelines.
guidelines.
policy

E. A.
A. Feigenbaum
Feigenbaum
E.

46
46

Progress
Progress - Future
Future Plans
Plans

5P41 RR00785-11
RR00785-11

I.A.2.9.
Future
Plans
I.A.2.9.
Future
Plans
Our
plans for
plans for
Our plans
for the next
next grant
grant year
year are based on the Council-approved
Council-approved plans
for our
our
5-year
plans for
5-year renewal
renewal that
that began in August,
August, 1980. In
In addition
addition to the specific
specific plans
for the
the next
next
grant
period to serve
grant year,
year, we include
include a summary
summary of
of the overall
overall objectives
objectives for
for this
this 5-year
5-year period
as
background. NearNear- and long-term
plans for
as a background.
long-term objectives
objectives and plans
for individual
individual collaborative
collaborative
projects are discussed in Section
beginning on page 69.
projects
Section II
II beginning

Ot.erall
Overall Goals
Goals
The
The goals of
of the SUMEX-AIM
SUMEX-AIM resource are long-term
long-term in supporting
supporting basic research
in artificial
broad range of
biomedical
applying these techniques
techniques to
to a broad
of biomedical
artificial intelligence,
intelligence, applying
problems, experimenting
with
technologies
promote scientific
with communication
communication
technologies to
to promote
scientific
experimenting
problems,
interchange,
better tools
interchange, and developing
developing better
tools and facilities
facilities to
to carryon
carry on this
this research.
research. Just
Just as
as
the tone of
proposal derives
of our
our renewal
renewal proposal
derives from
from the
the continuing
continuing long-term
long-term research
research objectives
objectives
of
philosophy
of the SU~fEX-AIM
SUMEX-AIM community,
community, our
our approach
approach derives
derives from
from the methods
methods and philosophy
already
We will
will continue
continue to
to develop useful
useful knowledge-based
knowledge-based
already established
established for
for the resource. We
software
biomedical research based on innovative,
softwa,re t,ools
t,ools for
for biomedical
innovative, yet
yet accessible computing
computing
t.echnologies.
t,echnologies.
For
For us it
it is important
important to
to make systems
systems that
that work
work and are exportable.
exportable. Hence, our
our
approach
is
to
integrate
available
state-of-the-art
hardware
technology
approach
to integrate available state-of-the-art hardware technology as a basis for the
the
underlying
software research and development
development necessary to support
support the AI
AI work.
work.
underlying software
SUMEX-AIM
SUMEX-AIM will
will retain
retain its broad
broad community
community orientation
orientation in choosing
choosing and implementing
implementing
its
its resources. \Ve will
will draw
draw upon
upon the expertise
expertise of
of on-going
on-going research efforts
efforts where
where possible
and
This orientation
orientation has
a.nd build
build on these where
where extensions
extensions or
or innovations
innovations are necessary. This
proved to be an effective
proved
effective way
way to build
build the current
current facility
facility and community.
community.
We
We have built
built ties to a broad
broad computer
computer science community;
community; have brought
brought the
the
rE'sults
results of
of their
t.heir work
work to
to the AIM
AIM users;
users: and have exported
exported results
results of
of our
our own
own work.
work. This
This
broader
particularly active
broader community
community is particularly
active in developing
developing technological
technological tools
tools in the form
form of
of
new machine
machine architectures,
architectures, language support,
support, and interactive
interactive modalities.
modalit.ies.

Toward
Distributed Resource
Resource
Toward a Afore
More Distributed
The
The initial
initial model
model for
for SUMEX
SUhLIEX as
as a centralized
centralized resource was based on the high
high cost
of
powerful computing
of powerful
computing facilities,
fa.cilit,ies, which
which were not
not readily
readily duplicated.
duplicated. This
This role is evolving,
evolving,
though,
though, with
with the
t.he introduction
introduction of
of more
more compact
compact and inexpensive
inexpensive computing
computing technology.
technology.
Our
by community
Olur fnture
future goals are guided
guided by
community needs for
for more
more computing
computing capacity
capacity and
improved
build more
improved tools
tools to build
more effective
effective expert
expert systems, and to
to test
test operational
operational versions
versions of
of
AI
programs in real-world
In order
order to
to meet
meet these needs, we must
must take
take advantage
advantage
AI programs
real-world settings.
settings. In
of
of a ralll,;e
range of
of newly-developing
newly-developing machine
machine architectures
architectures and systems. As a result,
result, SUIvIEXSUhsIEXAIM
AIM will
will become a more
more distributed
distributed community
community resource with
with heterogeneous
heterogeneous computing
computing
facilit,ies
fncilit.ies tE'thE'rE'd
tethered to
t,o each other
other through
through communications
communications media.
media. Many
Many of
of these machines
machines
will
physically near the projects
projects or
will be located
located physically
or biomedical
biomedical scientists
scientists using
using them.
them.

The C{Jntinuing
Role of
Continuing Role
of SUAfEX-Central
SUMEX-Central
Even
Even with
with more
more distributed
distribut.ed computing
computing resources, the central
central resource will
will continue
continue
to play
play an important
important role as
as a communications
communications crossroad,
crossroad, as
as a research
research group
group devoted
devoted to
integrating
integrating the new software
software and hardware
hardware technologies
technologies to
to meet
meet the needs of
of medical
medical AI
,41
projects, and as
applicat.ions,
applications, as a spawning
spawning ground
ground for
for new application
application projects,
as a base
base for
for local AI
AI
projects. A
projectas.
A key
key challenge
challenge will
will be to
to maintain
maintain the scientific
scientific community
community ties that
that grew
grew
naturally
previous colocation
naturally out.
out of
of the
t,he previous
colocation within
within a central
central facility.
facility.

47

E. A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785- 11

Progress - Future
Future Plans

Summary
Summary of
of Five-!lear
Five-year Objectives
Objectives
The long-term
long-term objectives
objectives of the SUMEX-AIM
SUMEX-AIM resource nucleus during
during the follow-on
follow-on
5 year
period
(of
which
we
are
in
the
third
year)
are
summarized
year period
which
third year)
summarized below. These are
broken
resource operations,
operations, training
training and
and education,
education, and core
broken into
into three
three categories:
categories: resource

research.
research.
Resource
Resource Operations
Operations
l.
will continue
continue to encourage
1. Maintain
Ma.inta.in the vitality
vitality of the AIM
AIM community
community -- We will

and explore
explore new applications
applications of AI
AI to biomedical
biomedical research and improve
improve
mechanisms
mechanisms for interinter- and intra-group
intra-group collaborations
collaborations and communications.
communications.
While
While AI
AI is
is our
our defining
defining theme,
theme, we may
may entertain
entertain exceptional
exceptional applications
applications
justified
justified by some other
other unique
unique feature
feature of SUMEX-AIM
SUMEX-AIM essential for
for important
important
biomedical
biomedical research. To
To minimize
minimize administrative
administrative barriers
barriers to the communitycommunityoriented
oriented goals of SUMEX-AIM
SUMEX-AIM and to direct
direct our
our resources toward
toward purely
purely
scientific
scientific goals, we plan
plan to retain
retain the current
current user funding
funding arrangements
arrangements for
for
projects
User projects
projects working
working on SUMEX
SUMEX facilities.
facilities.
projects will
will fund
fund their
their own
own
manpower
manpower and local needs; will
will actively
actively contribute
contribute their
their special expertise
expertise to
the SUMEX-AIM
SUMEX-AIM community;
community; and will
will receive an allocation
allocation of
of computing
computing
resources under
under the control
control of
of the
the AIM
AIM management
mana.gement committees.
committees. There
There will
will
be no "fee
“fee for
for service"
service” charges for
for community
community members.
members. We also will
will continue
continue
to exploit
exploit community
community expertise
expertise and sharing
sharing in software
software development,
development, and to
facilitate
facilitate more effective
effective information-sharing
information-sharing among
among projects.
projects.
2. Provide
Provide effective
effective computational
computational support
support for
for AIM
AIM community
community goals -- We
We will
will
continue
intelligence
continue to expand
expand support
support for
for artificial
artificial
intelligence research and new
applications
applicat.ions work,
work, to develop
develop new computational
computational tools
tools to support
support more
more mature
mature
proj ects, and to facilitate
projects,
facilitate testing
testing and research dissemination
dissemination of
of nearly
nearly
operational
programs. We
operational programs.
We will
will continue
continue to
to operate
operate and develop
develop the existing
existing
central
cent,ral facility
facility as
as the nucleus
nucleus of
of the resource. We
We will
will acquire
acquire additional
additional
11ipment to meet
eq
equipment
meet developing
developing community
community needs for
for more
more capacit,y,
capacity, larger
larger
program address
New computing
program
a.ddress spaces,
spaces, and improved
improved interactive
interactive facilities.
facilities. New
computing
hardware
becoming available
hardware technologies
t,echnologies becoming
available now
now and in the next
next few years
years will
will
play a key
play
key role in these developments,
developments, and we expect
expect to take
take the lead in this
this
community
biomedical AI
community for
for adapting
ada.pting these new tools
tools to biomedical
AI needs.
Provide effect.ive
effective and geographically
geographically accessible communication
communication facilities
facilities to
to the
the
3. Provide
SUMEX-AIM commiinit,y
community for
for effective
effective remote
remote collaborations,
collaborations, communications
communications
SUMEX-AIM
among dist,ribut,ed
distributed computing
computing nodes, and experimental
experimental testing
testing of
of AI
AI programs
programs
among
We will
will retain
retain the
the current
current ARPANET
ARPANET and TYMNET
TYMNET connections
connections for
for at
at
-- We
the near-term
near-term and
and will
will actively
actively explore
explore other
other advantageous
advantageous connections
COIJ.nections to
to
least the
communications networks
networks and to
to dedicated
dedicated links.
links.
new communications

Training
and Education
Education
Training and
l. Assist
Assist new and
and established
established projects
projects in
in the
the effective
effective use of
of the
the SlJMEX-AIM
SUMEX-AIM
1.
resource
Collaborative
projects
continue
to
be
responsible
for the
the
resource -- Collaborative
projects continue to
responsible for
development and
and dissemination
dissemination of
of their
their own
own AI
AI programs,
programs, but
but the
the resource
resource
development
staff will
will provide
provide general
general support
support and
and will
will work
work to
to make
make resource
resource goals and
and AI
AI
staff
systems known
known and
and available
available to
to appropriate
appropriate biomedical
biomedical scientists.
scientists. We
We will
will
systems
continue to
to exploit
exploit particular
particular areas of
of expertise
expertise within
within the
the community
community for
for
continue
deve loping pilot
pilot efforts
efforts in new application
application areas.
developing

E. A.
A. Feigenbaum
Feigenbaum

48

Progress
Progress - Future
Future Plans
Plans

5P41 RR00785-11
RR00785-11

2.
funds to
to qualifying
qualifying new and pilot
pilot
9. Continue
Continue to
to allocate
allocate "collaborative
“collaborative linkage"
linkage” funds
projects
projects to
to provide
provide for
for communications
communications and terminal
terminal support
support pending
pending formal
formal
approval
approval and funding
funding of
01 their
their projects
projects -- These funds
funds are allocated
allocated in
cooperation
prospective user
cooperation with
with the AIM
AIM Executive
Executive Committee
Committee reviews
reviews of
of prospective
projects.
projects.
3. Continue
Cont.inue to support
support workshop
workshop activities
activities including
including collaboration
collaboration with
with the
the
Rutgers
Rutgers Computers
Computers in Biomedicine
Biomedicine resource on the AIM
AIM Community
Community
\Vorkshop
Workshop and with
with individual
individual projects
projects for
for more specialized
specialized workshops
workshops
covering
covering specific
specific application
application areas or
or program
program dissemination
dissemination
Core
Research
Core Research
1.
1. Continue
Continue to
to explore
explore basic Artificial
Artificial Intelligence
Intelligence research issues
issues for
for knowledge
knowledge

acquisition.
reasoning
representation, and utilization;
utilization;
acquisition. representation,
reasoning in the presence of
of
uncertainty;
uncertainty; strategy
strategy planning;
planning; and explanations
explanations of
of reasoning
reasoning pathways
pathways with
with
particular
emphasis on biomedical
particular
biomedical applications
applications -- SUMEX
SUMEX core research
funding
to similar
funding is complementary
complementary
similar funding
funding from
from other
other agencies and
contributes
contributes to
to the
the long-standing
long-standing interdisciplinary
interdisciplinary effort
effort at
at Stanford
Stanford in basic AI
AI
research and expert
We expect
expect this
this work
work to
to provide
provide the
expert system
system design. We
foundation
foundation for
for increasingly
increasingly effective
effective consultative
consultative programs
programs in medicine
medicine and for
for
more
practical
adaptations
of
this
work
within
emerging
more practical adaptations 01 this work within emerging microelectronic
microelectronic
technologies.
technologies.
2. Support
Support community
community efforts
efforts to
to organize
organize and generalize
generalize AI
AI tools
tools that
that have been
developed
developed in the context
context of
of individual
individual application
application projects
projects -- This
This will
will include
include
work
work t,o
to organize
organize the present
present state-of-the-art
state-of-the-art in AI
AI techniques
techniques through
through the
development
development of
of practical
practical software
software packages for
for the acquisition.
acquisition, representation.
representation,
and utilization
utiiization of
of knowledge
knowledge in AI
AI programs.
programs. The
The objective
objective is to evolve a body
body
of
of software
software tools
tools that
that can be used to
to more
more easily build
build future
future knowledge-based
knowledge-based
systems
systems and explore
explore other
other bionmedical
bionmedical AI
AI applications.
applications.

49

E. A. Feigenbaum
Feigenbaum

5P41
5P41 RR00785-11
RR00785-11

Progress
Progress -- Future
Future Plans
Plans

Specific
Specific Plans
Plan8 for
for Year
Year 12
12
Specific
Specific plans
plans for
for the
the next
next grant
grant year
year (12)
(12) are
are summarized
summarized in
in the
the paragraphs
paragraphs below.
below.
The
The directions
directions and
and background
background for
for much
much of
of this
this work
work were
were given
given in
in earlier
earlier progress
progress report
report
sections
sections and
and are
are not repeated in
in detail
detail here.

Professional
Workstations
fiofessional
WOrk8tatiOn8
We see
see our
our major
major development
development efforts
efforts in
in year
year 12
12 to be
be in
in the area
area of
of professional
professional
workstat.ions,
workstat,ions, and
and specifically,
specifically, to fine
fine tune the integration
integration of
of these
these workstations
workstations into
into our
our
networking
networking environment.
environment. This
This involves
involves software
software integration,
integration, support
support of network
network protocols,
protocols,
general access
access to network
network printing
printing facilities,
facilities, telnet
telnet access
access to Lisp ma.chines,
ma.chines, and overall
overall
worksta.tion
workststion ma.intenance
maintenance and support.
support.
We will
will also continue
continue to explore the use
use of low cost workstations
workstations within
wit’hin our
our
environment,
environment, both
both as
as distributed
distributed processors for
for text
text editing
editing and electronic
electronic mail,
mail, and as
as
powerful
powerful graphic
graphic terminals
terminals for
for use
use with
with sophisticated
sophisticated programs
programs running
running on our
our
mainframes.
mainframes. We also see
see the use
use of virtual
virtual graphics
graphics interfaces
interfaces running
running on remote
remote
workstations
to
be
of
continued
importance
to
our
progress
in
the
future.
workstations
continued importance
our
future.

Continued
Continued Operation
Operation of
of Existing
Existing Hardware
Hardware
The
The current
current, SUMEX-AIM
SUMEX-AIM facilities
facilities represent
represent a large existing
existing investment.
investment. 'vVe
We plan
plan
to continue
continue development
deve!opment of
of our
our main
main timesharing
timesharing machine,
machine, the
t,he DEC2050/TOPS-20
DEC2060/TOPS-20
syst.ern,
syst.em, and the SUMEX-AIM
SUMEX-AIM file server
server (SAFE),
(SAFE), and make changes as
as necessary to
improve
improve the performance
performance of
of these machines.
machines. We
We do not
not propose any
any substantial
substantial changes
to the ot.her
ot,her hardware
hardware systems (2020, shared VAX,
VAX, and Lisp Machines).
Machines). We
We expect
expect them
them
to
provide effective
t.o continne
cont,inue to provide
effective community
community support
support and serve as
as a nucleus for
for our
our
distribut,ed
di,strihut.ed resource.

Communication
Networks
Communication
Network8
Networks have been centrally
Net,works
centrally important
important to the research goals of
of SUMEX-AIM
SUhlEX-AIM and
will
increasingly distributed
computing. Communication
will become more
more so in the context
context of
of increasingly
distributed computing.
Communication
will be crucial
crucial to
to mainta.in
maintain community
community scientific
scientific contacts,
contacts, t,o
to facilitate
facilitate shared
shared system
system and
will
software maintenance
maintenance based on regional
regional expertise,
expertise, to
to allow
allow necessary information
information flow
flow and
soft,ware
access at
at all
all levels, and to
to meet
meet the technical
technical requirements
requirements of
of shared
shared equipment.
equipment.
access
vVe have had reasonable
reasonable success
success at
at meeting
meeting the geographical
geographical needs of
of the
We
community during
during the
the early
early phases of
of SUMEX-AIM
SUMEX-AIM through
through our
our ARPANET
ARPANET and
and
cornmunit,y
TY~INET connections.
connections. These have allowed
allowed users from
from many
many locations
locations within
,vithin the
the United
United
T\-hINET
States and
and abroad
abroad to
to gain
gain terminal
terminal access
access to
to the
the AIhl
AIM resources and
and through
through ARPANET
ARPANET
States
links t,o
to communicat,e
communicate much
much more
more voluminous
voluminous file
file information.
information. Since many
many of
of our
our users do
links
not have ARPANET
ARPANET access
access privileges
privileges for
for technical
technical or
or administratQve
administrative reasons, a key
key
not
problem impeding
impt'ding remote
remote use has been the
the limited
limited communications
communications facilities
facilities (speed, file
file
problem
transfer. and
and terminal
terminal handling)
handling) offered
offered currently
currently by
by commercial
commercial networks.
networks. Commercial
Commercial
transfer,
improvements a.re
are slow
slow in coming
coming but
but may
may be expected
expected to
to solve the
the file
file transfer
transfer problem
problem
improvements
the next. few years.
years. A
A number
number of
of vendors
vendors (AT&T,
(AT&T, IBM,
IBM, XEROX,
XEROX, etc.)
etc.) have yet
yet to
to
in the
announce commercially-available
commercially-available facilities,
facilities, but
but TELENET
TELENET is actively
actively working
working in
in this
this
announce
direction. We
\-Ve plan
plan to
to continue
continue experimenting
experimenting with
with improved
improved facilities
facilities as offered
offered by
by
direction.
commercial
or
government
sources
in
the
next
grant
term.
We
have
budgeted
for
commercial or government sources in the next grant term.
We have budgeted for
cont.inued TYMNET
TY~vINET service
service and
and an additional
additional amount
amount annually
annually for
for experimental
experimental network
network
cont,inued
connections.
connections.
Higil-:':peed intera.ctive
interactive terminal
terminal support
support will
will continue
continue to
to be a problem
problem since one
High-speed
cannol. expect
expect to
to serve
serve 1200 to
to 9600
9500 baud
baud terminals
terminals effectively
effectively over
over shared
shared long-distance
long-distance
ca.nnoi.
trllnk lines
lines with
with gross
gross capacities
ca.pacities of
of only
only 9600
9500 to
to 19200 baud.
baud. We
We feel
feel this
this is
is aa problem
problem
trllnk

E. A.
A. Feigenbaum
Feigenbaum
E.

50
50

Progress - Future
Future Plans

SP41
5P4 1 RR0078S-11
RR00785- 11

that
that is
is best solved by distributed
distributed machines able to effectively
effectively support
support terminal
terminal
interactions
interactions locally
locally and coupled
coupled to other
other AIM
AIM machines and facilities
facilities through
through network
network or
telephone
t,elephone links.
links. As new machine
machine resources are introduced
introduced into
into the community,
community, we will
will
allocate
advice to assure
a,llocate budgeted
budgeted funds with
with Executive
Executive Committee
Committee
assure effective
effective
communications
communicat,ions links.
links.

Resource
Resource Software
So jtware
We will
will continue
continue to maintain
maintain the existing
existing system, language, and utility
utility support
support,
software
software on our
our systems at the most
most current
current release
release levels, including
including up-to-date
up-to-date
documentation.
We also will
will be
be extending
extending the facilities
facilities available
available to users where
documentation.
appropriate,
rely
appropriate, drawing
drawing upon
upon other
other community
community developments
developments where possible. We rely
heavily
heavily on the needs
needs of the user community
community to direct
direct system
system software
softwa.re development
development efforts.
efforts.
\Vithin
Within the AIM
AIM community
community we expect to serve as
as a center
center for software-sharing
software-sharing
between various
This will
will include
include contributing
contributing locallylocallyvarious distributed
distributed computing
computing nodes. This
developed programs,
those derived
programs, distributing
distributing
derived from
from elsewhere in the communit,y,
community,
maintaining
maintaining up-t.o-date
up-to-date information
information on subsystems
subsystems available,
available, and assisting
assisting in software
software
maintenance.
ma.inth the management
management committees.
committees. This
redistribut,ed
project members
by the individual
principal investigators.
redist,ribut.ed among
among project
members as directed
directed by
individual principal
investigators.
System
project allocations
Syst’em enforcement
enforcement of
of project
a.llocations is done on a weekly
weekly basis. As the weekly
weekly file
dump is done, if
the aggregate
aggregate space in use by
by a project
project exceeds
exceeds its
its allocation,
allocation, files are
dump
if the
archived from
from associated user directories
directories which
which are over
over allocation
allocation until
until the
the project
project is
archived
within its
its authorized
aut,horized lirnits.
limits.
within
vVe are using
using the TOPS-20
TOPS-20 class scheduler
scheduler to
to attempt
attempt to
to enforce
enforce the 40:40:20
We
bala,nce in terms
terms of
of CPU
CPU utilization
utilization and
and to
to avoid
avoid system
system and
and user’
user' inefficiencies
inefficiencies under
under
bal:-mce
overload conditions.
conditions. In practice,
practice, the
the 40:40 split
split between
between Stanford
Stanford and non-Stanford
non-Stanford
over1on.d
projects is fa.irly
fairly well
well realized
realized (see
(see Figure
Figure 10 on page 34 and the tables
tables of
of recent
recent project
project
projects
llsage on page 3Gj.
:36).
usage
Our job-scheduling
job-scheduling controls
controls bias the
the allocation
allocation of
of CPU
CPU time
time based on per
per cent
cent time
time
Our
consumed relative
relative to
to the
the time
time allocated
allocated according
according to
to the
the 40:40:20
40:40:20 community
community split.
split.
consumed
Howeyer, the
the controls
controls are “soft”
"soft" in
in that
that they
they do not
not waste
waste computer
computer cycles ifif users below
below
However,
their
allocated
percentages
are
not
on
the
system
to
consume
those
cycles.
In
the
early
their allocated percentages
not on the system to
In the early
years, the
the operating
operating dispa.rity
disparity in
in CPU
CPU use reflected
reflected a substantial
substantial difference
difference in demand
demand
years,
betwren the
the Stanford
Stanford community
community and
and the
the developing
developing national
national projects,
projects, rather
rather than
than
between
inequity of
of access.
access. For
For example,
example, the
the Stanford
Stanford utilization
utilization is spread
spread over
over a large
large part
part of
of the
the
inequity
24-hour cycle,
cycle, while
while national-AIM
national-AIM users tend
tend to
to be more
more sensitive
sensitive to
to local
local prime-time
prime-time
24-hour

A. Feigenbaum
Feigenbaum
E. A.

64

Resource Management
Management and Allocation
Allocation

5P41
5P41 RR00785-11

constraints.
(The 3-hour
3-hour time
time zone
zone phase
phase shift
shift across
across the continent
continent is
is of substantial
substantial help
constraints. (The
in load-balancing).
During
load-balancing).
During peak times under
under the overload
overload control
control system
system reported
reported
previously,
previously, the Stanford
St,anford community
community experienced mutual
mutual contentions
contentions and delays while
while the
Alr..l
AIM group
groap had relatively
relat.ively open access
access to the system.
This
This disparity
disparity in usage
usage has disappeared
disappeared in recent
recent years with
with the growth
growth of the
national
nat)ional user community,
community, and we enabled overload
overload controls
controls for
for the national
national community
community as
as
well. For
For the present.
present, we propose to continue
continue our
our policy
policy of "soft"
“soft” allocation
allocation enforcement
enforcement
for
for the fair
fair split
split of resource capacity.
capacity.
Our
Our system
system also categorizes
categorizes users
use,rs in terms
terms of access
access privileges.
privileges. These comprise
comprise
fully-authorized
users,
pilot
projects,
associates,
guests,
and
network
fully-authorized users, pilot projects, associates,
network visitors
visitors in descending
order
order of system
syst.em capabilities.
capabilities. \Ve
We want
want to encourage bona fide medical
medical and health
health research
people to experiment
experiment with
with the various
various programs
programs available
available with
with a minimum
minimum of red t.ape,
tape,
while
users to bypass the advisory
while Dot
not allowing
allowing unauthenticated
unauthenticated
advisory group
group screening
procedures
procedures by
by coming
coming on as
as guests. So far,
far, we have had relatively
relatively little
little abuse compared
compared
to that
that experienced
experienced by
by other
other network
network sites, perhaps
perhaps because of the personal
personal attention
attent,ion
directed
direct’ed by senior
senior staff
staff to logon records,
records, and to other
other security
security measures. However,
However, the
experience of most
most other
other computer
computer managers
managers behooves us to be cautious
cautious about
about being as
as
wide
wide open as
as might
might be preferred
preferred for
for informal
informal service to pilot
pilot efforts
efforts and demonstrations.
demonstrations,
\Ve
!Ye will
will continue
continue developing
developing this
this mechanism
mechanism in conjunction
conjunction with
with management
management committee
committee
policy
decisions.
policy
We
We also have encouraged
encouraged mature
mature projects
projects to apply
apply for
for their
their own
own machine
machine resources
in order
order to preserve
preserve the SUMEX-AIM
SUMEX-AIM resource for
for research and development
development efforts
efforts and
to
justify their
t,o support
support projects
projects unable
unable to justify
their own machines. The
The Rutgers
Rutgers resource has its
own
part of
project
own 2050
2060 machine,
machine, part
of which
which is allocated
allocated for
for AIM
AIM use,
use, and the CADUCEUS
CADUCEUS project
has instalit'd
planned development
program
installed a.a. VAX
V.4X 11/780
11/780 machine
machine to
t,o support
support its
its planned
development and program
test,ing work.
“Simulation of
of Cognitive
Cognitive Processes"
Processes” Project
Project
work. Profs.
Profs. Lesgold
Lesgold and Greeno's
Greeno’s "SimUlation
has
tire ly to their
h3.s moved
moved en
ent.irely
their own
own local VAX.
VAX.

65

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11
RR00785- 11

Dissemination
Dissemination Efforts
Efforts

I.E.
Efforts
I.E. Dissemination
Dissemination
Efforts
Throughout
Throughout its existence, SUMEX-AIM
SUMEX-AIM has expended substantial
substantial effort
effort toward
toward
disseminating
disserninating information
information about
about its activities
activities as
as aa resource and about
about the work
work of
individual
We continue
continue to make many
many presentations
presentations at
individual collaborative
collaborative projects.
projects.
professional
professional meetings,
meetings, to provide
provide services to demonstrate
demonstrate developed AI
AI programs
programs to
interested
visitors, and to work
work in organizing
organizing
interested groups
groups and individuals,
individuals, to welcome visitors,
workshops
workshops within
within the SUMEX-AIM
SUMEX-AIM community
community to introduce
introduce our
our research to collaborating
collaborating
professional
We also directed
professional communities.
communities.
directed considerable
considerable effort
effort in the past
past toward
toward
working
working with
with the Research Resources Information
Information Center
Center to produce
produce the "Seeds
“Seeds of
Artificial
Art,ificial Intelligence"
Int.elligence” monograph
monograph and other
other publications
publications and press articles
articles to address aa
broader
broader community
community of technical
technical and lay people.

Software
Software Distribution
Distribution
SUr-.IEX
SUMEX continues
continues to support
support various
various projects
projects in the distribution
distribution of
of versions
versions of
their
their sonware
soft,wa.re to requesting
requesting individuals
individuals or
or groups.
groups. Following
Following is a summary
summary of
of software
software
dissemination
dissemination this
this past
past year:
year:
EMYCIN
EMYCIN

Both
“source” versions
versions of the EMYCIN
EMYCIN
Both the "executable"
“executable” and "source"
distribution
package were restructured
for
distribution
restructured
for clarity
clarity and ease
ease of
installation.
Thirty
copies of the EMYCIN
package have been
inst,allation.
Thirty
EMYCIN
generated
generated for
for distribution
distribution of which
which about
about 6 were sources only.
only. An
An
Interlisp-VAX
version
Interlisp-VAX
version of
of EMYCIN
EMYCIN is now available,
available, thanks
thanks to Ray
Ray
Bates of
of USC-lSI,
USC-ISI, who
who did
did the conversion.
conversion. This
This runs
runs under
under UNIX
UNIX and
VMS.
VMS.

AGE
AGE

Twenty-two
Nearly
Twenty-two copies of
of the AGE
AGE system
system have been distributed.
distributed.
Nearly
half
half of
of these have been copies requested
requested in ANSI
ANSI format
format indicating
indicating they
they
were evidently
evidently going
going to
to non-Tops20
non-Tops20 sites (probably
(probably Vaxes). As with
with the
EMYCIN
EMYCIN system,
system, Ray
Ray Bates
Bates at
at USC-lSI
USC-IS1 has converted
converted AGE
AGE to
to run
run
under
AX. A
under Interlisp-V
Interlisp-VAX.
A version
version is also available
available for
for the Xerox
Xerox 1108
series Lisp
Lisp workstations.
workstations.

GENET
GENET

In conjunction
phaseout of
conjunction with
with the phaseout
of the GENET
GENET community
community on
SUMEX,
programs and databases
SUMEX, a software
software package comprised
comprised of
of programs
developed by
by researchers
researchers at
at Stanford
Stanford and elsewhere was assembled for
for
developed
distribution to
to interested
interested GENET
GENET users. Versions
Versions of
of the software
software were
distribution
provided for
for use on both
both DEC-10
DEC-lO and DEC-20
DEC-20 systems
systems operating
operating under
under
provided
TOPS-lO, TENEX.
TENEX, and TOPS-20.
TOPS-20.
Installation procedures
procedures were
were
TOPS-lo,
Installation
documented, and a substantial
substantial amount
amount of
of telephone
telephone consultation
consultation was
documented,
provided. The
The package has been well-received
well-received and appears
appears to
to be in
provided.
active use at
at many
many of
of the 21 academic
academic sites to
to which
which itit was sent. Only
Only
active
copy of
of the
the complete
complete Genet
Genet system
system was set out
out in the
the past
past year.
year.
one copy
However, several
several sets of
of Genet
Genet related
related data
data files have been distributed.
distributed.
However,
This includes
includes several
several copies of
of the
the NIH
NIH and EMBL
EMBL Sequence
Sequence Libraries.
Libraries.
This
A limited
limited amount
amount of
of operations
operations support
support has been given
given to
to Brutlag’
Brutlag's
A
s
interaction with
with Sam
Sam Karlin
Karlin of
of the
the Math
Math department
department and a variety
variety of
of
interaction
other
groups.
other groups.

MRS
MRS

Twenty-two copies of
of MRS
MRS have
have been distributed
distributed through
through Sumex.
Sumex.
Twenty-two
Several others
others have also been distributed
distributed directly
directly by
by the
the HPP.
HPP. Most
Most
Several
have been sent
sent out
out to
to VAX/Unix
V A.XjUnix sites or
or Symbolics
Symbolics Lisp
Lisp machine
machine sites.
have

A. Feigenbaum
Feigenbaum
E. A.

66

Dissemination
Dissemination Efforts
Efforts

5P41 RR00785-11
RR00785-11
5P41

SACON
SACON

Two copies
copies of
of SACON
SACON have
have been
been prepared
prepared and
and distributed.
distributed.
Two

GLISP
GLISP

Two copies
copies of
of GLISP
GLISP were
were distributed.
distributed.
Two

67
67

E. A.
A. Feigenbaum
Feigenbaum
E.

Comments
Comments on
on the
the Biot,echnology
Biot,echnology Resources
Resources Program
Program

5P41
5P41 RR00785-11
RR00785- 11

I.F.
on
Resources
Program
I.F. Comments
Comments
on the
the Biotechnology
Biotechnology
Resources
Program
Resource
Resource Organization
Organization
We continue
continue to believe that
that the Biotechnology
Biotechnology Resources Program
Program isis one
one of
of the
most effective
vehicles
for
developing
and
disseminating
technological
tools
for
biomedical
effective
for developing and disseminating technological
for biomedical
research.
resea.rch. The
The goals
goals and
and methods
methods of
of the program
program are
are well-designed
well-designed to encourage
encourage building
building
of
groups
of the necessary multi-disciplinary
multi-disciplinary
groups and
and merging
merging of
of appropriate
appropriate technological
technological and
and
medical
medical disciplines.
disciplines. In
In our
our experience
experience with
with the SUMEX-AIM
SUMEX-AIM resource, several
several elements
elements of
of
this
this approach
approach seem
seem to emerge as
as key to the development
development and management
management of an effective
effective
resource:
1.
1. Effect,ive
Effective Management
Management Framework
Framework -- There
There needs
needs to be
be an explicit
explicit agreement
agreement

between the
t.he BRP
BRP and the resource principal
principal investigator
investigator which
which establishes aa
clear
c1ea.r mandate
mandate for the resource and its allocation,
allocation, provides
provides worthwhile
worthwhile
incentives
incentives for the host institution
institution and investigator
investigator to invest
invest the necessary
substantial
substantial professional
professional career time
time to develop and manage the resource, and
ensures equitable
equitable distribution
distribution of
of resource services to its target
target community.
community.
2.
2. Close \Vorking
Working Relationship
Relationship with
with the NIH
NIH -- A
A resource is a major
major and often
often
long-term
long-term investment
investment of
of money
money and human
human energy. A
A close
close and mutuallymutuallysupportive
supportive working
working relationship
relationship between
between resource management,
management, its advisory
advisory
committees,
is essential to assure healthy
NIH administration
administration
healthy
committees, and the NIH
development
development of
of the resource and its relationship
relationship to its user community.
community. We
We at
SUMEX-AHvl
have benefited
SUMEX-AIhl
benefited immensely
immensely from
from such a relationship
relationship with
with Dr.
Dr.
William
William R. Baker,
Baker, Jr.,
Jr., in the evolution
evolution of
of the SUMEX-AIM
SUMEX-AIM community.
community. We
We
look
beneficial relationship
look forward
forward to a continuing
continuing mutually
mutually beneficial
relationship with
with Dr.
Dr. Baker's
Baker’s
successor at the
NIH.
the NIH.
3. Freedom
by its
Freedom to Explore
Explore Resource Potential
Potential -- A
A resource, by
its nature,
nature, operates
operates
at the
“cutting edge"
edge” in developing
t)he "cutting
developing its
its characteristic
characteristic technology
technology and learning
learning
to effectively
effectively disseminate
disseminate itit to
to the
the biomedical
biomedical community
community at
at large. The
The BRP
BRP
t,o
should not
not impose artificial
artificial constraints
constraints on the
the resource for
for commercializing
commercializing its
its
should
efforts (fees for
for service)
service) or
or developing
developing its
its potential
potential (funding
(funding duration
duration limits
limits or
or
efforts
annual budget
budget ceilings).
ceilings).
artificial policy
policy impositions
impositions can serve to
to
annual
Such artificial
lIndermine the very
very goals central
central to
to the
the BRP’
BRP's
for existence.
existence.
llndermine
s reason for
Satisfactory policies
policies in this
this regard
regard have been worked
worked out
out and
and should
should be
Satisfact.ory
retained.
retained.

Electronic Communications
Communication,Oj
Electronic
SUMEX-AIM has pioneered
pioneered in developing
developing more
more effective
effective methods
methods for
for facilitating
facilitating
SUMES-AIM
scientific communication.
commllllication. Whereas
\Vhereas face-teface
face-to-face contacts
contacts continue
continue to
to play
playa
key role,
role, in
in
scient.ific
a key
the longer-term
longer-term we feel that
that computer-based
computer-based communications
communications will
will become increasingly
increasingly
the
important to
to the
the NIH
NIH and
and the
the biomedical
biomedical community.
community. We
We would
would like
like to
to see
see the
the BRP
BRP take
take
important
more adive
active role
role in promoting
promoting these tools
tools within
within the
the NIH
NIH and
and its
its grantee
grantee community.
community.
aa more

E. A.
A. Feigenbaum
Feigenbaum
E.

68
68

sp·a RROO785-11
RR00785-11
5P-11

Description
Description of
of Scientific
Scientific Subprojects
Subprojects

ll. Description
Description of
of Scientific
Scientific Subprojects
Subprojects
II.
il.A. Scientific
Scientific Subprojects
Subprojects
ILA.
The following
following subsections
subsections report
report on the
the AIM
AIM community
community of
of projects
projects and “pilot”
"pilot"
The
efforts including
including local
local and national
national users of
of the
the SUMEX-AIM
SUMEX-AIM facility
facility at
at Stanford.
Stanford.
efforts
However, those projects
projects admitted
admitted to
to the
the National
National AIM.
AIM community
community which
which use the
the RutgersRutgersHowever,
AIM resource
resource as their
their home base are not
not explicitly
explicitly reported
reported here.
AIhf
addition to
to these det;ailed
detailed progress
progress reports,
reports, abstracts
abstracts for
for
In a.ddition
individual users are submitted
submitted on a separate
separate Scientific
Scientific Subproject
Subproject
individual
have included
included here briefer
briefer summary
summary abstracts
abstracts of
of the
the fully-authorized
fully-authorized
B on page 209.
B

project and its
its
each project
Form. However,
However, we
Form.
projects in
in Appendix
Appendix
projects

The collaborative
collaborative project
project reports
reports and comments
comments are the
the result
result of
of a solicitation
solicitation for
for
The
contributions sent
sent to
to each of
of the
the project
project Principal
Principal Investigators
Investigators requesting
requesting the
the following
following
contriblltions
information:
information:
I. SUMhIARY
SUMMARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM

A. Project
Project rationale
rationale
A.
Medica.l relevance
relevance and collaboration
collaboration
B. Medical
Highlights of
of research
research progress
progress
C. Highlights
--Accomplishments this
this past
past year
year
--Accomplishments
--Research in progress
progress
--Research
D. List
List of
of relevant
relevant publications
publications
E. Funding
Funding support
support
II. INTERACTIONS
\\11TH
INTERACTIONS
WITH THE
THE SUMEX-AIM
SUMEX-AIM RESOURCE
RESOURCE
A. Medical
Medical collaborations
collaborations and program
program dissemination
dissemination via
via SUMEX
SUMEX
B. Sharing
Sharing and interactions
interactions with
with other
other SUMEX-AIM
SUMEX-AIM projects
projects
(via
(via comp1lting
computing facilities,
facilities, workshops,
workshops, personal
personal contacts,
contacts, etc.)
etc.)
C. Critique
Critique of
of resource management
management
(community
(community facilitation,
facilitation, computer
computer services, communications
communications
services, capacity,
capacity, etc.)
etc.)
III.
III. RESEARCH
RESEARCH PLANS
PLANS
A. Project
Project goals and plans
--Near-term
--Near-term
--Long-range
--Long-range
B. .Justification
.Justifica.tion and requirements
requirements for
for continued
continued SUMEX
SUMEX use
use
C. Needs
Nerds and plans for
for other
other computing
computing resources beyond
beyond SUMEX-AIM
SUMEX-AIM
D. Recommendations
Recommendations for future
future community
community and resource development
development
\'Ve
We believe that
that the reports
reports of the individual
individual projects
projects speak for
for themselves
themselves as
as
case, the reports
reports are recorded
recorded as
as submitted
submitted and are
rationales
In any case,
rationales for
for participation.
participation.
The only
only exceptions
exceptions are the respective
respective
the responsibility
responsibilit#y of the indicated
indicated project
project leaders. The
lists of relevant
relevant publications
publications which
which have been uniformly
uniformly formatted
formatted for parallel
parallel reporting
reporting
on the Scientific
Scientific Subproject
Subproject Form.
Form.

69
59

Feigenbaum
E. A. Feigenbaum

Stanford Projects
Projects
St.anford

RR00785-11
5P41 RR00785-11

II.A.!. Stanford
Stanford Projects
Projects
II.A.l.
The following
following group
group of
of projects
projects is formally
formally approved
approved for
for access
access to
to the
the Stanford
Stanford
The
aliquot of
of the
the S
SUMEX-AIM
resource. Their
Their access
access is based on
on review
review by
by the
the Stanford
Stanford
alicluot
UXIEX-AIM resource.
Advisory Group
Group and
and approval
approval by
by Professor
Professor Feigenbaum
Feigenbaum as Principal
Principal Investigator.
Investigator.
,4dvisory
In addition
addition to
to the
the progress
progress reports
reports presented
presented here, abstracts
abstracts for
for each project
project and
and
In
its individual
individual users are submitted
submitted on a separate
separate Scientific
Scientific Subproject
Subproject Form.
Form.
its

E. A.
,4. Feigenbaum
Feigenbaum

70

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

5P41
5P41 RR00785-11

IT.A.1.1.
EXPEX
- Expert
Explanation
Project
II.A.l.l.
EXPEX
Expert
Explanation
Project

EXPEX
Project
EXPEX - Expert
Expert Explanation
Explanation
Project
Edward
M.D.,
Edward H.
H. Shortliffe,
Shortliffe,
M.D., Ph.D.
Ph.D.
Departments
of
Science
Departments
of Medicine
Medicine and
and Computer
Computer
Science
Stanford
Stanford University
University

I.
I.

SUM:MARY
OF
PROGRAM
SUMMARY
OF RESEARCH
RESEARCH
PROGRAM

A.
A. Project
Project Rationale
Rationale
EXPEX
EXPEX is
is not
not a single project
project but
but a combination
combination of
of efforts
efforts that
that are directed
directed at
basic
ba.sic issues
issues in the development
development of
of representational
representational schemes to facilitate
facilitate knowledge
knowledge
acquisition
The
The work
work includes
includes not
not only
only the study
study of
of fundamental
fundamental
acquisition and explanation.
explanation.
representational
represent,ations.! formalisms
formalisms but
but also the encoding
encoding of various
various types
types of
of knowledge,
knowledge, such as
as
causal information
information and user models. In
In addition,
addition, to complement
complement these research directions,
directions,
the project
project has served as
as the focus for
for preparing
preparing three
three books on medical
medical computing
computing
research.
vVe
We believe that
that the productivity
productivity of basic computer
computer science research tends
tends to be
heightened
heightened by experiments
experiments that
that deal with
with significant
significant real world
world problem
problem domains.
domains.
Challenges
biology have introduced
Challenges drawn
drawn from
from chemistry,
chemistry, medicine,
medicine, and molecular
molecular biology
introduced
additional
but have simultaneously
addition4 complexity
complexity to
to expert
expert systems
systems work
work at
at Stanford,
Stanford, but
simultaneously forced
forced
system
developers
to
respond
to
pragmatic
constraints
and
user
demands
system developers to respond to pragmatic constraints
demands that
that have had a
significant
significant impact
impact on the basic AI
AI techniques
techniques selected or
or developed.
developed. Thus,
Thus, we believe that
that
creative
by working
creative investigation
investigat,ion into
into symbolic
symbolic reasoning
reasoning techniques
techniques is facilitated
facilitated by
working in real
world
Much
world settings
sett,ings where
where the application
application forces us to
to avoid
avoid oversimplification.
oversimplification.
Much of
of our
our
research effort
effort therefore
therefore deals with
with medical
medical domains
domains (viz.,
(viz., endocrinology
endocrinology and renal
renal
pathophysiology).
pathophysiology).

Relevance and
B. lvfediral
Medical Relevance
and Collaboration
Collaboration
Our
Our interest
interest, in explanation
explanation derives
derives from
from the insights
insights we gained
gained in developing
developing
explanatory ca.pabi!ities
capabilities for
for the
the MYCIN
MYCIN system.
system.
In the
the case
case of
of MYCIN
MYCIN and
and its
its
explana.tory
In
descendents, we have been able to
to generate
generate intelligible
intelligible explanations
explanations by
by taking
taking advant,age
advantage
descendents,
of its
its rule-based
rule-based representation
representation scheme. Rules can be translated
translated into
into English
English for
for display
display
of
to
a
user,
their
interactions
explicitly
demonstrated.
By
adding
and
their
interactions
can
also
be
explicitly
demonstrated.
to
By adding
mechanisms for
for understanding
understanding questions
questions expressed in simple
simple English,
English, we were
were able to
to
mechanisms
create an interactive
interactive system
system that
that allowed
allowed physicians
physicians to
to convince
convince themselves
themselves tha.t
that they
they
create
with t,he
the basis for
for the
the program’
program's
recommendations.
The limit,ations
limitations of
of the
the
agreed with
s recommendations.
The
explanations generated
generated in this
this way
way have become increasingly
increasingly obvious,
obvious, however,
however, and have
expIa.nations
to improved
improved characterization
characterization of
of the
the kinds
kinds of
of explanation
explanation capabilities
capabilities that
that must
must be
led t.o
developed ifif clinical
clinical consultation
consultation systems
systems are to
to be accepted
accepted by
by physicians.
physicians. The
The potential
potential
developed
of workstation
workstation graphics
graphics as a means of
of avoiding
avoiding natural
natural language
language issues
issues in
in the
the
use of
explanation process is also an area
area of
of great
great promise
promise with
with which
which we are currently
currently
exp!a.nat.ion
experimen ting.
experimenting.
\Vith these motivations
motivations in mind,
mind, we are involved
involved in a series of
of research
research projects
projects
With
that address medical
medical knowledge
knowledge representation
representation and
and explanation.
explanation. The
The individual
individual projects
projects
that
include
the
following:
include
following:

71

A. Feigenbaum
Feigenbaum
E. A.

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

5P41
5P4 1 RR00785-11
RROO785-11

1.
1. Mr.
hlr.

Greg
NESTOR program
program uses
Greg Cooper's
Cooper’s NESTOR
uses a detailed
detailed knowledge
knowledge base
regarding
pathophysiologic relationships
The
program is
regarding pathophysiologic
relationships in hypercalcemia.
hypercalcemia.
The program
designed to
physician's
proposed explanation
to critique
critique a physician’
s hypothesis
hypothesis regarding
regarding a proposed
expla.nation
for
patient manifestations
for a set of
of patient
manifestations when
when an elevated
elevated serum
serum calcium
calcium has been
observed.
particular interest
Of particular
interest is the techniques
techniques Cooper
Cooper has developed
developed for
for
observed. Of
using knowledge
knowledge of
of causality
causality to
to avoid
avoid the assumption
assumption of
of conditional
conditional
independence
independence commonly
commonly used in Bayesian
Bayesian diagnosis
diagnosis systems.

2. Mr.
pathophysiology,
Mr. John Kunz
Kunz has represented
represented the
the knowledge
knowledge of
of renal
renal pathophysiology,

including
including the quantitative
quantitative relationships
relationships that
that characterize
characterize the way
way in which
which the
body manages water
body
water and electrolytes,
electrolytes, to develop
develop a consultation
consultation and analysis
analysis
system
system (AI/MM)
(AI/MM) that
that melds mathematics
mathematics and AI
AI techniques.
techniques.
3. Building
Building on his earlier
earlier experience
experience with
with developing
developing an explanation
explanation capability
capability for
for

NEOM YCIN (in
project members
NEOMYCIN
(in collaboration
collaboration with
with the GUIDON
GUIDON project
members as
as outlined
outlined
elsewhere in this
this report),
report), Dr.
Dr. Glenn
Glenn Rennels has begun to
to work
work on a new
system
syst*em that
that uses
uses knowledge
knowledge of
of medicine
medicine to help
help formulate
formulate and resolve complex
complex
decision
better
decision analyses. Convinced
Convinced that
that decision
decision analytic
analytic techniques
techniques would
would be better
accepted
physician were to
accepted in medicine
medicine if
if the physician
to interact
interact with
with a knowledge-based
knowledge-based
interface
int)erface (rather
(rather than
t,han with
with the
the decision
decision trees themselves),
themselves), Dr.
Dr. Rennels has
made use of
"influence
diagrams"
as
a
central
method
central method for
for guiding
guiding the
the
of “influence diagrams” as
interaction.
The
explanation
issues
become
especially
evident
when
interaction. The explanation issues
especially evident when an analysis
analysis
is complete
complete and his system
system needs to
to generate
generate a defense for
for the recommendation
recommendation
it
it has made.
4. Mr.
Mr. Curt
Curt Langlotz
La.nglotz has continued
continued to
to work
work on a hypothesis
hypothesis assessment module
module

for
program uses
This program
uses a critiquing
critiquing model
model which
which
for the ONCOCIN
ONCOCIN system.
system. This
inherently
inherently involves
involves advanced
advanced explanation
explanation techniques.
techniques. The
The work
work uses
uses the
the Xerox
Xerox
1108 professional
professional workstation
workstation (Dandelion)
(Dandelion) and is further
further described
described in the
ONCOCIN
portion of
ONCOCIN Project
Project portion
of this
this annual
annual report.
report.
S.
project using the Xerox
.5. During
During 1983, Ms. Shako
Shoko Tsuji
Tsuji completed
completed a project
Xerox workstation
workstation
to experiment
experiment with
with graphical
graphical techniques
techniques for
for examining,
examining, manipulating,
manipulating,
expanding,
base. Also
expanding, and editing
editing a large medical
medical knowledge
knowledge base.
Also working
working in the
context
of
ONCOCIN,
her
code
was
designed
for
use
by
knowledge
context of ONCOCIN, her
for use by knowledge engineers.
The
building an interface
The work
work has inspired
inspired subsequent
subsequent work
work in building
interface for
for the nonprogrammer clinician
protocols in the
programmer
clinician who
who wishes to
to write
write and test
t,est new protocols
ONCOCIN
The
project is described
ONCOCIN environment.
environment.
The project
described in greater
greater detail
detail in the
ONCOCIN
portion of
ONCOCIN portion
of the annual
annual report.
report.
To
prepared two
booles on
To complement
complement, these basic research activities,
activities, we have
hat-e prepared
two boo!cs
Artifidal
beginning work
Artificial Intelligence
Intelligence in Medicine
Medicine and are beginning
work on a third
third (see
(see Section
Section C for
for
details).
details).

C.
Highlights of
Research Progress
Progress
G. Highlights
o j Research

C.l
NESTOR
CT.1The NI3’
TOR System
Sydem
NESTOR is intended
patient data
NESTOR
intended to allow
allow a user to
t’o input
input patient
data plus a hypothesis,
hypothesis, and
then
then have the system
system critique
critique that
that hypothesis
hypothesis in light
light of
of the data.
data. The
The system,
system, an evolving
evolving
thesis
project that
t.hesis project
that is largely
largely the work
work of
of Mr.
Mr. Greg
Greg Cooper,
Cooper, relies on basic associational
associational
information
part from
but supplemented
information drawn
drawn in part
from the INTERNIST-J
INTERNIST-I knowledge
knowledge base but
supplemented with
with
causal and temporal
t.emporal associations.
associations.
The
behind this
physicians want
The motivation
motivation behind
this research
research is the conviction
conviction that
that physicians
want active
active

E. A. Feigenbaum
Feigenbaum

72

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

5P41
5P41 RR00785-11
R.R00785-11

control
cont,rol of the diagnostic
diagnostic process and that
that they
they also want
want and need aa system
system that
that explains,
explains,
in aa user-tailored
user-tailored way, its evaluation
evaluation of the physician's
physician’s hypothesis.
hypothesis. There
There may
may be
be times
times
when the user wants
wants to give complete
complete control
control to NESTOR
NESTOR and just
just be
be in aa mode of
answering
answering questions, but
but we feel
feel that
that this
this should be
be an option
option and not
not aa requirement.
requirement. It
It
is
is observations
observations such as
as these that
that have also accounted
accounted for
for the hypothesis
hypothesis assessment
assessment work
work
underway
underway in the ONCOCIN
ONCOCIN research, briefly
briefly mentioned
mentioned above and further
further described
described in
the section
section of this
this report
report dealing
dealing with
with that
that project.
project.
The
The initial
initial NESTOR
NESTOR system
system is
is now largely
largely complete
complete and is undergoing
undergoing evaluation
evaluation
at this
time.
Of
particular
interest
is
the
adequacy
of
the
techniques
Of particular interest
techniques developed for
this time.
allowing
allowing NESTOR
NESTOR to avoid
avoid the traditional
traditional assumption
assumption of conditional
conditional independence used in
Bayesian systems.
syst,ems. Also, because
because NESTOR's
NESTOR’s probabilistic
probabilistic model is
is more formal
formal than
than the
ad hoc scheme used
used in, say, INTERNIST,
INTERNIST, the assumptions
assumptions made by our
our system
system are more
explicit.
explicit.
\Ve have also developed search techniques
techniques that
that allow
allow NESTOR
NESTOR to
to explore
explore
efficiently
efficiently a very
very large search space in order
order to find
find the most
most probable
probable (multiple
(multiple disease)
disease)
hypothesis.
hypothesis. This
This technique
technique is
is general and can be applied
applied to many
many nonmedical
nonmedical problems
problems
where
where the goal is to find
find the most
most probable
probable hypothesis
hypothesis among many
many possibilities.
possibilities.

C.2
C.2 Integrating
Integrating ~Mathematical
Mathematical Models
Models with
with AI
AI Afethods
Methods
This
This research project,
project, known
known as
as AI/MM,
AI/MM, is the dissertation
dissertation research of
of Mr.
Mr. John
Kunz.
The
The system
system int,egrates
integrates AI
AI and simple
simple mathematics
mathematics to analyze a physiological
physiological
Kunz.
model.
model. In a selected medical
medical domain
domain (renal
(renal physiology),
physiology), we have built
built a computer
comput,er
program
It
program based on these techniques.
techniques.
It analyzes physiological
physiological behavior,
behavior, diagnoses
abnormality,
program fits
abnormality, and explains
explains the rationale
rationale for
for its
its analyses. The
The program
fits data
data to the
model.
model, identifies
identifies whether
whether the data
data are abnormal,
abnormal, and identifies
identifies the possible causes
causes and
effects
The
physiological model
effects of
of any
any abnormalities.
abnormalities.
The physiological
model is based on knowledge
knowledge about
about
anatomy,
physiological system,
anatomy, the hehavior
heha,vior of
of the physiological
system, and the mechanism
mechanism of
of action
action of
of the
system.
problems discussed
system. It's
It’s validity
va,lidity has been tested
tested by
by having
having it
it analyze many
many of
of the problems
in Valtin's
Renal Function.
Function.
Va.ltin‘s text
text IZellal
The
project have been to:
The specific
specific aims of
of this
this project
1.
physiological model.
1. Dev('lop
Develop a vocabulary
vocabulary for
for a physiological
model.

The
The vocabulary
vocabulary represents
represents
the "basic
physiology" of
“ba,sic physiology”
of a biological
biological system
system and appears to be adequate to
the concepts
concepts included
introductory professional-level
professional-level physiology
physiology
express Lhe
included in an introductory
text.
text.

Develop a reasoning
reasoning system
system which
which can solve problems
problems expressed
expres;~ed in the
2. Develop
vocabulary.
vocabul:try.
Demonstrate the basic necessity, appropriateness
appropriateness and limitations
limitations of
of the
3. Demonstrate
vocabulary and reasoning
reasoning procedure.
procedure.
vocabulary
C.3
Knowledge-Based Explanations
Explanations in
in a Decision
Decision Analysis
Analysis Environment
Environment
Cr.3 Kno,ufledge-Based
This new project,
project, thesis research by
by Dr.
Dr. Glenn
Glenn Rennels, is motivat,ed
motivated by
by the
This
observation that
that AI
AI techniques
techniques could
could greatly
greatly facilitate
facilitate a user’
user's
effort to
to specify
specify the
the det.ails
details
observation
s effort
of a complex
complex clinical
clinical decision
decision task
task and
and to
to seek assistance with
with that
that task.
task. Although
Although
of
decision theoretic
theoretic notions
notions have been shown
shown to
to be relevant
relevant to
to such medical
medical problems,
problems, they
they
decision
largely been unused by
by clinicians,
clinicians, even when
when computer-based
computer-based solutions
solutions have been
have largely
offered. \Ve
\Ve believe
believe that
that an intelligent
intelligent system
system should
should be able to
to guide
guide the
the definition
definition of
of the
the
offered.
decision t.ask
task and explain
explain the
the results
results of
or the
the analysis
analysis without
without requiring
requiring that
that a user be
decision
familiar wit,h
with the
the underlying
underlying decision
decision analytic
analytic techniques
techniques being
being used to
to solve the
the problem.
problem.
familiar

73

A. Feigenbaum
Feigenbaum
E. A.

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

5P41 RR00785-11
RR00785-11

The
The basic notion
notion is to
to use directed
directed graphs,
graphs, termed
termed "influence
“influence diagrams"
diagrams” as
as a
language for
physician at
for communication
communication with
with a physician
at a graphical
graphical display
display terminal.
terminal.
Nodes in
these graphs
by the user who
graphs are defined
defined by
who is seeking advice, and their
their structure
structure and
meaning
meaning is largely
ia.rgely intuitive.
intuitive. The
The task
task of
of converting
converting influence
influence diagrams
diagrams to
to decision
decision trees is
a knowledge-based
problem that
potentially well-suited
knowledge-based problem
that is potentially
well-suited for
for a solution
solution that
that uses
uses AI
AI
methods.
Similarly, the
the results
results of
of a decision
decision analysis,
analysis, including
including the sensitivity
sensitivity analysis,
analysis,
methods. Similarly,
will
physician user in terms
will need to
to be explained
explained to
to the physician
terms of
of influence
influence diagrams
diagrams and
knowledge
The
being
The necessary knowledge
knowledge structures
struct)ures are currently
currently being
knowledge of
of the domain.
domain.
designed, and an early
prototype system
The
The research uses
uses a 9835
9836
early prototype
system is operational.
operational.
workstation
by Hewlettworkstation donated
donated to the Medical
Medical Information
Information Sciences
Sciences Training
Training Program
Program by
HewlettPackard
Packard Company
Company and soon to
to be networked
networked to
to the SUMEX
SUMEX 2060.

C.4 Books
Books on Medical
Afedical Arti
Artificial
Intelligence and
Afedical Computing
jicial Intelligence
and Medical
Computing
\Ve
both of
published
1Ve have completed
completed two
two books, both
of which
which are in press and due to be published
in mid-1984:
mid- 1984:
W.J.
E.H.
Readings in
Medical Artificial
Artificial
W.J. and Shortliffe,
Shortiiffe,
E.H. Readings
in Medical
Intelligence: The First
First Decade. Reading,
Reading, MA:
MA: Addison-Wesley,
Addison-Wesley, 1984.
Intelligence:

•0 Clancey.
Clancey,

Rule-Based Ezpert
Expert Systems:
Rule-Based
Systems: the
.MYCIN
Experiments of
Heuristic Program.ming
Programming Project.
Project.
h,fi%IN Experiments
o j the Stanford
Stan ford Heuristic
Reading.
Reading, MA:
MA: Addison-\Vesley,
Addison-Wesley. 1984.

•l Buchanan,
Buchanan, \V.J.
W.J. and Shortliffe,
Shortiiffe, E.H.
E.H.

In addition,
just begun work
beginning to
addition, we have just
work on a textbook
textbook for
for students
students beginning
t.o study
study
medical
This
medical computing
computing and artificial
artificial intelligence.
intelligence.
This multi-authored
multi-authored volume
volume should
should be
completed
by the end of
publication date is contemplated.
completed in draft
draft form
form by
of 1984. A
A 1985 publication
contemplated.
•l Shortliffe,
An Introduction
Introduction to
Short.iiffe, E.H.,
E.H., \Viederhold,
Wiederhold, G.C.M.,
G.C.M., and Fagan,
Fagan, L.M.
L.M. An
Afedical Computer
preparation).
Rfedical
Computer Science.
Science. Reading,
Reading, MA:
MA: Addison-Wesley
Addison-Wesley (in
(in preparation).

D. Publications
Publications Since
January 1989
D.
Since January
1983
1. Shortliffe,
Shortliffe, E.H.
E.H.

Medical
In
Medical consultation
consultation systems:
systems: designing
designing for
for doctors.
doctors.
In
Dcsigning
for
Human-Computer
Communication
(M.S.
Sime
and
M.J.
Designing for Human-Computer
Communication
(M.S.
M.J.
Coombs.
Coombs. eds.),
eds.). Chapter
Chapter 8, pp. 209-238, London:
London: Academic
Academic Press, 1983.

2. Shortliffe,
Shortliffe, E. H. Medical
Medical Cybernetics:
Cybernetics: The
The Challenges
Challenges of
of Clinical
Clinica. Computing.
Computing.
In
Tcchnology
International
Stability,
and
Growth,
S.
Basheer
Ahmed
In Tech,nology International
Stability, and Growth, S.
Ahmed and
Alice
Alice P. Ahmed,
Ahmed, editors;
editors; Chapter
Chapter 12, pp. 148-165; Associated
Associated Faculty
Faculty Press,
Inc.,
New York,
Inc., Port, \Vashington,
Washington, New
York, 1984.
3. (*)
(*) Shortliffe,
Shortliffe, E.H.
E.H. and Fagan,
Fagan, L.M.
L.M. Expert
Expert systems
systems research: modeling
modeling the
medical
An Integrated
Integrated Approach
Approach to Monitoring
Afonitoring
In An
medical decision
decision making
making process. In
(J.S. Gravenstein.
Newbower, A.K.
N.T. Smith,
Gravenstein, R.S. Newbower,
.4.K. Ream, and N.T.
Smith, eds.), pp.
183-200, Woburn,
Woburn, MA:
hIA: Butterworth's,
Butterworth’s, 1983.
1983.
'1.
4. Duda,
Duda,

R.O.
R.O. and Short-Iiffe,
Shortliffe,
220:251-268
220:261-368 (1983).

E.H.
E.H.

Expert
Expert systems
systems

research.

Science,
Science,

5. (*)
(*) Langlotz,
Langiotz. C.P.
C.P. and Shortliffe,
Shortliffe, E.H.
E.H. Adapting
Adapting a consultation
consultation system
system to
to
critique
plans.
International Journal
Journal of
Studies,
critique user plans.
International
o j lI.-fan-Afachine
Man-Muchine
Studies,
19>179-495
19:479-496 (1983)
(1983)

6. Shortliffe,
Hypothesis
Shortliffe, E.H.
E.H.
Hypothesis generation
generation in medical
medical consultation
consultation systems:
systems:
art·ificial
In lI.-fEDINFO
83 (J.H.
artificial intelligence
intelligence approaches.
approaches.
MEDINFO
(J.H. van
van Bemmel,
Bemmel,
M.
North Holland,
M. Ball,
Ball, and
a.nd O.
0. Wigertz,
Wigertz, eds.),
eds.). pp. 480-483, North
Holland, Amsterdam,
Amsterdam, 1983.
E. A. Feigenbaum
Feigenbaum

74

EXPEX
EXPEX -- Expert
Expert Explanation
Explanation Project
Project

5P41
5P4 1 RR00785-11
RR00785- 11

7.
Graphical access
access to
to the
the knowledge
knowledge base
base of
of aa
7. (*)
(*) Tsuji,
Tsuji, S.
S. and
and Shortliffe,
Shortliffe, E.H.
E.H. Graphical
medical
consultation
system.
Proceedings
of
AA
.
A'[SI
Congress
83,
Congrese 83, pp.
pp.
Proceedings of A4MSI
medical consultation system.
551-555,
San
Francisco,
Ca
.
,
May
1983.
551-555, Ssn Francisco, Ca., May 1983.
8.
The
In
The science
science of
of biomedical computing.
computing.
In A1eeting
hleetilzg the
the
8. Shortliffe,
Shortliffe, E.H.
E.H.
and Medical
Challenge:
Challenge: Informatics
In formatics
Medical Education
Education (J.C.
(J.C. Pages,
Pages, A.H. Levy,
Levy,
F.
F. Gremy,
Gremy, and
and J.
J. Anderson, eds.),
eds.), pp. 1-10,
l-10, Amsterdam:
Amsterdam: North-Holland,
North-Holland, 1983.
1983.
To
To be
be reprinted
reprinted ih
ih Medical
Medical Informatics,
Informatics, 1984.
1984.
9.
9. (*)
(*) Kllnz.
Kunz, J.C.,
J.C., Shortliffe,
Shortliffe, E.H.,
E.H., Buchanan,
Buchanan, B.G.,
B.G., and
and Feigenbaum,
Feigenbaum, E.A.
E.A.
Comparison
Comparison of techniques of computer-assisted decision
decision making
making in medicine.
In
111Pure and Applied
.4ppfied Biostructure
Biostructure (Claudio
(Claudio Niccolini,
Niccolini, Ed.), Singapore: World
World
Press.
Press, 1983.
1983.
10.
10. (*)
(*) Kunz, J.C., Shortliffe,
Shortliffe, E.H., Buchanan, B.G.,
B.G., Feigenbaum, E.A. ComputerComputerassisted
of Philosophy
Philosophy and A-fedicine,
Medicine,
assisted decision making
making in medicine. Journal
Journal of
Summer 1984
1984 (in press).
press).
11.
ll.‘(*)(*) HasJing,
Hssling, D. \V.,
W., Clancey, W.
W. J.,
a.a diagnostic
consultation
system.
diagnostic consultation
Studies,
Studies, Spring
Spring 1984
1984 (in press).
press).

and Rennels,
Rennels, G. Strategic
Strategic explanations
explanations for
International
Journal
of
International
Journal of Afan-Afachine
Man-Machine

12.
12. Shortliffe,
Shortliffe, E.H. Reasoning methods in medical consultation
consultation systems: artificial
artificial
intelligence
intelligence approaches (tutorial).
(t,utorial). Computer
Computer Programs
Programs in
in Biomedicine,
Biomedicine,
January
January 1984
1984 (in press).
press).

E. Funding
Funding Support
Support
Grant
Grant Title:
Title: "The
“The Development
Development of Representation
Representation Methods
Methods to Facilitate
Facilitate
Knowledge
Knowledge Acquisition
Acquisition and Exposition
Exposition in Expert
Expert Systems"
Systems”
Principal
Principa.1 Investigator:
Investigator: Edward
Edward H. Shortliffe
Shortliffe
Agency:
Naval Research; ID
Number: NR
NR 049-479
A4gency: Office
Office of
of Naval
ID Number:
049-479
Term: January
January 1981
to December 1983
Term:
1981 to
Total award:
award: $~456,622
$455.622
Total
Grant Title:
Title: “Research
"Research on Introspective
Introspective Systems”
Systems"
Grant
Principal Investiga.tor:
Investigator: M
Michael
Principal
ichael R. Genesereth
Agency: Office
Office of
of Naval
Naval Research; ID
ID Number:
Number: NR
NR 049-479
Agency:
Term: Jnnunry
January 1984
1984 to
to December 1986
Term:
Total award:
award: $312,070
$:312.070
Total
Title: “Information
"Information Structure
Structure and Use in Knowledge-Based
Knowledge-Based Expert
Expert
Grant. Title:
Systems"
Systems”
Principal Investigat.or:
Investigator: Bruce
Bruce G. Buchanan
Buchanan
Principa.1
Agency: Na.tionsl
National Science
Science Foundation;
Foundation; ID
ID Number:
Number: 83-12148
83-12148
Agency:
Term: h,larch
t-.larch 1984 to
to February
February 1987
Term:
Tota,l
award: $300,000
$300.000 (includes
(includes indirect
indirect costs)
Tot
al award:

ll.
II.

INTERACTIONS WITH
WITH THE
THE SUMIZX-AIM
SUMEX-AIM RESOURCE
RESOURCE
INTERACTIONS
A.. Medical
Afedical Collaborations
Collaborations and
and Program
Program Dissemination
Dissemination via
via SUMEX
SUAfEX
A.

None of
of t,hese
these new
new programs
programs is
is yet
yet ready
ready for
for disseminat,ion.
dissemination.
None

75
75

They are
are mostly
mostly
They

E. A.
A. Feigenbaum
Feigenbaum
E.

5P41 RR00785-11
RR00785- 11

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

fundamental
research experiments
experiments with
with limited
limited clinical
clinical usefulness
usefulness other
other than
than as
funda.mental
research
demonstration
demonstrat,ion projects.
projects. Our
Our past
past experience
experience has shown,
shown, however,
however, that
that SUMEX
SUMEX provides
provides
a superb
superb vehicle
vehicle for
for demonstrating
demonstrating systems,
systems, even at a dist.ance.
dist,ance.
The
The new book
book writing
writing eCfort
effort will
will in particular
particular be facilitated
facilitated by
by SUMEX,
SUMEX, much
much as
the AI
A1 Handbook
Handbook was in the
the past.
past. A
A multi-authored
multi-authored text
text of
of this
this type,
type, particularly
particularly one for
for
which
which the
the authors
authors are spread
spread at
at numerous
numerous different
different universities
universities around
around the country,
country,
would be a nightmare
nightmare to
to compile
compile if
if it
it were not
not for
for the
the SUMEX
SUMEX resource.
resource. Many
Many of
of the
the
would
of manuscript
manuscript
contributors to
to the
the book
book have been assigned SUMEX
SUMEX accounts
accounts for
for purposes
purposes of
contributors
preparation.
preparation. Online
Online manuscript
manuscript work
work through
through the
the shared
shared facility,
facility, coupled
coupled with
with messaging
messaging
capabi\it.ies,
capabilit,ies, will
will greatly
greatly enhance the
the efficiency
efficiency and
and accuracy
accuracy oC
of the developing
developing chapters
chapters
the editing
editing process.
and the

B.
Interaction with
UM Project8
Projects
B. Sharing
Sharing and
and Interaction
with Other
Other SUMEX-..
SUMEX-AIM

Although our
our EXPEX
EXPEX work
work is young,
young, we are already
already benefiting
benefiting Cram
from interactions
interactions
Although
with ot.her
other researchers
researchers who
who use
use the SUMEX-AIM
SUMEX-AIM
resource.
The NESTOR
work in
with
resource.
The
NESTOR work
particular
knowledge
particular has depended
depended on access
access to
to the
the INTERNIST-l
INTERNIST-l
knowledge base and on Crequent
frequent
exchange of
Similarly,
of messa.ges
messa.ges with
with the researchers
researchers at
at the
the University
University of
of Pittsburgh.
Pittsburgh.
Similarly, our
our
collaboration
collaboration with
with the
the GUIDON
GUIDON research
research team
team for
for the
the implementation
implementation of
of an explanation
explanation
capability
capability would
would not
not have been possible
possible without
without the
the facilitated
facilitated communication
communication and
and shared
shared
file access
access available
available via
via SUMEX.
SUMEX.
file
C. Critique
Resource Management
Atfanagement
Critique of
of Resource
SUf\IEX
Not
SUhIES continues
continues to
to provide
provide a superb
superb environment
environment for
for research
research of
of this
this kind.
kind. Not
only
only is the
the 201)0
2060 a well
well managed
managed resource
resource under
under Ed
Ed Pattermann's
Pattermann’s leadership,
leadership, but
but the
hypothesis
hypothesis assessment and graphical
graphical query
query systems
systems are dependent
dependent upon
upon access
access to
to high
high
performance
performance professional
professional workstations,
workstations, and
and we are delighted
delighted with
with the resources that
that
SUMEX
SUMEX has provided
provided us in this
this regard.
regard.
III.
RESEARCH
PLANS
ID.
RESEARCH
PLANS

A. Project
Project Goals
Plan8
A.
Goal.9 and
and Plans
\Ve
We anticipate
ant,icipate completion
completion of
of many
many of
of these basic research
research efforts
efforts during
during the
the coming
coming
year.
NESTOR work
year. Cooper's
Cooper’s NESTOR
work is largely
largely complete,
complete, and a thesis
thesis document
document is anticipated
anticipated
in June
June 1984. Similarly,
Similarly, Kunz
Kunz has completed
completed his work
work on AI/MM,
AI/MM, and
and his dissertation
dissertation is
approaching
approaching completion.
completion. Both
Both Cooper
Cooper and
and Kunz
Kunz have completed
completed their
their oral
oral examinations
examinations
this ",,"ork.
work.
on this
The
The project
project oC
of Tsuji
Tsuji is complete
complete and
and she has now
now lett
left StanCord.
Stanford. However,
However, the
the code
she developed
developed is being
being modified
modified Cor
for ongoing
ongoing use in the
the ONCOCIN
ONCOCIN environment.
environment.
The
The project
project oC
01 Langlotz
Langlotz continues
continues to
to be an active
active research
research effort
effort within
within the
the
ONCOCIN
His plan
ONCOC’IN project.
project.
plan for
for the
the coming
coming year
year is briefly
briefly outlined
outlined in the ONCOCIN
ONCOCIN
portion of
of this
this annual
annual report.
report.
portion
The
just getting
The work
work of
of Rennels,
Rennels, which
which is just
getting underway,
underway, will
will be better
better formulated
formulated by
by
next
next year
year at
at this
this time.
time. We
We expect
expect the
the project
project to
to last
last at
at least
least two
two more
more years,
years, however.
however.
The
The textbook
textbook preparation
preparation is scheduled
scheduled for
for completion
completion in approximately
approximately one year,
year,
with publication
publication anticipated
anticipated during
during H185.
1985.
with

E. A. Feigenbaum
Feigenbaum

76
76

EXPEX
EXPEX - Expert
Expert Explanation
Explanation Project
Project

5P41
5P4 1 RR00785-11
RROO785-11

B. Requirement8
Requirement.'} for
for Continued
Continued SUAfEX
SUMEX Use
All
All the work
work we are doing
doing is largely
largely dependent
dependent on the SUMEX
SUMEX re.,ource.
resource. The
The new
work
by the Medical
work of
of Rennels
Rcnnels is using Hewlett-Packard
Hewlett-Packard 9835
9836 workstations
workstations owned
owned by
Medical
Information
program, but
but Dr.
Information Sciences training
training program,
Dr. Rennels continues
continues to be dependent
dependent upon
upon
SUMEX
for
projects, only
SUhlEX
for communication
communication and collaboration.
collaboration. Of
Of the other
other projects,
only the
t,he hypothesis
hypothesis
assessment and graphical
projects are sufficiently
justify their
graphical query
query projects
sufficiently mature
mature to justify
bheir transfer
transfer to
to
one of
personal workstations,
of the SUMEX
SUMEX personal
workstations, so the new 2060 continues
continues to
to be a key
key element
element
in our
plan.
our research plan.
In addition.
benefits of
addition, we have long appreciated
appreciated the
the benefits
of GUEST
GUEST and network
network access
access to
the progrs.ms
programs we are developing.
SUMEX
greatly
enhances
our
ability
to
obtain
feedback
developing. SUMEX greatly
our ability
obtain feedback
from
physicians and computer
programs
from interested
interested physicians
computer scientists
scientists around
around the country.
country. As our
our programs
continue
cant inue to
t,o mature,
mat,ure, it
it will
will become increasingly
increasingly important
important that
that we be able to make them
them
available
by distant
ava.ilable for
for demonstration
demonstration and for
for access
access by
distant collaborators
collaborators via
via the SUrvfEX
SUhlEX
network.
ne twnrk.

c.
Requirements for
for Additional
Additional Computing
Resources
C. Requirement8
Computing Resources
The
provide a suitable
The mainframe
ma.inframe machine
machine should
should continue
continue to
to provide
suitable environment
environment for
for
most
plans to
NESTOR, or
We have no plans
to transfer
transfer NESTOR,
or
most of
of our
our work
work in the
the months
months ahead. We
AI/~'IM
AI/MM to other
other hardware
hardware soon.

D.
Recommwdation,'J for
for Future
Future Community
Resource Development
Development
D. Recommendations
Community and
and Resource
\Ve are very
provided since the
We
very satisfied
satisfied with
with the facilities
facilities SUMEX
SUMEX has provided
the upgrade
upgrade to
the
professional workstations
t,he DEC
DEC 2050.
2060. Other
Other than
than continued
continued acquisition
acquisition of
of professional
workstations that
that can be
shared
by some of
programs in this
projects, we have no requests
shared by
of the more
more mature
mature progra.ms
this set of
of projects,
for
for additional
addit,ional acquisitions
acquisitions or
or resource development
development at this
this time.
time.

77
77

E. A.
A. Feigenbaum
Feigenbaum

GUIDON/NEOMYCIN
Project
GUIDON/NEOhIYCIN
Project

5P41
5P41 RR00785-11

II.A.1.2.
GUIDON/NEOMYCIN
Project
II.A.1.2.
GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project
William
J.
Ph.D.
William
J. Clancey,
Clancey,
Ph.D.
Department
Computer
Science
Department
Computer
Science
Stanford
University
Stanford
University
Bruce
Ph.D.
Bruce G.
G. Buchanan,
Buchanan,
Ph.D.
Computer
Science
Computer
Science Department
Department
Stanford
University
Stanford
University

I.
I.

SUM?vlARY
OF
PROGRAM
SUMMARY
OF RESEARCH
RESEARCH
PROGRAM
A.
A. Project
Project Rationale
Rationale

The
Project
The GUIDON/NEOMYCIN
GUIDON/NEOMYCIN
Project is a research program
program devoted
devoted to the
development
development, of
of a knowledge-based
knowledge-based tutoring
tutoring system
system for
for application
application to medicine.
medicine. This
This work
work
derived
That research gave way
way to three
three
derived from
from our
our first
first system,
system, the MYCIN
MYCIN program.
program. That
sub-projects
sub-projects (EMYCIN,
(EMYCIN, GUIDON,
GUIDON, and ONCOCIN)
ONCOCIN) described
described in previous
previous annual
annual reports.
reports.
El\l)"CIN
EMYCIN has
haz been completed
completed and its resources reallocated
reallocated to other
other projects.
projects. GUIDON
GUIDON
and ONCOCIN
ONCOCIN have become projects
projects in their
their own
own right.
right.
The
project is to
program that
The key
key issue
issue for
for the GUIDON/NEOMYCIN
GUIDON/NEOMYCIN
project
to develop
develop a program
that
can provide
provide advice
by human
advice similar
similar in quality
quality to that
that given
given by
human experts,
experts, modeling
modeling how
how they
they
structure
problem solving
procedures. The
struct,ure their
their knowledge
knowledge as
as well
well as
as their
their problem
solving procedures.
The consultation
consultation
program
using
this
knowledge
is
called
NEOMYCIN.
NEOMYCIN's
knowledge
base,
program using this knowledge
called NEOMYCIN.
NEOMYCIN’s
knowledge base,
designed for
use
in
a
teaching
application,
will
become
the
subject
material
used
by a
for use
teaching application, will
the subject material
by
family
programs referred
The
problemfamily of
of instructional
instructional programs
referred to
to collectively
collectively as GUIDON2.
GUIDON2.
The problemsolving
procedures are developed
by running
NEOMYCIN and
solving procedures
developed by
running test
test cases
cases through
through NEOMYCIN
comparing
behavior. Also,
NEOMYCIN as
Also, we are using NEOMYCIN
as a test
test bed for
for the
the
compa.ring them
them to
t,o expert
expert behavior.
explanation
part of
programs.
explana.tion capabilities
capabilities that
that will
will eventually
eventually be part
of our
our instructional
instructional programs.
The purpose
purpose of
of the
the current
current contract,
contract, now
now in its
sixth of
of six
six years,
years, is to
to construct
construct an
The
its sixth
intelligent tut,oring
tutoring system
system t.hat
that teaches diagnostic
diagnostic strategies
strategies explicitSly.
explicitly. By
By strategy,
strategy, we
intelligent
plans for
for establishing
establishing a set of
of possible diagnoses, focusing
focusing on and
and confirming
confirming
mean plans
individual diagnoses, gat.hering
gathering data,
data, and
and processing
processing new data.
data. The
The tut,orial
tutorial program
program will
will
individual
capabilities to
to recognize
recognize these plans, as well
well as to
to articulate
articulate strategies
strategies in explanat,ions
explanations
have capabilities
about how
how to do diagnosis.
diagnosis. The
The strategies
strategies represented
represented in the program,
program, modeling
modeling
a.bout
techniqlles,
and
explanat.ion
techniques
are
wholly
separate
from
the
knowledge
base, so
techniques,
explanation techniques
wholly separate from the knowledge base,
can be used wit.h
with many
many medical
medical (and
(and non-medical)
non-medical) domains.
domains. That
That is, the target
target program
program
will be able t.o
to be tested
tested with
with other
other knowledge
knowledge bases,
bases, using
using system-building
system-building tools
tools that
that we
will
provide.
provide.

B. Medical
Jvledical Relevance
Relevance and
and Collaboration
Collaboration
B.
There is a growing
growing realization
realization that
that medical
medical knowledge,
knowledge, originally
originally codified
codified for
for the
the
There
purpose of
of computer-based
computer-based consultations,
consultations, may
may be utilized
utilized in additional
additional ways
ways that
that are
purpose
medically relevant.
relevant. Using
Using the
the knowledge
knowledge to
to teach
teach medical
medical students
students is perhaps
perhaps foremost
foremost
medically
among t,hese.
these, and NEOMYCIN
NEOMYCIN continues
continues to
to focus on methods
methods for
for augmenting
augmenting clinical
clinical
among
knowledge in order
order to
to facilitate
facilitate its
its use in
in a tutorial
tutorial setting.
setting. A
A particularly
particularly important
important
knowledge

A. Feigenba.um
Feigenbaum
E. A.

78

GUIDON/NEOMYCIN Project
Project
GUIDON/NEOMYCIN

5P41 RR00785-11
RR00785-11
5Pll

aspect of
of this
this work
work is t,he insight
insight that
that has been gained
gained regarding
regarding the
the need to
to structure
structure
aspect
knowledge differently,
differently, and
and in
in more
more detail,
detail, when
when itit is being
being used for
for different
different purposes
purposes
knowledge
(e.g., teaching
teaching as opposed
opposed to
to clinical
clinical decision
decision making).
making). ItIt was this
this aspect
aspect of
of the
the GUIDON
GUIDON
(e.g.3
resea.rch that
t.hat led to
to the
the development
development of
of NEOMYCIN,
NEOMYCIN, which
which is an evolving
evolving computational
compu~ational
research
model of
of medical
medical diagnostic
diagnostic reasoning
reasoning that
that we hope will
will enable us to
to better
better understand
understand
model
teach diagnosis
diagnosis to
to students.
students. An
An important
important additional
additional realization
realization is that
that these
and teach
st.ructuring methods
methods are beneficial
beneficial for
for improving
improving the
the problem-solving
problem-solving performance
performance of
of
st,ructuring
consultation progra,ms,
progra.ms, providing
providing more
more detailed
detailed and
and abstract
abstract explanations
explanations to
to consultation
consultation
consultation
users, and
and making
making knowledge
knowledge bases easier to
to maintain.
maintain.
As we move
move from
from technological
technological development
development of
of explanation
explanation and
and student
student modeling
modeling
As
the next
next year
year begin
begin to
to collaborate
collaborate more
more closely
closely with
with the
the medical
medical
capabilities, we will
will in the
capabilities,
communit.y to
to design an effective,
effective, useful
useful tutoring
tutoring program.
program. Stanford
Stanford Medical
Medical School
School
community
Dr. Maffly,
Maffly, have shown
shown considerable
considerable interest
interest in
in this
this project.
project. A
A research
research
fa.culty, such as Dr.
fa.culty,
fellow associated with
with Maffly,
Maffly, Curt
Curt Kapsner,
Kapsner, MD,
MD, joined
joined the
the project
project last
last year
year to
to serve as
fellow
expert and liaison
liaison with
with medical
medical students
students at
at Stanford.
Stanford.
medical expert
medica.

C. High.lights
High.lights of
of Research
Research Progress
Progress
C.

C.l
Accomplish.ments This
This Pad
Past Ear
rear
C.l Accomplishments
C.1.i
Th.e NEOMYCIN
NEOlwYCIN Consultation
Consultation Program
Program
C.l.1 Th,e

NEOMYCIN is distinguished
distinguished from
from other
other AI
AI consultation
consultation programs
programs by
by its
its uses
uses of
of an
NEOMYCIN
explicit set of
of domain-independent
domain-independent meta-rules
meta-rules for
for controlling
controIling all
all reasoning.
reasoning. These rules
rules
explicit
constitute
the
diagnostic
procedure
that
we
want
to
teach
to
students:
the
stages
of
constitute the diagnostic procedure that
want to teach to students: the
of
diagnosis.
how
to
focus
on
new
hypotheses,
and
how
to
evaluate
hypotheses.
It
has
been
a
diagnosis, how
hypotheses, and how to evaluate hypotheses. It
major
undertaking, separa.te
problem of
major undertaking,
separate from
from the
the problem
of representing
representing disease
disease knowledge,
knowledge, to
to
design and test
procedure. Such modifications
test this
t,his diagnostic
diagnostic procedure.
modifications require
require changing
changing our
our
conception
partitioned
For example,
example, this
this year
year we partitioned
conception of
of how disease
disease knowledge
knowledge is organized.
organized. For
disease
and "red
disease findings
findings into
into "non-specific"
“non-specific”
“red flag"
flag” (those requiring
requiring explanation),
explanation),
augmenting
procedure to
augmenting the diagnostic
diagnostic procedure
to use this
this information
information for
for focusing
focusing on hypotheses.
hypotheses.
A
program reason about
A second cha.nge
change is to
to have the program
about the disease
disease process more
more generally.
generally.
By
by organ
NEOMYCIN now
primitive means to
By associating
associating symptoms
symptoms by
organ system,
system, NEOMYCIN
now has prirnitive
to infer
infer
when a disease
disease process began. It
It also makes more
more complete
complete use
use of
of severity,
severity, location,
location, and
progression information
progression
information to
to discriminate
discriminate among
among hypotheses.
hypotheses.
During
During this
this past
past year,
year, we completely
completely reworked
reworked the program's
program’s knowledge
knowledge of
of nonmeningitis
cases.
This
is
important
if
we
wish
to
teach
students
to
consider
meningitis cases. This
important if
wish
teach students
consider the
competitors
of
meningitis
and
how
to
discriminate
among
them.
The
goal
is
competitors
meningitis
discriminate among them. The
is to prepare
prepare
the program
order to test
test the
program for
for presenting
presenting these (or
(or similar)
similar) cases
cases to students.
students. In order
modeling
modeling component.
component, it
it is necessary to
to ensure that
that the program
program has sufficient
sufficient expertise
expertise to
recognize good student
student beha.vior.
behavior. All
All data
data that
that might
might be relevant
relevant to solving
solving a given
given
problem
problem must
must be
be known
known to the program.
program. The
The key
key problem
problem here is establishing
establishing a base
base of
synonyms
synonyms and knowledge
knowledge about
about classes
classes of data.
data. To
To do this,
this, we have been collecting
collect,ing
protocols
protocols of student.s
students solving
solving problems,
problems, requiring
requiring them
them to request
request all by simple
simple initial
initial case
case
information.
information. Student
Student behavior
behavior also suggests disease
disease knowledge
knowledge that
that must
must be
be added to the
knowledge
knowledge base
base that,
that an expert
expert might
might not
not consider,
consider, but
but which
which the modeling
modeling program
program must
must
recognize in a student.
In general, we find
find that
that students
students carry
carry out
out a much
much broader,
broader,
student.
inefficient
inefficient search, requesting
requesting much
much more information
information than
than an expert
expert and
a.nd drawing
drawing fewer
from
the
information
that
they
receive.
conclusions
conclusions from t,he information that they
The
Teaching diagnosis involves
involves recoglllzmg
recognizing
The Image
Ima.ge Student
Student Modeling
Modeling Program
Program -- Teaching
the intent
int,ent behind
behind a student's
student’s behavior,
behavior, so that
that missing
missing knowledge
knowledge can be distinguished
distinguished
from
from inappropriate
inappropriat,e strategies.
strategies. The
The teacher
teacher interprets
interprets behavior,
behavior, critiques
critiques it,
it, and provides
provides

79

E. A. Feigenbaum
Feigenbaum

5P41
5P41 RR00785-11
RR00785-11

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

advice
To do
do this
this successfully
successfully and
and efficiently
efficiently in
in aa complex
complex
advice about
about other
other approaches.
approaches. To
domain,
the
teacher
benefits
from
multiple,
complementary
modeling
strategies.
domain, the teacher benefit,s from multiple, complementary modeling strategies. IMAGE
IMAGE
isis aa. student
student modeling
modeling program
program that
that uses
uses NEOMYCIN's
NEOMYCIN’s meta-rules
meta-rules and
and disease
disease knowledge
knowledge
to
to understand
understand student
student diagnostic
diagnostic plans.
plans.
A student
student isis presented
presented with
with aa problem
problem to diagnose.
diagnose. As the student
student requests more
problem
problem data
data (Le.,
(i.e., takes aa history
history and
and physical
physical of
of the patient),
patient), IMAGE
IMAGE looks
looks for
for
regularities
regularities in
in sequences
sequences of
of his data
data requests. IMAGE
IMAGE contains
contains aa body of
of knowledge
knowledge about
about
how to map such
such sequences
sequences of
of behavior
behavior onto
onto aa strategical
strategical interpretation
interpretation of
of what
what the
student
student isis doing. The process isis heuristic
heuristic in
in nature
nature because
because the program
program will
will sometimes
sometimes
lose
lose track
track of what
what the student
student is
is doing, because
because he
he is
is being inconsistent
inconsistent or using
unexpected
unexpected strategies.
strategies.
The I~!AGE
IMAGE uses
uses aa dual search strategy.
strategy. The
The program
program first
first produces
produces multiple
multiple
predictions
Focused,
predictions of student
student behavior
behavior by aa model-driven
model-driven simulation
simulation of NEOMYCIN.
NEOMYCIN.
data-driven
By
By supplementing
supplementing each other,
other, these
dst)a-driven searches then
then explain
explain incongruities.
incongruities.
met.hods
lead
to
an
efficient
and
robust
plan
understander.
methods
efficient
robust plan understander.
A model of student
student strategies
strategies in medical
medical diagnosis must
must disambiguate
disambiguate the possible
purposes and knowledge
knowledge underlying
underlying the student's
student’s actions.
actions. The
The approaches
approaches followed
followed by
other
other plan
p1a.nrecognizers
recognizers and student
student modelers
modelers are not
not sufficient
sufficient here because:
because:
1.
1. t.he
t,he complex
complex domain
domain makes thorough
thorough searches impractical,
impractical, whether
whether top-down
top-down
or
bott.om-up;
or bottom-up;
2.
2. we are not
not modeling
modeling only
only facts and rules used in isolation,
isolation, but
but also the
procedures
for
applying
them;
procedures for applying them;
3. everyone
every one of
of the student's
student’s actions
actions must
must be monitored
monitored in case
case the teaching
teaching
module
module decides to
to interrupt;
interrupt;
4. his behavior
behavior must
just explained;
must be evaluated
evaluated and not
not just
explained; and
might not
not have any
any explicit
explicit goal statements
statements from
from the student,
student, so we expect
expect
5. we might
to rely
rely only
only on his queries for
for problem
problem data
data as evidence for
for his thinking.
thinking.
t,o
The IMAGE
IMAGE program
program is a prototype
prototype system
system which
which is now
now being
being extended.
extended. Specifically,
Specifically, a
The
more useful
useful system
system would
would examine
examine its
its own
own interpretations
interpretations and
and strive
strive for
for coherence. We
\Ve
more
designing such a such a system
system now,
now, using
using the
the “blackboard
"blackboard model”
model" for
for postding
posting
are designing
interpretations that
that. may
may change over
over time.
time. The
The levels of
of this
this blackboard
blackboard are: 1) the
the
interpretations
stUdent's
data requests,
requests, 2) a classification
classification of
of question
question type
type (e.g., triggered,
triggered, follow-up,
follow-up,
student’
s data
hypothesis-directed, general),
general), 3) a strategic
strategic interpretation
interpretation in terms
terms of
of NEOMYCIN’
NEOMYCIN's
hypothesis-directed,
s
diagnostic procedure
procedure (ta.sks
(tasks and
and meta-rules).
meta-rules).
By incorporating
incorporating a strategic
strategic level
level of
of
diagnostic
By
interpretation, this
this program
program can be expected
expected to
to make
make significant
significant contributions
contributions to
to our
our
interpretation,
understanding and
and use of
of the
the blackboard
blackboard model
model of
of interpretation.
interpretation. The
The first
first version
version of
of this
this
understanding
program will
will seek t,o
to explain
explain student
student behavior
behavior in
in terms
terms of
of deletion
deletion and
and reordering
reordering of
of
progra,m
simple
variations
of
disease
knowledge
(e.g.,
false
proC'edural
knowledge,
plus
procedural
knowledge, plus simple variations
of
knowledge (e.g.,
data/hypothesis relations).
relations). Study
Study of
of student
student protocols
protocols is now
now suggesting
suggesting what
what kinds
kinds of
of
dats.jhypothesis
variations are common
common that
that we might
might easily
easily identify
identify automatically.
automatically.
variations

C.l.2 The
The NEOMl%IN
NEOMYCIN Ezplanation
Explanation System
System
C.1.2
The initial
initial explanation
explanation system
system of
of NEOMYCIN,
NEOMYCIN, now
now completed,
completed, enables the
the user
user t,o
to
The
answer WHY
\VHY and
and HOW
HOW questions
questions during
during a consultation.
consultation. That
That is, when
when the
the program
program
answer
prompts the
the user
user for
for new
new data,
data, the
the user
user may
may ask
ask WHY
\VHY the
the data
data is
is being
being requested
requested or
or
prompts
HO\V some st,rategic
strategic task
task will
will be
be (or
(or was)
was) accomplished.
accomplished. Unlike
Unlike MYC’
MYCIN's
explanation
HO\V
IN’s explanation

E. A.
A. Feigenbnum
Feigenbaum
E.

80
80

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

5P41 RR00785-11
RR00785-11

system,
NEOMYCIN are
system, upon
upon which
which this
this kind
kind of
of capability
capability is patterned,
patterned, explanations
explanations in NEOMYCIN
in terms
plan, not
just specific
terms of
of the diagnostic
diagnostic plan,
not just
specific associations
associations between
between data
data and diagnoses.
The
program can provide
provide abstract
The program
abstract and concrete
concrete paraphrases
paraphrases of
of strategy
strategy rules (based on
canned text).
We have begun the next
next phase, which
which is to
to answer
answer WHY
WHY questions
questions by
by
text,). \Ve
condensing
condensing the entire
entire line of
of reasoning.
reasoning. The
The program
program will
will use
use models of
of the user's
user’s disease
disease
and strategic
strategic knowledge,
knowledge, plus general
general explanation
explanation heuristics,
heuristics, to select the task
task and focus
information
Prototypic
information that
that is most
most likely
likely to
to be of
of interest.
interest.
Prototypic user models are now
now
implemented.
Heuristics
Heuristics have been designed and include:
include: 1) mentioning
mentioning the last
last task
task
implemented.
whose focus (or
(or argument)
argument) changed
changed in kind
kind (e.g., from
from a disease
disease hypothesis
hypothesis to
to a finding
finding
request):
request): 2) never
never mentioning
mentioning tasks
tasks that
that are merely
merely iterating
iterating over
over a list
list of
of rules, findings,
findings,
or
or hypotheses;
hypot,heses; and 3) only
only mentioning
mentioning tasks
tasks with
with a rule
rule as
as an argument
argument to
to programmers.
programmers.
Related
Related to
t,o our
our explanation
explanation condensations
condensations is an effort
effort to
to teach the strategic
strategic
language of
For
For example,
example, we will
will have students
students annotate
annotate a
of tasks
tasks to
to students.
students.
NEO!"IYCIN typescript
NEOh,lYCIN
typescript in terms
terms of
of tasks
tasks and foci, to
to help them
them recognize good strategic
strategic
behavior.
behavior. This
This requires
requires a common
common language of
of what
what the tasks are, e.g., "grouping"
“grouping” and
"asking
Rather
just marking
Rather than
than just
marking annotating
annotating tasks, we seek the
“asking general
general question."
question.”
principles by
principles
by which
which the tasks
tasks could
could be consistently
consistently structured
structured into
into primitives
primitives and
auxiliary.
These same principles
auxiliary.
principles could
could be used by
by the
the explanation
explanation system
system for
for choosing
choosing
tasks to mention.
mention. Our
Our current
current theory
theory is that
that these primitive
primitive or
or "interesting"
“interesting” operations
operations
correspond
correspond to meta-rules
meta-rules that
that establish
establish a new focus.

C.LI
for Teaching
C.l.4 Graphics
Graphics for
Teaching
vVe
We are continuing
continuing make
make extensive
extensive use
use of
of graphics
graphics in our
our programs.
programs. For
For example,
example,
we are implementing
program that
implementing a program
that will
will mostly
mostly automatize
automatize the protocol
protocol collection
collection
process (though
(though we are cautious
cautious about
about how
how menus will
will bias student
student behavior,
behavior, even when
when
lists
lists are very
very long and full
full of
of irrelevant
irrelevant findings).
findings). As part
part of
of our
our series of
of instructional
instructional
programs, GUIDON-WATCH
is now being
progra.ms,
GUIDON-WATCH
being implemented
implemented as
as a graphic
graphic system
system for
for watching
watching
NEOMYCIN's
reasoning.
For
NEOMYCIN’
s
reasoning.
For example,
example, we can highlight
highlight the
the hypotheses
hypotheses under
under
consideration
consideration and show graphically
graphically how
how the program
program "looks
“looks up"
up” its
its hierarchies
hierarchies before
before
refining
is also mostly
refining hypotheses.
hypotheses. The
The design or
of GUIDON-ANNOTATE
GUIDON-ANNOTATE
mostly complete.
complete. It
It
will
NEOMYCIN's
will allow
allow a student
student to
to mark
mark up a typescript
typescript of
of NEOMYCIN’
s behavior
behavior using
using the same
language of
of tasks
tssks the
t.he program
program uses
uses when
when explaining
explaining its own
own behavior;
behavior; iconic
iconic menus are
very
very usef1ll
useful to
to avoid
avoid natural
natural language difficulties
difficulties (though
(though it
it is clear
clear that
that the student
student will
will
sometimes
somet,imes need to
to "talk
“talk back").
back”).

C.2
Research in
Progress
C.2 Research
in Progress
The
The following
following projects
projects are active
active as
as of
of June 1983 (see
(see also near-term
near-term plans
plans listed
listed in
Section
Section III.A):
111.‘~):
1.
NEOMYCIN's
disease
1. augmenting
augmenting NEOhlYCIN’
s
disease knowledge
knowledge so we can fairly
fairly evaluate
evaluate the

program's
program’s focussing strategies
strategies and evaluate
evaluate IMAGE;
IMAGE;
2. developing
produce
developing capability
capability to
to automatically
automatically
produce summary
summary explanations
explanations of
of
NEOMYCIN's
NEOMYCIN’
s reasoning.
reasoning.
3. development
and GUIDON-ANNOTATE
for
development of
of GUIDON-WATCH
GUIDON-WATCH
GUIDON-ANNOTATE
for teaching
teaching
NEOMYCIN's
NEOMYCIN’
s knowledge
knowledge to
to students.
students.
4. developing
developing new student
student modeling
modeling program
program based on the blackboard
blackboard model.

81

E. A.
A. Feigenbaum
Feigenbaum

5P41
5P41 RR00785-11

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

D.
D. Publications
Publications Since January
January 1989
198.3
D., Clancey,
Clancey, W.J.,
W.J., Rennels, G.: Strategic
Strategic
Consultation.
C’onsultation. Int
Int JJ Man-Machine
Man-Machine Studies, in press.

1.
1. Hasting,
Hasling,

explanations
explanations

an
in

2.
2. Clancey,
Clancey, W.J.:
W.J.: The advantages
advantages of
of abstract
abstract control
control knowledge
knowledge in
in expert
expert
8y8tem
system de8ign.
design. Proceedings
Proceedings of AAAI-83,
AAAI-83, pages
pages 74-78.
3. Clancey,
Clancey, \V.J.:
W.J.: Acquiring,
Acquiring, representing,
representing, and
and evaluating
evaluating a competence
competence model
model
of
OF
of diagno.'li8.
diagnosis. In Chi,
Chi, Glaser,
Glaser, and Fan
Farr (Eds.),
(Eds.), THE
THE NATURE
NATURE
OF
EXPERTISE.
EXPERTISE. In preparation.
preparation. HPP-84-2.
HPP-84-2.
4.
IN
4. Clancey,
Clancey, W.J.
W.J. and E. H. Shortliffe.:READINGS
Shortliffe.:RELaINGS
IN !vIEDICAL
MEDICAL ARTIFICIAL
ARTIFICIAL
Il'VTELLIGENCE:
THE
Reading:
IKTELLIGENCE:
THE FIRST
FIRST DECADE.
DECADE.
Reading: Addison-Wesley,
Addison-Wesley, in
press.
5. Clancey,
Clancey, W.J.:
W.J.:
AAAI-84.
AAAI-84.

Classification
Problem
Classification
Problem Solving.
Soluing. HPP-84-7.
HPP-84-7.

Submitted
Submitted to

E. Funding
Funding Support
Support
Cont,ract.
Contract Tit.le:
Title: "Explorat.ion
“Exploration of
of Tutoring
Tutoring and Problem-Solving
Problem-Solving Strategies"
Strategies”
Principal
Principal Investigator:
Investigator: Bruce
Bruce G. Buchanan,
Buchanan, Adjunct
Adjunct Prof.
Prof. Computer
Computer Science
Associate
Associate Investigator:
Investigator: William
William J. Clancey,
Clancey, Research Assoc. Computer
Computer Science
Agency:
Agency: Office
Office of Naval
Naval Research and
Army
Army Research Institute
Institute (joint)
(joint)
ID
ID number:
number: NOOOI4-79-C-0302
N00014-79-C-0302
Term:
Term: March
March 1979 to
to March
March 1985
Tot,al
Tot(a.1award:
a.ward: $583,892
$083,892

II.
WITH
THE
RESOURCE
II. INTERACTIONS
INTERACTIONS
WITH
THE SUMEX-AIM
SUMEX-AIM
RESOURCE
A. Medical
Medical Collaborations
Program Dissemination
Dissemination via
A.
Collaborations and
and Program
uia SUAfEX
SUMEX
A
NEOMYCIN has been shown
by the
‘4 great
great deal of
of iuterest
interest in GUIDON
GUIDON and NEOMYCIN
shown by
medical
mcdica.1 and
a.nd computer
computer science communities.
communit,ies. We
We are frequently
frequently asked to
to demonstrak
demonstrate
these programs
programs to
t,o Stanford
St,anford visitors
visitors or
or at meetings
meetings in this
this cOllDtry
country or
or abroad.
abroad. GUIDON
GUIDON is
available on the
the SUMEX
SUMEX 2020. Physicians
Physicians have generally
generally been enthusiastic
enthusiastic about
about these
ava.ila.ble
programs' potential
potential and what
what they
they reveal
reveal about
about current
current approaches
approaches to
to computer-based
computer-based
programs
medical decision
decision making.
making.
medical
Perhaps our
our most
most significant
significant project
project to
to disseminate
disseminate our
our research
research via
via SUMEX
SUMEX in l.he
the
Perhaps
past year
year has been the completion
completion of
of a book,
book, “Readings
"Readings in Medical
il,ledical Artificial
Artificial Intelligence:
Intelligence:
past
The First
First Decade,”
Dec:1de," edited
edited by
by Dr.
Dr. Clancey
Clancey and Dr.
Dr. Shortliffe.
Shortliffe. All
All of
of the significant
significant
The
SUMEX-AI~vl products
products of
of the
the past
past decade are described
described in this
this collection.
collection. Each
Each cha.pter
chapter is
SUMES-,41&I
preceded by
by a.
a one-age historical
historical introduction.
introduction. In addition,
addition, opening
opening and closing
closing chapters
chapters
preceded
by t)he
the editors
editors survey
survey issues
issues in the
the field
field and the
the promise
promise of
of the
the future.
future. A
A complete
complete index
index
by
should make
make t,he
t.he book
book of
of considerable
considerable educational
educational value.
value. Preparation
Preparation of
of this
this volume
volume has
should
greatly aided
aided by
by use of
of editing
editing and
and formatting
formatting programs
programs available
available on
on SUMEX-AIM.
SUMEX-AIM.
been greatly
Royalties for
for the
the book,
book, beyond
beyond production
production costs, will
will be used to
to sponsor
sponsor an invited
invited lecture
lecture
Royalties
at a major
major AI
AI nationa.
national conference,
conference, such as AAAI.
AAAI.
at
As mentioned
mentioned
disease knowledge
knowledge of
of
disease
July 1983). This
This
in July

A. Feigenbaum
Feigenbaum
E. A.

earlier, a physician
physician joined
joined our
our group
group this
this year
year to
to help
help us develop
develop the
the
earlier,
the program
program (our
(our first
first collaborator,
collaborator, Tim
Tim Beckett,
Beckett, MD,
MD. died
died of
of cancer
cancer
the
physician has found
found the
the convenience
convenience of
of accessing SUMEX
SUMEX from
from his
physician

82

RR00785-11
5P41 RR00785-11

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

laboratory or
or at
at home to be extremely
extremely important
important for
for finding
finding time
time to
to test
test NEOMYCIN
hboratory
NEOMYCIN and
to communicate
communicate with
with us by
by electronic
electronic mail.
mail.

83

Feigenbaum
E. A. Feigenbaum

5P41
5P41 RR00785-11
RR00785- 1 I

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

B.
Projects
B. Sharing
Sharing and
and Interaction
Interaction with
with Other
Other SUA/EX-AlA!
SUMEX-AIM
Projects
GUIDON/NEOMYCIN
retains
GUIDON/NEOMYCIN
retains strong
strong contact
contact with
with the
the ONCOCIN
ONCOCIN project,
project, as
as both
both
are
siblings
of
the
tvIYCIN
parent.
These
projects
regularly
share
programming
expertise
are siblings of the MYCIN parent. These projects regularly share programming expertise
and
and cont.inue
continue to
to jointly
jointly maintain
maintain large
large utility
utility modules
modules developed
developed for MYCIN.
MYCIN. In addition,
addition,
the
the centra.l
centra,l SUMEX
SUMEX development
development group
group acts
acts as
as an
an important
important clearing
clearing house
house for solving
solving
problems
problems and
and distributing
distributing new
new methods.
methods.

C.
C. Critique
Critique of
of Resource
Resource Management
Management
In the winter
winter of 1984,
1984, the SUMEX
SUMEX staff
staff efficiently
efficiently and effectively
effectively shifted
shifted our
our
operation
operation away from
from the center
center of campus to aa professional
professional office
office building
building adjoining
adjoining the
medical
medica~l center.
center. The placement
placement and installation
installation of LISP
LISP workstations
workstations proceeded smoothly.
smoothly.
After
After aa.year with
with Ed Pattermann
Pattermann as
as director
director of SUMEX,
SUMEX, we can report
report that
that the stability
stability
and
a.nd excellence of the resource we have come to expect has
has been completely
completely maintained.
maintained.
Very
Very import.ant
important to us,
us, the RAVEN
RAVEN laser printer
printer installed
installed at our
our new site not
not only
only provides
provides
excellent-quality
output,
but
as
a
machine
devoted
to
the
Heuristic
Programming
excellent,-quality out,put, but as a machine devoted
Heuristic Programming Project
Project
has
has eliminated
eliminated the dela.ys
dela.ys we were experiencing
experiencing aa year
year a.go.
a.go.
\Vith
continuing to experience a few
With t,he
the shift
shift to personal
personal machines, we are continuing
The
difficulties.
The greatest.
greatest, problem
problem appears to be inadequately
inadequately debugged software
software from
from
difficulties.
XEROX.
In particular,
SEROX.
particular, Interlisp-D
Interlisp-D relies heavily
heavily on network
network capabilities
capabilities and must
must be
compatible
compatible with
with several operating
operating systems. This
This transition
transition to new kinds
kinds of hardware
hardware and
software
softwa.re can be
be expected
expected to continue
continue for
for several years. Therefore,
Therefore, we are extremely
extremely
reliant
reliant upon
upon the availability
avai1abilit.y of experienced
experienced systems support.
support. We
We believe
believe that
that additional
additional
SU~IEX
SUhlEX staff
staff is necessary to accommodate
accommodate growing
growing community
community needs.
needs

ill.
RESEARCH
PLANS
III.
RESEARCH
PLANS

A. Project
A.
Project Goals
Goals and
and Plans
Plans
over the next
next year
year will
will continue
continue on several
fronts, leading
leading to
to several
several
Research over
several fronts,
prototype instruct,ional
instructional programs
programs by
by early
early 1985.
protot,ype
1. Continue
Continue to
to develop
develop the knowledge
knowledge base so the program
program can understand
understand and
and

anticipate any
any reasonable
reasonable approach
approach to
to the cases
cases chosen for
for teaching.
teaching.
ant.icipate
Test st,udent
student modeling
modeling program
program on these cases,
cases, collecting
collecting data
data for
for further
further
2. Test
development of
of the program,
program, as well
well as exploring
exploring about
about the
the range
range of
of student
student
development
approaches to
to diagnosis.
diagnosis.
approaches
Extend the
t.he explanation
explanation system
system to
to do full
full summaries.
summaries. Incorporate
Incorporate modeling
modeling
3. Extend
capabilities that
t.hat relate
relate inquiries
inquiries to
to a user model.
model. Provide
Provide explanations
explanations t.ailored
tailored
capabilit,ies
to this
this interpretation
interpretation of
of t,he
the motivation
motivation behind
behind the
the user’
user's
inquiry.
to
s inquiry.
4. Int,egra.te
Integrate current
current display
display capabilities
capabilities into
into running
running NEOMYCIN
NEOMYCIN consultation
consultation to
to
show
how
the
space
of
diagnoses
is
explored
and
how
diagnostic
tasks
are
show how the
of
explored and how diagnostic tasks
generated. Develop
Develop these capabilities
capabilities to
to explore
explore forms
forms of
of graphic
graphic explanation
explanation
generated.
useful
in
tutoring.
(GUIDON-WATCH)
useful
tutoring. (GUIDON-WATCH)
S. Extend
Extend student
student modeling
modeling system
system to
to include
include heuristics
heuristics for
for generating
generating tests
tests that
that
S.
will confirm
confirm and
and extend
extend the
the model.
model. Improve
Improve the
the model
model to
to include
include analysis
analYSis of
of
will
pat,tans
in
model
interpretations,
including
dependency-directed
patterns
including
in
model
interpretat*ions,
dependency-directed
backt.racking in
in the
the belief
belief system
system and
and some capability
capability to
to critique
critique the
the
” backtra.cking”
mode lint:?; rules.
rules. Relate
Relate this
this to
to knowledge
knowledge acquisition
acquisition research.
research.
rnodrling
II

E. A.
A, Feigenbaum
Feigenbaum
E.

II

84
84

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

5P41
5P-11 RR00785-11

6.
6. Work
Work closely with
with medical
medical students
students to package NEOMYCIN
NEOMYCIN capabilities
capabilities in aa
"workstation"
“workstation” for
for learning
learning medical
medical diagnosis, determining
determining what
what mix
mix of student
student
and program
program initiative
initiative isis desirable.
desirable.

B. Long
Programs
Long term
term plans:
plans: the GUIDON2
GUIDON2 Family
Family of
of Instructional
Instructional
Programs
"Ve
We sketch
sketch here our
our general conception
conception of the research we plan
plan for
for 1984-87,
specifically
Our
specifically the construction
construction of instructional
instructional systems that
that use
use NEOMYCIN.
NEOMYCIN.
Our ideas are
strongly
strongly based on recent
recent proposals
proposals by
by JS
JS Brown,
Brown, particularly
particularly his paper
paper "Process
“Process versus
Product
--A perspective
perspective on tools for
for communal
communal and informal
informal electronic
electronic learning"
learning” and
Product, --A
some related
related papers that
that he wrote
wrote in 1983.
1983. The
The plan
plan is
is to implement
implement at least three
three of
these programs
GUIDON-MANAGE,
and GUIDONprograms (here called GUIDON-WATCH,
GUIDON-WATCH,
GUIDON-MANAGE,
GUIDONANNOTATE).
ANNOTATE).
The key idea is that
that NEOMYCIN
NEOMYCIN provides
provides aa language
language by
by which
which aa program
program can
converse wit.h a student
student about
about strategies
strategies and knowledge
knowledge organization
organization for
for diagnosis.
NEOtvIYCIN's
NEOMYCIN’s tasks and structural
structural terms
terms provide
provide the vocabulary
vocabulary or
or parts
parts of speech; the
meta-rules
We will
will construct
construct different
different
meta-rules are the grammar
grammar of the diagnostic
diagnostic process. We
graphic,
graphic, reactive
reactive environments
environments in which
which the student
student can observe, describe, compare,
compare, and
improve
improve diagnost,ic behavior
behavior of
of himself
himself and others.
others. There
There are many
many shared, underlying
underlying
ca.pabilities
ca.pa.bi!it.ies that
that wil!
will be constructed
constructed in parallel
parallel and improved
improved over
over time.
time.
Our
Our approach
approach is
is to delineate
delineate clearly
clearly different
different kinds
kinds of
of interactions
interactions that
that a student
st.udent
might.
might, have with
wit,h a program
pr0gra.m concerning
concerning diagnostic
diagnostic strategies.
strategies. Thus,
Thus, each instructional
instructional
system
activity>,
where
syst.em (but
(but one) has a name of the form
form GUIDON-,
where the name
specifies what
what the student
student is doing
doing (e.g., watching,
watching, telling).
telling). The
The programs
programs can be made
arbitrarily
by integrating
arbitrarily complex
complex by
integrating coaches,
coaches, student
student models, and explanation
explanation systems. "Ve
We
try
try here to
to separate
separate out
out these capabilities,
capabilities, trying
trying to
to get at
at the
the minimum
minimum interesting
interesting
activities
provide for
activities we might
might provide
for a student.
student.
GUIDON-WATCH
The
NEOMYCIN
GUIDON-VVATCH
The simplest
simplest system
system allows
allows a student
student to watch
watch NEOMYCIN
solve a.a. problem,
problem. perhaps
perhaps one supplied
by the student.
supplied by
student. Graphics
Graphics display
display the evolving
evolving
search space, that
that is, how
how tasks, as
as operators,
operators, affect
affect the differential
differential (Differential
(Differential
---(Question
The
---(Quest ion X)--S)--- > Differential').
Differential’).
The student
student can step through
through slowly
slowly and replay
replay the
interaction.
prosaic explanations
program is
intcract.ion. He can ask for
for prosaic
explanations and summaries
summaries of
of what
what the program
doing.
program will
The program
will also indicate
indicate its
its task
task and focus for
for each data
data request.
request. This
This
doing. The
introduces
introduces the
t.he student
student to the idea that
that the diagnostic
diagnostic process has structllre
structure and follows
follows a
certain kind
kind of
of logic.
certain
GUIDON-MANAGE In
In this
this system
system the
the student
student solves a problem
problem by
by telling
telling
GUIDON-MANAGE
NEO!\IYCIN
what task
task to
to do at each step. Essentially,
Essentially, the
the student
student provides
provides the strategy
strategy
NEOXIY’
C’IN what
and the program
program supplies
supplies the
the tactics
tactics (meta-rules)
(meta-rules) and
and domain
domain knowledge
knowledge to carry
carry out
out the
a.nd
strategy. The
The program
program will
will in genera!
general carry
carry through
through tasks
tasks in
in a logical
logical way,
way, for
for example,
example,
strst.egy.
proCt'cdiug to
to t,est
tcst a hypothesis
hypothesis completely,
completely, and not
not “breaking”
"breaking" on FINDOUT
FINDOUT or
or
proceeding
APPLYHULES
(t.wo low-level
low-level tasks
tasks that
that mainly
mainly test
test domain
domain knowledge
knowledge and not
not strategy).
strategy).
APPLI’
RIJLES (two
ThE' program
program will
will not
not pursue
pursue new hypotheses
hypotheses automatically.
automatically. However,
However, the
the student
student will
will
The
alwa.ys see
see wha.t
what questions
questions a task
task caused the
the program
program to
to request,
request, as well
well as how
how the
alwa,ys
differential changes.
This activity
activity leads the
the student
student to
to observe the entailments
entailments of
of
differential
This
strategies, helping
helping him
him become a better
better observer
observer of
of his own
own behavior.
behavior. Here he shows that
that
strategies,
knows the structural
structural vocabulary
vocabulary that
that makes
makes a strategy
strategy appropriate.
appropriate.
he knows
GUIDON-ANNOTATE This
This system
system allows
allows the
the student
student to
to annotate
annotate a NEOMYCIN
NEOMYCIN
GUIDON-ANNOTATE
typescript, indicating
indicating the
the task
task and
and focus associated with
'with each data
data request.
request. The
The program
program
typescript,
will indicate,
indicate, upon
upon request,
request, where
where the
the student
student is incorrect
incorrect and
and which
which annotations
annotations are
will
different from
from NEOMYCIN’
NEOMYCIN's,
but still
still reasonable
reasonable interpretations.
interpretations. The
The student
student will
will be
different
s, but

85

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11
RR00785- 11

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

able to
to- choose these tasks from
from a menu
menu of
of icons, either
either linearly
linearly or
or hierarchically
hierarchically displayed,
displayed,
as
NEOMYCIN will
as he prefers.
prefers. (Again,
(Again, NEOMYCIN
will annotate
annotate its
its own
own solutions
solutions upon
upon request
request and allow
allow
replaying.)
This
replaying.)
This activity
activity gets the
the student
student to
to think
think strategically
strategically by
by recognizing
recognizing a good
st.rategy.
st,rategy. In
In this
t,his way,
way, he learns to
to recognize how
how strategies
strategies affect
affect the
the problem
problem space.
GUIDON-APPRENTICE
This
NEOMYCIN in which
GUIDON-APPRENTICE
This is a variant
variant of
of NEOMYCIN
which the program
program
stops during
during a consultation
consultation and asks the student
student to propose the
the next
next data
data request(s).
request(s). The
The
student
student is asked to
to indicate
indicate the task
task and focus he has in mind,
mind, plus
plus the differential
differential he is
operating
NEOMYCIN would
operating upon. The
The program
program compares
compares this
this proposal
proposal to
to what
what NEOMYCIN
would do. In
In
this
this activit.y
activit,y we descend to
to the domain
domain level and require
require the student
student to instantiate
instant,iate a
st.rategy
st,rategy appropriately.
a.ppropriately.
GUIDON-DEBUG
Here the student
GUIDON-DEBUG
student is presented
presented with
with a buggy
buggy version
version of
of
NEOM'(CIN
goes through
through the steps of
of annotating
annotating the buggy
buggy
NEOMY’
CIN and must
must debug it.
it. He goes
consultation
consultation session,
session, indicating
indicating what
what questions
questions are out
out of
of order
order or
or unnecessary,
unnecessary, indicating
indicating
what
properly, and then
what tasks are not
not being
being invoked
invoked properly,
then trying
trying out
out his hypothesis
hypothesis on a
"rep:J.ired"
predict what
what will
will be different,
d.ifferent, then
then allowed
allowed to
to observe
“repaired” system.
system. He is asked to predict
what
what happens. This
This activity
activity teaches the student
student to recognize how
how a diagnostic
diagnostic solution
solut,ion
can be non-optimal,
provides him
non-optimal, further
further emphasizing
emphasizing the value
value of
of good strategy.
strategy. It
It also provides
him
with
practice for
problem behavior.
with key
key meta-cognitive
meta-cognitive practice
for criticizing
criticizing and debugging
debugging problem
behavior.
GUIDON-SOLVE
This
The
The student
student carries
carries
GUIDON-SOLVE
This is the complete
complete tutorial
tutorial system.
system.
through
plan recognizer
through diagnosis
diagnosis completely,
completely, while
while a plan
recognizer attempts
attempts to
to track
track what
what he is doing
doing
and a coach interrupts
interrupts to offer
offer advice. Here annotation,
annotation, comparison,
comparison, debugging,
debugging, and
('xplana.tion
expiana.tion are all integrated
int)egrated to illustrate
illustrate to
to the student
student how his solution
solution is non-optimal.
non-optimal.
For
For example,
example, the student
student might
might be asked to annotate
annotate his solution
solution after
after he is done; this
this
will
point out
will point,
out strategic
strategic gaps in his awareness and provide
provide a basis for
for critique
critique and
improvement.
A "curriculum"
“curriculum” based on rrequent
frequent student
student faults
faults and important
important things
things to
improvement. A
learn
In
learn will
will drive
drive the interaction.
interaction.
In this
this activity,
activity, the student
student is on his own.
own. Faced with
with
the
must exercise his diagnostic
diagnostic procedure
procedure from
from start
start to
to
t.he proverbial
proverbial "blank
“blank screen,"
screen,” he must
finish.
finish.
GUIDON-GAME
Two
GUIDON-GAME
Two or
or more
more students
students play
play this
this together
together on a single machine.
machine.
They
All
They are given
given a case
case to solve together,
together, and each student
student requests data
data in turn.
turn.
All
st.lldents
When
students receive the requested
requested information.
information.
When a student
student is ready,
ready, he makes a
di3gnosis,
diagnosis, indicated
indicat)ed secretly
secretly to
to the program
program while
while the others
others are not
not watching.
watching. He then
then
drops
drops out
out of
of the questioning
questioning sequence.
sequence. However,
However, he can re-enter
re-enter later,
later, but
but of
of course will
will
be penalized.
Afterwards, score is based on the number
number of
of questions
questions asked and use
use of
of
penalized. Afterwards,
good strategy.
strategy. The
The coach will
will indicate
indicate to
to weak
weak players
players what
what they
they could
could learn
learn from
from strong
strong
players,
one
players. encouraging
encouraging them
them to
to discuss certain
certain issues
issues among
among themselves.
themselves. Variation:
Variation:
person solves while
while one or
or more
more competing
competing students
students annotate
annotate the solution
solution and show where
where
it
Variation:
NEOMYCIN (and
Variation: one team
team introduces
introduces a bug
bug into
into NEOMYCIN
(and
it, could
could be improved.
improved.
predicts
This
predicts the effect)
effect) and the
the other
other team
team finds
finds it
it (as in SOPHIE).
SOPHIE).
This activity
activity will
will
encourage students
students to share their
their experiences and talk
talk to and learn
learn from
from each other
other about
about
the diagnostic
disgnostic process.

C. Requirements
Reqw:rements for
for Continued
Continued SUMEX
SUMEX Use
Although
NEOMYCIN work
Although most
most of
of the
the GUIDON
GUIDON and NEOMYCIN
work is shifting
shifting to
to Xerox
Xerox
Dolphins
Dolphins and Dandelions
Dandelions (D-machines),
(D-machines), the DEC
DEC 2050
2060 and 2020 continue
continue to
to be key
key
elements
elements in our
our research plan.
plan. Our
Our primary
primary use
use of
of the 2050
2060 will
will be to develop
develop the
NEOMYCIN conSUltation
access.
NEOMYCIN
consultation system,
system, possibly
possibly by
by remote
remote ARPANET
ARPANET
access. Because of
of
address
a.ddress space limitations,
limitations, the consultation
consultation program
program can be combined
combined with
with explanation
explanation or
or
student
student modeling
modeling facilities,
facilities, but
but not
not both,
both, as is required
required for
for GUIDON2
GUIDON2 programs.
programs. We
We
continue
continue to use
use the 2020 for
for demonstrating
demonstrating the original
original GUIDON
GUIDON program.
program. As always,
always,
the 2050
2060 will
will be essential
essential for
for work
work at home, writing,
writing, and electronic
electronic mail.
mail.
E. A. Feigenbaum
Feigenbaum

85
86

5P41
5P4 1 RROOi85-11
RR00785- 11

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

D. Requirements
Requirements for
for Additional
Additional Computing
Resources
D.
Computing Resources
The D-machine's
D-machine’s large address space is permitting
permitting development
development of
of the large
The
program that
that complex
complex computer-aided
computer-aided instruction
instruction requires.
requires. Graphics
Graphics will
will enable us to
program
methods for
for presenting
presenting material
material to naive
naive users. We
We also plan
plan to use
use the Ddevelop new methods
mn.chine as
as a reliable,
reliable, constant
constant "load-average"
“load-average” machine,
machine
machine, for
for running
running experiments
experiments with
with
physicians and students.
students.
The development
development of
of GUIDON2
GUIDON2 on the
the D-machine
D-machine will
will
physicians
The
demonstrate the
the feasibility
feasibility of
of running
running intelligent
demonstrate
intelligent consultation
consultation or
or tutoring
tutoring systems on
sma.11,affordable
affordable machines
machines in physicians'
physicians’ offices,
offices, schools and other
other remote
remot,e sites.
small,
We cnrrently
currentSly have access
access to
to 1 1/2
l/2 DOLPHINs.
DOLPHINS.
We
We
We expect that
that 3 full
full time
time
programmers will
will need access
access to
to two
two full
full machines.
machines. vVe
We are keeping
keeping logs so we can
ca.n begin
begin
programmers
understand patterns
patterns of
of ::t.ctivity
activity and how
how these "personal"
“personal” machines
machines can be effectively
effectively
to understand
shared.
shared.

87

E. A. Feigenbaum
Feigenbaum

GUIDON/NEOMYCIN
Project
GUIDON/NEOMYCIN
Project

RR00785-11
5P41 RR00785-11

E.
Community and
and Resource
E. Recommendations
Recommendations for
for Future
Future Community
Resource Development
Development
shift our
our development
development of
of systems
systems to
to personal
personal LISP
LISP machines,
machines, such as
as the
As we shift
DOLPHIN, it
it becomes more
more difficult
difficult to
to access
access these programs
programs remotely
remotely for
for access
access from
from
DOLPHIN,
our
our homes (so that
that we may
ma,y work
work conveniently
conveniently during
during the evenings and weekends)
weekends) and
from remote
remote sites for
for collaboration
collaboration and demonstration.
demonstration.
from
This
This problem
problem will
will be partly
partly
ameliorated by
by "dial-up"
“dial-up” (modem)
(modem) access
access to these machines,
machines, but
but the use
use of
of bit-mapped
bit-mapped
ameliorated
displays requiring
requiring a high-bandwidth
high-bandwidth makes the
the phone lines inadequate
inadequate for
for our
our purposes.
displays
Further
Further technological
technological development
development of
of networks,
networks, probably
probably involving
involving access
access over
over cables, will
will
be necessary.
computer resources become more distributed,
distributed, the need for
for a central
central machine
machine
As computer
does not
not diminish.
diminish. Programs
Programs and knowledge
knowledge bases
bases continue
continue to be shared,
shared, requiring
requiring highhighdoes
speed network
network connections
connections among
among computers
computers and file servers. SUMEX-AIM's
speed
SUMEX-AIM’s role will
will
shiftly slightly
slightly over
over the
the next
next few years
years to accommodate
accommodate these needs,
needs, but
but its
its identity
identity as
as a
shiftly
central
will only
only change in kind,
kind, not
not importance.
importance.
cent
ral resource will
Moreover,
Moreover, sophisticated
sophisticated
printing devices, such as
as the
the Xerox
Xerox RAVEN,
RAVEN, must
must necessarily
necessarily be shared, again using a
printing
network. Maintenance
Maint,enance of
of this
this network
network and its
its shared
shared devices will
will become a key
key activity
activity
network.
for
for the SUMEX
SUMEX staff.
staff. Thus,
Thus, while
while computing
computing resources will
will be provided
provided by
by the
the "outboard
“outboard
engines” of
of personal
personal machines,
machines, the community
community will
will remain
remain intricately
intricately
linked and
engines"
linked
dependen
dependent,t. on common,
common, but
but peripheral,
peripheral, resources.
From
From t.his
t,his perspective,
perspective, future
future resource development
development should
should focus on improving
improving the
capabilities of
of networks,
networks, file servers, and attached
attached devices to respond
respond to individua,l
individual
ca,pabilities
For example,
example, it
it is now
now common
common for
for 10%
10% of
of a user's
user’s time
time at a personal
personal machine
machine
requests. For
spent waiting
waiting for
for a file server
server or
or printer
printer to
to process a request. Multi-processing
to be spent
Multi-processing
necessity in such an environment,
environment, so a request
request can be honored,
honored, while
while the user
becomes a necessity
returns
returns to continue
continue his programming
programming or
or editing.
editing.

E. A.
i4. Feigenbaum
Feigenba,um

88

HPP
HPP Core
Core AI
AI Research
Research

RR00785-11
5P41 RR00785-11

II.A.1.3. HPP
HPP Core
Core AI
AI Research
Research
II.A.1.3.

Heuristic
Heuristic Programming
Programming Project
Project
Principal Investigator:
Investigator: Edward
Edward A.
A. Feigenbaum
Feigenbaum
Principal
C-Principal
Co-Principal Investigator:
Investigator: Bruce
Bruce G.
G. Buchanan
Buchanan
Department
Department of
of Computer
Computer Science
Science
Stanford
Stanford University
University

SUMMARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
I.I. SUMMARY

A.IB. Rationale
Rationale and
and Medical
Afedical Relevance
Relevance
A./B.
Medicine and the
the biological
biological sciences are knowledge-intensive
knowledge-intensive with
with an exponential
exponential
Medicine
rate of
of growth
growth in relevant
relevant knowledge.
knowledge. This
This means that
that problem
problem solving
solving of
of all
all sorts
sorts is
rate
becoming increasingly
increasingly complex
complex in these disciplines.
disciplines. Further,
Further, most
most problems
problems are symbolic
symbolic
becoming
nature rather
rather than
than amenable
amenable to
to mathematical
mathematical formulation
formulation and numerical
numerical solution.
solution.
in nature
Artificia.l Intelligence
Intelligence (AI)
(AI) methods
methods have been focused on medical
medical and biological
biological problems
problems
Artificia.1
for over
over a decade with
with considerable
considerable success.
success. This
This is because, of
of all
all the
the computing
computing methods
methods
for
known, AI
AI met,hods
met.hods are the
the only
only ones that
that deal explicitly
explicitly with
with symbolic
symbolic information
information and
known,
problem solving
solving and with
with knowledge
knowledge that
that is heurist,ic
heuristic (experiential)
(experiential) as well
well as factual.
factual.
problem
particularly fast-moving
fast-moving area of
of AI
AI is expert.
expert systems. An
An expert
expert system
system is one
One particularly
performance level rivals
rivals that
that of
of an human
human expert
expert because itit has extensive
extensive domain
domain
whose performance
knowledge (current,ly
(currently usual!y
usually derived
derived from
from an human
human expert);
expert); itit can reason
rea..<;on about
about its
its
knowledge
knowledge
knowledge to solve difficult
difficult problems
problems in the domain;
domain; it
it can explain
explain its line of
of reasoning
reasoning
much
much as
as an human
human expert
expert can; and it
it is flexible
flexible enough to incorporate
incorporate new knowledge
knowledge
without
Expert
witholit reprogramming.
reprogramming.
Expert Systems
Systems draw
draw on the current
current stock
stock of
of ideas in AI,
AI, for
for
They are adequate
adequate for
for capturing
capturing
example,
They
example, about
about representing
representing and using
using knowledge.
knowledge.
problem-solving
Numerous high-performance,
problem-solving expertise
expertise for
for many
many bounded
bounded problem
problem areas. Numerous
high-performance,
expert
expert, systems
systems have resulted
resulted from
from this
this work
work in such diverse
diverse fields
fields as
as analytical
analytical chemistry,
chemistry,
medlcal
cancer chemotherapy
chemotherapy management,
management, VLSI
VLSI design, machine
machine fault
fault
medical diagnosis,
dia,gnosis, cancer
of these programs
programs rival
rival human
human experts
experts in solving
solving
dia.gnosis.
diagnosis, and molecular
molecular biology.
biology. Some of
problems
problems in particular
particular domains
domains and some are being
being adapted
adapted for
for commercial
commercial use.
use. Other
Other
tools for
for representing
representing and utilizing
utilizing
projects
projects have developed
developed generalized
generalized software
software tools
knowledge
UNITS,
EMYCIN,
UNITS, AGE,
AGE, MRS,
MRS, GLISP)
GLISP) as
as well
well as
as comprehensive
comprehensive
knowledge (e.g., EMYCIN,
publications
Intelligence
publications such as
as the three-volume
three-volume Handbook
Handbook of
of Artificial
Artificial
Intelligence and books
summa.rizing
summarizing lessons
lessons learned
learned in the DENDRAL
DENDRAL and MYCIN
MYCIN research projects.
projects.
But
But the current
current ideas fall
fall short
short in many
many ways, necessitating
necessitating extensive
extensive further
further basic
research dforts.
efforts. Our
Our core research goals, as
as outlined
outlined in the next
next section,
section, are to analyze
the limitations
limitations of current
current techniques
techniques and to investigate
investigate the nature
nature of methods
methods for
for
overcoming
overcoming them.
them. Long-term
Long-term success
success of
of computer-based
computer-based aids in medicine
medicine and biology
biology
methods available
available for
for representing
representing and using
depend on improving
improving the programming
programming methods
domain
That
That knowledge
lcnowledge is inherently
inherently complex
complex .... it contains
contains mixtures
mixtures of
domain knowledge.
knowledge.
symbolic
symbolic and numeric
numeric facts
facts and relations,
relations, many
many of them
them uncertain;
uncertain; it
it contains
contains knowledge
knowledge
at different
different levels of
of abstraction
abstraction and in seemingly
seemingly inconsistent
inconsistent frameworks;
frameworks; and it
it links
links
examples and exception
exception clauses
clauses with
with rules of thumb
thumb as
as well
well as
as with
with theoretical
theoretical principles.
principles.
Current
Current techniques
t,echniques have been successful only
only insofar
insofar as
as they
they severely limit
limit t.his
t#his
complexity.
comp1exit.y. As the applications
applications become more far-reaching,
far-reaching, computer
computer programs
programs will
will have
to deal more effectively
effectively with
with richer
richer expressions and much
much more voluminous
voluminous amounts
amounts of
knowledge.
knowledge.
This
This report
report documents
documents progress on the basic or core research activities
activities within
within the

89

Feigenbaum
E. A. Feigenbaum

HPP
HPP Core AI
AI Research

5P
41 RR00785-11
5P41

Hruristic
Heurist,ic Programming
Programming Project
Project (HPP),
(HPP), funded
funded in part
part under
under the SUMEX
SUMEX resource as
as well
as
This work
work explores aa broad
broad range of basic
as by other
other federal and industrial
industrial sources. This
research
resea,rch ideas
ideas in many
many application
application settings,
settings, all of which
which contributes
contributes in the long term
term to
improved
improved knowledge
knowledge based
based systems in biomedicine.
biomedicine.
C. Highlights
Highlights of
of Research
Research Progress
Progress
In the last year,
year, we made progress on several major
major topics
topics of research. The
The style
style of
research t.hat
that we believe is
is most
most productive
productive at this
this stage of development
development of AI
AI is the
experimental
experimental style.
style. Thus,
Thus, within
within the HPP
HPP we build
build systems that
that implement
implement our
our idea.s
idea.s for
for
answering
answering (or
(or shedding
shedding some light
light on) fundamental
fundamental questions; we experiment
experiment with
with those
systems to drtermine
determine the strengths
strengths and limits
limits of the
ihe ideas;
ideas: we redesign and test more; we
attempt
attempt to generalize
generalize the ideas from
from the domain
domain of
of implementation
implementation to other
other domains;
domains; and
we publish
order to carry
carry out
out this
this style
style of research, then,
then,
publish details
details of the experiments.
experiments. In order
we select.
select specific
specific problems
problems to help focus the general questions. Many
Many of these specific
specific
problem
problem domains
domains are medical
medical or
or biological.
biological. In this
this way
way we believe the HPP
HPP has made
substantial
just to the AIM
substantial contributions
contributions to core research problems
problems of
of interest
interest not
not just
AIM
community
community but
but. to AI
AI in general.
general.
Progress
Progress is reported
reported below
below under
under each of
of the major
major topics
topics of
of our
our work.
work.
are to HPP
HPP technical
technical reports
reports listed
listed in the publications
publications section.
section.

Citations
Citations

How
How can the knowledge
knowledge necessary for
for complex
complex
problem
problem solving
solving be represented
represented for
for its
its most
most effective
effective use
use in automatic
automatic inference
inference
processes?
processes? Often,
Often, the knowledge
knowledge obtained
obtained from
from experts
experts is heuristic
heuristic knowledge,
knowledge,
gained
How can this
this knowledge,
knowledge, with
with its
its
gained from
from many
many years
years of
of experience.
experience. How
inherent
inherent vagueness and uncertainty,
uncertainty, be represented
represented and applied?
applied?

1.
n01lliedge representation:
1. 1-(
Knowledge
representation:

\Vork
NEOMYCIN systems
VVork on the logic-based
logic-ba.sed MRS
MRS and the rule-based
rule-based NEOMYCIN
systems
continues,
Numerous copies
cont.inues, attracting
attracting wide
wide interest
interest within
within the AI
AI community.
community. Numerous
of
of MRS
MRS have been sent
sent to collaborators
collaborators elsewhere who
who are experimenting
experimenting with
with
it
book on rule-based
by Buchanan
it on the own
own machines.
machines. The
The book
rule-based expert
expert systems by
Buchanan
8:
& Shortliffe
Short,liffe was completed
completed in this
this year.
year.
[Se('
[See HPP
HPP technical
t,echnical memos HPP-83-26,
HPP-83-26, HPP-83-28,
HPP-83-28, HPP-83-29,
HPP-83-29, HPP-83-34,
HPP-83-34,
HPP-84-1]
HPP-84- 1]
What
What kinds
kinds of
of software
software tools
tools and
system architectures
architectures can be constructed
constructed to
to make
make itit easier to
to implement
implement expert
expert
syst,em
programs with
with increasing
increasing complexity
complexity and high
high performance?
performance? How
How can we
programs
flexible control
control structures
structures for
for powerful
powerful problem
problem solving
solving programs?
programs?
design flexible

2. Advanced
Adt1anced architectures
architectures and
and Control:
Control:

A major
major effort,
effort in exploring
exploring and understanding
understanding the
the Blackboard
Blackboard a.rchitecture
architecture ha.s
has
A
undrrtaken. A
A new pilot
pilot project
project using
using this
this architecture
architecture was st,arted
started in
been undcrbnken.
domain of
of protein
protein chemistry
chemistry (see
(see description
description of
of Jardetzky
Jardetzky &
8[ Buchanan
Buchanan
the domain
pilot project).
project). We
\Ve have also begun
begun investigating
investigating Blackboard
Blackboard systems
systems as a way
way
pilot
of organizing
organizing expert
expert systems
systems to
to exploit
exploit concurrency.
concurrency. Initial
Initial work
work has begun
begun
of
using the HASP/AGE
HASP/AGE systems
systems as an application
application example.
example.
using
[See
HPP t,echnical
technical memos
memos HPP-83-30,
HPP-83-30, HPP-83-33,
HPP-83-33, HPP-83-38,
HPP-83-38, HPP-83-43,
HPP-83-43,
[See HPP
HPP-83-H, HPP-84-4,
HPP-84-4, HPP-84-61
HPP-84-6]
HPP-83-44,

Knowledge acquisition:
acquisition: How
How is knowledge
knowledge acquired
acquired most
most efficiently”from
efficiently"from
3. Knowledge
human expert,s,
experts, from
from observed
observed data,
data, from
from experience,
experience, and
and from
from discovery?
discovery?
human
How can a program
program discover
discover inconsistencies
inconsistencies and
and incompleteness
incompleteness in its
its
How
knowledge base?
base?
How can the
the knowledge
knowledge base be augmented
augmented without
without
knowledge
How
perturbing t,he
the established
established knowledge
knowledge base?
base?
perturbing

A. Feigenba.um
Feigenbaum
E. A.

gO
90

HPP
HPP Core
Core AI
AI Research

5P41 RR00785-11
RR00785-11

\Ve
We have continued
continued to
to make
make progress on two
two on-going
on-going projects
projects for
for learning
learning by
by
experience
experience and learning
learning by
by analogy,
analogy, and have initiated
initiated work
work on three
three new
systems
Those
Those three
three are learning
learning by
by watching,
watching,
systems for
for acquiring
acquiring knowledge.
knowledge.
learning
All
learning from
from text,
text, and learning
learning rules
rules & meta-rules
meta-rules inductively.
inductively.
All three
three of
of
the
t,he new systems
systems use
use medical
medical problems
problems as their
their test-domains.
test-domains.
[Preliminary
[Preliminary results
results have been published
published in HPP-83-27,
HPP-83-27, HPP-83-36,
HPP-83-36, HPP-84-2,
HPP-84-2,
HPP-84-8.]
HPP-84-S.]
4.

Knowledge utilization:
By
By what
what inference
inference methods
methods can many
many sources of
of
hr7owledge
utilization:
knowledge
jointly and effiCiently
knowledge of
of diverse
diverse types
types be made to
to contribute
contribute jointly
efficiently
toward
How can knowledge
knowledge be used intelligently,
intelligently, especially
especially in
toward solutions?
solutions? How
systems with
with large knowledge
knowledge bases,
bases, so that
that it
it is applied
applied in an appropriate
appropriate
manner
manner at the appropriate
appropriate time?
time?
These issues
issues are being
being explored
explored in the
the development
development of
of MRS
MRS (Meta(MetaRepresentation
Representation System)
System) where
where one of
of the roles of
of meta-knowledge
meta-knowledge is to guide
the effective
They are also central
central in the studies
studies
effective use
use of
of lower
lower level knowledge.
knowledge. They
of
of Blackboard
Blackboard control
control systems and their
their use
use in concurrent
concurrent expert
expert systems.
[See
[See HPP
HPP technical
technical memos
membs HPP-83-26,
HPP-83-26, HPP-83-28,
HPP-83-28, HPP-83-30,
HPP-83-30, HPP-83-33,
HPP-83-33,
HPP-83-38,
HPP-83-38, HPP-84-1,
HPP-84-1, HPP-84-2,
HPP-842, HPP-84-6]
HPP-84-61

5. Software
Software Tools:
Tools: How
How can specific
specific programs
programs that
that solve specific
specific problems
problems be
generalized
generalized to more
more widely
widely useful
useful tools
tools to
to aid
a.id in the development
development of
of other
other
programs
programs of
of the same class?
class?
We
We have continued
continued the development
development of
of new software
software tools
tools for
for expert
expert, system
system
construction
construction and the distribution
distribution of
of packages that
that are reliable
reliable enough and
documented
documented so that
that other
other laboratories
laboratories can use
use them.
them. These include
include the old
old
rule-based
rule-based EMYCIN
EMYCIN system,
system, MRS,
MRS, and AGE.
AGE.
[See
[See HPP
HPP technical
technical memos HPP-83-26,
HPP-83-26, HPP-83-28,
HPP-83-28, HPP-83-29,
HPP-83-29, HPP-83-33]
HPP-83-331
6.

Explanation and
How
Explanation
and Tutoring:
Tutoring:
How can the knowledge
knowledge base
base and the line of
of
reasoning
reasoning used in solving
solving a particular
particular problem
problem be explained
explained to
to users?
users? What
What
constitutes
constitutes a sufficient
sufficient or
or an acceptable
acceptable explanation
explanation for
for different
different classes
classes of
of
users?
How can
ca.n knowledge
knowledge in a system
system be transferred
transferred effectively
effectively to students
students
users? How
and trainees?
tra.inees?
The
NEOMYCIN program
comparison
The NEOMYCIN
program has undergone
undergone preliminary
preliminary
comparison with
with
medical
medical students'
students’ protocols
protocols to
to understand
understand the extent
extent to
to which
which its medical
medical
concepts
concepts match
match those of
of the students.
students. Analysis
Analysis of
of experts'
experts’ problem
problem solving
solving has
also been done. NEOMYCIN’
NEOMYCIN's
s explanation
explanation capabilities
capabilities have been improved.
improved.
NEOMYCIN in
New work
work on student
student modelling
modelling has started
started in order
order to
to test
test NEOMYCIN
the cont.ext
context of
of tutoring.
tutoring.
[See
[See HPP
HPP technical
technical memos HPP-83-41,
HPP-83-41, HPP-83-42,
HPP-83-42, HPP-84-2,
HPP-84-2, HPP-84-7]
HPP-84-71

7. Planning
Planning and
Design:
and Design:

planning
planning and design?
design?
const.raints?
const.raint,s? How
How are

What
What are reasonable and effective
effective methods
methods for
for
How
How can symbolic
symbolic knowledge
knowledge be coupled
coupled with
with numerical
numerical
constraints
const,raints propagated
propagated in design problems?
problems?

The
The Palladio
Palladia system
system for
for assisting
assisting in the design of
of VLSI
VLSI circuits
circuits has been
demonstrated
demonstrated and results
results presented
presented in major
major publications
publications and conferences.
[See
[See HPP
HPP technical
t,echnical memos HPP-83-31,
HPP-83-31, HPP-83-39,
HPP-83-39, HPP-83-45,
HPP-83-45, HPP-83-46.
HPP-83-46,
HPP-83-47,
HPP-83-47, HPP-84-3,
HPP-84-3, HPP-84-5]
HPP-84-51
8.

Diagnosis: How
Diagnosis:
How can we build
build a diagnostic
diagnostic system
system that
that reflects
reflects any
any of
of several

91

E. A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785-11

HPP
HPP Core
Core AI
AI Research

diagnostic
How
use knowledge
How can we use
knowledge at different
different levels of
of
diagnostic strategies?
strategies?
abstraction
process?
abstraction in the diagnostic
diagnostic process?
Research on using
using causal models in a medical
medical decision
decision support
support system
system
(NESTOR)
was
largely
completed
and
will
be
published
in
the
coming
(NESTOR)
largely completed
will
published
coming year.
year. A
A
~econd
program that
physiology
second medical
medical diagnosis
diagnosis program
that uses
uses causal models of
of renal
renal physiology
(AIjMM)
published soon. We
(AI/MM) was also substantially
substantially completed
completed and will
will be published
We are
investigating
investigating the process of
of diagnosis
diagnosis in electronics
electronics as
as well
well as
as in medicine.
medicine. The
The
major
major thrust
thrust of
of this
this work
work has been integrating
integrating causal models about,
about, and the
the
structure
body.
structure of, a computer
computer system
system or
or systems
systems of
of the
the human
human body.
[See
[See HPP
HPP technical
technica. reports:
reports: HPP-83-32,
HPP-83-32, HPP-83-37,
HPP-83-37, HPP-83-40,
HPP-83-40, 84-7]
84-71

D. Relevant
Relevant Publications
Publications
D.
HPP-83-28
HIT-83-20

Miehael
“MRS Casebook",
Casebook “, May
May 1983.
Michael R. Genesereth,
Genesereth, ".MRS

HPP-83-27
EDT’-83-27

Thomas
Thomas

HPP-83-28
HIT-83-28

Michael
Meta-level Representation
Representation System",
Michael R. Genesereth,"A
Genesereth, “A Meta-level
System “, May
Ma.y
1983.

HPP-83-29
HIT-83-29

M. Grinberg,
Installation Instructions
Instruction8",“, May
This
Grinberg, "MRS
“MRS Installation
May 1983.
This
report
purchased the software
report available
available only
only to
t,o those who
who have purchased
software system
system
MRS.
MRS.

HPP-83-30
HIT-83-30

Barbara
Barbara

HPP-83-31
HPP-83-31

Harold
Tong,
An
Harold Brown,
Brown, Christopher
Christopher
Tong, Gordon
Gordon Foyster,
Foyster, "Palladio:
“Palladia: An
Exploratory
Exploratory
Environment for
for IC
IC Design
Design",“, June 1983.
Environment

HPP-83-32
HFP-83-32

John Kunz,
Kunz, E.A.Feigenbaum,
E.A.Feigenbaum, Bruce
Bruce G. Buchanan,
Buchanan, E.H.
E.H. Shortliffe,
Shortliffe,
"Comparison
of
Decision hfaking
Afuking
“Comparison
of Techniques
Techniques of
of Computer-Assisted
Computer--4ssisted Decision
in
publication in the Pure
and
in Aledicine".
Medicine “. Submitted
Submitted for
for publication
Pure
and Applied
Applied
Biostructure.
World
Biostructure.
World Press, Singapore
Singapore (1983).

HPP-83-33
HIT-83-33

Nelleke Aiello,
for Expert
Expert
Nelleke
Aiello, "A
“A Comparative
Comparative Study
Study of
of Control
Control Strategies
Strategies for
SY8tems:
AGE Implementation
Implementation of
PUFF",“, June
Systems: AGE
of Three Yariations
Variations of
of PUFF
1983.

HPP-83-34
HIT-83-34

Jock Mackinlay,
Presentation: The Generation
Problem for
for
Mackinlay, "Intelligent
“Intelligent Presentation:
Generation Problem
User Interfaces
Inter faces ",“, March
March 1983.

HPP-83-36
I-ET-83-30

Russell Greiner
New'? A Semantic
Greiner and ~tichael
Michael R. Genesereth,
Genesereth, "H-1wt's
“What’s New?
Semantic
Definition of
Novelty",“, June 1983.
Definition
of Novelty

HPP-83-37
HIT-83-37

Robert
About Time-dependent
Behavior in
Robert Joyce, "Reasoning
“Reasoning About
Time-dependent Behavior
in a System
System
for Diagnosing
Diagnosing Digital
Digital Hardware
Hardware Faults
Faults",“, August
for
August 1983.
1983.

HPP-83-38
Hpp-83-38

Barbara
Blackboard Model
A/odel of
Barbara Hayes-Roth,
Hayes-Roth, "The
“The Blackboard
of Control
Control ",“, June 1983.

HPP-83-39
mP-83-39

Jerry
Expert System
for
.Jerry Yan,
Yan, Gordon
Gordon Foyster,
Foyster, Harold
Harold Brown,
Brown, "An
“An Expert
Syste,m for
Integrated Circuits",
As.'Jigning
A[ask Levels
Levels to Interconnect
Interconnect in
Assigning
Mask
in Integrated
Circuits “,
October
October 1983.

E. A. Feigenbaum
Feigcnbaum

D. Dietterich
and Ryszard
Dietterich
Ryszard S.
S. Michalski,
Michalski,
Patterns in
Patterns
in Sequences of
of Objects
Objects If,
“, May
May 1983.

"Discovering
“Discovering

Hayes-Roth,
"The
Blackboard Architecture:
Architecture: A
Hayes-Roth,
“The Blackboard
Framework for
for Problem
Problem Solvi1'g'?"
Framework
Solving?” May
May 1983.

92
92

General
General

HPP
HPP Core
Core AI
AI Research
Research

5P411 RR00785RR00785-11
5P4
11

HPP-83-40
HPP-83-40

Benoit Mulsa.nt
Mulsant and
and
Benoit
A
Daily
Activity
on
A Daily Activity on

HPP-83-41
HPP-83-41

(working paper)
paper) Diane
Diane Warner
Warner Hasling,
Hasling, “Strategic
"Strategic Explanations
Explanations for
for aa
(working
Diagnostic Consultation
Consultation System
System “,", in
in MA1
AAAI Proceedings
Proceedings 1983
1983 pp.
pp.
Diagnostic

David Servan-Schreiber,
Servan-Schreiber, “Is been improved
improved with
with a region
region based window
window system
system which
which increases both
both speed and
flrxibility.
The
The region
region based window
window system,
system, the register
register input
input devices, and the
flexibility.
formatting
formatt,ing language
la.ngua.ge interface
interface (that
(that describes how
how forms
forms should
should be presented
presented on the
display)
display) have been generalized
generalized to
to be usable by
by other
other portions
portions of
of the ONCOCIN
ONCOCIN project
project
(notably
(notably the OPAL
OPAL knowledge
knowledge acquisition
acquisition interface
interface described
described below).
below).

C.2.2
Development of
program to
physician inter
face to
C.2.2 Development
of new
new program
to connect
connect the
the physician
interface
to the
the
reasoning
portion of
program
reasoning portion
of .,the
the program
The
The ONCOCIN
ONCOCIN system
system uses
uses a special design that
that allows
allows the Interviewer
Interviewer program
program
a.nd
program to operate
and the reasoning
reasoning section
section of
of the program
operate independently.
independently. In order
order to coordinate
coordinate
the activities
programs, a special communication
a,ctivities of
of these two
two programs,
communication program,
program, called
called the
t,he
Interactor
1nteract)or was drsigned
designed and built.
built.
The
t,eractor program
program provides
provides a message
The In
Interactor
message passing facility
facility between
between two
two or
or more
more
Interlisp-D
processes (sub-programs
Interlisp-D processes
(sub-programs that
that can run
run at the same time).
time). The
The form
form of
of the
messages
The
messa.ges are specified
specified by
by the programmer.
programmer.
The system
system further
further allows
allows messages
messages to
to
processes
machines
processes running
running on different
different
machines via
via the computer
computer network
network called
called the
ETHERNET.
This
ETHERNET.
This will
will allow
allow moving
moving components
components of
of a large program
program from
from one to
to several
machines
The Interactor
Interactor also has the
machines in a way
way invisible
invisible to the programs
programs themselves. The
ability
ability to rind
find other
other Int.eractors
Interactors on the local communication
communication network.
network.

C.2.3
Detleiopment of
programs to
C.2.9 Development
of new
new programs
to improve
improve the
the efficiency
efficiency and
and capabilities
capabilities
of
NCO CIN
0f o
olvc~oc~Iiv

109

E. A.
A. Feigenbaum
Feigenbaum

ONCOCIN
ONCOCIN Project
Project

5P41 RR00785-11

In order
order to speed
speed up both
both versions
versions of the ONCOCIN
ONCOCIN system, we have written
written aa
simple
simple rule and control
control block
block compiler
compiler for
for ONCOCIN
ONCOCIN that
that converts
converts rules and control
control
blocks into
into Interlisp
Interlisp programs,
programs, and then
then into
into compiled
compiled Interlisp.
Interlisp. This
This helps to alleviate
alleviate aa
memory
version
memory space
space problem
problem we have had in the Interlisp-lO
Interlispversion of the system
system as
as well
well as
as
the
workstation
version
of
the
program.
give us increased speed
in
speed
workstation version
program.
Another
Another systems level 3.'>pect
aspect of our
our work
work is in the creation
creation and access
access of efficient
efficient
patient
patient record
record data
data files. To
To this
this end, we have implemented
implemented a machine
machine independent
independent hash
file system
system (special data
data record
record format)
format) that
that allows access
access to the data
data base
base via
via memory
memory
from
and Interlisp-D
The system
system is
is compatible
compatible with
with both
both Interlisp-lO
InterlispInterlisp-D and
from disk files. The
allows sharing
sharing of files between
between the two
two systems. Its
Its format
format is
is also machine
machine independent
independent
enough to allow
allow access
access from
from other
other lisps on other
other computers.
computers. It
It is currently
currently accepted by
XEROX
XEROX as
as a standard
standa.rd for
for the D machines
machines and has been used by them
them to bring
bring up
programs
programs of use
use by all D machine
machine users.
users. Along
Along the same lines, we have experimented
experimented with
wit,h
solutions
solutions to the problems
problems of having
having portions
portions of text
text easily
easily accessible
accessible by key
key from
from a file in
a machine
machine independent
independent way.
way.

C.2.4
Reorganization
and
programs for
for improved
C.2.4
Reorganization
and recoding
recoding of
of existing
ezisting programs
improved efficiency
efficiency
The
The reasoning
reasoning portion
portion of the
t.he ONCOCIN
ONCOCIN program
program is being reprogrammed
reprogrammed to
increase speed and to benefit
from
the
special
capabilities
of
the
Interlisp
benefit from
capabilities
Interlisp workstation.
workstation.
We
are
also
re-writing
parts
of
the
program
that
were
borrowed
We
re-writing parts of
program that
borrowed from
from other
other expert
expert
systems developed
developed by our
our group.
group.
\Ve
We have reorganized
reorganized the system
syst,em into
into logical
logical subsystems
subsystems that
that are of a manageable
manageable
size. This
This consisted
consisted of cat.egorizing
categorizing all the system
system functions
functions (portions
(portions of
of the program)
program) that
that
are necessary for
putting each in an appropriate
The
for the Reasoner to
to run
run and putting
appropriate file.
file. The
Reasoner now runs
runs in stand
stand alone mode independently
independently of
of which
which system
system it
it is on.
programming part
part for
\Ve
We are now
now in the
the process of
of cleaning
cleaning up the specific
specific programming
for each of
of
the subsystems.
both style
subsystems. This
This entails
entails making
making various
various enhancements
enhancements for
for both
style and efficiency,
efficiency,
adding
and further
breaking down
breaking
down functionally
functionally
adding comments
comments and documentation,
documentation,
further
independent
parts
of
the
system.
independent parts of
system.
We
portions of
We have transferred
transferred portions
of our
our EMYCIN
EMYCIN utilities
utilities (based on the MYCIN
MYCIN
expert
both Interlisps.
expert syst.em) and rewritten
rewritten those utilities
utilities to
to make
make them
them work
work in both
Interlisps. We
We
have removed
program that
removed from
from the stand-alone
stand-alone Reasoner sections
sections of
of the
the program
that depended on
specific hardware
hardware of
of the
the DEC-2060
DEC-2050 mainframe
mainframe computer
computer and now
now have versions
versions of
of
the specific
ONCOCIN on the 2060
2050 and D-machines
D-machines that
that are identical,
identical, being
being generated
generated from
from the same
ONCOCIN
program text,.
t.ext. This
This step also included
included the
the use of
of the
the new
new hash file syst,em
system (described
(described
program
D-machines.
above) on the D-machines.

C.2.5 System
Sy.'1tem support
support for
for the
the reorganization
reorganization
C.2.5
implemented a program
program called
called Graphcalls
Graphcalls which
which allows
allows programmers
programmers on
\Ve have implemented
the D-ma.chines
D-machines to
to visually
visually graph
graph the structure
structure of
of the
the programs
programs they
they have written.
written. One
t,he
examine the use of
of each of
of the
the functions
functions on the graph
graph as well
well as examine
examine and
can also examine
variables they
they access.
access. It
provides visual
visual tracing
tracing and
and dynamic
dynamic control
control of
of a
change the variables
It also provides
program in
in execution.
execution. ItIt has been used daily
daily since its
its creation
creation by
by both
both our
our project
project and
pr0gra.m
members of
of the
the SUMEX
SUMEX community.
community.
members

C.B Goal:
Goal: To
To modify
modify the
the 2020 Clinic
Clinic Version
Version of
of ONCOCIN
ONCOCIN in
in response
response to
to user
user
C.9
feedback
feedback
During the
the last
last year,
year, we have added
added a number
number of
of new
new options
options to
to ONCOCIN
ONCOCIN for
for use
During
by the
the fellows
fellows in the
the clinic.
clinic. These include:
include: a special
special option
option to
to request
request that
that a test
test be
by

A. Feigenbaum
Fdgenbaum
E. A.

110

ONCOCIN
ONCOCIN Project
Project

5P41
5P41 RR00785-11
RR00785- 11

ordered
for entering
entering reasons
reasons for
for treatment
treatment
ordered STAT
STAT (immediately),
(immediately), special menus for
modifications
modifications (these are used
used when there is a disagreement between the ONCOCIN
ONCOCIN
recommended therapy
physician's
therapy and the physician’
s treatment
treatment plan, in order
order to
to gather
gather data
data about
about
why
why the physician
physician has decided to override
override the system), and the option
option to
to request a copy of
a patient’
patient's
s flowsheet
flowsheet be printed
printed out
out on the clinic
clinic line-printer.
line-printer. We
We have also streamlined
streamlined
the met.hods
met,hods that
that the various
various forms
forms are created
created by
by ONCOCIN.
ONCOCIN.

C.4
for use
the
used
C.4 Goal:
Goal: To encode
encode and
and implement
implement
for
use by
by ONCOCIN
ONCOCIN
the COfnmonly
commonly
used
chemotherapy
protocols from
from our
chemotherapy
protocols,
our oncology
oncology clinic
clinic
We have pursued two
two approaches to
to increasing the number
number of
of prot.ocols
protocols known
known by
by
our
The first
first approach is to
to use
use the existing
existing software
software to
to implement
implement active
active
our system. The
protocols not
protocols
not encoded
encoded at the time
time of
of our
our last report.
report. The
The second
second approach has been
been to
develop new software
software that
that is able to
to dramatically
dramatically speed
speed up the entry
entry of
of protocols
protocols by
by
providing graphically-oriented
providing
graphically-oriented forms
forms to
to be filled
filled out
out on the computer
computer that
that follow
follow the basic
outline
protocol documents.
outline of the protocol
In t.he
protocol to
t,he past, adding
adding a new protocol
to the ONCOCIN
ONCOCIN knowledge base
base has been
been a
tedious process
process in which
programmer sit
which an oncologist
oncologist and a programmer
sit down and translate
translate the
oncologist's
All
oncologist’s knowledge about
about the protocol
protocol into
into rules accessible
accessible to ONCOCIN.
ONCOCIN.
All the
rules pertaining
pertaining to the new protocol
protocol are written
process must
written at
at that
that time,
time, and this
this process
must be
repeated for
protocol that
This
for every new protocol
that is added to ONCOCIN.
ONCOCIN.
This method
method is rather
rather
inefficient
inefficient since many
many of
of the rules are similar
similar between protocols,
protocols, differing
differing only
only in their
their
data
To
To speed
speed the process
process of knowledge acquisition,
acquisition, a program
program is being
data content.
content.
developed whereby
whereby a doctor
doctor could sit
sit down at a terminal
terminal and fill
fill in a series
series of
of forms
forms
containing
containing appropriate
appropriate questions about
about a new protocol.
protocol. The
The information
information entered would
would
take care of the large number
number of general rules pertaining
pertaining to the protocol
protocol and allow
allow the
doctor
doctor and programmer
programmer to concentrate
concentrate on the special cases.
cases.
The
The program
program will
will have two
two levels, the first
first of
of which
which is the program
program that
that will
will
interact
interact directly
directly with
with the doctor.
doctor. This
This program
program runs on Xerox
Xerox D machines which
which have
extensive graphics
Sections of the display
graphics capabilities.
capabilities.
display screen
screen (called windows)
windows) are
organized in a way
way that
that emulates the physician's
physician’s patterns
patterns of thought
thought when thinking
thinking about
about
the prot.ocoi.
protocol. Other
Other graphical
gra.phical entry
entry devises
devises have been
been used
used to
to encourage
encourage pointing
pointing at the
answer rather
rat.her than
than text
text entry.
entry. These methods are able to display
display all of the possible
choices
choices in a compact
compact and comprehensible way.
way. The
The first
first phase
phase of
of this
this program
program has been
been
complrted
complet.ed and h;JS
has been
been examined and approved
approved by our
our oncology collaborators.
collaborators.
InFormat.ion
Informat.ion entered in the top
top level program
program will
will be
be converted
converted to
t,o an intermediate
intermediat,e
data
data. structure
structure which
which will
will be
be used
used by the second
second level of the program
program to make new rules
for the ONCOCIN
ONCOCIN knowledge base.
base. Eventually,
Eventually, this
this process
process will
will also work
work in the opposite
opposit,e
direction
direclion so that
that information
informa,tion about
about a previously
previously entered protocol
protocol can be copied or
or
modified
modified by the
t,he physician
physician for the new protocol.
protocol. This
This "similar
“similar to"
to” option
option will
will also extend
t·o
t.o chemotherapies
chemot,herapies and drugs, so that
that when the doctor
doctor enters a chemotherapy
chemotherapy or
or drug
drug that
that
the system
systt=rn knows about,
about, pertinent
pertinent information
information will
will be filled
filled in for the doctor
doctor to copy or
or
modify.
modify. The
The "similar
“similar to"
to” capably
capably along with
with the use
use of
of graphical
graphical input
input devices to speed
speed
the process
process of entering
entering a new protocol
protocol and will
will also reduce errors
errors and duplications.
duplications. When
When
this
this project
project is completed
completed the total
total time
time needed
needed to
to enter
enter a new protocol
protocol should be greatly
greatly
reduced and more effort
effort will
will be concentrated
concentrated on fine tuning
tuning the rules to handle special
situations.
situations.

C.5
Evaiuat'ions of
Performance
(2.5 Eualuations
of ONCOCIN's
ONCOCIN'S
Performance
Data
Data collection
collection and analysis for all three ONCOCIN
ONCOCIN evaluations
evaluations are now complete,
complete,
results
results were presented at the annual
annual meeting
meeting of
of the Society
Society for Medical
Medical Decision Making,
Making,
and we expect to have formal
formal reports
reports published
published during
during the next
next year.
111
111

E. A. Feigenbaum
Feigenbaum

ONCOCIN
ONCOCIN Project
Project

5P41 RR00785-11
RR00785- 11

Study
Study 1, overseen by
by Dr.
Dr. Robert
Robert Carlson
Carlson of
of the Division
Division of
of Oncology,
Oncology, is an
evalua,tion of
of the program's
program’s impact
impact on the attitude
attitude of
of the
the oncology
oncology fellows
fellows towards
t,owards
evaluation
computers
All
All physicians
physicians were administered
administered
computers in general
genera.1 and ONCOCIN
ONCOCIN in particular.
particular.
questionnaires and structured
structured interviews
interviews in the Spring
Spring of
of 1981
1981 before
before ONCOCIN
ONCOCIN was
questionnaires
introduced.
intQoduced. The
The same questionnaires
questionnaires were distribut.ed
distributed to them
them again after
after they
they had used
system for
for over
over a year.
year. Follow-up
Follow-up interviews
interviews were also undertaken.
undertaken. This
This study
study was
the system
repeated again
a.ga.in during
during 1983 to
to determine
determine the trends
trends over
over time.
time. The
The results
results of
of this
this study
study
repeated
are presently
presently being
being prepared
prepared as a formal
formal report.
report.
\Ve
We are also revising
revising this
this study
study in preparation
preparation for
for the integration
integration of
of the
workst.ation version
version of
of ONCOCIN
ONCOCIN into
into the clinic.
clinic. To
To maintain
maintain some consistency
consistency in the
workstation
evaluation process, the original
original questions
questions from
from Study
Study 1 will
will be given
given and analyzed
analyzed as
as
evaluation
before along with
with new questions.
questions. Several
Several of
of these new survey
survey and interview
interview questions
questions will
will
before
serve as
as a "baseline"
“baseline” for
for evaluating
evaluating any
any perceived
perceived improvements
improvements that
that will
will come with
with the
introduction
intZroduction of
of the professional
professional workstations
workstations in the clinic.
clinic.

Study 2, overseen by
by Dr.
Dr. Daniel
Daniel Kent
Kent of
of the Division
Division of
of General
General lilternal
Internal Medicine,
Medicine,
Study
is an evaluation
evaluat.ion of
of the program's
program’s impact
impact on the
the completeness
completeness and accuracy
accuracy of
of flowsheet
flowsheet
data
Research programmers
data recorded
recorded with
with and without
without ONCOCIN.
ONCOCIN.
programmers wrote
wrote routines
routines to
to
formally analyze
a.naiyze on-line
on-line flow
flow sheets for
for completeness
completeness and accuracy.
accuracy. Pre-ONCOCIN
Pre-ONCOCIN flow
flow
formally
then entered
entered into
into the system
system exactly
esactly as
as they
they were
were originally
originally
recorded by
by the
the
sheets were then
recorded
physicinn.
The same analytic
analytic routines
routines were used to
to analyze
analyze these pre-ONCOCIN
pre-ONCOCIN flow
flow
physician. The
sheet,s. The
The pre- and postpost- ONCOCIN
ONCOCIN data
data were compared.
compared.
sheet.s.
Results
Results indicat.e that
that
ONCOCIN has had a statistically
statistically significant
significant beneficial
beneficial impact
impact on the completeness
completeness of
of data
data
ONCOCIN
recording, the ordering
ordering of
of required
required tests, and the accuracy
accuracy of
of the data
data recorded.
recorded. A
A formal
formal
recording,
report
report of
of the results
results is in preparation.
preparation.
Finally, Study
Study 3 is examining
examining the comparison
comparison between
between ONCOCIN's
ONCOCIN’s therapeutic
therapeutic
Finally,
treatment decisions made by
by oncology
oncology fellows
fellows in the same setting.
setting. The
The
advice and the treatment
study was coordinated
coordinated by
by Dr.
Dr. David
David Hickam,
Hickam, formerly
formerly of
of our
our Division
Division of
of General
General Internal
Internal
study
hledicine and now
now on the
the faculty
faculty at the University
University of
of Oregon
Oregon in Portland.
Portland.
Medicine
Expert
Expert
evaliuators rated
rated treatment
treatment plans
plans without
without knowing
knowing whether
whether the recommendation
recommendation was that
tha.t
evaluators
of ONCOCIN
ONCOCIN or
or one of
of the clinic
clinic physicians.
physicians. Over
of
Over 200 flow
flow sheets were evaluated
evaluated by
by
Stanford lymphoma
lymphoma experts,
experts, and the resulting
resulting data
data have been fully
fully analyzed
analyzed by
by Dr.
Dr.
Stanford
Hick-am. The
The results
results indicate
indicate that
that the experts
experts were unable
unable to
to fault
fault the recommendations
recommendations
Hickam.
by ONCOCIN
ONCOCIN relative
relative to those of
of experienced
experienced oncology
oncology fellows
fellows treating
treating patients
patients
made by
with
wit,h lymphoma.
lymphoma.. A
A paper
pa,per describing
describing the results
results is in preparation.
preparation.
A stlldy
study was made
ma.de of
of all
all of
of the cases
cases run
run by
by phYSicians
physicians in the clinic
clinic to determine
determine
A
st.atist.ics
when they
they chose
chose to
to override
override ONCOCIN's
ONCOC’I~N’s therapy
therapy recommendation.
recommendation. The
The
stat
istks about. when
result#s showed that
that approximately
approximately 75%
75% of
of the time
time they
they agreed completely.
completely. \Vhen
When there
there
resl1lts
disagreements, 15%
15% were about
about individual
individual drug
drug doses.
doses. This
were disagreements,
This study
study pointed
pointed out
out a
number of
of situations
situations where
where ONCOCIN
ONCOCIN needs more
more knowledge,
knowledge, and
snd where
where our
our expert
expert
nnmber
clarificntion from
from the Principal
Principal Investigators
Investigators of
of the particular
particular protocol.
protocol.
needed clarificar.ion
As a
result.. a meeting
meet,ing was
wa.s held (7/12/83)
(7/12/83) with
with some of
of the Faculty
Faculty in charge of
of the Hodgkin's
Hodgkin’s
result.
prot,ocols to discllsS
discllss issues
issues arising
arising from
from this
this study.
study.
protocols
C.6 Documentation
C.6
Documentation

An extensive
extensive effort
effort to
to document
document the ONCOCIN
ONCOCIN system
system was completed
completed during
during this
this
An
year. Many
Many aspects of
of the ONCOCIN
ONCOCIN program
program and its
its programming
programming environment
environment are
last year.
now \vritten
writ,ten and available
available for
for project
project members'
members’ use.
use. The
now
The increase in documentation
documentation has
significantly reduced
reduced the start-up
start-up time
time for
for new researchers
researchers working
working with
with the project.
project. In
significantly
addition, we have published
published several papers
papers and prepared
prepared several
several technical
technical reports
reports
addition,
describing the system.
syst,em.
describing

E. A. Feigenbaum
Feigenbaum

112
112

ONCOCIN
ONCOCIN Project
Project

5P41 RR00785-11
RR00785-11

C.7
Hypothesis Assessment
Assessment
C.7 Hypothesis
As mentioned
mentioned above, largely
largely through
through the
the efforts
efforts of
of Curtis
Curtis Langlotz,
Langlotz, we have
continued
continued to
to develop modifications
modifications to ONCOCIN
ONCOCIN that
that will
will permit
permit it
it to
to function
function as
as an
"observer"
physician's
“observer” of
of the physician’
s own
own decisions rather
rather than
than as
as a primary
primary source of
of advice. By
By
permitting the physician
physician to
plan on the flowsheet,
permitting
to enter
enter his or
or her
her own
own therapy
therapy plan
flowsheet, we can
acknowledge
acknowledge the oncologist's
oncologist’s ability
ability to
to reach
reach appropriate
appropriate therapeutic
therapeutic decisions for
for most
most
patients. ONCOCIN
physician's
plan with
patients.
ONCOCIN will
will simply
simply compare
compare the physician’
s plan
with what
what it
it believes is the
proper therapy.
If
physician, or
If the system
system agrees with
with the
the physician,
or determines
determines that
that small
small
proper
therapy.
differences
differences are clinically
clinically insignificant,
insignificant, no advice from
from the computer
computer will
will be necessary. If
If
significant
significant disagreements
disagreements occur,
occur, on the other
other hand,
hand, ONCOCIN
ONCOCIN will
will need to respond
respond with
with
warnings
plan may
warnings and explanations
explanations for
for why
why it
it feels that
that an alternate
alternate therapy
therapy plan
may be
preferable. Our
preferable.
Our experience with
with ONCOCIN
ONCOCIN since its
its clinic
clinic implementation
implementation suggests that
that
tllis
preferred by
physicians. It
this mode of
of interaction
interaction will
will be preferred
by the clinic
clinic physicians.
It will
will require
require minimal
minimal
changes
decision
chnnges to ONCOCIN's
ONCOCIN’s
decision making
making approach,
approach, but
but the
the determination
determination of
of what
what
differences
differences are clinically
clinically significant,
significant, and the optimal
opt,imal method
method for
for explaining
explaining their
their
importance
physician, are exciting
problems.
importance to the physician,
exciting challenges and important
important theoretical
theoretical problems.
An
International Journal
Journal
An initial
initial report
report, describing
describing this
this work
work appeared
appeared during
during 1983 in the International
of
Al11tl-Afachine Studies,
plan to continue
of Man-i\lachine
Studies, and we plan
continue enhancing
enhancing the
the system's
system’s critiquing
critiquing and
explanation
Mr.
Mr. Langlotz
Langlotz presented
presented this
this work
work in the 1983 Society
Society for
for
explanation capabilities.
capabilities.
Computer
Computer Applications
Applications in Medical
Medical Care
Care Conference
Conference Student
Student Paper
Paper Competition,
Competition, and was
a finalist
finalist in the competition.
competition. The
The approach
approach will
will not
not be used in the clinic,
clinic, however,
however, llDtiI
until
ONCOCIN
professional workstations,
ONCOCIN has been transferred
transferred to
to professional
workstations, hopefully
hopefully in about
about two
two
years.

C.7
Rule Analysis
Analysis
C.? Query
Query System
System and
and Rule
Shako
Shoko Tsuji
Tsuji has completed
completed her work
work on the development
development of
of a query
query system
system to
permit
permit easy access
access to the large ONCOCIN
ONCOCIN knowledge
knowledge base. Once we had encoded several
several
hundred
hundred rules.
rules, it
it became unwieldy
unwieldy for
for system
system builders
builders to work
work from
from large hard-copy
hard-copy
listings
listings of
of the
t,he knowledge
knowledge base,
base, and we anticipate
anticipate that
that physicians
physicians will
will also require
require direct
direct
access
access to the program's
progra.m’s knowledge.
knowledge. The
The query
query system
system permits
permits this
this kind
kind of
of access.
access. Rather
Rather
than
than dealing
dea.ling with
with natural
natural language understanding
understanding by
by computer,
computer, we are designing
designing ways
ways
that
that menu
menu selection
select,ion and the high-speed
high-speed interface
interface can be used to
to permit
permit access
access to the
information
information that
that is needed by
by a physician
physician or
or system
system builder.
builder. A
A paper
paper describing
describing the early
early
work
A,A.MSI Congress
work was presented
present.ed last year
year (May
(May 1983) at the A4MSI
Congress 83
8.Y in San Francisco.
Francisco.
In previous
previous reports
reports we also described
described the work
work of
of Dr.
Dr. Motoi
Motoi Suwa
Suwa who
who developed
developed
programs to 3:"sist
programs
a.?sist. in determining
determining knowledge
knowledge base
base consistency
consistency and completeness. His
paper on this
AI Magazine.
Afagazine. However,
paper
t,his subject
subject appeared
appeared in late 1982 in the AI
However, the programs
programs
that.
t,hnt, he wrote
wrote were never
never formally
formally linked
linked to
to our
our system
system for
for writing
writing rules and modifying
modifying
otller
parts of
Noble spent
ot)her p;Lrts
of tlle
t,he knowledge
knowledge base.
base. As a result,
result, Mr.
Mr. Robert
Robert Noble
spent time
time during
during the
last few years modifying
modifying Suwa's
Suwa’s code so that
that it
it would
would operate
operat,e as
3s an integral
integral part
part of
of
ONCOCIN.
These changes
ONCOCIN.
cha.nges have now
now been implemented
implemented so that
that a new rule
rule can be
dynamically
dynnmicslly cOlllpared
compared to the rest of
of the knowledge
knowledge base
base during
during the process of
of knowledge
knowledge
entry.
Noble is currently
entry. Mr.
Mr. Noble
currently considering
considering how
how such a program
program might
might be implemented
implemented on
a workstation
workstation in order
order to take
take advantage
advantage of
of the newly
newly available
available graphical
graphical capabilities
capabilities of
of
these macllines.
machines.

C.g
Encoding of
Additional Fkotocols
Protocols
C.9 Encoding
of Additional
As was indicated
to a
indicated above, we have emphasized
emphasized transfer
transfer of
of ONCOCIN
ONCOCIN
professional workstation
rather
of
professional
workstation
rather than
than the implementation
implementation
of additional
additional protocols.
protocols.
However,
However, the oncologist
oncologist in charge of
of breast
breast cancer treatment
treatment at Stanford
Stanford had expressed
great
great interest
interest in adding
adding those treatment
treatment protocols
protocols to
to the system
system as
as soon as
as possible. We
We

113

E. A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785-11

ONCOCIN Project
Project
ONCOCIN

accordingly encoded and
and thoroughly
thoroughly tested
tested the
the treatment
treatment plans
plans for
for adjuvant
adjuvant therapy
therapy
have accordingly
of breast
breast carcinoma
carcinoma (CMF
(CMF and
and CMFVP
CMFVP treatment
treatment plans)
plans) and
and released them
them for
for regular
regular
of
the spring
spring of
of this
this year.
year. Encoding
Encoding of
of the
the CMF
CMF treatment
treatment plan
plan required
required encoding
encoding of
of
use in the
special rule
rule types.
types. In
In order
order to
to represent
represent these treatment
treatment plans
plans special
special methods
methods were
were
special
created for
for looking
looking back
back to
to previous
previous cycles to
to compa.re current
current laboratory
laboratory results
results to
to
created
previous va.lues.
values. This
This allows
allows the
the development
development of
of treatment
treatment recommendations
recommendations based upon
upon
previous
past experience
experience with
with the
the patient.
patient. A
A number
number of
of other
other protocols
protocols were
were added
added to
to the
the
past
ONCOCIN system
system in order
order to
to keep the
the system’
system's
knowledge about
about Hodgkin’
Hodgkin's
and
ONCOCIN
s knowledge
s and
These included
Lymphoma protocols
protocols current.
current.
included new Lymphoma
Lymphoma protocols
protocols with
with very
very
Lymphoma
complex alternating
alternating chemotherapies
chemotherapies (M-HOP/B-Cepp/HD-MTX
(M-HOP jB-CeppjHD-MTX and M-BACOD/HDM-BACODjHDcomplex
MTX), and new Hodgkin’
Hodgkin's
protocols (alternating
(alternating MOPP/ABVD).
MOPP j ABVD).
hlTX),
s protocols

C.l0 Strategic
Strategic Therapy
Therapy Planning
Planning
C.10
As mentioned
mentioned above, we have begun
begun a new
new research
research project
project to
to study
study the
the therapy
therapy
As
planning process, and how
how strategies
strategies which
which are used to
to plan
plan therapy
therapy in difficult
difficult cases
cases
pla.nning
might be represented
represented on a computer.
computer. This
This project,
project, which
which we call
call the
the ONYX
ONYX project,
project, has
might
as it,s
its goals: to
to conduct
conduct basic research
research into
into the
the possible representations
representations of
of the
the therapy
therapy
planning process; to
to develop
develop a computer
computer program
program to
to represent
represent. this
this process:
process; and eventually
eventually
planning
t.o interface
interface the
the planning
planning program
program with
with ONCOCIN.
ONCOCIN. The
The project
project members
members (Fagan,
(Fagan,
to
Bischoff, Williams,
Williams, Langlotz,
Langlotz, and Rennels)
Rennels) have spent
spent many
many hours
hours meeting
meeting with
with Dr.
Dr. Sikic
Sikic
Bischoff,
trying to
to understand
underst.and how
how he plans
plans therapy
therapy for
for patients
patients whose special
special clinical
clinical situation
situation
trying
precludes following
following the
the standard
standard therapeutic
therapeutic plan
plan described
described in the
the prot.ocol
protocol document.
document. In
In
precludes
March of
of t)his
this year,
year, the
the group
group spent
spent two
two days at
at Xerox
Xerox Palo
Palo Alto
Alto Research Center
Center
binrch
(PAHe), working
working with
with Mark
Mark Stefik,
Stefik, Daniel
Daniel Bobrow
Bobrow and Sanjay
MiHal of
of PARC
PARC on
(PARC),
Sanjay Mittal
possible representations
representations for
for the knowledge
knowledge structures
structures and how
how such a program
program might
might run
run
We hope to
to have a prototype
prototype of
of this
this
using the LOOPS
LOOPS knowledge
knowledge programming
programming system.
system. We
system
system running
running this
this year.
year.

D. Publications
Publications Since
January 1989
D.
Since January
1983
Expert systems
systems research:
research: modeling
modeling the
the
Expert
In: An
to Monitoring
medical
An Integrated
Integrated Approach
Approach to
Afonitoring
medical decision
decision malcing
making process. In:
(J.S. Gravenstein,
Newbower, A.K.
N.T. Smith,
Gravenstein, R.S. Newbower,
A.K. Ream, and N.T.
Smith, eds.), pp.
183-:!OO.
183-200, \Voburn,
Woburn, MA:
MA: Butterworth's,
Butterworth’s, 1983.

1. (*)
(*) Shortiiffe,
Shortiiffe, E.H.
E.H. and Fagan,
Fagan, L.M.
L.M.

2.
2. Duda,
Dudn,

R.O.
R.O.

and Shortliffe,
Shortliffe,

E.H.
E.H.

Expert systems research.
Expert

Science,
Science,

220:251-258
220:261-268 (1983).

Adapting a consultation
consultation system
system to
3. (*)
(*) Langlotz,
La,nglot,z, C.P.
C.P. and Shortliffe,
Shortliffe, E.H.
E.H. Adapting
International
Journal
of Man-Machine
Studies,
critique
International
Journal
of
]t.lan-Afachine Studies,
critique user plans.
19:479-495
19:479-496 (1983).

4.
Graphical access
access to the knowledge
knowledge base
base of
of a
4. (*)
(*) Tsuji,
Tsuji, S.
S. and Shortliffe,
Shortliffe, E.H.
E.H. Graphical
Proceedings
of
Congress
89,
medical
medical consultation
consultation system.
Proceedings
of AAAfSI
AAMSI
Congress
83, pp
551-555. San Francisco,
Francisco, CA,
CA, May
May 1983.
1983.
5. (*)
(*) Bischoff,
Bischoff, M.B.,
M.B., Shortliffe,
Shortliffe, E.H.,
E.H., Scott,
Scott, A.C.,
A.C., Carlson,
Cnrlson, R.W.
R.W. and Jacobs,
C.D.
C.D. Integratio:l
Integration of a computer-based
computer-based consultant
consultant into
into the clinical
clinical setting.
setting.
Proceedings
7th
Symposium
on
Applications
in
Proceedings
7th Annual
Annual
Symposium
on Computer
Computer
Applications
in Afedical
Medical
Care,
October 1983,
1983, Baltimore,
Baltimore, Maryland.
Maryland.
Care. pp. 149-152. October
6. Mulsant,
Rlulsant, B. and Servan-Schreiber,
Servan-Schreiber, D.: Knowledge
Knowledge engineering:
engineering: a daily
daily activity
activity
on a hospital
and
Re.ion of confounding
confounding time
time intervals,
intervals, or
or statistical
statistical control.
control. The
The Study
Study Module
Module then
then
determines
detrrmines an appropriate
appropriate statistical
statistical method,
method, using knowledge
knowledge stored
stored as
as production
production rules.
\10st.
1lost studies
studies have used a longitudinal
longitudinal design involving
involving a multiple
multiple regression model
model applied
applied
to individual
individual pat.ient
pat,ient records.
records. Results
Results across patients
patients are combined
combined using weights
weights based
on the
t,he precision
precision of the estimat.ed
estimat,ed regression coefficient
coefficient for
for each patient.
patient.

B
\/edical Relevance
B..•,Weciical
Relevance and
and Collaboration
Collaboration
As a t.est
t,est bed for
for system
syst,em development
development our
our focus of
of attention
attention has been on the
records
records of patients
patients with
with systemic
systemic lupus
lupus erythematosus
erythematosus (SLE)
(SLE) contained
contained in the Stanford
Stanford
port.ion
port,ion of the ARAMIS
ARAMIS Da.t.a
Da,t,a Bank.
Bank. SLE
SLE is a chronic
chronic rheumatologic
rheumatologic disease
disease with
with a broad
broad
spectrum
spectrum of manifestations.
manifest.ations. Occasionally
Occasionally the disease
disease can cause
cause profound
profound renal
renal failure
failure and
lead to an early
early dt'ath.
death. \Nith
With many
many perplexing
perplexing diagnostic
diagnostic and therapeutic
therapeutic dilemmas,
dilemmas, it
it is a
disea.se
of
considerable
medical
interest.
disease of considerable medical interest.
In the fut.lIre
project users of
future we anticipate
anticipate possible collaborations
collaborations with
with other
other project
of the
TOO
National Stroke
Northern California
TOD Syst.em
System SllCh
such 3,S
as the National
Stroke Data
Data Bank,
Bank, the Northern
California Oncology
Oncology
Gro1lp,
Group, and the
t,he St,anford
St,a.nford Divisions
Divisions of
of Oncology
Oncology and of
of Radiation
Radiation Therapy.
Therapy.
\Ve
project is broadly
broadly applicable
We believe that
tha.t this
t,his research
resea.rch project
applicable to the entire
entire gamut
gamut of
of
chronic
bulk of
chronic diseases
diseases that
that constitute
constit,ute the bulk
of morbidity
m0rbidit.y and mortality
mortality in the United
United States.
States.
Consider
Consider flY\"
five major
major diagnost.ic
diagnost,ic categories
categories responsible
responsible for
for approximately
approximately two
two thirds
thirds of
of the
two
two million
million deaths
deat,hs per year
year in the United
United States: myocardial
myocardial infarction,
infarction, stroke,
stroke, cancer,
hypertension,
Therapy
hypertension. and diabetes.
diabetes.
Therapy for
for each of
of these diagnoses is fraught
fraught with
with
controversy concerning
concerning the balance
balance of
of benefits
benefits versus costs.
controversy
l. Myocardial
Myocardial Infarction:
Infarction: Indications
Indications for
for and
and efficacy
efficacy of
of coronary
coronary artery
artery bypass
1.
graft vs. medical
medical management
management a,lone.
alone.
Indications for
for long-term
long-term
grsft
Indications
antiarrhythmics ..... . long-term
long-term anticoagulants.
anticoagulants. Benefits
Benefits of
of cholesterol-lowering
cholesterol-lowering
antia.rrhythmics
diet.s, exercise, et,c.
etc.
diets,

Strokr: Efficacy
Ef'ficacy of
of long-t,erm
long-term anti-platelet
anti-platelet agents, long-term
long-term anticoa.gulation.
anticoagulation.
2. Stroke:
Indications for
for revascularization.
revascularization.
Indications
Cancer: Relative
Helative efficacy
efficacy of
of radiation
radiation therapy,
therapy, chemotherapy,
chemotherapy, surgical
surgical excision
excision
3. Cancer:
singly or
or in combination.
combination. Optimal
Optimal frequency
frequency of
of screening
screening procedures.
procedures.
- singly
Prophylactic thera.py.
therapy.
Prophylactic
4. Hypertension:
Hypertension: Indications
Indications for
for
chronic
an
tihypertensive
drugs.
chronic antihypertensive drugs.
arteriography in
in work-up.
work-up.
arteriography

therapy. Efficacy
Efficacy versus
versus
therapy.
Role of
of various
various diagnostic
diagnostic

adverse effects
effects of
of
tests
such
as
renal
tests
renal

Diabetes: Influence
Influence of
of insulin
insulin administration
administration on microvascular
microvascular complications.
complications.
5. Diabetes:
of ora.
or::!,l hypoglycemics.
hypoglycemics.
Role of
Feigenbaum
E. ‘A.
4. Feigenbsum

-..

1'>'>
132

RADIX
RADIX Project
Project

5P4l
5PA 1 RR00785-11
RR00785- 11

Despite the expenditure
expendit,ure of billions
billions of dollars
dollars over recent
recent years for randomized
randomized
controlled
controlled trials
trials (RCT's)
(RCT’s) designed to answer these and other
other questions, answers have been
been
slow in coming.
coming. ReT's
RCT’s are expensive of funds and personnel. The therapeutic
therapeutic questions in
clinical
clinical medicine
medicine are too numerous
numerous for each
each to be
be addressed by its own series
series of RCT's.
RCT’s.
On t.he
t,he other
ot,her hand.
hand, the data
data regularly
regularly gathered
gathered in patient
patient records
records in the course of
the
t.he normal
normal performance
performance of health
health care delivery
delivery are a rich
rich and largely
largely underutilized
underutilized
resource. The
of these data
The ease
ease of accessibility
accessibility and manipulation
manipulation
data afforded
afforded by
computerized
computerized clinical
clinical databases holds out
out the possibility
possibility of a major
major new resource for
for
acquiring
acquiring knowledge
knowledge on the evolution
evolution and therapy
therapy of chronic
chronic diseases.
diseases.
The
The goal of the research that
that we are pursuing
pursuing on SUMEX
SUMEX is to increase the
reliability
reliability of knowledge
knowledge derived
derived from
from clinical
clinical data
data banks with
with the hope of providing
providing a new
tool
tool for augmenting
augmenting knowledge
knowledge of diseases
diseases and therapies
therapies as
as a supplement
supplement to knowledge
knowledge
derived
the incorporation
derived from
from formal
formal prospective
prospective clinical
clinical trials.
trials. Furthermore,
Furthermore,
incorporation of
knowledge
knowledge from
from both
both clinical
clinical data
data banks
banks and other
other sources into
into a uniform
uniform knowledge
knowledge base
base
should
should iflC'rease
increase the ease
ease of
of access
access by individual
individual clinicians
clinicians to this
this knowledge
knowledge and thereby
thereby
facilitate
facilitate both
both the
t,he practice
pract.ice of
of medicine
medicine as
as well
well as
as the investigation
investigation of
of human
human disease
disease
processes.
processes.

C.
of
C. Highlights
Highlights
of Research
Research Progress
Progress
C.l
C.1 1 .May
May 1988
198.9 to
to 1 May
May 1984
198;5
Our
Our primary
primary accomplishments
accomplishments in this
this period
period have been the following:
following:
1) complete
complet,e modifications
modifications to RADIX
RADIX to accommodate
accommodate the one hundred-fold
hundred-fold increase
patients,
in the size of
of our
our database to
to 1700 patients,

2) carry
prednisone on serum
carry out
out the study
study of
of the effect
effect of
of prednisone
serum cholesterol
cholesterol on the new
database.
database.
3) publish
publish results
patient prednisonejcholesterol
prednisone/cholesterol study,
results of
of the 1700 patient
study,

4) publish
publish the description
by us to this
description of
of a two-stage
two-stage regression method
method adapted
adapted by
this

stlldy.
study,
5) complete
preparation for
complete System
System Programmer's
Programmer’s !v1:1nuals
Manuals and User's
User’s Manual
Manual in preparation
fo1
transfer
tra.u?fer to
to outside
olltside sites, and
5) begin transfer
transfer of
of RADIX
RADIX to
to Xerox
Xerox D-Machine
D-Machine personal
personal work
work stations.
stations.
6)

C.l.l Modifications
j'IIodifications to
to RADIlr’
RADIX
the enlarged
enlarged database
database
C.l.l
for for the
Extensive modifications
modifications to
to RADIX
RADIX were
were required
required to
to deal with
with the
the 100-foid
Extensive
lOO-fold
in the
the size of
of the database.
database. The
The modifications
modifications necessary to
to run
run the study
study
automatically on the prednisone/cholesterol
prednisone/cholesterol study
study were completed
completed this
this year.
year.
automatically

increase
module
module

C.i.E Prednisone/chlouterol
Prednisone/chlosterol study
study on
on enlarged
enlarged database
database
C.1.2
\Ve have carried
carried out
out the
t,he automated
automated study
study of
of the
the effect
effect of
of prednisone
prednisone on serum
serum
We
cholesterol using
using the
the new
new 1700 patient
patient database.
database. ItIt has strongly
strongly confirmed
confirmed the
the effect
effect
cholest,erol
previously observed
obst'rved in the
the 50-patient
50-patient SLE
SLE database.
da.tabase. In
In addition,
addition, we are examining
examining the
the
previously
effect in non-SLE
non-SLE patients
patients and
and in
in other
other patient
patient subsets. We
We are also examining
examining alternative
alternative
effect
pharmacokinetic models
models for
for the
the prednione
prednione effect
effect using
using the
the newly
newly available
available data.
data.
pharmscokinetic

C.1.3 Publish
Publish results
results of
of predn.isone/cholesterol
prednisone/cholesterol study
study
C.1.9

123

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11
RR00785- 11

RADIX
RADIX Project
Project

The
paper reporting
The paper
reporting these results
results is in draft
draft form.
form. It
It will
will be submitted
submitted for
for
publicat.ion shortly.
publication
shortly.

C.LI
Publish description
of
method
C.1.4 Publish
description
of 2-stage
!&stage regression
regression
method
A
publicat.ion.
A description
description of
of the 2-stage
Z-sta.ge regression method
method has been submitted
submitted for
for publicat,ion.

C.l.5
Documentation
C.l.5 Document&ion
A
A two-volume
two-volume System
System Programmer's
Programmer’s Manual
Manual and a User's
User’s Manual
Manual describing
describing
implementation,
implementation, maintenance
maintenance and use of
of the
the system
system at
at Stanford
Stanford has been completed.
completed. In
addit.ion,
addit,ion, a complete
complete set of
of the files needed for
for on-line
on-line demonstrations
demonstrations has been created,
created,
separating
separating them
them from
from the working
working versions.
versions.

C.1.6
RADIX to
D-Machines
CJ.6 Tran.ser
Transer of
of RADIX
to D-Machines
Prl.'liminary
Preliminary work
work on implementing
implementing RADIX
RADIX on D-Machines
D-Machines has begun. This
This will
will
cont.inue
continue in coming
coming years.
years.

C.l.7
C. 2.7 Other
Other accomplishments
accomplishments
\Ve have present.ed the results
results of
of our
our research at
at several
several conferences during
during the
year.
publications for
publications.
year. Additional
Additional publications
for the year
year are noted
noted in the section
section on publications.
C.2
Research in
Progress
C.2 Research
in Progress
\Ve
patient
We are currently
currently completing
completing additional
additional studies
studies on subsets of
of the
the 1700 patient
database.
These
include
automated
analysis
of
the
prednisone/
cholesterol
effect
in
nondatabase.
include automated analysis of
prednisonel cholesterol effect
SLE
patients and subsets of
patients, and fitting
pharmacokinetic models
SLE pa.tients
of SLE
SLE patients,
fitting alternative
alternative pharmacokinetic
of
prednisone/cholesterol effect.
of the prednisone/cholesterol
effect. This
This work
work should
should be completed
completed shortly.
shortly. We
We will
will then
then
return
below in the
ret,urn to
to the more
more AI-oriented
AI-oriented aspects of
of RADIX,
RADIX, as
as described
described below
the section
section on
Resea.rch
Resea.rch Plans.
Plans.

D. Publication8
Publications
D.
1. Blum.
Regression: Application
Application to
Clinical
Blum. R.L.:
R.L.: Two
Two Stage
Stage Regression:
to a Time-Oriented
Time-Oriented
Clinical

Database. (Submitted
Dcltabnue.
(Submitted
Epidemiology.)
Epidemiology.)

for
for

publication
publication

to
to

the

American
American

Journal
Journal

of
of

R.L.:
Prednisone Elevates
Elevates Cholesterol:
An
Automated Study
R.L.: Prednisone
Cholesterol:
An Automated
Study of
of
Longitudinal Clinical
Data. (Manuscript
preparation.)
Longitudinal
Clinical
Data.
(Manuscript in preparation.)

2.
9. Blum,
Blum,

3.
Minimycin: AA !vfiniature
Rule-Based System
nnd Walker,
Walker, M.G.:
M.G.: Minimycin:
Miniature
Rule-Ba8ed
System
3 . Blllm,
Bliim, R.L
R.L.... and

(Submitted
publication to
(Submitted for
for publication
to M.D.Computing)
M.D.Computing)
R.L.:
A/odding and
encoding
clinical
causal
R.L.:
Modeling
and
encoding
dinid
CCJu8al relationships.
relationships.
Proceedings
Proceedings of
of SCAMC,
SCAMC, Baltimore,
Baltimore, MD,
MD, October,
October, 1983.

4. Blum,
Blum,

5. Blum,
Representation of
derived
Blum, R.L.:
R.L.: Representation
of empirically
empirically
deriued causal
causal relationships.
relationships.
IJCAI,
IJCAI, Karlsruhe,
Ksrlsruhe, \Vest
West Germany,
Germany, August,
August, 1983 .

R.L.:
representation
of
Machine
representation
0 / clinical
clinical
R.L.: !v/achine
MEDINFO
MEDINFO 83, Amsterdam,
Amsterdam, August,
August, 1983.

causal
relationships.
causal
relationships.

R.L.:
decision
making
aboard
Clinical
decision
making
aboard the
the
R.L.: Clinical
Chairman's
paper,
Session
on
Artificial
Intelligence
Chairman’s paper, Session
Artificial
Intelligence
Making,
hlaking, AAMSI,
AAMSI, San Francisco,
Francisco, May,
May, 1983.

Starship
Enterprise.
Starship
Enterprise.
and Clinical
Clinical Decision
Decision

5.
G. Blum.
Blum.
7. Blum,
Blum,

8. Blum.
hypotheses
on
Studying
hypotheses
on a time-oriented
time-oriented
Blum, R.L.
R.L. and Wiederhold,
Wiederhold, G.: Studying

E. A. Feigenbaum
Feigenbaum

124

RADIX
RADIX Project
Project

5P41 RR00785-11
RR00785-11

database:
.An
An overvie'w
of
RX project.
project. Proc.
overview
of the
the Rx
Proc. Sixth
Sixth SCAMC,
SCAMC, IEEE,
IEEE,
database:
\Vashington
D.C.,
October,
1982.
\Vashington D.C., October,
9. Blum,
Induction of
from a time-oriented
clinical
Blurn, R.L.:
R.L.: Induction
of causal
causal relationships
relationship8
from
time-oriented
clinical
database:
An overview
project. Proc.
database: An
overview of
of the
the R:'<:
RY project.
Proc. AAAI,
AAAI, Pittsburgh,
Pittsburgh, August,
August,
1982.
1982.

10. Blum,
Automated induction
of
from a timeinduction
of causal
causal relationships
relationships
from
timeBlum, R.L.:
R.L.: Automated
oriwted
RX project. Proc.
oriented clinical
clinical database:
database: The
The RXproject.
Proc. AMIA
AMIA San Francisco,
Francisco, 1982.

Discovery and
Representation of
Relationships from
from a
11.
11. Blum,
Blum, R.L.:
R.L.: Discovery
and Representation
of Causal
Causal Relationships
Clinical
Database: The
RX Project.
Project. IN
Large Time-oriented
IN D.A.B.
D.A.B. Lindberg
Lindberg
Large
Time-oriented
Clinical
Database:
The RX
and P.L.
NOTES IN
P.L. Reichert.z
Reichertz (Eds.),
(Eds.), LECTURE
LECTURE NOTES
IN MEDICAL
MEDICAL INFORMATICS,
INFORMATICS,
SpringerVerlag, 1982.
Springer-Verlag,
1982.
Discovery, confirmation,
and
incorporation
12. Blum,
Blum, R.L.:
R.L.: Discovery,
confirmation,
and
incorporation
relationships
from
a
large
time-oriented
clinical
database:
The
relationships
from
large time-oriented
clinical
database:
The
Computers
April,
Computers and Blomed.
Biomed. Res.
Res. 15(2):164-187,
15(2):161-187,
April, 1982.
1982.

of
causal
of
causal
RX
project.
RX project,

13.
Discovery and
of
from a
13. Blum,
Blum, R.L.:
R.L.: Discovery
and representation
representation
of causal
causal relationships
relationships
from
large
time-oriented
clinical
database:
The
RX project
project (Ph.D.
large
time-oriented
clinical
database:
The RX
(Ph.D. thesis).
thesis).
Computer
Computer Science and Biostatistics,
Biostatistics, St.anford
Stanford University,
University, 1982.
1982.
R.L.:
Displaying clinical
data
from a time-oriented
clinical
data
from
time-oriented
R..L.: Di8playing
Computers
1981.
Computers in BioI.
Biol. and Med. 11(4):197-210,
11(4):197-210,
1981.

14.
14. Blum.
Blum.

database.
database.

15.
Automating the
study of
hypothe,'3es
15. Blum,
Blum, R.L.:
R.L.: Automating
the study
of clinical
clinical
hypotheses on
on a time-oriented
time-oriented
databa,'le:
R.Y project.
project. Proc.
database: The
The RY
Proc. MEDINFO
MEDINFO 80, Tokyo,
Tokyo, October,
October, 1980, pp.
4,56-4130.
456-460. (Also
(‘41~0 STAN-CS-79-816)
STAN-CS-79-816)
16.
Inferring
knowledge
from clinical
data
jerring
knowledge
from
clinical
data
16. Blum,
Blum, R.L.
R.L. and Wiederhold,
Wiederhold, G.: In
utilizing
techniques
from artificial
intelligence.
Proc.
banks
technique8
from
artificial
intelligence.
Proc. Second
banks
utilizing
SCMvlC,
November, 1978.
SCAh3C:. IEEE,
IEEE, Washington,
Washington, D.C.,
D.C., November,
1978.
Blum. R.L.:
R.Y
17. .Blum.
R.L.: The
The RY
clinical
data
clinical
data banking
banking
University
University Ph.D.
Ph.D. thesis
thesis

project: AA medical
consultation
system integrating
project:
medical
consultation
system
integrating
and
intelligence
methodologies,
Stanford
and artificial
artificial
intelligence
methodologies,
Stanford
proposal, August,
proposal,
August, 1978.
1978.

18.
18. Kuhn,
Kuhn, Ingeborg,
Ingeborg, Gio
Gio Wiederhold,
Wiederhold, Jonathan
Jonathan E. Rodnick,
Rodnick, Diane
Diane M.
M. RamseyRamseyKlee,
Automated Ambulatory
Ambulatory Medical
Medical
Klee, Sanford
Sanford Brnctt,
Benet,t, and Donald
Donald D. Beck:
Beck: Automated
published by
by Springer-Verlag,
Record Sy.5tems
Springer-Verlag, 1983, in
Record
System8 in
in the
the U.S.,
U.S., to be published
Information
Inforrna.tion Systems
Systems for
for Patient
Patient Care,
Care, B. Blum
Blum (ed.), Section
Section III,
III, Chapter
Chapter 14.
19. Walker,
\Valker, M.G.,
Lisp Tutorial.
(Submitted
publication to
M.G., and Blum,
Blum, R.L.:
R.L.: AA Lisp
Tutorial.
(Submitted for
for publication
to
M.D.Computing.)
M.D.Computing.)
20. Wiederhold,
Knowledge and
Database Management,
A1anagement, IEEE
W’iederhold, Gio:
Gio: Knowledge
and Database
IEEE Software
Software
Premier
pp.63--73.
Premier IsslJe,
Issue, Ja11.1984,
Jan.1984, pp.63--73.
21. Wiederhold,Gio:
\Viederhold,Gio: Networking
Networking of
Data Information,
Information,
of Data
\Vorkshop
on
the
Role
of
Computers
Workshop
of Computers in Cancer
Cancer
Institu
tes
of
Health,
June
1983,
pp.113-11 9.
Institutes of Health,
1983, pp.L13-119.

National
National
Clinical
Clinical

Cancer
Cancer
Trials,
Trials,

Institute
Institute
National
National

22. \Viederhold,
Database Design
Design (in
Wiedrrhold, Gio:
Gio: Database
(in the Computer
Computer Science Series) McGrawMcGrawHill
NY, May
Hill Book
Book Company,
Company, New York,
York, NY,
May 1977, 678 pp. Second edition,
edition, Jan.
1983,
1983, 758
768 pp.

125
125

E. A.
A. Feigenbaum
Feigenbaum

RADIX Froject
Project
RADIX

5P41 RR00785-11
RR00785-11
5P41

23. Wiederhold,
Wiederhold, G.:
G.: IN
IN D.A.B.
D.A.B. Lindberg
Lindberg and
and P.L.
P.L. Reichertz
Reichertz (Eds.).
(Eds.), Databases
Databases lor
for
23.
Health
Cure,
Lecture
Notes
in
Medical
Informatics,
Springer-Verlag,
1981.
Health Care, Lecture Notes in Medical Informatics, Springer-Verlag, 1981.
Wiederhold, G.:
G.: Database
Database technology
technology in,
in health
health care.
care. J.
J. Medical
Medical Systems
Systems
24. Wiederhold,
5(3):175-Hlt), 1981.
5(3):175-196,

E. Funding
Funding Support
Support Statue
Status
E.
Representation and
and Use of
of Causal
Causal Knowledge
Knowledge for
for Inference
Inference from
from
1) Representation
Databases
DatabEes
Robert L.
L. Blum,
Blum, M.D.,
M.D., Ph.D.:
Ph.D.: Principal
Principal Investigator
Investigator
Robert
Total award:
award: $89,597
$89,597 (direct
(direct ++ indirect)
indirect)
Total
Term: hIarch
l\.'larch 15, 1984 through
through March
March 14, 1986
Term:
DeriVing Knowledge
Knowledge from
from Clinical
Clinical Databases
Databases
2) Deriving
Gio C.
C. M.
M. Wiederhold,
Wiederhold, Ph.D.:
Ph.D.: Principal
Principal Investigator
Investigator
Gio
National Library
Library of
of Medicine
Medicine
National
Total award:
award: $291,192
$291,192 (direct)
(direct)
Total
Term: May
May 1,
I, 1984 through
through November
November 30,
3D, 1986
Term:

II.
II.

INTERACTIONS WITH
WITH THE
THE SUMEXA.IM
SUMEX-AIM: RESOURCE
RESOURCE
INTERACTIONS
A.. Collaborations
-4.
Collaborations

During
During the
the last
last year
year we have completed
completed System
System Programmer's
Programmer’s Manuals
Manuals and
and a
User's
User’s Manual
hlsnual as
as steps
steps towards
towards making
making the
the system
system available
available to
to outside
outside collaborators.
collaborators. We
We
have had preliminary
preliminary discussions
Edward Shortliffe
Shortliffe and
and Lawrence
Lawrence Fagan
Fagan
discussions with
with Drs.
Drs. Edward
concerning
project. Once
Once the
the RADIX
RADIX
concerning use of
of components
components of
of RADIX
RADIX in the
the ONCOCIN
ONCOCIN project.
program
progrs.m is developed,
developed, we would
would anticipate
anticipate collaboration
collaboration with
with some of
of the
the ARAMIS
ARAMIS
project
project sites in the further
further development
development of
of a knowledge
knowledge base pertaining
pertaining to
to the chronic
chronic
arthritides.
arthritides. The
The ARAMIS
ARAlMIS Project
Project at
at the Stanford
Stanford Center
Center for
for Information
Information Technology
Technology is
used by
by a number
number of
of institutions
institutions around
around the
the country
country via
via commercial
commercial leased lines to store
store
and process their
their data.
data. These institutions
institutions include
include the University
University of
of California
California School
School of
of
Medicine,
hledicine, San Francisco
Francisco and Los Angeles;
Angeles; The
The Phoenix
Phoenix Arthritis
Arthritis Center,
Center, Phoenix;
Phoenix: The
The
University
University of Cincinnati
Cincinnati School of Medicine;
Medicine; The
The University
University of
of Pittsburgh
Pittsburgh School of
Medicine;
Kansas
University;
and
The
University
of
Saskatchewan.
All
All of
of the
The University
Saskatchewan.
hledicine; Kans,as University:
rheumatologists
at
these
sites
have
closely
collaborated
with
the
development
of
ARAMIS,
rheumatologists
closely collaborated with
development of ARAMIS,
and their
their interest
interest in and use
use of the RADIX
RADIX project
project is
is anticipated.
anticipated. We
We hasten
hasten to mention
mention
thaI,
we
do
not
expect
SUMEX
to
support
the
active
use
of
RADIX
as
an
on-going
that
not expect SUMEX
support
active use
RADIX as
on-going service
to this
this extensive
extensive network
network of arthritis
arthritis centers,
centers, but
but we would
would like to be able to allow
allow the
national
nationa. centers
centers to participate
participate in the development
development of the arthritis
arthritis knowledge
knowledge base
base and to
test that
that knowledge
knowledge base
base on their
their own clinical
clinical data
data banks.
banks.

B. Interactions
EX-AIM Projects
Interactions with
with Other
Other SUM
SUA4ELX-AIM
Projects
Several
Several of
of the conce!}ts
concepts incorporated
incorporated into
into the design
design of
of the RADIX
RADIX Project
Project have
been
by other
The RADIX
RADIX knowledge
knowledge base
base is
is similar
similar to
been inspired'
inspired .by
other SUMEX-AIM
SUMEX-AIM Projects.
Projects. The
the Units
The
The production
production rule inference
inference
Units Package of
of the MOLGEN
MOLGEN PROJECT.
PROJECT.
mechanism
mechanism used
used by us
us is
is similar
similar to that
that in
in the MYCIN
MYCIN Project.
Project.
Several
Several programs
programs developed
developed by
by the
the MYCIN
MYCIN group
group are
are regularly
regularly used
used by
by RADIX.
RADIX.
These
These include
include disk
disk hash
hash file
file facilities,
facilities, text
text editing
editing facilities,
facilities, and
and miscellaneous
miscellaneous LISP
LISP
functions.
functions.

E.
E. A.
,A. Feigenbaum
Feigenbaum

126
126

RADIX
RADIX Project
Project

5P41
5P4 1 RR00785-11
RR00785- 11

Regular
Regular communication
communication on programming
programming details
details isis facilitated
facilitated by the on-line
on-line mail
mail
system.

C. Critique
Critique of
of Resource
Resource Alanagement
Management
The
The DEC
DEC System
System 20 continues
continues to provide
provide accepta.ble
acceptable performance,
performance, but
but it
it is
is
frequently
frequently heavily
heavily loaded at peek hours.
The
The SUMEX
SUMEX resource management
management continues
continues to be accessible
accessible and cooperative.
cooperative.

ID.
III. RESEARCH
RESEARCH PLANS
PLANS
A. Project
Project Goals and
and Plans
The
The overall
overall goal of the RADIX
RADIX Project
Project is to develop a computerized
computerized medical
medical
information
system
capa.ble
of
accurately
extracting
medical
knowledge
information system cspa.ble
accurately extracting medical knowledge pertaining
pertaining to the
therapy
therapy and evolution
evolution of
of chronic
chronic diseases
diseases from
from a database consisting
consisting of
of a collection
collection of
of
stored
stored patient
patient records.
records.
SHORT-TERM
SHORT-TERM GOALS
GOALS -For
For the last two
two years we have concentrated
concentrated more heavily
heavily on publishing
publishing and
presen
tat ion of our
presentation
our earlier
earlier AI
AI results,
results, on acquisition
acquisition of a 1700 patient
patient database, on
medical
medical studies
studies based on the enlarged
enlarged database, and on reporting
reporting the medical
medical results
results and
statistical
statist.ica.1 techniques
techniques arising
arising from
from our
our research. This
This is
is in concert
concert with
with the long-term
long-term goal
of ensuring
ensuring that
that the work
work of
of the SUMEX
SUMEX / Artificial
Artificial Intelligence
Intelligence in Medicine
Medicine community
community
be disseminated
he
disseminated and applied
applied in the general
general medical
medical community.
community.
During
coming two
two years
years we will
will concentrate
concentrate much
much more
more on the artificial
artificial
During the coming
intelligence
intelligence aspects of
of RADIX.
RADIX. We
We were successful this
this year
year in obtaining
obtaining funding
funding from
from the
National Library
National Science Foundation
pursue this
National
Library of
of Medicine
Medicine and the National
Foundation to pursue
this work.
work. In
In
particular, we will
particular,
will be deeply
deeply concerned
concerned with
with the representation
representation of
of causal, temporal,
temporal, and
11antitative medical
qqusnt!it,st,ive
medical knowledge.
knowledge. It
It has become clear
clear that
that t,hese
these types
types of
of knowledge
knowledge are
crurial
crucia.1 for
for the RADIX
RADIX tasks of
of automated
automated discovery
discovery of
of medical
medical knowledge
knowledge and the
provision of
provision
of intelligent
intelligent, automated
aut.omsted assistance to
to clinical
clinical researchers, in addition
addition to their
t,heir
generally
perceived value
generally perceived
value in other
other medical
medical expert
expert systems
systems applications.
applications.
LONG-RANGE GOALS
GOALS -- There
There are two
two inter-related
inter-related long-range
long-range goals of
of t,he
the
LONG-RANGE
RADIX Project:
Project: 1) automatic
automatic discovery
discovery of
of knowledge
knowledge in a large time-oriented
time-oriented database
database
RAD1.X
and 2) provision
provision of
of assistance to
to a clinician
clinician who
who is interested
interested in testing
testing a specific
specific
a.nd
hypothe:ois. These t.asks
tasks overla,p
overla.p to
to the extent
extent that
that some of
of the algorithms
algorithms used for
for
hypothesis.
discovery are also used in the process of
of testing
testing an hypot,hesis.
hypothesis.
discovery
\Ve hope to
to make
make these
broad
range
of
hypotheses
broad
of hypotheses and
records.
records.

algorithms sufficiently
SUfficiently
algorithms
over
a
broad
spectrum
over broad spectrum

robust that
that they
they will
will work
work
robust
of
data
distributions
in
the
of data distributions
the

over a
over
patient
patient

B. Justification
Justification and
and Requirements
Requirements for
for Continued
Continued Use of
of SUMEX
SUMEX
B.
Computerized clinical
clinical data
data banks
banks possess
possess great
great potential
potential as tools
tools for
for assessing
assessing the
the
Computerized
effica.cy of
of new diagnostic
diagnostic and
and therapeutic
therapeutic modalities,
modalities, for
for monitoring
monitoring the
the quality
quality of
of health
health
efficacy
delivery, and for
for support
support of
of basic medical
medical research.
research. Because of
of this
this potential,
potential, many
many
care delivery.
clinical da.ta
data banks
banks have recent,ly
recently been developed
developed throughout
throughout the
the United
United Stat,es.
States. However,
However,
clinical
the init)ial
initial problems
problems of
of data
data acquisition,
acquisition, storage,
storage, and retrieval
retrieval have been dealt
dealt with,
with,
once the
there remains
remains a.
a. set of
of complex
complex problems
problems inherent
inherent in the
the task
task of
of accurately
accurately inferring
inferring
t.here

127

A. Feigenbaum
Feigenbaum
E. A.

RADIX
RADIX Project
F’roject

5P41
5P41 RR00785-11
RR00785- 11

medical
medics.1 knowledge
knowledge from
from aa collection
collection of
of observations
observations in
in patient
patient records.
records. These
These problems
problems
concern
concern the
the complexity
complexity of
of disease
disease and
and outcome
outcome definitions,
definitions, the
the complexity
complexity of
of time
time
relationships,
relationships, potential
potential biases
biases in
in compared
compared subsets,
subsets, and
and missing
missing and
and outlying
outlying data.
data. The
The
major
major problem
problem of
of medical
medical data
data banking
banking isis in
in the
the reliable
reliable inference
inference of
of medical
medical knowledge
knowledge
from
from primary
primary observational
observational data.
data.
\Ve see
see in
in the
the RADIX
RADIX Project
Project aa method
method of
of solution
solution to
to this
this problem
problem through
through the
the
utilization
of
knowledge
engineering
techniques
from
artificial
intelligence.
The
utilization of knowledge engineering techniques from artificial intelligence. The RADIX
RADIX
Project,
Project, in
in providing
providing this
this solution,
solution, will
will provide
provide an
an important
important conceptual
conceptual and
and technological
technological
link
link to aa large community
community of medical
medical research groups
groups involved
involved in the treatment
treatment and study
study
of the chronic
chronic arthritides
arthritides throughout
throughout the United
United States and Canada,
Canada, who
who are presently
presently
using the ARArvns
Data Bank
Ba.nk through
through the CIT
CIT facility
facility via
via TELENET.
TELENET.
ARAMIS Data
Beyond
Beyond the arthritis
arthritis centers which
which we have mentioned
mentioned in this
this report,
report, the TOD
TOD
(Time-Oriented
Data
(Time-Oriented
Data Base) User Group
Group involves
involves aa broad
broad range of university
university and
community
community medical
medical institutions
institutions involved
involved in the treatment
treatment of cancer, stroke,
stroke, cardiovascular
cardiovascular
disease,
disease, nephrologic
nephrologic disease,
disease, and others.
others. Through
Through the RADIX
RADIX Project,
Project, the opportunity
opportunity
will
will be
be provided
provided to foster
foster national
national collaborations
collaborations with
with these research groups
groups and to
provide
provide aa major
major arena in which
which to demonstrate
demonstrate the utility
utility of artificial
artificial intelligence
intelligence to
clinical
clinical medicine.
medicine.

C. Recommendation8
for Re80urce
Recommendations for
Resource Development
Development
The
The on-going
on-going acquisition
acquisition oi
oi personal
personal work-station
work-station Lisp
Lisp processors is a very
very positive
positive
step.
step, as
a.s these provide
provide an
a.n excellent
excellent environment
environment for
for program
program development,
development, and can serve
as
as a vehicle
vehicle for
for providing
providing programs
programs to
to collaborators
collaborators at other
other sites. Continued
Continued acquisitions
acquisitions
are very
very desirable.
desirable.
Another
Another resource that
that would
would be highly
highly desirable
desirable is a faster
faster and more
more reliable
reliable means
for
programs interactively
between SUMEX
for transferring
tra.nsferring data
data and programs
interactively between
SUMEX and the CIT
CIT IBM
IBM 370.
The
The addition
addition of
of a reliable
reliable local network
network facility
facility would
would greatly
greatly facilitate
facilitate our
our ability
ability to
to
transfer
patient files from
transfer patient
from CIT
CIT to
to SUMEX.
SUMEX.

E. A.
A. Feigenbaum
Feigenbaum
E.

128

National
National AIM
AIM Projects
Projects

5P41
51’41 RROOi85-11
RR00785-11

ll.A.2.
AIM
II.A.2. National
National
AIM Projects
Projects
The following
following group
group of projects
projects isis formally
formally approved
approved for access
access to the AIM
AIM aliquot
aliquot
of the SUMEX-AIM
Their access
access isis based
based on review
review by the AIM
AIM Advisory
Advisory
SUMKX-AIM resource. Their
Group
Group and approval
approval by the AIM
AIM Executive
Executive Committee.
Committee.
In addition
addition to the progress reports
reports presented
presented here, abstracts
abstracts
its individual
users
are
submitted
on
a
separate
Scientific
individual users
submitted
Scientific Subproject
Subproject

129

for each project
project and
Form.
Form.

A. Feigenbaum
Feigenbaum
E. A.

CADUCEUS
CADUCEUS Project
Project

5P41
5P41 RR00785-11
RR00785- 11

ll.A.2.1.
CADUCEUS
Project
II.A.2.1.
CADUCEUS
Project

CADUCEUS
CADUCEUS Project
Project
J. D.
D. Myers,
Myers, M.D.
M.D. and
and Harry
Harry E.
E. Pople,
Pople, Jr.,
Jr., Ph.D.
Ph.D.
University
University of
of Pittsburgh
Pittsburgh
Decision
Decision Systems
Systems Laboratory
Laboratory
Pittsburgh,
Pittsburgh, Pa.,
Pa., 15261
15261

I. Su.M:MARY
SUMMARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
A.
A. Projcct
Project rationalc
rationale
The
The principal
principal objective
objective of this
this project
project is the development
development of a high-level
high-level computer
computer
diagnostic
diagnostic program
program in the broad
broad field
field of internal
internal medicine
medicine as
as an aid in the solution
solution of
complex
To be effective,
effective, the program
program must
must be
complex and complicated
complicated diagnostic
diagnostic problems.
problems. To
~apable
capa.ble of multiple
multiple diagnoses (related
(related or independent)
independent) in aa given patient.
patient.
A
A major
major achievement
achievement of this
this research undertaking
undertaking has been the design of
of a
program
along with
progra.m called INTERNIST-I,
INTERNIST-I,
with an extensive
extensive medical
medical knowledge
knowledge base.
base. This
This
program
program has been
been used
used over
over the past
past decade to analyze
analyze many
many hundreds
hundreds of
of difficult
difficult
diagnost.ic
diagnost,ic problems
problems in t.he
t,he field
field of internal
internal medicine.
medicine. These problem
problem cases
cases have included
included
cases
journals (particularly
Case Records
of the Massachusetts
Massachusetts
cases publisht"cl
published in medical
medical journals
(particularly
Records of
General
CPCs, and unusual
unusual problems
problems
General Hospital,
Hospital, in the New England
England Journal
Journal of Medicine),
Medicine), CPCs,
of
patients
in
our
Medical
Center.
In
most
instances,
but
by
no
means
all,
INTERNIST-I
of patient,s
our Medical Center. In most
but by
all, INTERNIST-I
has performed
high-lighted
performed at the level of the skilled
skilled internist,
internist, but
but the
the experience has high-lighted
several areas (or
for improvement.
improvement.

B. Medical
Afcdical Releoance
Relevancc and
and Collaboration
B.
Collaboration
The progra.m
program inherently
inherently has direct
direct and substantial
medical relevance.
relevance.
The
substantial medical
The institution
institution of
of collaborative
collaborative studies
with other
other institutions
institutions has been deferred
deferred
The
studies with
pending completion
completion of
of t,he
the programs
programs and knowledge
knowledge base enhancements
enhancements required
required for
for
pending
CADUCEUS. The
The insta.llation
installation of
of our
our own,
own, dedicated
dedicated VAX
VAX computer
computer can be expected
expected to
to
CADUCEUS.
considerably any
any future
future collaboration.
collaboration.
aid considerably
HI~ghlights of
of Research
Rcsearch Progress
Progrcss
C. Highlights

----Accomplishments this
this past
past year
year
----Accomplishments
In a previous
previous progress
progress report
report the
the concept
concept of
of “facets”
"facets" of
of diseases
diseases was
was introduced
introduced
In
and the
the need of
of CADUCEUS
CADUCEUS to
to proceed
proceed from
from broad
broad pathophysiopathophysio- logical
logical and
and pa.thopathoand
biochemical
concepts
t,o
specific
disease
processes
was
emphasized.
The
need
for
better
biochemical concepts t,o specific
emphasized. The
for better
represent,ation of
of a.natomical
anatomical information
information and
and for
for better
better time
time representation
representation were
were pointed
pointed
represent,ation
out.
out.
Drs. hliller
rvliller and
and Myers
Myers have
have continued
continued in
in the
the development
development of
of a new
new format
format for
for the
the
Drs.
CADUCEUS knowledge
knowledge base.
major goal
goal in
in making
making the
the transit,ion
transition from
from the
the
CADLiCEUS
AA major
INTEH0:IST-I knowledge
knowledge base to
to that
that of
of CADUCEUS
CADUCEUS ha.s
has been
been to
to insure
insure that
that there
there is
INTERNIST-I
continuity between
between the
the two:
two: The
The CADUCEUS
CADUCEUS knowledge
knowledge base
base will
will be
be derived
derived from
from the
the
cont.inuity
information in
in the
the INTERNIST-I
INTERNIST-I knowledge
knowledge base,
base, with
with significant
significant additions
additions made
made as
as
information
necessary.
necessary.
A. Feigenbaum
Feigenbaum
E. A.
E.

130

CADUCEUS
CADUCEUS Project
Project

5P41
5P41 RR00785-11
RRO0785-11

A screen-oriented
screen-oriented editor
editor program
program for entering
entering and manipulating
manipulating the knowledge
knowledge
base was written
writt.en in Franz
Franz Lisp. Using the editor,
editor, aa total
total of 52
52 diagnosis nodes have been
been
base
crea.ted and aa total
total of 282
282 findings
findings have been defined. Due to the more complex
complex nature
nature of
created
finding, the 282
282 findings
findings represent
represent over
over 600 old INTERNIST-I
INTERNIST-I style
style manifestations.
manifestations.
aa. finding.
In the CADUCEUS
CADUCEUS knowledge
knowledge base,
base, the basic unit
unit of observational
observational in- formation
formation is
is
Unlike an INTERNIST-I
INTERNIST-I
manifestation, aa finding
finding can be
be assigned aa
called aa. finding.
Unlike
manifestation,
finding.
status within
within aa given
given patient,
patient, either
either "normal"
“normal” or any number
number of forms
forms of
of abnormal.
abnormal. For
For
status
example,
example, the status
status of the finding
finding "heart
“heart murmur"
murmur” can be either
either absent (normal)
(normal) or
Various qualifiers
qualifiers are allowed
allowed to modify
modify aa finding.
finding.
For the finding
finding "heart
“heart
present.
For
present. Various
patient, aa user might
might specify
specify that
that it is
is heard at the second left
left
murmur",
murmur”, in aa specific patient,
interspace.
systole only,
only, that
that it
it is blowing,
blowing, that
that
interspace. that
that it
it is
is systolic,
systolic, that
that it
it is heard
heard in early
early systole
is crescendo- decrescendo. For
For findings
findings whose values vary
vary
it
it is
is grade
gra,de 2 of 6, and its shape is
numerically,
numerically, (e.g. SGOT-blood)
SGOT-blood) the units
units of measurement
measurement and the normal
normal range are
specified so
so that
that aa. user may
may simply
simply enter
enter aa number
number as
as the result
result of the test.
test,
specified
The
The concept
concept of a disease
disease profile
profile has been carried
carried over
over from
from the INTERNIST-I
INTERNIST-I
knowledge
knowledge base.
bsse. However,
However, there
there are three
three separate
separate diagnostic
diagnostic node types
types represented
represented in
the CADUCEUS
CADUCEUS knowledge
knowledge base:
base: the disease,
disease, the facet, and the subdivision.
subdivision. A
A disease
disease is
ent,ity whose presence should
should be reported
reported if
if detected
detected in a patient,
patient, and conceptually
conceptually
an entity
corresponds
correeponds to the diseases
diseases mentioned
mentioned in the separate chapters
chapters of standard
standard medical
medical
text.bnoks. A
A subdivision
subdivision is either
either a specific
specific subtype
subtype of
of a disease
disease (e.g. hepatitis
hepatitis B is a
textbooks.
subt.ype
subtype of acute viral
viral hepatitis,
hepatitis, a disease)
disease) or
or a major
major specific
specific organ
organ system
system involvement
involvement
by a multisystem
multisystem disease
disease (e.g. lupus
lupus nephritis
nephritis and lupus
lupus cerebritis
cerebritis are subdivisions
subdivisions of
of the
disease system
system lupus
lupus erythematosus).
erythematosus).
disease
The
The internal
internal organization
organization of
of disease,
disease, facet
facet and subdivision
subdivision profiles
profiles is identical.
identical.
Apart
Apart from
from links
links to other
other nodes, there
there are nine essential components
components to
to each profile:
profile:
disea.:5e
disea.se para.meters
para.meters (e.g. prevalence
prevalence of
of disease,
disease, specific
specific sites it
it effect,s);
effects); demographic
demographic
information about
about patients
patients with
with the disease;
disease; general
general predisposing
predisposing factors
factors (which
(which are
information
interciepencient.
interdependent, only
only one of
of which
which is likely
likely to
to be present);
present); independent
independent risk
risk factors
factors (which
(which
often
often co-exist
co-exist, synergist.ically);
synergistically); general
general findings
findings caused by
by the illness; specific
specific findings
findings which
which
relatively unique
unique to the disease
disease process; characteristic
characteristic findings
findings (e.g. a positive
positive throat
throat
are relatively
culture
culture for
for beta
beta hemolytic
hemolytic streptococcus
streptococcus in streptococcal
streptococcal pharyngitis);
pharyngitis); academically
aca.demically known
known
but
but clinically
clinically contraindicont,raindi- cated findings
findings (e.g. one should
should not
not do a renal
renal biopsy
biopsy in patients
patients
wit,h renal
renal leptospirosis,
lept,ospirosis, but
but we know
know what
what the biopsy
biopsy will
will show if
if it
it is done anyway);
anyway); and
with
manifestations whose presence make
make the
the diagnosis
diagnosis untenable
untenable (e.g. male sex makes
makes
manifest,ations
pregnancy
pregnancy an invalid
invalid consideration).
consideration). In
In addition
addition to
to the aforementioned
aforementioned work
work in internal
internal
medicine.
medicine. Drs.
Drs. Gordon
Gordon Banks
Banks and
a.nd John
John Vries
Vries have beeu
been working
working on the development
development of
of a
neurological diagnostic
diagnostic component
component for
for CADUCEUS.
CADUCEUS.
Dr. Banks
Banks has developed
developed a
neurological
Dr.
neurnanatomic
1,000 neuroanatomic
neuroanat.omic database
database which
which contains
contains spatial
spatial descriptors
descriptors for
for nearly
nearly 1,000
neuroanatomic
st,ructures
structures and contains
contains information
information as to
to their
their blood
blood supply,
supply, and function.
function. This
This database
database
will allow
allow a.na.tomic
anatomic loca.lization
localization of
of neurologic
neurologic lesions. Some of
of this
this work
work for
for the
the peripheral
peripheral
will
nervous
nervous system
system has been done previously
previously by
by students
students in our
our laboratory.
laboratory. The
The approach
approach to
to
central nervous
nervous syst.em
system hss
has been to
to design a set of
of “symbolic
"symbolic coordinates”.
coordinates". In
the cent.ral
const0uct
ing the neuroanat,omic
constructing
neuroanatomic database,
database, the human
human body,
body, including
including the
the nervous
nervous system,
system,
conceptually partitioned
partitioned into
into a set of
of cubes (boxes). The
The largest
largest cube, containing
containing the
the
is conceptually
ent,ire body.
body. is 2.187m
2.187m on a side. This
This cube is divided
divided into
into 27 smaller
smaller cubes, each 729mm
729mm
on a side. Each
Each of
of the
the smaller
smaller cubes is likewise
likewise subdivided
subdivided until
until finaliy
finally cubes that
that are
each lmm
1mm on a side are reached.
reached. Thus
Thus any
any cube has neighbors
neighbors (of
(of equal size) rostra],
rostral,
caudal,
caudal. vent,ral,
ventral. dorsal,
dorsal, left,
left, and right
right of
of it,
it, as well
well as a “pa,rent”
"parent" cube which
which contains
contains it,
it,
and “dsught,er”
"dallghter" cubes which
which itit contains.
contains. Each
Each of
of these cubes has the
the potential
potential for
for being
being
represented inside
inside the
the computer
computer program
program with
with a unique
unique name (known
(known as an atom
atom in LISP.
LISP.
represrnted
which the
the dat,abase
database is programmed).
programmed). Attached
Attached to
to each cube LISP
LISP atom
atom
the language in which

131

A. Feigenbaum
Feigenbaum
E. A.

5P41
5P4 1 RR00785-11
RR00785- 1 I

CADUCEUS
CADUCEUS Project
Project

a.re
are lists of all of the anatomic
anatomic structures
structures that
that are completely
completely and partially
partially contained
contained
within
within the cube, as
as well
well as
as the blood supply
supply to the region. This
This structure
structure facilitates
facilitates rapid
rapid
retrieval
retrieval of the location
location of aa given anatomic
anatomic structure
structure as
as well
well as
as rapid
rapid localization
localization of
possible areas
areas of involvement
involvement when there isis evidence of dysfunction
dysfunction of one or more neural
neural
systems.
The
The hierarchical
hierarchical arrangement
arrangement of
of the nested cubes ensures rapid
rapid convergence during
during
searches,
searches, because
because if the sought
sought object
object is
is not
not found
found in a parent
parent cube, there
there is no need to
search for
for it
it in any of the patient's
patient’s children
children cubes.
cubes. The
The addition
addition of anatomic
anatomic reasoning
reasoning
may
may allow
allow parsimonious
parsimonious explanation
explanation of multiple
multiple manifestations
manifestations arising
arising from
from a single
lesion.
lesion, or allow
3,110~the program
program to query
query the user regarding
regarding the presence of manifestations
manifestations of
involvement
involvement of areas that
that might
might be
be expected to be affected
affected by whatever
whatever clinical
clinical state
st.ate the
program
pr0gra.m has under
under current
current consideration.
consideration.
Dr. Vries
Vries has developed an imaging
imaging system
system using "octree
“octree encoding"
encoding” to reconstruct
reconstruct
n-dimensional
n-dimensional images of
of the database as
as well
well and images of
of patients
patients acquired
acquired by
by CT,
CT,
NMR,
NMR, and
a.nd other
other neuroimaging
neuroimaging techniques.
techniques. Combining
Combining the database with
with the imaging
ima.ging
system
correlation
system may
may open new areas
a,rea.s of
of research, including
including clinical-pathological
clinical-pathological
correlation of
of
imaged lesions with
with symptoms,
symptoms, signs, and affected
affected structures,
structures, automated
automated reading
reading of
of
images, etc.
Dr.
Dr. r-.mler
Miller in the last
last year
year completed
completed work
work on a sub-project
sub-project of
of CADUCEUS,
CADUCEUS, called
called
CPCS.
CPCS. He received
received support
support for
for this
this work
work from
from the National
National Library
Library of Medicine
Medicine New
Investigator
lnvest.igator Program.
Program. The
The original
original objective
objective of the project
project was to create a program,
program,
CPCS
CPCS (for
(for Computer-based
Comput,er-based Patient
Patient Case Simulator),
Simulator), to aid in the teaching
teaching of diagnosis to
medical
knowledge
medical students.
st,udents. The
The INTERNIST-I/CADUCEUS
INTERNIST-I/CADUCEUS
knowledge base
base was to be used as
as the
source of
program's
of the program’
s medical
medical expertise.
expertise. This
This overall
overall goal has been accomplished,
accomplished, and
the
program CPCS
AX-II/780 using Franz
t.he program
CPCS exists and runs
runs on our
our V
VAX-h/780
Franz Lisp.
Lisp. The
The CPCS
CPCS
project was a feasibility
project
feasibility study
study to demonstrate
demonstrate that
that it
it is possible to
to construct
construct a general
general
case
The
project has been successful in that
program has been
The project
that the CPCS
CPCS program
case simulator.
simulator.
written,
and
runs
quite
well
in
its
small
test
domain.
But
there
is
room
writt)en,
runs quite well
its small test domain. But there
room for
for the
the future
future
development
development. of
of CPCS.
CPCS. Further
Further construction
construction of
of the CADUCEUS
CADUCEUS knowledge
knowledge base, in areas
beyond the current
beyond
current set of
of liver
liver diseases,
diseases, will
will significantly
significantly improve
improve the utility
utility of
of the CPCS
CPCS
program. As additional
a.ddit.ional capabilities
capabilities are added to CADUCEUS,
CADUCEUS, the corresponding
corresponding changes
program.
will
will be made in CPCS.
CPCS.
The
medical knowledge
continued to
grow both
both in the incorporation
of
The medical
knowledge base has continued
to grow
incorporation of
modification of
of diseases
diseases already
already profiled
profiled so as to
to include
include recent
recent
diseases and the modification
new diseases
medical knowledge.
knowledge. The
The knowledge
knowledge base of
of 3/l/84
3/1/84 includes
includes 591
591 individual
individual
advances in medical
disease profiles,
profiles, 4,040 manifestations
manifestations of
of disease,
disease, and
and about
about 3,500 ”"links"
or
disease
links” or
interrelationships among
among diseases
diseases as well
well as a myriad
myriad of
of miscellaneous
miscellaneous pieces of
of
int,errelnt,ionships
informat.ion which
which are essential
essential for
for the
the correct
correct operation
operation of
of the
the system.
system. Twenty
Twenty new
information
profiled during
during the
the past
past year
year and the
the pediatrics
pediatrics knowledge
knowledge base
base has
diseases have been profiled
diseases
continued to
to grow.
grow.
continued
Recently the
the medical
medical knowledge
knowledge base (but
(but not
not yet
yet the
the diagnost.ic
diagnostic program)
program) has been
Recently
made available
ava.ilable on line for
for use of
of the medical
medical house staff
staff at
at Presbyterian-University
Presbyterian-University
ma.de
Hospit.al, our
our main
ma.in t,eaching
t.eaching hospital
hospital in internal
internal medicine,
medicine, and
and at
at an affiliated
affiliated community
community
Hospital,
hospit.al in Pit.tsburgh,
Pit.tsburgh, the
t.he Shadyside
Shadyside Hospital
Hospital which
which operates
operates a residency
residency program
program in
hospit,al
internal medicine.
medicine. Preliminary
Preliminary reports
reports indicate
indicate that
that the
the residents
residents find
find the
the knowledge
knowledge base
internal
useful.
useful.
----Research in progress
progress
----Research
There are five
five major
major components
components to
to the
the continuation
continuation of
of this
this research
research project:
project:
There

A. Feigenbaum
Feigenbaum
E. A.

132

CADUCEUS Project
Project
CADUCEUS

5P41
5Pdl RR00785-11
RR00785-11

1.
refinement and testing
testing of
of the
the extensive
extensive
1. The
The enlargement,
enlargement, continued
continued updating,
updating, refinement
medical
medical knowledge
knowledge base
base required
required for
for the operation
operation of
of INTERNIST-I.
INTERNIST-I.
2.
?. The
The completion
completion and implementation
implementation of
of the improved
improved diagnostic
diagnostic consulting
consulting
program,
which
CADUCEUS,
which has been designed to
to overcome
overcome certain
certain
program, CADUCEUS,
performance problems
performance
problems identified
identified during
during the
the past
past years
years of
of experience with
with the
the
origina.l
original INTERNIST-I
INTERNIST-I program.
program.
3. Institution
Institution of
of field
field trials
trials of
of CADUCEUS
CADUCEUS on the clinical
clinical services in internal
internal
medicine
medicine at the Health
Health Center
Center of
of the
the University
University of
of Pittsburgh.
Pittsburgh.
4. Expansion
Expansion of
of the clinical
clinical field
field trials
trials to other
other university
university health
health centers
centers which
which
ha.ve
have expressed interest
interest in working
working wit,h
with the system.
system.
5. Ad:tptation
program and data
Adaptation of
of the diagnostic
diagnostic program
data base
base of
of CADUCEUS
CADUCEUS to
SI\
bserve educational
pl1rposes and the evaluation
of clinical
clinical performance
performance and
subserve
educational purposes
evaluation of
competence.
compet,ence.
Current
Current, activity
activity is devoted
devoted mainly
mainly to
to
development
of
the
medical
knowledge
development of
medical knowledge
diagnostic
program.
diagnostic consulting
consulting program.

the first
first
base,
base, and

two
two
the

of
of these, namely,
namely,
implementation
implementation of
of

the
the

continued
continued
improved
improved

D. List
List of
publications
D.
of relevant
relevant publicationa
Harry
E.:Knowledge-based Expert
Expert Systems:
The Buy
Buy or
Build
Harry E.:Knowfedge-based
Systems:
or Build
Derision IN
Reitman
INTELLIGENCE
Derision
IN Walter
Walter
Reitman (Ed.),
(Ed.), ARTIFICIAL
ARTIFICIAL
INTELLIGENCE
APPLICATIONS
FOR
Proceeding
NYU Symposium.
APPLICATIONS
FOR BUSINESS.
BUSINESS.
Proceeding of
of the NYU
Symposium.
Ablex
Ablex Pub.
Pub. Corp.,
Corp., May
May 1983,
1983, pp. 23-40.

1.
I. Pople,

2.
Artificial Intelligence
Intelligence and
Medical Education
Education The
3. ~1yers,
Myers. J.D.:
J.D.: Artificial
and MedicaI
The Medical
Medical
Journal,
Journal, St. Joseph Hospital,
Hospital, Houston.
Houston. Vol.
Vol. 18, December
December 1983, pp. 193-202.

E. Fwnding
Funding support
support
E.

1. Clinical
Decision
Clinical
Decision Systems Research
Research Resource
Resource
Harry
Harry E. Pople,
Pople, Jr.,
Jr., Ph.D.
Associate
Associate Professor
Professor of
of Business
Business
Jack D. Myers,
Myers, M.D.
University
Professor
University
Professor (MediCine)
(Medicine)
University
Uni versi ty of
of Pittsburgh
Pittsburgh
Division
Division of
of Research
Research Resources
Resources
National Institutes
National
Institutes of
of Health
Health
5 R24
R24 RROllOI-07
RROllOl-07
07/01/80 - 06/30/85 - $1,607,717
$369,484
07/01/83 - 06/30/84 - $369,484

2.

CADUCEUS:
CADUCEUS: A
A Computer-Based
Computer-Based Diagnostic
Diagnostic Consultant
Consultant
Harry
Harry E. Pople,
Pople, Jr.,
Jr., Ph.D.
Associate
Associate Professor
Professor of
of Business
Business
Jack D. Myers,
Myers, M.D.
H.D.
University
Professor
University
Professor (Medicine)
(Medicine)
University
of
University
of Pittsburgh
Pittsburgh
National Library
National
Library of
of Medicine
Medicine
National Institutes
of
National
Institutes
of Health
Health

133

E. A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785- 11

CADUCEUS
CADUCEUS Project
Project

R01 LM03710-04
5 ROI
o;r/ol/ao
06/30/85
$817,884
07/01/80
-- 06/30/85
-- $817,884
07/01/83
C6/30/84 -- $196,710
$196,710
07/01/83
-- 06/30/84

3.

Neurologic Consultation
Computer Program
Neurologic
Consultation
Gordon E. Banks,
Banks, M.D.
Assistant
Assistant Professor
Professor of
of Medicine
Medicine
National
Library of
of Medicine
Medicine - New
New Investigator
Investigator
National Library
National
Institutes
of Health
Health
of
National Institutes
5 R23 LM03889-02
Lb+03889-02
04/01/82 - 03/31/85
03/31/85 - $107,675
$107,675
04/01/82
04/oi/a3
03/31/84
$35,975
04/01/83
-- 03/31/84
-- $35,975
04/oi/a4
03/31/85
$35,975
04/01/84
-- 03/31/85
-- $35,975

ll.
WITH
II. INTERACTIONS
INTERACTIONS
WITH THE
THE SUMEX-AIM
SUMElX-AIM RESOURCE
RESOURCE
A,B.
Collaborations and
and Program
Ka SUMEX
SUMEX
A,B. Medical
Afedical Collaborations
Program Dissemination
Dissemination Via
CADUCEUS
CADUCEUS remains
remains in a stage of
of research and development.
development. As noted
noted above, we
are continuing
cont,inuing to
to develop
develop better
better computer
computer programs
programs to operate
operate the diagnostic
diagnostic system,
system, and
a.nd
t.he
t,he knowledge
knowledge base
base cannot
cannot be used very
very effectively
effectively for
for collaborative
collaborative purposes
purposes until
until it
it has
critical stage of
of completion.
completion.
factors have stifled
stifled collaboration
collaboration via
via
reached a critical
These factors
SUI\!EX
SUMEX up to this
this point
point and will
will continue
continue to
to do so for
for the next
next year
year or
or two.
two. In
In the
the
meanwhile, through
through the SUMEX
SUMEX community
community there
there continues
continues to
to be an exchange of
of
meanwhile.
information and states
states of
of progress. Such interactions
interactions particularly
particularly take
take place at
at the
information
annual AIM
AIhI Workshop.
Workshop.
annual

Critique of
of Resource
C. Critique
Resource Management
Jvfanagement

sur-.mx
Our
SUhlEX has been an excellent
excellent resource for
for the development
development of
of CADUCEUS.
CRDUCEUS.
Our
large program
program is handled
handled efficiently,
efficiently, effectively
effectively and accurately.
accurately. The
The staff
staff at SUMEX
SUMEX have
uniformly supportive,
supportive, cooperative,
cooperative, and innovative
innovative in connection
connection with
with our
our project's
projectA’s
been uniformly
needs.
needs.
III. RESEARCH
RESEARCH PLANS
PLANS
ill.
A.
Goals and
and Plans
A. Project
Project Goals
Plan8
Continued
Continued effort
effort to complete
complete the medical
medical knowledge
knowledge bases
bases in internal
internal medicine
medicine and
pediatrics
pedintrics will
will be pursued
pursued including
including the incorporation
incorporation of
of newly
newly described
described diseases
diseases and new
or
or altt'n'd
alt,ered medical
medical information
information on "old"
“old” diseases.
diseases. The
The latter
latter two
two activities
activities have proven
proven to
to
Profiles of
of added diseases
diseases plus other
other
be more formidable
formidable than
t,ha.n originally
originally conceived.
conceived. Profiles
informat,ion
information is first
first incorporated
incorporated into
into the medical
medical knowledge
knowledge base
base at SUMEX
SUMEX before
before being
being
transferred
into
This
tratlferred
into our
our newer
newer information
information structures
structures for
for CADUCEUS
CADUCEUS on the
t,he VAX.
VAX.
This
sequence retains
retains the operative
operative capability
capability of
of INTERNIST-I
INTERNIST-I as
as a computerized
computerized "textbook
“textbook of
of
sequence
medicine"
medicine” for educational
educstionsl purposes.

Justification and
Requirements for
for Continued
B. Justification
and Requirements
Continued SUMEX
SUME. Use
Our
A..X.
Our use
use of
of SUMEX
SUMEX will
will obviously
obviously decline
decline with
with the installation
installation of
of our
our V
VAX
Nevertheless. the excellent
Nevertheless,
excellent facilities
facilities of
of SUMEX
SUMEX are expected to
to be used for
for certain
certain
developmental
at SUMEX
development,al work.
work. It
It is intended
intended for
for the present
present to keep INTERNIST-I
INTERNIST-I
SUMEX for
for
Our team
team hopes to remain
remain as
as a
compara.tive
compsra,tive use
use as
as CADUCEUS
CADUCEUS is developed here. Our
component, of
of the SUMEX
SUhlEX community
community and to
to share experiences and developments.
developments.
component.
E. A. Feigenbaum
Feigenbaum

134

CADUCEUS
CADUCEUS Project
Project

5P41 RR00785-11
RR00785-11

C. Need.~
Need.~ and
Plans for
for Other
Re80urces Beyond
Beyond SUAfEX-ALM
and Plans
Other Computing
Computing Resources
SUMEX-AIM
Our
predictable needs in this
by the dedicated
~'C computer
Our predictable
this area will
will be met
met by
dedicated V
VAX
computer
recently
recently installed.
inst,alled.

D. Recommendations
Recommendations for
for Future
Future Community
Resource Development
Development
D.
Community and
and Resource
\\llether a program
program like
better operated
Whether
like CADUCEUS,
CADUCEUS, when
when mature,
mature, will
will be better
operated from
from
centralized,
personal computers
centra.lized, larger
larger computers
computers or
or from
from the developing
developing self contained
contained personal
computers is
difficult
predict. For
difficult to
t,o predict.
For the foreseeable future
future it
it would
would seem that
that centralized,
centralized, advanced
advanced
fa,cilit·ies
like
SUtvlEX
will
be
important
in
further
program
development
fscilit,ies like SIJMEX will
important
further prograrn development and refinement.
refinement.

135
135

E. A.
A. Feigenbaum
Feigenbaum

CLIPR
CLIPR -- Hierarchical
Hierarchical Models
Models of
of Human
Human Cognition
Cognition

5P41
5P41 RR00785-11
RR00785- I I

II.A.2.2.
CLIPR
Models
II.A.2.2.
CLIPR -- Hierarchical
Hierarchical
Models of
of Human
Human Cognition
Cognition

Hierarchical
Hierarchical Models
Models or
of Human
Human Cognition
Cognition (CLIPR
(CIJPR Project)
Project)
Walter
Kintsch
and
Walter
Kintsch
and Peter
Peter G.
G. Polson
Polson
University
University or
of Colorado
Colorado
Boulder,
Colorado
Boulder,
Colorado

I.
OF
PROGRAM
I. SUM:MARY
SUMMARY
OF RESEARCH
RESEARCH
PROGRAM
A. Project
Project Rationale
Rationale
The
The two
two CLIPR
CLIPR projects
projects have made progress during
during the last year.
year. The
The prose
comprehension
project
has
completed
one
major
project,
and
is
designing
comprehension project has completed
major project,
designing aa prose
comprehension
comprehension model that
that reflects
reflects state-of-the-art
state-of-the-art knowledge
knowledge from
from psychology
psychology (van
(van Dijk
Dijk
&.
& Kintsch,
Kintsch, 1983) and artificial
artificial intelligence.
intelligence. During
During the last year, Polson,
Polson, in collaboration
collaboration
with
with Dr.
Dr. David
David Kieras
Kieras of the University
University of Arizona
Arizona have h'ave
have continued
continued work
work on a project
project
studying
st,udying the psychological
psychological factors
factors underlying
underlying device complexity
complexity and the difficulties
difficulties that
that
nontechnically
nontechnically trained
trained individuals
individuals have in learning
learning to use
use devices like
like word
word processors.
They
They have develop formal
formal representations
representations of
of aa user's
user’s knowledge
knowledge of
of how
how of
of operate
operate a
device and of
of the user-device interface
interface (Kieras
(Kieras &
& Polson,
Poison, in Press) and have completed
completed
several experiment.s
experiment.s evaluating
evaluating their
their theory.
theory.

B. Tech.nical
Goals
Tech.nical Goals
The
first, the
the text
text comprehension
comprehension
The CLIPR
CLIPR project
project consists of
of two
two subprojects.
subprojects. The
The first,
project,
is
headed
by
\Valter
Kintsch
and
is
a
continuation
of
work
on
understanding
of
project,
by Walter Kintsch
continuation of work
understanding of
connected
laboratory for
for several
years.
connected discourse that
that has been underway
underway in Kintsch's
Kintsch’s laboratory
several years.
The second, the device complexity
complexity project
project is headed by
by Peter
Peter Polson
Polson in collaboration
collaboration with
with
The
Dayid Kierss
Kieras of
of the University
University of
of Arizona,
Arizona, Tucson.
Tucson. They
They are studying
studying the
the learning
learning and
and
David
problem solving
processes involved
involved in the utilization
utilization of
of devices like
like word
word processors or
or
problem
solving processes
complex computer
computer controlled
controlled medical
medical instruments
instruments (Kieras
(Kieras &
&. Poison,
Polson, in Press)
Press)
complex
The goal of
of the prose comprehension
comprehension project,
project is to
to develop
develop a computer
computer system
system
The
capable of
of the meaningful
meaningful processing
processing of
of prose. This
This work
work has been generally
generally guided
guided by
by the
the
ca,pa,ble
comprehension model
model discussed by
by Kintsch
Kintsch and van
van Dijk
Dijk (1978)
(1978), although
although our
our
prose comprehension
programming efforts
efforts have identified
identified necessary clarifications
clarifications and
and modifications
modifications in that
that
programming
model
(Miller
&.
Kintsch,
1980,
1981;
Kintsch
&
Miller,
1981;
Miller,
1982).
In
general,
model (Miller S: Kintsch,
Kintsch
Miller,
Miller, 1983). In general,
this research
research has emphasized
emphasized the
the importance
importance of
of knowledge
knowledge and
and knowledge-based
knowledge-based processes
this
in comprehension,
compn'hension, and
and we are accordingly
accordingly working
working with
with the
the AGE
AGE and
and UNITS
UNITS groups
groups at
at
SUMEX
toward
the
development
of
a
knowledge-based,
blackboard
model
of
prose
SU~IEX toward the development of
knowledge-based, blackboard model of
comprehension. \\‘
\Ve
to be able to
to merge the
the substantial
substantial artificial
artificial intelligence
intelligence research
research
comprehension.
e hope to
systems with
with psychological
psychological interpretations
interpretations of
of prose comprehension,
comprehension, resulting
resulting in
in a
on these systems
computational model
model that
that is also psychologically
psychologically respectable.
respectable.
computational
The goal
goal of
of the
the device
device complexity
complexity project
project is to
to develop
develop explicit
explicit models
models of
of the
the useruserThe
device interactsion.
interaction. They
They model
model the
the device
device as a nested
nested automata
automata and
and the
the user
user as a
device
production system.
system. These
These models
models make
make explicit
explicit kinds
kinds of
of knowledge
knowledge that
that are
are required
required to
to
production
operate different
different kinds
kinds of
of devices
devices and
and the
the processing
processing loads
loads imposed
imposed by
by different
different
operate
implementations of
of a device.
device. We
\Ve feel that
that tools
tools being
being developed
developed at
at SUMEX--in
SUMEX--in particular
particular
implementations
AGE and
and the
the UNIT
UNIT package--will
package--will dramatically
dramatically facilitate
facilitate our
our abilities
abilities to
to generate
generate such
such
AGE
models
of
the
user-device
interface.
models of the user-device interface.
E. A.
A. Feigenbaum
Feigenbaum
E.

136

SP41
5P41 RR00785-11
RROO785-11

CLIPR
CLIPR - Hierarchical
Hierarchical Models
Models of
of Human
Human Cognition
Cognition

C. Medical
Afedical Relevance
Relevance and
and Collaboration
Collaboration
The
The text
text comprehension
comprehension project
project impacts
impacts indirectly
indirectly on medicine,
medicine, as
as the medical
medical
profession is no stranger
problems of
profession
stranger to
to the problems
of the information
information glut.
glut. By
By adding
adding to
to the
research on how
how computer
computer systems
systems might
might understand
understand and summarize
summarize texts,
texts, and
determining
deterrnining ways
ways by
by which
which the readability
readability of
of texts
texts can be improved,
improved, medicine
medicine can only
only be
helped by research on how people understand
understand prose. Development
Development of
of a more
more thorough
thorough
understanding
processes responsible
underst,anding of
of the various
various processes
responsible for
for different
different types
types of
of learning
learning problems
problems
in children
children and the
the corresponding
corresponding development
development of
of a successful remediation
remediation strategy
strategy would
would
also be facilitated
facilitat,ed by
by an explicit
explicit theory
theory of
of the normal
normal comprehension
comprehension process.
Note that
particularly relevant
Not,e
that, our
our goal of
of a blackboard
blackboard model
model is particularly
relevant to
to the
understanding
understa.nding of
of learning
learning difficulties.
difficulties. One important
important aspect of
of a blackboard
blackboard model
model is the
separation
sepa.ra.tion of
of cognitive
cognitive processes
processes into
into a set of
of interacting
interacting subprocesses. Once such
subprocesses have been identified
identified and constructed,
constructed, it
it would
would be instructive
instructive to
to observe the
model's
performance when
processes are facilitated
model’s performance
when certain
certain of
of these processes
facilitated or
or inhibited.
inhibited. Many
Many
researchers have shown
shown that
that there
there are a variety
variety of
of cognitive
cognitive deficits
deficits (insufficient
(insufficient shortshortterm
poor long-term
can lead to
that
to
term memory
memory capacity,
capacit,y, poor
long-term memory
memory retrieval,
retrieval, and such) that
reading
problems. Having
power of
resding problems.
Having a blackboard
blackboard model
model in which
which the power
of individual
individual
components
components could
could be ma.nipulated
ma,nipulated would
would be a significant
significant step in determining
determining the
the nature
nature of
of
such ren.ding
problems.
reading problems.
The
project has two
The device complexity
complexity project
two primary
primary goals: the development
development of
of a
cognitive
cognitive theory
theory of
of user-device
user-device interaction
interaction in including
including learning
learning and performance
performance models,
and the
t,he development
development of
of a theoretically
theoretically driven
driven design process that
that will
will optimize
optimize the
relationships
relationships between
between device functionality
functionality and ease
ease of
of learning
learning and other
other performance
performance
factors
factors (Polson
(Polson & Kieras,
Kieras, 1983). The
The results
results of
of this
this project
project should
should be directly
directly relevant
relevant to
to
the
t.he design of
of complex,
complex, computer
computer controlled
controlled medical
medical equipment.
equipment. We
We are currently
currently using
using
word
word processors to
t,o study
study user-device
user-device interactions,
interactions, but
but principles
principles underlying
underlying use
use of
of such
devices should
should generalize
generalize to
to medical
medical equipment.
equipment.
Both
Both the text
text comprehension
comprehension project
project and the device complexity
complexity project
project involve
involve the
t#he
development
processes; cognitive
development of
of explicit
explicit models
models of
of complex
complex cognitive
cognitive processes:
cognitive modelling
modelling is a
stated
NIMH.
stated goal of
of both
both SUMEX
SUMEX and research supported
supported by
by NIMH.
The
The on-going
on-going development
development of
of the prose comprehension
comprehension model
model would
would not
not be possible
without
without, our
our colla.boration
colla.borat*ion with
with the AGE
AGE and UNITS
UNITS research groups.
groups. We
We look
look forward
forward to
a continued
Several
continued collaboration,
colla.boratlion, with,
with, we hope, mutually
mutually beneficial
beneficial ,results.
results.
Several other
other
psyC'hologists h3\'e
psychologists
ha,ve either
eit,her used or
or shown
shown an interest
interest in using an early
early version
version of
of the prose
romprehension
comprehension model.
model, including
including Alan
Alan Lesgold
Lesgold of
of SUMEX's
SUMEX’s SCP project,
project, who
who is exporting
exporting
the system
system to the LRDC
LRDC Vax.
Vax. \Ve
We have also worked
worked with
with James Greeno
Green0 -- another
another
member
project -- on a project
project that
rnember of
of the SCP
SCP project
that will
will integrate
integrate this
this model
model with
with models of
of
problem solving
problem
solving developed
developed by
by Greeno
Green0 and others
others at the University
University of
of Pittsburgh.
Pittsburgh.
:\'eedless
Needless to sa.y,
say, all of
of this
this interaction
interaction has been greatly
greatly facilitated
facilitated by
by the local and
network-wide
network-wide communication
communication systems supported
supported by
by SUMEX.
SUMEX. There
There has
ha.s been considerable
considerable
commnnication
communica.t.ion bet.ween
bet,ween members
members of
of the prose comprehension
comprehension and AGE/UNITS
AGE/UNITS groups
groups as
as
program bugs have been discovered
program
discovered and corrected;
corrected; the presence of
of a mail
mail system
system has made
this
this process infinitely
infinitely easier than
than if
if telephone
telephone or
or surface
surface mail
mail messages
messages were required.
required. The
The
mail
professional contacts
mail system,
system, of
of course, has also enabled us to
to maintain
maintain professional
contacts established
established at
conferences
conferences and other
other meetings,
meetings, and to
to share and discuss ideas with
with these contacts.
contacts.

137
137

E. A.
A. Feigenbaum
Feigenbaum

CLIPR
CLIPR - Hierarchical
Hierarchical Models of Human
Human Cognition
Cognition

5P41 RR00785-11

D.
D. Progress
Progress Summary
Summary
The prose comprehension
comprehension project
project has compieted
compieted an initial
initial version
version of aa model of
prose comprehension
1980).
comprehension (Miller
(Miller 8l
& Kintsch,
Kin&h,
1980). This
This model has been applied
applied to aa large
number
of
texts,
and
has
yielded
quite
reasonable
predictions
of
reca.ll
number
yielded quite
predictions
recall and readability.
readability.
Psychologists
from
other
universities
have
used
this
system
to
derive
Psychologists from other universities
this system
derive reading
reading time
time and
recall
recall predictions
predictions for
for their
their own experimental
experimental materials.
materials. We
We are currently
currently using the AGE
AGE
and UNITS
UNITS packages to extend
extend this
this model
model toward
toward one that
that can make use
use of world
world
knowledge
knowledge in its analyses; this
this model
model is discussed in Miller
Miller and Kintsch
Kintsch (1981) and Miller
Miller
(1982). It
It is further
further developed in van Dijk
Dijk and Kintsch
Kintsch (1983) has been applied
applied to the
domain
domain of word
word arithmetic
arithmetic problems
problems in our
our most
most recent
recent work
work (Kintsch
(Kintsch and Greeno, in
Press).
The
The device complexity
complexity project
project is in it's
it’s third
third year.
year. "We
We have developed an explicit
explicit
model for
for the knowledge
knowledge structures
structures involved
involved in the user-device
user-device interaction,
interaction, and we are
developing
Our
Our preliminary
preliminary theoretical
theoretical results
results are described
described in
developing simulation
simulation programs.
programs.
Kieras
Kieras &,
& Polson
Polson (in
(in Press). We
We have also completed
completed several experiments
experiments evaluating
evaluating the
theory.
theory.

E. List
List of
of Relevant
Relevant Publications
Publication8
1.
1.

Kieras,
Kieras, D.E.
D.E. and Polson, P.G.:
P.G.: An
An outline
outline of
of a theory
theory of
of the
the user
u8er complexity
complezity
of
of devices and sy.'Jtems.
sy.stem.9. Working
Working Paper
Paper No.1,
No. 1, Device Complexity
Complexity Project,
Project,
Universities
Iiniversities of
of Arizona
Arizona and Colorado,
Colorado, May,
May, 1982.

2. Kieras,
formal analysis
Kieras, D.E.
D.E. and Polson,
Poison, P.G.:
P.G.: The formal
analyclis of
of user
u8er complexity.
complexity. Int.
ht.
J. Man-f\.lachine
Studies,
In
Press.
Man-Machine Studies, In
3. Kintsch,
tiint,sch, "V.
W. and
a.nd van
van Dijk,
Dijk, T.A.:
T.A.: Toward
Toward a model
model of
of text
tezt comprehension
comprehension and
and

production. Psychological
production.
Psychological Rev. 85:363-394, 1978.
4. Kintsch,
J.G.:Understanding and
solving word
word arithmetic
arithmetic
Kintsch, W.
W. and Greeno,
Greeno, J.G.:Underatanding
and solving
problems. Psychological
problems.
Psychological Review,
Review, In
In Press.
S.
Readability and
short prose
prose passages:
passages:
and recall
recall of
of short
5. l'vliller,
Miller, J.R.
J.R. and Kintsch,
Kintsch, W.:
W.: Readability
.4.
Human
-4 theoretical
theoretical analysis.
analysis. J. Experimental
Experimental Psychology:
Psychology:
Human Learning
Learning and
f\.lemory
hiemory 5:33.5-354,
61335-354, 1980.
Miller, J.R.
J.R. and
and Kint,sch,
Kintsch, W.:
"V.: Readability
Readability and
and recall
recall of
of short
short prose
prose passages.
passages.
6. h?iller,
Text 1:21S-232,
Text
1:215-232. 1981.
J.R.:
A Knowledge-based
Knowledge-based Model
Model of
of Prose
Prose Comprehension:
Comprehension:
Miller, J.R.:
7. hiiller,
A
Applications toto Expository
Expository Text.
Text. IN
IN B.K.
B.K. Britton
Britton and J.B.
J.B. Black
Black (Eds.),
(Eds.),
Appficationu
UNDERSTANDING
EXPOSITORY
TEXT.
Erlbaum,
Hillsdale,
NJ,
1982.
UNDERSTANDING
EXPOSITORY TEXT. Erlbaum, Hillsdale, NJ,
8. Polson,
Polson, P.G.
P.G. and Kieras,
Kieras, D.E.:
D.E.: Theoretical
Theoretical foundations
foundations of
of a design
design process
process
for
the
minimization
of
user
complexity.
\Vorking
Paper
No.3,
guide
guide for the minimization
of user complezity. \Vorking Paper No.
3,
Project on
ou User
User Complexity,
Complexity, Universities
Universities of
of Arizona
Arizona and
and Colorado,
Colorado, June, 1983.
Project

Polson, P.G.
P.G. and
and Kieras,
Kieras, D.E.:
D.E.: A
A formal
formal description
description of
of users’
users' knowledge
knowledge ofof
9. Poison,
how to
to operate
operate a device
device and
and user
user complexity.
complexity. Behavior
Behavior Research Methods
Methods
how
Instrumentation.
and Instrument,ation.
van Dijk,
Dijk, T.A.
T.A. and
and Kintsch,
Kintsch, W.
\v.:STRATEGIES
10. van
: STRATEG IES
COiHPREHENSION. Academic
Academic Press, New
New York,
York, 1983.
198:3.
CO~\IPREIIENSIO.V

A. Feigenbaum
Feigenbaum
E. A.

138

0OF
F

DISCOURSE
DISCOURSE

5P41 RR00785-11
RR007851 1

CLIPR
CLIPR - Hierarchical
Hierarchical Models
Models of
of Human
Human Cognition
Cognition

Funding Support
F. Funding
Support Status
Status
1. Text
Text Comprehension
Comprehension and Memory
Memory

\Valter
Walter Kintsch,
Kintsch, Professor,
Professor, University
University of
of Colorado
Colorado
National
Institute
of
Mental
Health
5
Rol
Nat,ional Institute of Mental Health
Rol MHI5872-14-16
MH15872-14-16
7/1/81
7/l/81 - 6/30/84:
6/30/84: $281,085
7/1/83
7/l/83 - 6/30/84:
6/30/84: $69,878
$69,878
2. Understand
problems
Understand and solving
solving word
word arithmetic
arithmetic problems
\Valter
Walter Kintsch,
Kintsch, Professor,
Professor, University
University of
of Colorado
Colorado
National Science Foundation
National
Foundation
8/1/83
8/l/83 - 7/31/86:
7/31/86: $200,000
$200,000
3. User Complexity
Complexity of
of Devices and Systems
David
David Kieras,
Kieras, Associate
Associate Professor,
Professor, University
University of
of Arizona
Arizona
Peter
Pet.er G. Polson,
Polson, Professor,
Professor, University
University of
of Colorado
Colorado
International
International Business Machines
Machines Corporation
Corporation
1/1/82
]
2/31/84:
$364,000
l/1/82
12/31/84: $364,000
1/1/84
l/l,‘84 - 12/31/84:
12/31/84: $145,000

IT.
WITH
RESOURCE
II. INTERACTIONS
INTERACTIONS
WITH THE
THE SUMEX-AIM
SUMEX-AIM
RESOURCE
A. Sharing
Interactions with
Projects
.Q.
Sharing and
and Interactions
with Other
Other SUAfEX-AIAf
SUMEX-AIM
Projects
Our
primary interaction
Our primary
interaction with
with the SUMEX
SUMEX community
community has been the work
work of
of the
projects at
prose comprehension
comprehension group
group with
with the AGE
AGE and UNITS
UNITS projects
at SUMEX.
SUMEX. Feigenbaum
Feigenbaum
and Nii
Nii have visited
visited Colorado,
Colorado, and one of
of us (Miller)
(Miller) attended
attended the AGE
AGE workshop
workshop at
at
sur"mx.
Both
of
these
meetings
have
been
very
valuable
in
increasing
our
understanding
SUMG.X. Both of
meetings
very valuable
increasing our understanding
of
problems might
by the various
of ilow
how our
our problems
might best be solved
solved by
various systems
systems available
available at
at SUMEX.
SUMEX.
\Ve also hope that
We
that our
our experiments
experiments with
with the AGE
AGE and UNITS
UNITS packages have been helpful
helpful
to the development
projects.
development of
of those projects.
\Ve
\Ve should
should also mention
mention theoretical
theoretical and experimental
experimental insights
insights that
that we have received
received
from
project. The
from Alan
Alan Lesgold and other
other members
members of
of the SUMEX
SUMEX SCP project.
The initial
initial
comprehension
by Dr.
comprehension model
model (Miller
(Miller &
& Kintsch,
Kintsch, 1980) has been used by
Dr. Lesgold and other
ot,her
researchers at
a.t tht'
the University
University of
of Pittsburgh,
Pittsburgh, as
as well
well as
as rest'archers
researchers at
at Carnegie-Mellon
Carnegie-Mellon
Univt'rsity,
Universit’y, the University
University of
of Manitoba,
Manitoba, Rockefeller
Rockefeller University,
University, and the University
University- of
of
Victoria.
Victoria.

Re.'3ource Management
Afanagement
B. Critique
C’ritique of
of Resource
The
The SU!\IEX-AIM
SIJ’llEX-AIM resource is clearly
clearly suitable
suitable for
for the current
current and future
future needs of
of
our
project. \Ve
our project.
We have found
found the staff
staff of
of SUMEX
SUMEX to be cooperative
cooperative and effective
effective in dea.ling
dealing
wit.h
The
wit,h special requirements
requirements and in responding
responding to
to our
our questions.
The facilities
facilities for
for
communicat.ion
on
the
ARPANET
have
also
facilitated
collaborative
communicat.ion
ARPANET
fa.cilitated collaborat.ive work
work with
with
investigators
investigators throughout
throughout the country.
country.

ill.
III. RESEARCH
RESEARCH PLANS
PLANS

A.
Long Range
Range Projects
Projects Goals
Plans
A. Long
Goals and
and Plans
The
by the prose comprehension
The use
use of
of SUMEX
SUMEX by
comprehension group
group was greatly
greatly reduced
reduced in the

139

E. A.
A. Feigenbaum
Feigenbaum

CLIPR
CLIPR - Hierarchical
Hierarchical Models
Models of
of Human
Human Cognition
Cognition

5P41 RR00785-11
RR00785-11

two years, because
because the
the focus of
of the work
work during
during that
that period
period was on experimental
experimental work
work
two
book writing,
writ.ing, rather
rather than
than computer
computer simulation.
simulation. This
This will
will change in the
the fall
fall of
of 1984,
and book
when a new research associate will
will join
project whose primary
primary responsibility
responsibility will
will be in
when
join the project
cont,inuing the modelling
modelling work
work started
started in previous
previous years
years with
with J. Miller
Miller (who
(who is no longer
longer
continuing
with us). Thus,
Thus, we expect
expect a level of
of activity
activity comparable
comparable to
to previous
previous years
years next
next
associated with
fall.
fall.

E. A. Feigrnbaum
Feigpnbaum

140

5P41
5P41 RR00785-11
RR00785-11

CLIPR
CLIPR -- Hierarchical
Hierarchical Models
Models of
of Human
Human Cognition
Cognition

The
The primary
primary goal
goal of
of the
the device
device complexity
complexity project
project isis the
the development
development of
of aa theory
theory
of
of the
the processes
processes and
and knowledge
knowledge structures
structures that
that are
are involved
involved in
in the
the performance
performance of
of routine
routine
cognitive
cognitive skills
skills making
making use
use of
of devices
devices like
like word
word processors.
processors. We
We plan
plan to
to model
model the
the useruserdevice
interaction
by
representing
the
users
processes
and
knowledge
as
a
production
device int,eraction by representing the users processes and knowledge as a production
system
system and
and the
t.he device
device as
as aa nested automata.
automata. We
We are
are also
also studying
studying the role of
of mental
mental
models in
in learning
learning how to use
use them.

B. Justification
and Requirements
Justification
Requirementa for
for Continued
Continued SUMEX
SUMEX Use
Use
The
The research of
of the prose comprehension
comprehension project
project isis clearly
clearly tied to continued
cont.inued access
access
to t.he
the AGE
AGE and UNITS
UNITS packages, which
which are simply
simply not available
available elsewhere. We hope that
that
our
our continued
continued use
use of these systems will
will be
be offset
offset by the input
input we have been
been and will
will
continue
continue to provide
provide to those projects:
projects: our
our relationship
relationship has
has been
been symbiotic,
symbiotic, and we look
forward
forward to its continuation.
continuation.

C. Needs
Keeds and
and Plans
Plans for
for Other
Other Computational
Computational Resources
Resources
\Ve currently
currently use
use two
two other
other computing
computing systems located at the University
Universit,y
Colorado.
Colorado. One is
is the Department
Department of
of Psychology's
Psychology’s VAX
VAX 11/780,
11/780, which
which is used
used primarily
primarily
run
run real-time
rea.l-t,ime experiments
experiments to be modeled on SUMEX.
SUMEX. The
The second is
is the University
University
Colorado's
Colorado’s CDC
CDC 6400,
6400. which
which is
is used
used for
for various
various types
types or
of statistical
statistical analysis.

of
to
of

When
When the ARPA-sponsored
ARPA-sponsored Vax/Interlisp
Vax/Interlisp project
project is completed,
completed, we would
would be most
most
int,erested in experimenting
experimenting with
with becoming
becoming a remote
remote AGE/UNITS
AGE/UNITS site. It
It would
would seem that
that
this
this sort
sort of dev('lopment
devciopment is the ultimate
ultimate goal of
of the package projects,
projects, and this
this type
type of
int.eract.ion,
intera,ct.ion, once it
it becomes feasible, would
would be a logical
logical extension
extension of
of our
our association
association wit.h
with
the SUMEX
SUhIEX fa.cility.
fa.cilit,y.

D. Recowlwzendations
Recommendations for
for Future
Future Community
Resource Development
Development
and Resource
D.
Community and
Our
prima.ry recommendation
Oiur primary
recommendation for
for ruture
future development
development within
within SUMEX
SUMEX involves
involves (a)
t.he
t,he cont,inued
cont,inurd support
support of
of INTERLISP,
INTERLISP, which
which is needed for
for AGE
AGE and for
for other
other work
work we
have underway
underway on SUMEX
SUMEX and (b)
(b) the continued
continued development
development of
of the
the AGE
AGE and UNITS
UNITS
projects. In
In particular,
particular, we would
would like
like to
to see
see an extension
extension of
of AGE
AGE to
to include
include a wider
wider
projects.
variety
of
cont.rol
structures
so
that
our
psychological
models
would
not
be
confined
to
variety of control structures
that our psychological
would not
confined to
one particular
part iclliar view
view of
of knowledge-based
knowledge-based processing. The
The limited
limited physical
physical capacity
capacity of
of
SUr-,fEX, both
both in terms
terms of
of address space and overloading,
overloading, is, as before,
before, a major
major problem.
problem.
SUhlES,
The prose comprehension
comprehension group
group can no longer
longer use the
the publicly
publicly released AGE/UNITS
AGE/UNITS
The
system due to
to its
its severely
severely limited
limited address space, and
and has had
had to
to build
build a personal
personal AGE
AGE
system
system from
from a stripped-down
stripped-down version
version of
of Interlisp
Interlisp and
and a selected subset
subset of
of AGE
AGE and
system
UNITS. We
We heartily
heartily endorse the
the plans
plans underway
underway to
to obtain
obtain more
more computing
computing capacity
capacity for
for
UNITS.
the SUMEX
SUl'vlEX project.
project.
the
Given our
our acquisition
acquisition of
of a VAX,
VA.X,
. we particularly
particularly support
support the
the ongoing
ongoing and
and continued
continued
Given
development of
of INTERLISP
INTERLISP for
for the
the VAX,
VAX, so that
that local
local use of
of AGE
AGE and
and UNITS
UNITS would
would be
development
possihle. Since we, as well
well as other
other psychologists,
psychologists, need the
the real-time
real-time capability
capability of
of
possible.
VAX/VMS to
to run
run on-line
on-line experiments,
experiments, we hope that
that the
the INTERLISP
INTERLISP system
system to
to be
VAX/VhIS
developed will
will be compatible
compatible with
with VMS.
VMS. Note
Note that
that this
this need for
for real-time
real-time work
work coincides
coincides
developed
with
real-world
applications
of
SUMEX
programs,
in
which
a
VAX
might
be
devoted
to
with real-world applications of SUMEX programs, in which
VAX might
devoted to
both real-t.ime
real-time patient
patient monitoring
monitoring and
and diagnostic
diagnostic systems
systems such
such as PUFF
PUFF or
or MYCIN.
MYCIN.
both

141

E. A.
A. Feigenba.um
Feigenbaum
E.

5P41
5P41 RR00785-11
RR00785- 11

Rutgers
Rut,gers Research
Research Resource
Resource

II.A.2.3.
Rutgers
II.A.2d.
Rutgers Research
Research Resource
Resource

Rutgers
in Biomedicine
Rutgers Research
Research Resource--Computers
Resource--Computers
Biomedicine
Principal
Saul
Principal Investig.ators:
Investigators:
Saul Amarel
Amarel [1982-83],
[1982-831,
Casimir
Casimir Kulikowski,
Kulikowski, Sholom
Sholom Weiss
Weiss [1983-84].
[1983-841.
Rutgers
Rutgers University,
University, New
New Brunswick,
Brunswick, New
New Jersey
Jersey

I. SUMMARY
SUMMARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
A. Goals
Goals and
and Approach
Approach
The
The fundamental
fundamental objective
objective of the Rutgers
Rutgers Resource is to develop a computer
computer based
framework
framework for significant
significant research in the biomedical
biomedical sciences
sciences and for
for the application
application of
resea,rch
resea.rch result,s
results t.o
t.o the solution
solution of important
important problems
problems in health
health care. The
The central
central concept
concept
is
in artificial
is t.o
to introduce
int,roduce advanced
advanced methods
methods of computer
computer science
science - particularly
particularly
artificial
intelligence
int,elligence into
into specific
specific areas of biomedical
biomedical inquiry.
inquiry. The
The computer
computer is used as
as an integral
integral
part
part of the inquiry
inquiry process, both
both for
for the development
development and organization
organization of knowledge
knowledge in a
domain
domain and for its utilization
utilization in problem
problem solving
solving and in processes
processes of
of experimentation
experimentation and
theory
theory formation.
forma.tion.
At
At, present,
present., the total
total number
number of investigators
investigators who
who participate
participate in scientific
scientific activities
activities
of t.he
outside investigators
investigators
t.he Resource is 83, of
of these, 20 have Rutgers
Rutgers appointments,
appointments, 21 are outside
who
mainly located
located at
at Rutgers,
Rutgers, and
who participate
particips.te in collaborative
colla,borative research projects
projects that
that are mainly
42 are investigators
from
colla.borative
national
AIM
projects
that
are
located
different
investigators from collaborative national AIM projects that
located in different
parts
members in Administrative,
Administrative,
parts of
of the country.
country. In addition,
addition, the
the Resource has 12 other
other members
Computer
functions. Thus,
Thus,
Computer Systems/Operations
Systems/Operations and general
general programming
programming and secretarial
secretarial functions.
Rutgers Resource community
community numbers
numbers at
at present
present a total
total of
of 95 participants.
participants.
the Rutgers
activities include
include research projects
projects (collaborative
(collaborative research
research and
and core
Resource activities
res(~arch) training/dissemination
training/dissemination projects,projects, and computing
computing services in
in support
support of
of user
rese;r.rch)
projects.
projects.

B. Medical
Afedical Relevance
Reiwanre and
and Collaborations
Collaborations
B.
In 1883-84
1983-84 we continued
con tinned the
the development
development of
of several
several versatile
versatile systems
systems for
for building
building
In
and t,esting
testing consult,ation
consult.ation models
models in biomedicine.
biomedicine. The
The EXPERT
EXPERT system
system ha.s
has had
had many
many of
of
a.nd
its ca.psbilities
capabilities enhanced in the course of
of collaborative
collaborative research
research in
in the
the areas of
of
its
rheumatology, ophthalmology,
ophthalmologs, and clinical
clinical pathology.
pathology.
rheumatology,
In ophthnlmology
opht.halmology we have developed
developed a knowledge
knowledge representation
representation scheme for
for
In
treatment
pb,nning
which
is
both
natural
and
efficient
for
encoding
the
strategies
for
treatment planning which
both natural and efficient for encoding the strategies for
choo:3ing
among
competing
and
cooperating
treatment
plans.
This
involves
a
ranking
of
choosing a.mong competing
cooperating treatment plans. This involves
ranking of
treatments according
according to
to their
their characteristics
characteristics and
and desired
desired effects
effects as well
well as
treatments
contra indications. Kastner
Kastner has generalized
generalized the
the scheme so that
that itit is now
now being
being used
used for
for a
cont,raindications.
number of
of reasoning
reasoning models:
models: infectious
infectious eye disease, primary
primary eye care,
care, and
and rheumatology
rheumatology
number
m;1nagement. Our
Our main
main collaboration
collaboration continues
continues to
to be with
with Dr.
Dr. Chandler
Chandler Dawson
Dawson of
of the
the
management.
Proct.or Foucdation,
Foundation, UCSF.
UCSF.
Proctor
In rheumatology,
rheumatology, our
our collaboration
collaboration with
with Drs.
Drs. Donald
Donald Lindberg
Lindberg and
and Gordon
Gordon Sharp
Sharp
In
at the
the Universit’
University
of Missouri-Columbia
Missouri-Columbia has
has continued
continued at
at a very
very active
active level.
level. The
The model
model
at
y of
which now
now includes
includes detailed
detailed diagnostic
diagnostic criteria
criteria for
for 26
26 major
major
for rheumstologicsl
ri1ellmatological diseases which
for

E. A.
A. Feigenbaum
Feigenbaum
E.

142

Rutgers
Rutgers Research
Research Resource
Resource

5P41
5P11 RH00785-11
RROOi85- 11

diseases,
diseases, had
had the
the management
mana.gement advice
advice and
and treatment
treatment planning
planning developed
developed further.
further. Dr.
Dr.
Sha.rp's
Sha.rp’s group
group continues
continues to
to develop
develop the
the knowledge
knowledge base
base in
in this
this area,
area, with
with formalization
formalization of
of
the
the knowledge
knowledge carried
carried out
out in
in conjunction
conjunction with
with Dr.
Dr. Lindberg's
Lindberg’s group
group and
and the
the Medical
Medical
Expert
The
Expert Systems
Systems Group
Group at
at Rutgers.
Rutgers.
The Resource
Resource researchers
researchers have
have developed
developed new
new
representational
representational elements
elements for
for EXPERT
EXPERT in
in response
response to the needs
needs of
of the rheumatology
rheumatology
research, and
and Politakis
Politakis has
has developed
developed aa coordinated
coordinated system
system called
called SEEK
SEEK (System
(System for
for
Empirical
Empirical Experimentation
Experimentation with
with Expert
Expert Knowledge.)
Knowledge.) which
which provides
provides interactive
interactive assistance
assistance
to the human
human expert
expert in
in testing,
t,esting, refining
refining and
and updating
updating aa knowledge
knowledge base
haze against
against aa data
dat,a.
base
bsse of
of trial
trial cases.
ca.ses.SEEK
SEEK has
has been
been tested and
and extended
extended during
during the past year.
In clinical
Robert Galen
clinical pathology
pathology our
our main
main collaboration
collaboration has
has been with
with Dr. Robert
(Cleveland
Clinic
Foundation),
with
whom
we
have
developed
the
serum
serum protein
protein
(Cleveland Clinic Foundation), with whom
electrophoresis
electrophoresis model which
which isis incorporated
incorporated into
into an instrument
instrument - the scanning
scanning
densitometer
This
densitometer manufactured
manufactured by Helena Laboratories.
Laboratories.
This instrument
instrument with
with interpretive
interpretive
reporting
reporting capabilities
capabilities has now been
been on the market
market for over aa year, isis located at over 100
100
clinical
clinical sites, and represents
represents the first
first known
known spin-off
spin-off of AI
AI expert
expert systems research in the
t.he
field of laboratory
instrumentation.
'vVe
continue
laboratory
instrumentation.
We
continue to refine
refine the representational
representational
mecha.nisms
model.
mecha.nisms used
used for
for this
this kind
kind of model.
In biomedical
biomedical modeling
modeling applications
applications we are experimenting
experimenting with
with several prototype
prototype
models for
for giving
giving advice on the interpretation
interpretation of
of experimental
experimental results
results in the field
field of
enzyme kinetics.
kinetics. in conjunction
conjunction with
with Dr.
Dr. David
David Garfinkel.
Garfinkel. His PENNZYME
PENNZYME program
program has
been linked
linked to a model in EXPERT,
EXPERT, which
which allows
allows the user to interpret
interpret the progress of the
model analysis.

C. Highlights
of Research
Research Progress
Progress
Highlights of
Expert
Expert Medical
Medical Systems
Systems (C.
(C. Kulikowski,
Kulikowski, S. Weiss)
Weiss)
Research has continued
problems of
continued on problems
of representation,
representation, inference
inference and control
control in
expert
systems.
Emphasis
has
been
placed
this
year
on
problems
of
knowledge
expert
Emphasis
placed this year
problems of knowledge base
acquisition,
acquisition, empirical
empirical testing
testing and refinement
refinement of
of reasoning
reasoning (the
(the SEEK
SEEK system),
system), and
treat.ment planning
planning st,ra.tegies
strategies over
over time.
time. From
From a technological
technological point
point of
of view
view the
the market,
market
trea.t.ment
availability of
of the int.erpretive
interpretive reporting
reporting version
version of
of a scanning
scanning densitometer,
densitometer, and the
the
availability
development of
of modeis
models for
for eye care consultation
consultation that
that run
run on microprocessor
microprocessor systems
systems
development
(Apple IIc,
lIe, IBM-PC)
IBM-PC) represents
represents an important
important achievement
achievement for
for AIM
AIM research in showing
showing its
its
(Apple
practical impact
imp:1ct in medical
medical applications.
applications. This
This was recognized
recognized by
by the award
award of
of a scientific
scientific
practical
exhibit prize
prize at.
at, the Academy
Academy of
of Ophthalmology
Ophthalmology Annual
Annual Meeting
Meeting in November
November 1983.
exhibit
SEEK: A
A System
System for
for Empirical
Empirical Experimentation
Experimentation with
with Expert
Expert Knowiedge
Knowledge
1.1) SEEK:
SEEK is a~t system
syst.em which
which has been developed
developed t.o
to give
give interactive
interactive advice
advice about
about rule
rule
SEEK
refinement
daring
the
design
of
an
expert
system.
The
advice
takes
the
form
of
refinement during the
of
expert system.
The advice takes the form of
suggestions
for
possible
experiments
in
generalizing
and
specializing
rules
in
an
expert
suggestions for possible experiments in generalizing and specializing rules in
expert
model that
that has been specified
specified based on
on reasoning
reasoning rules
rules cited
cited by
by aa. human
human expert.
expert. Case
model
('xperience, in the
the form
form of
of stored
stored cases
cases with
with known
known conclusions,
conclusions, is used to
to interactively
interactively
experience.
guide the
the expert
expert in refining
refining the
the rules
rules of
of a model.
model. The
The clesign
design framework
framework of
of SEEK
SEEK consists
consists
gl.;idtx
of a tabular
tabular model
model for
for expressing
expressing expert-modeled
expert-modeled rules
rules and
and a general
general consultatiou
consultation system
system
of
for a.pplying
applying a model
model to
to specific
specific cases.
cases. This
This approach
approach has proven
proven particularly
particularly valuable
valuable in
in
for
assisting the
the expert
expert in
in domains
domains where
where the
the logic
logic for
for discriminating
discriminating two
two diagnoses is difficult
difficult
assisting
to specify;
specify; and
and we have
have benefited
benefited primarily
primarily from
from experience
experience in
in building
building the
the consultation
consultation
to
system in
in rheumatology.
rheumatology.
system
Treatment Planning
Planning
1.2) Trea.tment
The ranking
ranking and
and selection
selection strategies
strategies developed
developed as
as aa stand-alone
stand-alone system
system last
last year
year
The

143

E. A.
A. Feigenbaum
Feigenbaum
E.

Rutgers
Rut,gers Research Resource

5P41 RR00785-11

have been
Capabilities
Capabilities for expressing
been incorporated
incorporated into
into the EXPERT
EXPERT framework.
framework.
reasoning over time
time have been
been added, so stored
stored chart
chart reviews can be
be carried
carried out
out
automatically,
a.ut~omat.ically. summarizing
summarizing various
various patterns
patterns of findings
findings over
over time,
time, and abstracting
abstracting the
major
major features
features of interest
interest for prognostic
prognostic advice or treatment
treatment recommendations.
recommendations.
Applications
Applications have been in infectious
infectious eye
eye disease
disease modeling,
modeling, rheumatology
rheumatology treatment,
treatment, and
sequential
sequential advice in interpretation
interpretation and sequencing of cardiac
cardiac enzyme tests (e.g. CPK/LDH
CPK/LDH
isoenzymes).
1.3) Technology
Technology Transfer
Transfer
Important
Import.ant technology
technology transfer
transfer milestones
milestones have also been achieved this
this year:
year: the
instrument
EXPERT
inst.rument interpretation
interpretation
EXPERT program
program for
for serum
serum protein
protein has been widely
widely
disseminated
disseminated after
after being made available
available by Helena Laboratories,
Laboratories, based on the prototype
prototype
program
program developed by us;
us; and we have succeeded
succeeded in transferring
transferring a large knowledge
knowledge base
base
in rhel1matology
hypotheses and 1000 rules)
rules) onto
onto a
rheuma.tology (about
(about 1000 findings,
findings, 400 hypotheses
mi<'roprocessor
microprocessor (rvtotorola
(Motorola 68000) based system
system - the WICAT
WICAT - which
which is well
well within
within the
means of clinical
clinical researchers and practitioners.
practitioners. This
This system
system has been on site at the
University
University of r-.lissouri
Missouri during
during the last
last year
year for
for testing
testing and refining
refining of
of the knowledge
knowledge base.
base.
1.4)
1.4j Learning
Learning with
with Prior
Prior Structural
Structural Knowledge
Knowledge
This
This approach
approach to knowledge
knowledge acquisition
acquisition and representation
representation has as
as its goal to allow
allow
the expert
just the elements
expert to specify
specify just
elements that
that are to enter
enter into
into the reasoning
reasoning model,
model, with
with a
few causal and taxonomic
taxonomic relations.
relations. These should
should then
then be
be sufficient
sufficient to guide a learning
learning
program
which
operat,es
on
a
data
base
of
cases
with
known
program which operates
data base of cases with known end-points.
end-points. Such an approach
approach
wonld
woiiid he
be w3eful
ii,seful in situations
sit:uations where
where the expert
expert either
either has little
little time
time to explicitly
explicitly formulate
formulate
decision rules, or
program [Drastal
or finds
finds it
it difficult
difficult to do so.
so. Our
Our program
[Drastal and Kulikowski,
Kulikowski, 1982]
19821
uses
uses aa. blackboard
blackboard representation,
representation, with
with multiple
multiple knowledge
knowledge sources to
to handle
handle the different
different
conclusions,
pertain to
conclusions, and the formation
formatsion of
of rules from
from the data
data that
that pertain
to them.
them. We
We have
tested
tested this
this scheme in the areas of
of glaucoma
glaucoma and rheumatology,
rheumatology, and shown
shown that
that there
there are
some int.eresting
provided by
interesting tradeoffs
tradeoffs between
between the degree of
of a-priori
a-priori structure
structure provided
by the expert,
expert,
and
snd the
t.he complexity
complexity of
of rule
rule generation.
generation.
In relat
ion to
relation
to
alternative
alternn.tive means
mea.ns of
of
of
of additional
a.dditional medical
medical

a system
preprocessing or
system like
like SEEK,
SEEK, this
this approach
approach represents
represents a preprocessing
or
developing
prototype model. We
developing the prot.otype
We are now
now investigating
investigating the role
semantic
semantic constraints
constraints on the strategies
strategies of
of rule
rule generation.
generation.

2) Artificial
Artificial 1ntelligence:Expertise
Intelligence:Expertise Acquisition
Acquisition and Problem
Problem Reformulation
Reformulation (S.
Amarel)
Amarei)
The main
main research
research activity
activity in this
this area is concerned
concerned with
with improvements
improvements in problem
problem
The
solving expertise
expertise via
via shifts
shifts in problem
problem representation,
representation, i.e., via
via reformulation.
reformulation.
solving
this research, we have concentrated
concentrated on the developmental
developmental processes
processes that
that lead to
to
In this
formation of
of specialized
specialized high
high performance
performance procedures
procedures in sub-domains
SUb-domains of
of a problem
problem
the forma.tion
ci::lss. Theory
Theory formation
formation is a,
a key
key task
task in these processes;
processes; and we are now
now studying
studying several
several
CIXS.
approaches to this
this task
task - both
both top-down,
top-down, model
model guided,
guided, approaches
approaches and ‘'bottom-up'
approaches
bottom-up’
methods that
that are based on detailed
detailed analysis
analysis of
of individual
individual cases.
cases.
methods

D. lip-to-Date
Up-to-Date List
List ofof Publications
Publications
D.
The following
following is an update
update of
of publications
publications in the
the Rutgers
Rutgers Resource for
for the
the period
period
The
198·1 (only
(only publications
publications not
not listed
listed in previous
previous SUMEX
SUMEX annual
annual reports
reports are
1983 and 198.1
presented here).
presented
Wei5s, S.M.
S.M. and
and Kulikowski,
Kulikowski, C.A.
C.A. A
A Practbcal
Practical Guide
Guide to Designing
Designing Ezpert
Expert
1. Weiss,
Systems, Rowman
Rowman a.nd
and Allanheld,
Allanheld, 1984.
Syatemu.
Feigenbaum
A. Feigenbaum
E. A.

144

Rutgers
Rutgers Research
Research Resource
Resource

5P41
5P41 RR00785-11
RR00785- 11

2.
2. Kulikowski,
Kulikowski, C.A.
C.A. contributor
contributor to
to the
the Knowledge
Knowledge Acquisition
Acquisition chapter
chapter edited
edited by
by
B.
B. Buchanan
Buchanan in
in the
the book
book Building
Building Expert
Expert Systems
Systems (F.
(F. HayesHayes- Roth,
Roth, et
et aI.,
al.,
cds)
eds) Addison-\o\Tesley,
Addison-Wesley, 1983
1983 (in
(in press).
press).

Strategies
of
3.
3. 1'30,
l’a.0, Y.
Y. and
and Kulikowski,
Kulikowski, C.A.,
C.A., "lYfultiple
’ Multiple
Strategies
of Reasoning
Reasoning for
for
Expert
Expert Sy.'3tems",
Systems “, Proc.
Proc. Sixteenth
Sixteenth Hawaii
Hawaii International
International Conference
Conference on Systems
Sciences,
Sciences, pp.
pp. 510-514
510-514 ,, 1983.
1983.**
4.
4. Kulikowski,
Kulikowski, C.A.
C.A. "Progress
“Progress in
in Expert
Expert AI
Al Medical
Medical Consultation
Consultation Systems:
Systems:
1980
1980 -- 1983
1983 If,“, Proc.
Proc. MEDINFO
MEDINFO '83
‘83 , pp.
pp. 499-502,
499-502, Amsterdam,
Amsterdam, August
August 1983.*
1983.t
5.
5. Kastner,
Kastner, J.K.,
J.K., \Veiss,
Weiss, S.M., and Kulikowski,
Kulikowski, C.A.,
C.A., "An
“An Efficient
Efficient
Consultation
Systems",
Proc.
MEDINFO
Time-Dependent
Time-Dependent Consultation
Systems I,
MEDINFO '83,
‘83,
1983.
*
1983.’

Scheme for
for
pp.619-622,

6.
6. K1Jlikowski,
Kulikowski, C.A.
C.A. "Expert
“Ezpert Medical
Medical Consultation
Consultation Systems",
Systems “, Journal
Journal of Medical
Medical
Systems, v.7. pp. 229-234, 1983.*
7. Weiss, S.M., Kulikowski,
Kulikowski, C.A.,
C.A., and Galen, R.S., "Representing
“Representing Expertise
Expertise in
in a
Computer
Computer Program:
Program: The Serum
Serum Protein
Fbotein Diagnostic
Diagnostic Program",
Program”, Journal
Journal of
Clinical
Clinical Laboratory
Laboratory Automation,
Automation, v.3, pp. 383-387, 1983.*
8. Kastner,
for
Kastner, J.K.,
J.K., Weiss, S.M., and Kulikowski,
Kulikowski, C,A.,
C.A., "” lin
An Expert
Expert System
System for
Front-line
Front-line Health
Health Workers
Workers in
in Primary
Primary Eye Care",
Care “, Proc.
Proc. Seventeenth
Seventeenth Hawaii
Hawaii
International
1nt.ernationn.i Conference
Conference on Systems Sciences,
Sciences, pp. 162-166, 1984.
1984.**
9. Kulikowski,
Kuiikowski. C.A.
C.A. "Knowledge
“K,nowledge Acquisition
Acquisition and
and Learning
Learning in
in EXPERT",
EXPERT”, Proc.
Proc.
1983 'Workshop
IIIinois,Champaign-Urbana
Workshop on Machine
Machine Learning,
Learning, Univ.
Univ. of
of Illinois,Champaign-Urbana
1983.
credit.
Indicat.e
Indicate by
by an
a,n asterisk
asterisk (*)
(*) that
that the resource was given
given credit.

E. Funding
Funding Support
Support
Since December
December 1983, the Rutgers
Rutgers Research Resource on Artificial
Artificial Intelligence
Intelligence in
funded under
under grant
grant RR
RR 02230-01 from
from the Division
Division of
of Research Resources,
Medicine is funded
Medicine
Biotechnology Resources Program.
Program. Principal
Principal Investigators
Investigators are Casimir
Casimir A.
A. Kulikowski,
Kulikowski,
Biotechnology
Proffssor of
of Comput,er
Computer Science and Chairman
Chairman of
of the Department
Department of
of Computer
Computer Science
Profrssor
Dr. Shoiom
Sholom M.
M. Weiss,
\Veiss, Associate
Associate Research Professor
Professor of
of Computer
Computer Science.
[1984-87]' and Dr.
[1984-871,
The tota.
total direct
direct costs for
for the
the period
period 1983-87 is
The
current
period
(December
I,
1983
November
current period (December 1,
November 30, 1984)

$3,198,075, with
with the
the total
total for
for the
the
$3,198,075,
being
$
989,276.
being

The Rutgers
Rutgers Resource was funded
funded until
until December
December 1983 through
through an NIH
NIH grant
grant
The
entitled"“Rut.gers
Rut,gers Research Resource on
on Computers
Computers in
in Biomedicine”
Biomedicine" -- number
number P41RR643.
P41RR643.
entitled
The Co-Principa.1
Co-Principal Investigators
Investigators were
were Dr.
Dr. Saul
Saul Amarel,
Amarel, Professor,
Professor, Chairman
Chairman of
of the
the
The
Department of
of Computer
Computer Science, and Director
Director of
of the
the Laboratory
Laboratory for
for Computer
Computer Science
Department
Research, and Dr.
Dr. Casimir
Casimir Kuiikowski,
Kulikowski, Professor
Professor of
of Computer
Computer Science at
at R.utgers.
Rutgers.
Kesezrch,

II. INTER4CTIONS
INTERA.CTIONS WITH
WITH THE
THE SUMEX-AIM
SUMEX-AIM RESOURCE
RESOURCE
II.

A. hledical
Aledical Collaborations
Collaborations and
and Dissemination
Dissemination
A.
The SUMEX-hIM
SUMEX-AIM facility
facility provides
provides a backup
backup node
node where
where some of
of our
our medical
medical
The
collaborators
can
access
programs
developed
at
Rutgers.
The
bulk
of
the
medical
collaborators ca.n access programs developed at Rutgers.
The bulk of the medical
collaborative work
work outlined
outlined in
in LB.
LB. above
above is centered
centered at
at the
the Rutgers
Rutgers facility
facility (the
(the RutgersRutgerscollaborative
AIM node).
node).
AIM

145

A. Feigenbaum
Feigenbaum
E. A.
E.

RR00785-11
5P41 RR00785-11

Rutgers
Rutgers Research Resource

Dissemination
Dissemination activities
activities continue
continue to
to be an important
important responsibility
responsibility of
of the Rutgers
Rutgers
Resource within
wit)hin t,he
t,he AIM
AIM community.
community. The
The following
following activities
activities took
took place in the last year:
year:
1.
Ninth AIM
1. Ninth
,4IM \Vorkshop
Workshop (1983):

Organized
Organized by
by Dr.
Dr. Casimir
Casimir Kulikowski,
Kulikowski, it
it was held in Baltimore,
Baltimore, in conjunction
conjunction
with
It
It consisted
consisted of
of a series of
of working
working group
group
with the SCAMC
SCAMC 83 meeting.
meeting.
discussions followed
followed by
by summary
summary presentations
presentations by
by members
members of
of the AIM
AIM
community
community on their
their conclusions.
conclusions.
2. Hawaii
Hawaii International
International Conference
Conference On
On Systems Sciences:
Sciences:
presented a paper
paper on the expert
Dr.
Dr. \Veiss
Weiss presented
expert system
system for
for front-line
front-line health
health
workers,
workers, and Dr.
Dr. Kulikowski
Kulikowski chaired
chaired a session
session on knowledge
knowledge based medical
medical
systems.
3. VII-pn,n-American
VII-Pan-America,n Congress on Rheumatology:
Rheumatology:
Dr.
presented the
Dr. Sharp
Sharp present,ed
the rheumatology
rheumatology knowledge
knowledge base and consultation
consultation
program at this
program
this meeting.
meeting.
4. At
Amarel was elected member
member of
of the Executive
Executive
At the AAAI-82
AAAI-82 meeting,
meeting, S. Amarel
Council
Council of
of AAAI.
AAAI. He is also General
General Chairman
Chairman of
of IJCAI-83
IJCAI-83 which
which was held
held in
Karlsruhe,
Karlsruhe, W.
W. Germany
Germany in August
August 1983. Dr.
Dr. Kulikowski
Kulikowski was the organizer
organizer for
for
an expert
expert medical
medical systems session
session at
at MEDINFO
MEDINFO 83.

B.
National .11.\1
Projects at Rutgers
Rutgers
B. National
AIM Projects
The
projects, approved·
The national
national AIM
AI&l projects,
approved. by
by the AIM
AIM Executive
Executive Committee,
Committee, that
that are
associated with
with the RutgerS-AIM
Rutgers-AIM node are the
the following:
following:
1. INTERNIST
jCADUCEUS project,
project, headed by
INTERNISTjCADUCEUS
by Dr.
Dr. Myers
Myers and Dr.
Dr. Pople
Pople from
from

the University
University of
of Pittsburgh,
Pittsburgh, has been using
using the Rutgers
Rutgers Resource as a backup
backup
system
system for
for development
development, and experimentation.
experimentation.
2. Medical
project, headed by
Medica. Knowledge
Knowledge Representation
Representation project,
by Dr.
Dr. Chandrasekaran
Chandrasekaran

from
from Ohio
Ohio State
State University,
University, is doing
doing most
most of
of its
its research on the Rutgers
Rutgers
system.
system.
3. PURSUIT
project, directed
PURSUIT project,
directed by
by Dr.
Dr. Greenes from
from Harvard
Harvard University,
University, is doing
doing

most
most of
of its
its research on a Goal-Directed
Goal-Directed Model
Model of
of Clinical
Clinical Decision-Making
Decision-Making at
Rutgers.
Rutgers.
4. Biomedical
Biomedical Modeling,
Modeling, by
by Dr.
Dr. Garfinkel
Garfinkel from
from the
the University
University of
of Pennsylvania.
Pennsylvania.
5. At.tending
At,tending Project,
Project, directed
directed by
by Dr.
Dr. Perry
Perry Miller
Miller of
of the Yale
Yale Medical
Medical Center,
Center, is
doing
physician's
plan of
doing much
much of
of the
the research on critiquing
critiquing a physician’
s plan
of management
management at
Rutgers.
Rutgers.
6. MEDSIM
project:
MEDSIM project:

This
pilot project
project designed to provide
provide resource-sharing
This is a pilot
resource-sharing
a.nd
and community
community building
building facilities
facilities for
for about
about 25 researchers
researchers in bio-mathematical
bio-mathematical
modeling
modeling and simulation.
simulation.

C. Critique
Resource Management
Management
Critique of
of SUMEX-AIAf
SUMEX-AIM
Resource
Rutgers
using
DEC-20
primarily for
SUMEX
DEC-PO system
system primarily
for
Rut,gers is currently
currently
using the SUMEX
communication
communication with
with other
other researchers in the AIM
AIM community
community and with
with SUMEX
SUMEX staff,
staff, and

E. A. Feigenbaum
Feigenbaum

146

Rutgers
Rutgers Research
Research Resource
Resource

SP41
5P41 RR0078S-11
RROO785-11

also
also for
for backup
backup computing
computing in
in demonstrations,
demonstrations, conferences
conferences and
and site
site visits.
visits. Our
Our usage
usage isis
currently
currently running
running at
at less
less than
than 50
50 connect
connect hours
hours per
per year
year at
at SUMEX,
SUMEX, with
with an
an overall
overall
connect/CPU
connect/CPU ratio
ratio of
of about
about 30.
30.
Rutgers
Rutgers isis beginning
beginning to
to place
place more
more emphasis
emphasis on
on the
the use
use of personal
personal computers,
computers, and
on
on network
network support
support needed
needed to
to make
make these
these effective.
effective. Sumex
Sumex has
has been
been help in the following
following
wa.ys:
ways :
•l The
The AIM
AIM Executive
Executive Committee
Committee allocated
allocated to
to the
the Rutgers-AIM
Rutgers-AIM node
node one
one of the
Xerox
Xerox Dolphins
Dolphins acquired
acquired by
by SUMEX,
SUMEX, to
to help
help us
us develop experience in
supporting
This machine was used
used almost
almost entirely
entirely to help
support,ing personal machines. This
us
us develop and test network
network support(We
support(We are using Ethernet
Ethernet with
with the Xerox
Xerox
PUP
PUP networking
networking protocols),
protocols), and subsequently
subsequently returned
returned to SUMEX.
SUMEX.
•l Most
Most of network
network software
software that
that we use
use
Having
this
software
available
Having t,his software availa.ble has saved

was originally
originally developed at SUMEX.
SUMEX.
us an enormous
enormous amount
amount of time.
time.

•l Initia.lly
Initinlly SUMEX
SUMEX was very
very helpful
helpful in giving
giving us advice about
about setting
setting up our
our
Ethernet
Ethernet and the Dolphins.
Dolphins.

m.
III. RESEARCH
RESEARCH PLANS
PLANS
A.
A. Project
Pro;lect Goals
Goals and
a,nd Plans
Plans
We
We are pla.nning
planning to
t,o continue
continue along the main
main lines of
of research that
that we have
est.ablished
est.n.blished in the
t,he Resource to date. Our
Our medical
medical collaborations
collaborations will
will continue
continue with
with
emphasis on development
development of expert
expert consultation
consultation systems in rheumatology,
rheumatology, ophthalmology
ophthalmology
and clinical
inference and planning
planning will
will
clinical pathology.
pathology. The
The basic AI
AI issues
issues of
of representation,
representation, inference
continue
Our
with emphasis on further
further
continue to receive attention.
attention.
Our core work
work will
will continue
continue with
development
representations and
development of
of the
t,he EXPERT
EXPERT framework
framework and also on AI
AI stUdies
studies in representations
problems
of
knowledge
and
expertise
acquisition.
We
propose
to
work
number of
of
to work on a number
problems
knowledge
expertise acquisition. We
terhnology
will be affordable
affordable by
by our
our
technology transfer
transfer experiments
experiments to micro
micro processing
processing that
that will
biomedka.l research snd
and clinical
clinical collaborators.
collaborators. We
We also plan
plan to
to continue
continue our
our participation
participation
biomedica.1
AIM dissemination
dissemination and training
training activities
activities as well
well as our
our contribution
contribution -- via
via the
the
in :\IM
RUTGERS/LCSR computer
computer -- to
to the shared
computing facilities
facilities of
of the national
national AIM
AIM
RUTGERS/LCSR
shared computing
network.
network.

B. Justification
Justification and
and Requirements
Requirements for
for Continued
SUMEX Use
B.
Continued SUMEX
Continued access
access to
to SUMEX
SUMEX is needed for:
for:
Continued
Ba.ckllP for
for demos, et,c.
et.c.
1. Rack~lp
Programs developed
developed to
to serve the National
National AIM
AIM Community
Community should
should be runnable
runnable
2. Programs
both facilit.ies.
facilities.
on both
There should
should be joint
joint development
development activities
activities between
between the
the staffs
staffs at
at Rutgers
Rutgers and
and
3. There
SUMEX in order
order to
to ensure portability,
portability, share the
the load,
load, and
and provide
provide a wider
wider
SUMEX
variety of
of inputs
inputs for
for developments.
developments.
variety

C. Needs
Needs and
and Plans
Plans for
for Other
Other Computing
Computing Resources
Resources Beyond
Beyond SUMEX-AIM
SUAIEX-AL\1
C.
Our comput,ing
computing is going
going to
to move
move in
in the
the direction
direction of
of personal
personal computers.
computers. We
vVe will
will
Our
continue to
to use Sumes
Sumex for
for backup
backup purposes,
purposes, however.
however.
continue

D. Recommendations
Recommendations for
for Future
Future Community
Community and
and Resource
Resource Development
Development
D.

147

E. A.
A. Feigenbaum
Feigenbaum
E.

Rutgers Research Resource
Rutgers

5P41 RR00785-11
RR00785- 11

Use of
of personal
personal comput,ers
compubers and minicomputers
minicomputers is continuing
continuing to
to grow
grow in the
the AIM
AIM
community.
We find
find that
that the
the biggest
biggest challenge
challenge is supporting
supporting these systems. Although
Although
community.
vVe
central computing
computing will
will continue
continue to
to be needed for
for communication
communication and coordination,
coordination,
some central
that over
over the next
next few years
years all AIM
AIM research projects
projects and even individual
individual
we believe that
collaborators
collaborators will
will come to
to have their
their own
own hardware.
hardware. However
However many
many of
of these community
community
members (particularly
(particularly the
the collaborators)
collaborators) will
will not
not be in a position
position to
to support
support hardware
hardware or
or
members
software
software on their
their own.
own. We
We would
would certainly
certainly expect
expect SUMEX
SUMEX to
t,o continue
continue to
to provide
provide expert
expert.
this area. However
However we believe it
it would
would be helpful
helpful for
for SUMEX
SUMEX to
to have a formal
formal
advice in this
program to support
support smaller
smaller computers
computers in the field.
field. We
We envision
envision this
this as including
including at least
program
following items:
it,ems:
the following

A central
central source of
of information
information on hardware
hardware and software
software that
that is likely
likely to
to be
•l A
of interest
interest to the AIM
AIM community.
community.
SUMEX might
might want
want to become a
of
SUMEX
distribution
dist,ribution point
point for
for certain
certain of
of this
this software,
software, and even help coordinate
coordinate
quantity
quantity purchase
purchase of
of hardware
hardware if
if this
this proves
proves useful.
useful.
•l Assistance
Assistance in support
support of
of hardware
hardware and software
software in the
the field.
field. Depending
Depending upon
upon
the hardware
hardware involved,
involved, this
this might
might involve
involve advice over
over the telephone
telephone or
or actual
actual
board-swapping
board-swapping by
by mail.
mail. \Vith
With our
our Dolphins
Dolphins we have found
found that
that there
there are a
number
number of
of problems
problems that
that can be resolved
resolved over
over the telephone
telephone if
if we can find
find
someone with
with appropriate
appropriate expertise.
expertise.

E. A. Feigenbaum
Feigenbaum

148

Rutgers
Rutgers Research Resource

5P41
5P4 1 RR0078S-11
RR00785- 11

ll.A.2.4.
SECS:
&
of
II.A.2.4.
SECS: Simulation
Simulation
St Evaluation
Evaluation
of Chemical
Chemical Synthesis
Synthesis
SECS
SECS - Simulation
Simulation and
and Evaluation
Evaluation or
of Chemical
Chemical Synthesis
Synthesis Project
Project
Principal
W.
Principal Investigator:
Investigator:
W. Todd
Todd Wipke
Wipke
Board
Board or
of Studies
Studies in
in Chemistry
Chemistry
University
University or
of California
California
Santa
Santa Cruz,
Cruz, CA.
CA. 95064
95064
Coworkers:
Coworkers:
l.
I. Kim
Kim
D. Rogers
J. Chou
Chou
M.
hl. Hahn
Hahn
M.
hf. Yanaka
>‘anska
l.
I. Iwataki
Iwataki

1.
I.

(Grad
(Grad student)
student)
(Grad
(Grad Student)
Student)
(Postdoctoral)
(Postdoctoral)
(Grad
(Grad Student)
Student)
(Postdoctoral)
(Postdoctoral)
(Postdoctoral)
(Postdoctoral)

SUMMARY
OF
SUMMARY
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
A. Project
Project Rationale
Rationale
-4.

\Vith the SECS project
project our
principles of
With
our long range goal is to
to develop the logical
logical principles
of
molecula.r
practical computer
programs to
molecu1a.r construction
construction and to
to use
use these in developing
developing practical
computer programs
to
assist
bio-organic molecules.
a.ssist investigators
investigators in designing
designing stereospecific
stereospecific syntheses of
of complex
complex bio-organic
Our
project, is aimed
Our second area
a.rea of
of research, the XENO
XENO project,
aimed at
at improving
improving methods
methods for
for
predicting potential
potential biologica.
biological activity
plausibility of
predictsing
activity of
of metabolites
metabolites and plausibility
of incorporation
incorporation and
excretion
excretion of metabolites.
metabolites.

B. Medical
Afedical Relevance
Relevance and
and Collaboration
Collaboration
The
biological activity
The development
development of
of new drugs
drugs and the study
study of
of drug
drug structure
structure biological
activit,y
relationships
depends
upon
the
chemist's
ability
to
synthesize
new
molecules
relstionships
chemist’s ability to synthesize
molecules as
as well
well as
as his
to
isotopic
a.hility
incorporating
isotopic labels or
or other
ot,her
a.bility
to modify
nlodify existing
existing structures,
structures, e.g., incorporating
substituents
bio-molecular substrates.
The Simulation
Simulation and Evaluation
Evaluat,ion of
of Chemical
Chemical
subst ituents into
into bio-molecular
substrates. The
Synthesis
project aims at assisting
Synt,hesis (SEeS)
(SECS) project
assisting the synthetic
synthetic chemist
chemist in designing
designing stereospecific
stereospecific
synt,heses
biologically important
The advantages
advantages of
of this
this computer
computer approach
approach
syntheses of
of biologically
important molecules. The
over
over normal
normal manual
manual approaches
a.pproa.ches are many:
many: 1) greater
greater speed in designing
designing a synthesis;
synthesis; 2)
freedom
past experience and past solutions;
freedom from
from bias of
of past
solutions; 3) thorough
thorough consideration
consideration of all
possibl(' syntheses llsing
possible
using a more extensive
extensive library
library of
of chemical
chemical reactions
reactions than
than any
any individual
individual
person can remember;
remember; 4) greater
greater capability
capability of
of the computer
computer to
to deal with
with the many
many
struct
st.rucl ilHS
ures which
which result;
result,: and 5) capability
capability of
of computer
computer to
to see
see molecules
molecules in a graph
graph
theoretical
projection.
theoretical S(,llSC.
sense. free from
from the bias of
of 2-D projection.
The
using XENO
predict the plausible
plausible
The objective
objective of
of using
XENO in metabolism
metabolism studies
studies is to
to predict
metabolites
of
a
given
xenobiotic
in
order
that
they
may
be
analyzed
for
metabolites of
given xenobiotic
order that they may
analyzed for possible
carcinogenicity.
Metabolism
carcinogenicity.
Metabolism research may
may also find
find this
this useful in the identification
identification of
of
metabolites
met,a.bolitcs in that
t,lmt it
it suggests what
what to
to look
look for.
for. Finally,
Finally, one may
may envision
envision applications
applications of
of
this
problem domains
t,his technology
technology in problem
domains where
where one wishes to
to alter
alter molecules
molecules in order
order to
to inhibit
inhibit
certain
certain types
types of
of metabolism.
metabolism.

C. Highlights
Highlights of
Research Progress
Progress
o j Research

149

E. A.
A. Feigenbaum
Feigenbaum

SECS: Simulation
Simulation 8l
62 Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

RR00785-11
5P41 RR00785-11

C.1 SECS
SECS Project
C.l
Project Developmenta
Developments
The
The majorit.y
majority of
of our
our research has been aimed
aimed at
at strategic
strategic planning
planning in chemical
chemical
synthesis.
Specific work
work has included
included the SST
SST project
project for
for recognizing
recognizing potential
potential starting
starting
synthesis. Specific
materials from
from a target,
target, the MCS
MCS project
project for
for maximal
maximal common
common subgraph
subgraph searching,
searching, and a
mat.erials
project for
for rapid
rapid substructure
substructure search using
using parallelism.
parallelism.
project

Starting Material
Strategies. The
C.l.a
Material Strategies.
C.1.a SST
SST -- Starting
The importance
importance of
of selecting
selecting good
starting materials
materials for
for a synthesis
synthesis has been known
known for
for a long time,
time, but
but only
only recently
recently has
starting
work
work st.arted
started on applying
applying computer
computer techniques
techniques to
to the
the selection
selection process. The
The selection
selection of
of
st,arting material
material for
for a synthesis
synthesis is frequently
frequent,ly the major
major discovery
discovery in a synthesis
synthesis and the
st.arting
of converting
converting the starting
starting material
material to the target
target is minor
minor by
by comparison.
comparison. Last
Last
process of
year
year we reported
reported development
development of
of the SST program
program for
for selecting
selecting starting
starting materials
materials that
that
are appropriate
appropriate for
for a given
given synthetic
synthetic target
target using a library
library of
of available
available chemicals,
chemicals, but
but
without
without reference
reference to
to reactions.
reactions. SST
SST handles problems
problems of
of classes
classes I-III
I-III given
given below:
below:

I)
I)

Target
Target =
= SM

Identical
Identical match
match
Superstructure match
match
Superstructure
Substructure match
match
Substructure
Similarity
Similarity match
match

II) Target
Target >
> SM
II)
III) Target
Target <
< SM
III)
IV)
IV) None of
of these

For a search over
over our
our abstracted
abstracted file,
file, the identical
identical match
match means that
that the target
target
For
starting materials
materials are identical
identical except
except for
for functionalization.
functionalization. The
The superstructure
superstructure match
match
and sta.rt.ing
t.he case
case where
where we must
must make
make carbon-carbon
carbon-carbon bonds during
during a synthesis.
synthesis. The
The
is the
substracture match
match is the case
case where
where the
the starting
starting material
material is larger
larger than
than the
the target,
target, so
substructure
carbon-ca,rbon bonds have to
to be broken.
broken. Finally,
Finally, the
the similarity
similarity match
match is where
where carboncarboncarbon-carbon
carbon
carbon bonds have to be both
both made
made and
and broken
broken during
during the synthesis.
synthesis.
Our
Our research
research in efficient
efficient starting
starting material
material strategies
strategies has continued
continued this
this past
past year
year in
two
two different
different areas. In the first,
first, we have explored
explored the prospect
prospect of
of llsing
using a.a parallel
parallel
computer in the graph
graph matching
matching process described
described in the following
following section
section and in the
computer
solution to
to the class IV
IV problem
problem (see
(see above) which
which is
second we have developed a solution
described in a subsequent
subsequent section.
section.
described
C.1.c Subgraph
Subgraph Search
Search Using
Using Parallelism.
C.Lc
Parallelism.

Subgraph
Subgraph matching
matching is an important
important method
method llsed
used in many
many different
different computer
computer
applications in organic
organic chemistry.
chemistry, including
including the
the recognition
recognition of
of functional
functional groups,
groups, synthesis
synthesis
applications
planning,
planning, constraint
const.raint testing
testing in structure
structure generation,
generation, selection
selection of
of starting
starting materials
materials for
for
synt,hesis, and structure
structure oriented
oriented retrieval.
retrieval. The
synthesis.
The fundamental
fundamental problem
problem is, given
given a query
query
.d.~trutttwe
(QS) and a5 candidate
candidate
superstructure
(CS), determine
determine if
if there
there exists a
superstructure
(CS),
substructure (QS)
mapping of
of the atoms
stems (nodes) of
of the substructure
substructure onto
onto the candidate
candidate superstructure
superstructure such
mapping
that the connected
connected atom
atom pairs
pairs in the query
query substructure
substructure are also connected
connected in the
that
superst,ructure, and that
that the atom
atom and bond
bond types
types also correspond.
correspond.
superstructure.

Although substructure
substructure search is a non-numerical
non-numerical problem.
problem, it
it is computationally
computationally
Although
demanding
demanding because
because ultimately
ult!imately it
it involves
involves establishing
establishing an atom
atom by
by atom
atom correspondence
correspondence
belween the QS and
a.nd the CS, and this
this problem
problem is a member
member of
of the class of
of NP-complete
between
NP-complete
problems. In a worst
worst case
case for
for N
N atoms
atoms in the QS and M
M atoms
atoms in the CS (M>N).
(M>N),
problems.
one
may have to
t,o consider
consider N!/(M-N)!
N!/(M-N)!
mappings for
for each CS. The
may
mappings
The objective
objective of
of our
our research
wa,s to
t]o explore
explore the feasibility
feasibility of
of applying
applying parallel
parallel processing
processing to this
this problem.
problem.
'was
Although the node ma.tching
ma.tching process is an NP-complete
problem, if
if we eliminate
eliminate all
a.11
Although
NP-complete problem.
backtracking, the order
order of
of the algorithm
algorithm reduces to
t,o O(N),
O(N), where
where N
number of
of
backtracking,
N is the number

Feigenba.um
E. A. Feigenbaum

150

RR00785-11
5P41 RR00785-11

SECS: Simulation
Simulation & Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

atoms
atoms in t.he
the subgraph.
suhgraph. This
This would
would represent
represent a major
major improvement.
improvement. Unfortunately,
Unfortunately, the
algorithm
NP-complete with
a.lgorithm is now NP-complete
with respect
respect to
to sequential
sequential processors.

configuration architecture;
architecture; a central
central processor with
with a
We
We proposed
proposed a "star"
“star” configuration
communication
communication line to
to a number
number of
of lower
lower processors, with
with no direct
direct communication
communication
Each processor
processor has a small
small amount
amount of
of memory
memory as
as a
allowed
a,llowed between
between the lower
lower processors. Each
which avoids
avoids the problems
problems inherent
inherent in shared
shared memory.
memory.
",vorking
space",
The
“working
space”, which
The
communication
communication packets
packets are compact
compact to reduce the storage
storage and communication
communication burden
burden on
the central
cent’ra.1processor.
The simulation
simulation algorithm
algorithm called
called MOLSIM
MOLSIM was implemented
implemented using the SIMULA
SIMULA
The
Isngua.ge. We
We studied
studied this
this algorithm
algorithm as a function
function of
of the number
number of processors and
and the
language.
nature
nature of
of the particular
particular graphs
graphs being
being matched
matched (6-31 non-H
non-H atoms).
atoms). We
We found
found an average
utiliza.tion
utiliza.tion of
of 84%
84% on a ~5
25 cpu machine
machine (figured
(figured as
as total
total processor time/real
time/real time),
time), bnt
but
only
only 50%
SO% on a 50 cpu machine
machine although
although for
for some structure
structure matching
matching questions,
questions, the
The average speed enhancement
enhancement using this
this size machine
machine (50
efficiency
efficiency rearhed
reached 97%.
97%. The
factor of
of 30. A
A real machine
machine of
of the architecture
architecture needed to
to run
run this
this
processors) was a factor
algorithm
algorithm exists
exists at Purdue
Purdue University,
University, and time
time is being requested
requested to test
test the algorithm
algorithm in
real time.
time. This
This algorithm
algorithm is a unique
unique approach
approach to
to the problem
problem of
of graph
graph matching
matching and will
will
likely become practical
pract.ical when
when parallel
pa.rallel processors are commonly
commonly available
available and inexpensive.
inexpensive.
likely
(This
.. )
C.1.b
(This work
work is submitted
submitted for
for publication
publication in J. Chern.
Chem. Inf.
Inf. Comput.
Comput. Sci
Sci..)
Alaximal Common
AICS-- Maximal
Search.
AICS-Common Subgraph
Subgraph Search.
The second area of
of starting
starting material
material strategy
strategy work
work this
this year
year has been in solving
solving the
The
Our solution
solution to
to this
this problem
problem involves
involves development
development of
of a
class IV
IV problem
problem given
given above. Our
efficient maximal
maximal common
common subgraph
subgraph matching
matching algorithm.
algorithm.
chemists represent
represent
new efficient
Since chemists
organic
organic molecules as
as graphs,
graphs, computational
computational chemists
chemists need graph
graph theoretical
t,heoretical techniques
techniques
such as
as graph
graph isomorphism,
isomorphism, subgraph
subgraph search, and maximal
maximal common
common subgraph
subgraph search. Of
Of
these three
three important
import,a.nt procedures.
procedures, maximal
maximal common
common subgraph
subgraph search (MCSS)
(MCSS) remains
remains the
rnost difficult
difficult and least utilized.
utilized.
most
The ext.ensive
ext,ensive comput.a.tional
computational demands
demands of
of MCSS
MCSS has restricted
restricted its
its possible uses.
uses. \Ve
We
The
have previollsly
previously not.ee!
not,ed that.
tha.t maximal
maximal common
common subgraph
subgraph search
sea.rch could
could be useful in our
our
starting mat.erial
ma,terial select.ioll
selection program,
program, SST, but
but that
that the computational
computational demands were too
too
st.arting
rigoro1ls:
Cone et al. who
interpretive
rigorous.*
who has used MCSS
MCSS in their
their "self-training
“self-training
interpretive and
retrieval
retrieval system"
system” (STIRS).
(STIRS), has noted
noted that
that other
other potential
potential uses
uses of
of for
for MCSS
MCSS include
include
comput.er-assisted
computer-assisted organic
organic synt.hesis
synthesis and structure
structure activity
activity studies."
studies.**
Given two
two gra.phs.
gra.phs, finding
finding a common
common subgraph
involves discovering
discovering the assignment
assignment
Given
subgraph involves
of
of some of
of the nodes ane!
and edges
edges of
of one graph
graph onto
onto the other
other graph
graph while
while preserving
preserving the
adjacency
adjacency relationships
relat,ionships of
of the nodes. The
The size of
of the common
common subgraph
subgraph is the number
number of
of
graph of
edges
edges preserved
preserved in the assignment.
assignment. If
If there
t,here exists no common
common sub
subgraph
of larger
larger size, the
common
common subgraph
subgrsph is called
called maximal.
maximal.
Our
Our approach
approach to MCSS
MCSS was to reduce redundant
redundant searching
searching and try
try to
to shrink
shrink the
size of
of the
t,he search space. We
size
We observe that
that most
most libraries
libraries of
of chemicals
chemicals have compounds
compounds
which have similarities;
simila.rities; by
by capitalizing
capitalizing on these similarities
similarities we might
might be able to
t,o reduce
which
the
t,he search space. The
The essentia.l
essentia>lprinciple
principle is that
that if
if we know
know a relationship
relationship between
between library
library
graph
graph A
A and B we can relate
relate query
query graph
graph C to
to A.
A, then
then we may
may therefore
therefore already
already know
know
somet.hing about
about) the
t.he relationship
relationship of
of C to
to B. We
We establish
establish the relationship
relationship between
between A
A and
a.nd
B in a one-time-only
one-time-only preprocessing
preprocessing of
of the library
library. .

•W.T.
Inf. Comput.
Sci.,
lC’.T. Wipke.
Wipke, D. Rogers,
Rogers, I.I. Chern.
Chem. Inf.
Corn&.
Sci., 10S4,
1984, ,(in
,(in press)
press)
•• M.M
MAI Cone.
Cone,

Venkataraghavan,
F.W. Mclafferty,
Melafferty,
I.I. Am.
Chem. Soc.,
Sot., 00,
QQ, 766S,
7688, (1077)
(1977)
R. Venkataraghavan,
F.W.
Am. Chern.

lSI
151

E. A. Feigenbaum
Feigenbaum

SEes:
SECS: Simulation
Simulation &
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

5P41
5P41 RR00785-11
RR00785-11

The
The objective
objective of
of our
our study
study was
was to
to demonstrate
demonstrate the
the feasibility
feasibility and
and study
study properties
properties
of
of such
such an
an algorithm.
algorithm. The
The following
following design
design features
features were
were deemed
deemed important.
important.
•l The
The processed
processed library
library should
should be
be storage
storage efficient.
efficient.
•l It
It should
should be
be possible
possible to
to add
add compounds
compounds incrementally
increlnentally to the file, to avoid the

cost
cost of
of reprocessing
reprocessing the
the file
file whenever
whenever changed.
changed.
•l The
The time
time required
required to
to store
store aa new
new compound
compound in
in the
the processed
processed library
library should be
be
minimal;
minimal: preferably,
preferably, an
an upper
upper limit
limit on
on this
this time
time requirement
requirement, should
should be
be known.
known.
•l It
It should be
be possible to create the compound
compound from
from its processed form,
form, so
so that
that

both
both processed
processed and non-processed libraries
libraries do not have to be
be kept.
kept.
\Ve
We wished the system to be
be useful for aa range of non-preselected
non-preselected queries, therefore
therefore
we ruled out
out simply
simply "training"
“training” the system
system for aa small
small class
class of query
query choices or depending
depending
on det.ailed
detailed Knowledge
knowledge of the allowed
allowed queries.
We had noticed
noticed that
that identical
identical common
common subgraph
subgraph candidates
candidates are often
often generated
generated
during
during the search of different
different compounds
compounds against
ag?inst aa query.
query. \Ve
We envisioned
envisioned collecting
collecting these
common
allow the search to be conducted
common segments into
into an intermediate
intermediate graph,
graph, to.
to.allow
conducted once
over
over the common
common featllre.
feature. If
If our
our search procedure
procedure can take
take advantage
advantage of
of the abstraction
abstraction
of the common
common graph
graph from
from two
two or more test compounds,
compounds, then
then the number
number of attempted
attempted
subgraph
suhgraph matches
mat)ches will
will be
be reduced.
reduced.
A
A non-recursive
non-recursive FORTRAN
FORTRAN algorithm
algorithm was implemented
implemented for
for the MCS-1
MCS-1 program
program
using aa, tree-st.ructured
st.orage
file.
The
tree
storage
search
algorithm
reductions
tree-struct,ured storage
The
storage
algorithm gives reductions
from
from 70 to
to 90%
90% in the search relative
relative to conventional
conventional unstructured
unstructured sequential
sequential storage
storage
systems.
Reductions
are
especially
good
for
the
case
where
library
contains smaller
Reductions
especially
for
case
where
the
library
contains
smaller
syst,ems.
graphs
The
of the search tree is
gra.phs or
or a series of
of similar
similar graphs.
graphs.
The general
general structure
structure of
det.ermined
tree are established,
established, addition
addition
det.ernlined early
en.rly in its creation;
creation; once the major
major nodes of the
the tree
of
Sorting
that the tree
tree can
of compounds
compounds rarely
rarely alters
alters it
it, significantly.
significantly.
Sorting experiments
experiments showed
showed that
that improved
improved results
results can be obtained
obtained when
when searching
searching over
over the
the
be "seeded"
“seeded” in such a wa.y that
seeded
library relative
relative to
to the unseeded library.
library.
seeded library
\Ve have a.pplied
applied this
this MCS-1
MCS-1 algorithm
algorithm to
to the Class IV
IV starting
starting material
material
We
recognition problem.
problem. The
The init.ial
initial abstraction
abstraction of
of the starting
material library
library resulted
resulted in an
recognition
starting mat,erial
abstracted library
library of
of significantly
significantly reduced
reduced size. Organizing
Organizing this
this abstracted
abstracted library
library in
in a
abstract.ed
tree-structured form
form allows
allows the
the discovery
discovery of
of starting
starting materials
materials which
which do not,
not have a
tree-structured
subgraph or
or super
super graph
graph relationship
relationship with
with the
the target.
target. A
A trial
trial run
run with
with morphine
morphine was
was
subgraph
successful in point,ing
point.ing out
out an interesting
interesting starting
starting material
material candidate
candidate not
not found
found by
by the
the SST
SST
progra.m. This
This work
work is being
being submitted
submitted to
to the
the J. Chem.
Chem. Inf.
Inr. Comput.
Comput. Sci..
program.

C.2 _YiTiVO
TEND Program
Program Developments
Developments
~7.2
The metabolic
metabolic fate
fate of
of various
various compounds
compounds in
in the
the human
human body
body is extremely
extremely complex,
complex,
The
extremely important
important for
for itit is known
known that
that through
through metabolism
metabolism certain
certain otherwise
otherwise
yet. extremely
harmless compounds
compounds are converted
converted into
into toxic
toxic and possibly
possibly carcinogenic
carcinogenic agents.
agents. Because of
of
harmless
this complexity
complexity itit is difficult,
difficult, looking
looking at
at a given
given compound,
compound, to
to forecast
forecast potential
potential biological
biological
this
activity of
of t,hat
that given
given compound.
compound. The
The objective
objective of
of this
this proposal
proposal is to
to develop
develop a practical
practical
activit,y
computer program
program by
by which
which a biochemist
biochemist or
or metabolism
metabolism expert
expert can
can explore
explore the
the
computer
metabolites of
of a given
given compound
compound and
and be alerted
alerted to
to the
the plausible
plausible biological
biological activity
activity of
of each
metabolites
rnet,abolite.
met,a.bolit,e.
This research
research aims
aims to
to explore
explore the
the degree to
to which
which current
current knowledge
knowledge of
of metabolism
metabolism
This
can be
be used
used by
by a computer
computer program
program to
to make
make reasonable
rea.sonable projections
projections of
of what
what metabolites
metabolites
can

E. A.
A. Feigenbaum
Feigenbaum
E.

152

SPH
SP41 RR00785-11
RR00785-11

SECS:
SECS: Simulat.ion
Simulation 8l
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

might
might result
result. from
from exposure
exposure of
of aa compound
compound to
to aa biological
biological system.
system. The
The project
project involves
involves
representing
in
a
computer
the
metabolic
processes
with
all
known
specificities
representing in a computer the metabolic processes with all known specificities and
and
applying
applying these
these processes
processes to
to aa xenobiotic
xenobiotic compound
compound to
to generate
generate aa set
set of
of plausible
plausible
metabolites
metabolites which
which may
may t.hemselves
themselves be
be further
further metabolized.
metabolized. \Ve
We also
also plan
plan an
an evaluation
evaluation
module
module to
to appraise
appraise the
the plausible
plausible biological
biological activities
activities of
of each
each metabolite
metabolite using aa rule base
base
to
Thus
Thus the attention
attention of the
to rebte
relate chemical
chemical structure
structure to
to biological
biological activity.
activity.
experimentalist
experiment,alist will
will be
be attracted
attracted to
to those
those metabolites
metabolites that
that are likely
likely to be
be biologically
biologically
active.
active.

C.2.a Atomic
Atomic Charges.
Chargeu.
During
During the past year, we implemented
implemented rapid
rapid atomic
atomic charge calculation
calculation algorithms
algorithms
to be
be llsed
used wit.h
wit,h the XENO
XENO program
program in order
order to better
better predict
predict the biological
biological activity
activity of
metabolite~.
metnbolites. The algorithms
algorithms were based
based on Gasteiger's
Gasteiger’s PEOE
PEOE (part,ial
(partrial equalization
equalization of
orbital
orbital electronegativity)·
electronegativity)* and SO-POE
SD-POE (sigma
(sigma dependent
dependent pi orbital
orbital electronegativity)··
electronegativity)”
mooels.
models. The PEOE
PEOE model has
has been
been used
used for sigma charge calculations
calculations for sigma
sigma bonded
bonded
and non-conjugated
non-conjugated pi systems. The
The SO-POE
SD-POE model has been used for
for conjugated
conjugated
aliphatic
aliphat,ic and single ring
ring aromatic
a.romatic molecules. For
For polyaromatic
polyaromatic systems, the pi charge
cha.rge
calculations
calculations are being implemented
implemented using the SO-POE
SD-POE model
model and the Longuet-Higgins
Longuet-Higgins
approximat.ion.
In the PEOE
PEOE model, the SO-POE
SD-POE model,
model, and polyaromatic
polyaromatic
approximation.***•••
hydrocarbon
pi charge calculation,
atoms
by
hydrocarbon
calculation,
atoms are characterized
characterized
by their
their orbital
orbital
e1ectronega,ti
vi ties.
electronegativities.
\Ve
We have shown that
that the charges so calculated
calculated are reasonable when
when compared
compared to
the work
Our
work of others
others in the literature.
literature.
Our purpose is then
then to
to correlate
correlate the biological
biological
centers in the
the metabolites.
metabolites.
a,ctivit,y
activit,y of metabolites
meta.bolites with
wit,h the atomic
atomic charge on electrophilic
electrophilic centers
\Ve
extent by
by atomic
atomic charges
We also think
think that
that metabolism
metabolism itself
itself can be controlled
controlled to
to some extent
so the metabolic
metabolic transforms
transforms may
may make
make use
use of this
this data
data eventually.
eventually.

C.2.b ph’
pKa
a Calculations.
Calculations.
\Ve have continued
continued work
work on the problem
problem of
of the
the estimation
estimation of
of the
the dissociation
dissociation
\Ve
constant.s for
for organic
organic acid and bases
bases to be used to
to increase the expertise
expertise of
of the
the XENO
XENO
constant,s
program. \Ve have been investigating
investigating two
two approaches
approaches to
to do this
this type
type of
of estimation:
estimation:
pr0gra.m.
LFER (linear
(linear free energy
energy relations),
relations), and theoretical
theoretical or
or quantum
quantum chemical
chemical approaches.****
approaches .. • ..
LFER
The LFER
LFER computation
computation is performed
performed automatically
automatically by
by first
first selecting
selecting appropriate
appropriate
The
skeleton st#ructures
structures from
from a library,
library, then
then recognizing
recognizing attached
attached groups
groups and
and finally
finally
skelet.on
calculating the
the pKa
pKa from
from the relevant
relevant equations
equations and group
group substituent
substituent constants.
constants. This
This is
ca.lculat,ing
first automatic
automatic pKa
pKa estimation
estimation algorithm
algorithm ever developed
developed and
and promises
promises wide
wide utility
utility on
on
the first
its own
own outside
outside of
of the XENO
XENO program.
program.
its
To determine
determine the
the most
most representative
representative pKa
pKa ifif more
more than
than one acid
acid or
or base center
center is
To
present, several
several empirica.
empirical rules
rules are followed:
followed:
present,

• acids
acids -- use the
the ionized
ionized form
form of
of the
the acid
acid center
center with
with the
the lowest
lowest pKa
pKa as a
substituent for
for the
the pKa
pKa calculation
calculation of
of the
the next
next lowest
lowest acid,
acid, and
and so on.
on .
substituent

l

GasteigE'r and
and M.
M. Marsili,
Marsili, Tetrahedron
Tetrahedron Iett.,
lett., 34,
84,3181,
(1981)
l•J.J. Gasteiger
3181, (1081)
Gasteiger and
and hl.
11. Ma&i,
Marsili, Tetrahedron,
Tetrahedron, 38,
88, 3219,
3210, (1080)
(1980)
l••*J.J. Gasteiger

•••H,H, C.C. Longuet-Higgins,
Longuet-Higgins, 1.
I. Chem.
Chern. Phys.,
Phy,., f3,
18, 275,
275, (1050)
(1950)

l *.

•••• D.O.

Perrin, B.
B. Dempsey,
Dempsey, and
and E.P.
E.P. Serjeant,
Serjeant, ‘·pKa
Prediction for
for Organic
Organic Acids
Acids and
and Bases.,
Bases·, Chapman
Chapman and
and
D.D. Perrin,
pKa Prediction
Hall, New
New York,
York, 1081.
}081.
Hall,

153

E. A.
A. Feigenbaum
Feigenbaum
E.

SECS: Simulation
Simulation 8[
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

5P41
5P41 RR00785-11
RR00785- 11

•l bases
bases - use
use the
protonated form
pKa as a
the protonated
form of
of the
the base center
center with
with the highest
highest pKa
substituent
for
the
pKa
calculation
of
the
next
highest'
base,
and
so
on.
substituent for
pKa calculation of the next highest’base,
•l strong
acids
bases - compute
pKa's
strong
acids and
and bases
compute pKa’
s for
for the acids first,
first, then
then use
use ionized
ionized
form
pKa's
bases.
form of
of the acids as substituents
subst,ituents to
to compute
compute pKa’
s of
of the
the bases.
•l strong
acids
basis - compute
pKa's
bases first,
compute pKa’
s for
for bases
first, then
then use
use
strong
acids and
and weak
weak
basis
protonated form
bases as
pKa's
protonsted
form of
of the bases
as substituents
substituents to
to compute
compute pKa’
s of
of the acids.
•l weak
basis - compute
pKa's
bases first,
compute pKa’
s for
for weak
weak bases
first, then
then use
use
weak acids
acids and
and weak
weak basis
deprot.onated
bases as substituents
pKa's
deprot,onated form
form of
of bases
substituents to compute
compute pKa’
s for
for the acids.
C.2.g Collaborative
Efforts. In
past year
program
Collaborative Efforts.
In the past
year most
most work
work was aimed
aimed at program
development
problems, however
development rather
rather than
than application
application to
to laboratory
laboratory problems,
however in the next
next year
year we
do expect
program modifications
perform several
expect after
aft*er completion
completion of
of the current
current program
modifications to
to perform
practical analyses in conjunction
NIH. and other
practical
conjunction with
with our
our collaborators
collaborators at lCl
ICI of
of UK,
UK, NIH,
other
parties that
parties
that have indicated
indicated interest.
interest.
The
project continues
pharmaceutical
The SECS project
continues to
to have collaborations
collaborations with
with the pharmaceutical
indust,ry
joint program
program development.
indust,ry who
who are adding
adding chemical
chemical transforms
transforms and doing
doing some joint
development,
for
for example.
example, Dr.
Dr. Yanaka
Yanaka continued
continued work
work started
started at Santa
Santa Cruz
Cruz after
after he returned
returned to
to
Kureha
paper has been prepared
prepared on that
Kureha. Chemical
Chemical in Japan
Japan and a paper
that work.
work.

D. List
List of
Project Publications
Publications
D.
of Current
Current Project
1.
Rapid Subgraph
Parallelism
Using Parallelism
Subgraph Search
Search Using
1. Wipke.
Wipke, W.T
W.T.,.. and Rogers, D.:
D.: Rapid

J. Chen.
Chen. Inr.
Inf. Comput.
Comput. Sci (submitted
(submitted 24 April
April 84)

2. Wipke.
\-V.T.: "An
A-Z: A
Wipke, W.T.:
“An Integrated
Integrated System
System for
for Drug
Drug Design"
Design” in Computers
Computers A-Z:

AluTwfacturer's
Hardward and
for the Pharmaceutical
Pharmaceutical
~l~untrfacturer
‘s Guide
Guide to Hardward
and Software
Software for
Industry Aster
Industry
Aster Publishing
Publishing Co., Sprinfield.
Sprinfield, Oregon.
Oregon, (in
(in press)
3. Wipke.
synthetic design.
Symmetry and
and organic
organic synthetic
design.
Wipke, W.T.
W.T. and Huber.
Huber, M.:
M.: Symmetry
Accepted
Accepted in Tetrahedron.
Tetrahedron.
4. Wipke,
Prediction of
Wipke, W.T.,
W.T., Ouchi,
Ouchi, G.I.
G.I. and Chou,
Chou, J.T.:
J.T.: Computer-Assisted
Computer-Assisted
Prediction
of
Afetabolism.
IN
L. Goldberg
(Ed.).
STRUCTURE-ACTIVITY
Goldberg
STRUCTURE-ACTIVITY
&fetnbolism.
IN
(Ed.),
CORRELATIONS
AS A
TOOL
IN
CORRELATIONS
A PREDICTIVE
PREDICTIVE
TOOL
IN TOXICOLOGY.
TOXICOLOGY.
Hemisphere
New York,
Hemisphere Publishing
Publishing Corp
Corp.,.. New
York, 1983.
1983.
5. Johnson.
A.,
Johnson, C.K..
C.K., Thiessen.
Thiessen, W.E.,
W.E., Burnett,
Burnett, M.N.,
M.N., Condran.
Condran, P. Ronlan,
Ronlan, A.,
Yanaka.
\Vipke. W.T.:
procedure.'}
Ysnaka. M. and Wipke,
W.T.: Systematic
Systematic derivation
derivation of
of chemical
chemical procedures
products, J. of
for tra7I·'Jforming
.surplus
for
transforming
surplus hazardous
hazardous chemicals
chemical8 to useful
useful products,
of
Hazardous Materials.
Alaterials. (In
Hazardous
(In press)

o.
Automated Inference
Inference in
Planning Organic
in Planning
Organic Synthesis
Synthesis
6. Dolata.
Do1a.t.a. D.P.:
D.P.: QED:
QED: Automated
(Ph.D.
(Ph.D. dissertation).
dissertation). University
University of
of California,
California, Santa
Santa Cruz,
Cruz, 1984.
1984.
7. Rogers.
Artificial Intelligence
Intelligence in
SST:
SST: Starting
Starting
Rogers, D.: Artificial
in Organic
Organic Chemistry.
Chemistry.
Alaterial Selection
(Ph.
Material
Selection Strategies
Strategies
(Ph. D. dissertation).
dissertation). University
University of
of California.
California,
Santa
Sant,a Cruz.
Cruz. 1984.
Wipke.
Artificial Intelligence
Intelligence in
SST:
in Organic
Organic Synthesis.
Synthesis.
SST:
Wipke, W.T
W.T.,.. and Rogers.
Rogers, D.: Artificial
Starting
Afaterial Selection
An Application
Application of
Search.
Starting Material
Selection Strategies.
Strategies.
An
of Superstructure
Superstructure
Search.
J. Chern.
Chem. lnf.
Inf. Cornput.
Comput. Sci..
Sci., 24:0000.
24:0000, 1984.
1984.

E. Funding
Funding Status
Status

E. A.
A. F'eigenbaurn
F‘eigenbaum

154
154

5P41
5Pfl RR00785-11
RR00785-11

SECS:
SECS: Simulation
Simulation &
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

1.
1. Computer-Assisted
Computer-Assisted Prediction
Prediction of
of Xenobiotic
Xenobiotic Metabolism
Metabolism

Principal
Principal Investigator:
Investigator: \V.
W. Todd
Todd Wipke,
Wipke, Professor,
Professor, UCSC
UCSC
Agency:
NIH,
Environmental
Health
Sciences;
Agency: NIH, Environmental Health Sciences; No:
No: ES02845-02
ES02845-02
4/1/82-3/31/85
$257,801
4/1,‘82-3/31/85
$257,801 TDC
TDC
4/1/84-3/31/85
$$ 89,140
4/l/84-3/31/85
89,140 TDC
TDC
2.
2. Graphical
Graphical Display
Display of
of Chemical
Chemical Inferences
Inferences and
and Molecular
Molecular
Relationships
Relationships
Principal
Principal Invest,igator:
Investigator: W. Todd
Todd Wipke,
Wipke, Professor,
Professor, UCSC
Agency: Evans and
a,nd Sutherland
Sutherland Corporation
Corporation
Gift
Gift of PS300
PS300 B/W
B/W High Performance
Performance Graphical
Graphical Display
Display System
Permanent,
Permanent, value $95,000 TDC
TDC
3.
3. Compnter
Computer Synthesis
Principal
Principal Investigator:
Investigator: W.
W. Todd
Todd Wipke,
Wipke, Professor, UCSC
Agency: Stauffer
St,auffer Chemical
Chemical Company
Company
Permanent,
Permanent, $5.000
$6,000 TDC
TDC

F.
I;. Research
Research Env£ronment
Environment
At
At the University
University of California,
California, Santa Cruz,
Cruz, we are connected to the SUMEX-AIM
SUMEX-AIM
resource by
a
4800
baud
multiplexed
leased
line.
Our
by
4800
multiplexed leased
Our video terminals
terminals consist of a Z-29,
‘Z-29.
DM-3025,
1620,
We have a PS300 graphic
DM-3025, TIi45.
TI745, CDI-I030.
CDI-1030, DIABLO
DIABLO
1620, and an ADM-3A.
ADM-3A.
graphic
display
display which
which is driven
driven by SUMEX.
SUMEX. UCSC
UCSC has only
only a small IBM
IBM 370/145,
370/145, a PDP-ll/45,
PDP-11/45,
11/70
jobs for student
11/70 and a V1\..'(
VAX 11/780,
llj780, (the l1's
11’s are restricted
restricted to running
running small jobs
student timetimesharing)
sharing) all of which
which are unsuitable
unsuitable for
for our
our current
current research. The
The SECS laboratory
laboratory is
located
locst#ed in 125
125 Thimann
Thimann Laboratories,
Laboratories, adjacent
adjacent to
to the synthetic
synthetic organic
organic laboratories
laboratories at
at
Santa
Sa.nta Cruz.
Cruz.

n.
II.

INTERACTIONS WITH
WITH THE
THE SUhdEX-AIM
SUMEX-AIM RESOURCE
RESOURCE
INTERACTIONS
A.. Medical
AJedicaJ Collaborations
Collaborations and
and Program
Program Dissemination
Dissemination via
via SUMEX
SUAJEX
A.

available in the GUEST
GUEST area of
of SUMEX
SUMEX for
for casual users, and in the SECS
SECS is available
DE~'fO area for
for serious collaborators
collaborators who
who plan
plan to
to use
use a significant
significant amount
amount of
of time
time and
DEMO
need to
to save the synthesis
synthesis tree generated. Much
Much of
of the
the access
access by
by others
others has been through
through
graphic terminal
terminal equipment
equipment at
at Santa
Santa Cruz,
Cruz, so much
much more
more convenient
convenient for
for structure
structure
the gra.phic
input and output.
output. Demonstrations
Demonstrations and sample synthetic
synthetic analyses were generated for
for
input
numerOllS visitors
visitors from
from t,he
the US and abroad.
abroad. Demonstrations
Demonstrations of
of SECS in Sweden were
numerous
performed by
by Dr.
Dr. R. E. Carter,
Carter, University
University of
of Lund,
Lund, Sweden, at
at many
many universities
universities and
performed
companies.
Professor Wipke
vVipke has also used several SUMEX
SUMEX programs
programs such as CONGEN
CONGEN in
in his
Professor
Computers and Information
Information Processing in Chemistry.
Chemistry. Communication
Communication between
on Computers
course on
collaborators is facilitated
facilitated by
by using SUMEX
SUMEX message
message drops, especially
especially when
when time
time
SECS collaborators
differences between
between the U.S. and Europe
Europe and Australia
Australia makes normal
normal telephone
telephone
differences
communication difficult.
difficult. Testing
Testing and
and collaboration
collaboration on
on the
the XENO
XENO and
and FSECS
FSECS project
project
communication
with researchers
rese8-rchers at
at the
the NC1
NCI depend on
on having
having access
access through
through SUMEX
SUMEX and
and TYMNET.
TYMNET.
with
Collaboration with
with Lund
Lund University.
University. The
The introduction
introduction of
of SECS to
to organic
organic chemists
chemists
Collaboration
in
Sweden
was
one
of
the
seeds
that
led
to
the
establishment
of
a
computer
graphics
in
of the seeds that
to the establishment of
computer graphics
laboratory for
for organic
organic chemistry
chemistry at
at the
the University
University of
of Lund,
Lund, with
with strong
strong support
support from
from aa
laboratory
government agency,
agency, the
the National
National Swedish
Swedish Board
Board for
for Technical
Technical Development
Development (STU).
(STU).
government

155
155

E. A.
A. Feigenbaum
Feigenbaum
E.

SEes:
SECS: Simulation
Simulation &
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

5P41
5P41 RR00785-11
RR00785- 11

Interest
Interest in the
the applications
applications of
of computers
computers and computer
computer graphics
graphics in organic
organic chemistry
chemistry has
spread
very
rapidly
throughout
the
country,
and
chemist.s
at
all
of
the
major
spread very rapidly throughout the country,
chemists at all of
major Swedish
Swedish
universities,
universit)ies, as
as well
well as in the pharmaceutical
pharmaceutical industry,
industry, have taken
taken steps to participate
participate in
the exciting
exciting developments
developments in this
this field.
field.
Interest
Interest in the pedagogical
pedagogical value
value of
of SECS at the graduate
graduate level has led to
to its
its use to
to
illustrate
illustrate the concepts
concepts of
of retrosynthetic
retrosynthetic planning
planning and analysis
analysis in conjunction
conjunction with
with a course
"Synthetic
given
“Synthetic Organic
Organic Chemistry--Chemistry--given by
by Prof.
Prof. Paul
Paul Helquist
Helquist (SUNY,
(SUNY, Stony
Stony Brook):
Brook):
Modern
The same course was given
given at the Royal
Royal Institute
Institute of
of
Modern Methods
h4ethods and Strategy
Strategy”.II . The
Technology
Technology in Stockholm,
Stockholm, and at the
the Chemical
Chemical Center
Center in Lund,
Lund, using
using SECS as an integral
integral
part
part of
of the course.
A
A Workshop
Workshop on Computers
Computers in Organic
Organic Chemistry
Chemistry was sponsored
sponsored by
by STU
STU on May
May
17-18,1983, in Gothenburg
Gothenburg to
to help organic
organic chemists
chemists in Sweden enter
enter this
this area of
of work.
work.
Daniel
Daniel P. Dolata
Dolata from
from Prof
Prof Wipke's
Wipke’s group
group in SC was an invited
invited speaker.
speaker.
A
A chemist
chemist from
from Lund,
Lund, Dr.
Dr. Alvin
Alvin Ronlan,
Roman, spent
spent a sabbatical
sabbatical leave with
with the Wipke
Wipke
group.
group, and a graduate
graduate of
of the Wipke
Wipke group,
group, Dr.
Dr. Dolata,
Dolata, is spending
spending a postdoctoral
postdoctoral stay
stay in
Lund.
Lund.
In
In collaboration
collaboration with
with the SECS group
group at UCSC,
UCSC, Dolata
Dolata will
will inst,all
install the SECS
program
11/780 for
for the use of
of the chemists
chemists in
program and the QED
QED program
program on a new VAX
VAX 11/780
Lund.
Lund, and will
will continue
continue research with
with QED.
QED. For
For example,
example, it
it would
would be of
of interest
interest to
develop rule
rule bases
bases to
to assist the
the chemist
chemist in structure
structure elucidation,
elucidation, and structure-activity
snructure-activity
relationships.
re1a.tionships.
Another
Anot,her area of
of collaboration
collaboration involves
involves compilation
compilation of
of chemical
chemical transforms
transforms by
by the
chemists
chemists in Lund.
Lund. Some of
of the chemists
chemists in Lund
Lund work
work with
with natural
natural products
products (isolation
(isolation
and synthesis),
synthesis), with
with a view
view toward
toward the discovery
discovery and characterization
characterization of
of physiologically
physiologically
active
active substances. For
For example
example a strongly
strongly mutagenic
mutagenic compound
compound has been isolated
isolated from
from a
Swedish mushroom
(Lactarius
vellereus),
its
structure
determined,
and
a
total
synthesis
mushroom (Lact,arius vellereus), its structure determined,
total synthesis
elaborated.
other work,
work, a traditional
traditional abortifacient
abortifacient from
from Bangladesh
Bangladesh is being
being isolated
isolated
elaborat.ed. In other
from
from plant
plant material,
material, and a psychoactive
psychoactive substance
substance is being
being extracted
extracted from
from the leaves of
of a
Nigerian plant.
A collaboration
collaboration with
with a university
university in Holland
Holland is now
now developing
developing along
Nigerian
p!ant. A
similar
similar lines, and Cornell
Cornell University
University is planning
planning a similar
similar center
center for
for computer
computer applications
applications
in chemistry.
chemist,ry.

B. Examples
Examples of
with
Projects
R.
o j Cross-fertilization
Cross- fertilization
with other
other SUAfEX-A1Af
SUhfEX-,4Ihf
Projects
The
The AILIST
AILIST bulletin
bulletin board
board has been used extensively
extensively for
for interacting
interacting with
with many
many
projects
projects and locating
locating references
references for
for further
further information
information related
related to
to program
program design and AI
AI
technology.
t.echnology. There
There are no longer
longer any
any other
other chemical
chemical or
or biochemical
biochemical projects
projects on SUMEX
SUkIEX so
our
our interaction
int.eraction with
with the community
community is limited
limited to AI
AI technology
technology interchange,
interchange, attending
attending
seminars
at
Stanford,
etc.
seminars
Stanford,

C. Critique
Resource Services
Clritique of
o j Resource
Services
SUMEX-AIM
SUMES-AIM gives us at UCSC,
UCSC, a small
sma.11university,
university, the advantages
advantages of
of a larger
larger
group
group of
of colleagues, and interaction
interaction with
with scientists
scientist,s all
ail over
over the
the country.
country. Previollsly
Previously we
were provided
provided very
very good service
service by
by SUMEX-AIM,
SUMEX-AIM, but
but since 1 April
April 1984, the
the computer
computer
service has been very
National AIM
very poor.
poor. Although
Although the
the National
AIM usage
usage of
of SUMEX
SUMEX has been small,
small,
our
This is a very
very
our project.
project, has been put
put in a separate
separate class with
with a 3%
3% cpu limitation.
limit,ation. This
severe restriction
restriction which
which prevents
prevents short
short usage
usage peaks from
from being
being averaged
averaged with
with other
other users.
Our
Our project
project is the only
only project
project subjected
subjected to
to such limitations.
limitations. The
The poor
poor response time
time we
are observing
observing (load
(load averages of
of 25-50!) is significantly
significantly hindering
hindering our
our ability
abilit,y to
to perform
perform
the research NIH
NIH funded
funded to be done on SUMEX.
SUMEX. This
This is worsened
worsened by
by the fact
fact we are in
ect.
th
thee Ia.st
la,st year
year of
of the proj
project.
E. A. Feigenbaum
Feigcnbaum

156
156

RR00785-11
5P41 RR00785-11

SECS: Simulation
Simulation 8l
62 Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

D. Collaborations
Atfedical Use of
Programs
D.
Collaborations and
and Medical
o j Fkograms
via
via Computers
Computer8 other
other than
than SUMEX
SUMEX
SECS 2.9 has been installed
installed on the CompuServe
CompuServe computer
computer networks
networks for
for the past
past
three
three years
years so anyone can access
access it
it without
without having
having to convert
convert code for
for their
their machine.
machine.
This
This has proved
proved very
very useful
useful as
as a method
method of
of getting
getting people to
to experiment
experiment with
with this
this new
t.echnology.
t,echnology. Dr.
Dr. George Purvis
Purvis of
of Battelle
Battelle has accessed
accessed SECS via
via CompuServe,
CompuServe, as
as has
SECS also resides on the
Gene Dougherty
Dougherty of
of Rohm
Rohm and Haas and many
many others.
others.
Medicindat
Medicindat machine
machine at the University
University of
of Gothenborg,
Gothenborg, Sweden, and is available
available all over
over
Sweden by
Similarly in Australia,
Australia, SECS resides at the University
University of
of Western
Western
by phone. Similarly
Australia
A
Australia and is available
available throughout
throughout Australia
Australia over
over CSIRONET.
CSIRONET.
A lecture
lecture series was
given
given on SECS in Tokyo
Tokyo and
a.nd SECS has been installed
installed at
at two
two locations
locations in Japan. FSECS
FSECS
has been installed
National Laboratory
installed on a DEC-IO
DEC-10 at
at Oak
Oak Ridge
Ridge National
Laboratory and serves for
for
collaborative
collaborat.ive development
development of
of that
that approach
approach with
with Carroll
Carroll Johnson. PRXBLD
PRXBLD has been
disseminated
disseminated to over
over 30 sites on various
various types
types of
of computers
computers including
including DEC-lO,
DEC-10, DEC-20,
DEC-20,
IBf'.I.
IBRI, VAX,
VAX, PRIt-.m
PRIME and Honeywell.
Honeywell.

III.
RESEARCH
PLANS
(4/84-4/85)
ill.
RESEARCH
PLANS
(4/84-4/85)

A. Near-Term
Near-Term ProJkct
Project Goals
Plans Our
A.
Goals and
and Plans
Our research projects
projects will
will move off
off
SUMEX-AIM
SUMEX-AIM by
by 31
31 March
March 1985
1985 to some other
other as
as yet
yet unspecified
unspecified computer
computer system.
system.
Therefore
Therefore our
our research object,iYes
objectives on SUMEX-AIM
SUMEX-AIM are to complete
complete research in progress,
consolidate
consolidate programs
programs and files for
for moving
moving to
to another
another system.
system. The
The QED
QED and SST
SST projects
projects
have been completed
completed and the first
first phase
phme of
of the RXAN
R,YAN project
project outlined
outlined last year
year has been
completed. On
On the SECS
SEC’S project,
project, the reaction
reaction library
library is being extended
extended by
by Dr.
Dr. Iwataki.
Iwataki.
completed.
\Ve
\Ve will
will continue
continue to collaborate
collaborate with
with coworkers
coworkers in SECS research on other
other machines
machines but
but
on SUMEX
We
SLIMES will
will primarily
primarily be preparing
preparing SECS for
for removal
removal from
from SUMEX.
SUMEX.
We are
exploring
exploring PROLOG
PROLOG as
as a replacement
replacement for
for the QED
QED system
system and plan
plan some preliminary
preliminary
sample PROLOG
PROLOG programs
programs to compare
compare the
the capabilities
capabilities of
of PROLOG
PROLOG and QED.
QED. But
But the
majority
majority of
of our
our activities
activities will
will be aimed
aimed at completion
completion of
of the XENO
XENO program.
program.
XENO Goals
Our
dY&\rO
God3
Our objectives
objectives for
for this
this year
year follow
follow our
our pla.n
plan in the original
original proposal.
proposal.
Basically in this
this next
next year
year we plan
plan to complete
complete the implementation
implementation of
of algorithms
algorithms not
not yet
yet
Basically
completed
completed and focus on testing
testing with
with applications
applications to demonstrate
demonstrate the current
current power
power of
XENO
XENO on typical
typical laboratory
Iaborat.ory metabolism
metabolism problems.
problems. In
In this
this last
last year
year of
of this
this project
project our
our
goal is to bring
bring XENO
XENO to a relatively
relatively stable
stable finished
finished point
point which
which will
will be useful to
to other
other
will complete
complete the
the algorithms
algorithms in progress, document
document them
them and
researchers. \Ve believe we will
submit
submit, publications
publications on all of
of the work
work within
within the year.
year. The
The major
major areas of
of focus are
listed below.
below.
listed
A.1
Charge Calculations.
Calculations.
A.1 ,&ornic
Atomic Charge
We
We also plan
plan to complete
complete our
our correlations
correlations
between
between atomic
atomic charges and sites of
of metabolism,
metabolism, as
as we have already
already done with
with bond
bond
reactivity.
react,ivity. \Vhen
CZ’hen such correlations
correla.tions are established,
established, then
then they
they can be used to
to guide XENO
XENO
to
to apply
apply metabolic
metabolic transforms
transforms more
more selectively
selectively to
to the most
most active
active parts
parts of the molecules.

A.2 pKa
pJ(a Calculations.
A.2
Calculatione. We
We plan
plan to complete
complete testing
testing of
of the pKa
pKa algorithm
algorithm on
different
different. groups
groups on metabolites
metsbolites so that
that information
information may
may be used by
by XENO
XENO for
for activity
activity
evaluation,
evaluation, select,ion
selection of
of further
further possible metabolism,
metabolism, and estimation
estimat#ion of
of excretion
excretion and
t.ra.nsport.
transport.
A.S Three-Dimensional
Criteria.
We
-4.3
Three-Dimensional
Criteria.
We plan
plan to
to complete
complete our
our work
work on threethreedimensional
dimensional constraints
constraints that
tha.t apply
apply to metabolism
metabolism which
which have been obtained
obtained through
through
study
This
study of many
many metabolic
metabolic studies
studies in the literature.
literature.
This will
will require
require extensions
extensions of
of the

157

Feigenbaum
E. A. Feigenbaum

SECS: Simulation
Simulation 86
& Evaluation
Evaluation of
of Chemical
Chemical Synthesis
Synthesis

5P41 RR00785-11
RR00785-11

ALCHEM
.4LCHEM language t,o
to accommodate
accommodate new types
types of
of three-dimensional
three-dimensional relationships
relationships such as
as
overall
overall molecular
molecular size, width,
width, thickness,
thickness, ratio
ratio of
of length
length to
to width,
width, etc
etc..

.1.4
-4.4 Log P Calculation.
Calculation. In
In order
order to
to more
more accurately
accurately estimate
estimate the possibility
possibility of
of
excretion
excret,ion and
a.nd binding
binding for
for met.abolism,
met,abolism, we plan
plan to
to incorporate
incorporate a log P calculation
calculation module.
module.
This
This will
will provide
provide the partition
partition coefficient
coefficient between
between octanol
octanol and water.
water. There
There are already
already
programs
Our
programs to
to calculate
calculate log P, and these have been shown
shown to
to be very
very accurate.
accurate.
our
objective,
objective, time
time permitting,
permitting, will
will be to
to include
include such a module
module in the XENO
XENO program
program and
correlate
correlate log P with
with metabolism
metabolism transform
transform application
application and with
with excretion.
excretion.
B. Justification
Justification and
Requirements for
for Continued
B.
and Requirements
Continued Use of
of SUAIEX
SUMEX
The
The XENO
XENO project
project which
which resides on SUMEX
SUMEX is in its
its last year
year of
of support,
support,
consequently
consequently we need to complete
complete that
that research on the SUMEX
SUhlEX machine.
machine. By
By 31 March
March
1985,
plan to move XENO
1985, we
we plan
XENO and all our
our research off
off SUMEX
SUMEX onto
onto some other
other computer.
computer.
\Ve are currently
currently exploring
exploring what
what machine
machine may
may be suitable
suitable and available.
available. After
After 1 April
April
198,5
1085 we will
will not
not need SUMEX
SUMEX for
for computational
computational support,
support, but
but will
will need access
a.ccessto be able
to
respond to
to electronic
electronic mail,
mail, and continue
continue to
to
t,o retrieve
retrieve certain
cert.ain files from
from archive,
archive, respond
participate
participate in the AIM
AIM scientific
scient,ific interchange
interchange through
through electronic
electronic mail
mail and bulletin
bulletin boards.
boards.
It.
practical to
It, is not
not, pract,ical
to retrieve
retrieve every
every file
file we have ever archived,
archived, it
it would
would use
use too
too much
much
SUMEX
SUMEX operator
operator time,
time, and it
it is unnecessary
unnecessary as
as long as
as we can access
access them
them if
if we need
them
them in the future.
future. That
That access
access would
would not
not require
require significant
significant resources.
prior to 31 March
However
1985 we have obligations
obligations to
to complete
complete the research on
However prior
March 1985
t,he
NIH and need sufficient
the XENO
SENO project
project supported
supported by
by NIH
sufficient SUMEX
SUMEX cpu
CPU time
time to
to accomplish
accomplish
t.his
this goal. This
This means normal
normal editing,
editing, compile,
compile, load, and test
test executions
executions plus
plus some
runs
to
some
metabolic
problems.
It
appears
the
current
removal
applicat.ion
application runs to
metabolic problems. It
of our
our
current removal of
project from
National AIM
project,
from the
t,he National
AIM portion
portion of
of the
the SUMEX-AIM
SUMEX-AIM resource and placement
pla.cement in a
of
class restricted
restricted to 3%
3% peak utilization
utilization is hindering
hindering the research productivity
productivity
of this
this
project.. \Ve are experiencing
project,.
experiencing load averages of
of 25-50 a high
high percentage
percentage of
of the
the time.
time. We
We
request
National AIM
request to
to have om
our project
project placed back
back in the National
AIM portion
portion of
of the SUMEX-AIM
SUMEX-AIM
resource as
as we were allocated,
allocated, and we will
will carefully
carefully monitor
monitor to
to see
see that
that our
our resource
utilization
utQlizat.ion does
does not
not exceed our
our quota
quota of
of time.
time. We
We feel this
this is a reasonable request
request in light
light
of
National community
of the mission
mission of
of SUMEX-AIM
SUMEX-AIM to the National
community of
of which
which this
this project
project is a part.
part.
C. Needs
Needs Beyond
Beyond SUA-lEX-AIM
SUMEX--,;1IM As mentioned
mentioned above our
our project
project needs additional
additional
computing
computing resources and we are exploring
exploring acquiring
acquiring a computer
computer for
for installation
installation at
at UCSC
UCSC
and obt.aining
obt.aining the necessary resources to
to support
support it.
it. \Ve
We are seeking information
information about
about
comparisons
comparisons between
between machines
machines and cost effectiveness
effectiveness of
of different
different hardware
hardware combinations.
combinations.

D.
Recommendations for
for Community
and
Resource Development
Development
L?. Reconmendations
Community
and Resource
It
It, appears the
the’ SUMEX-AIM
SUMEX-AIM resource is increasingly
increasingly becoming
becoming basically
basically a Stanford
Stanford
reSOllrce
resource and
a.nd that
t,hst. there
there is a difference
difference between
between the
the portion
portion of
of the resource allocated
allocated to the
National community
National community.
Our
Nationa.
communit,y and the portion
portion actually
actually used by
by the National
community.
Our
project
National community
project, is part
pa.rt of
of the National
community and in need of
of better
better service,
service, we hope that
that can
be improved.
improved.

An
An important
important part
part of
of medicine
medicine is treatment
treatment of
of disea:-es
diseases with
with drugs,
drugs, chemicals,
chemicals,
chemicals
chemicals that
t,hat were designed and synthesized
synthesized by
by chemists.
chemists. Since the termination
termination of
of the
DENDRAL
project, there
seems to be declining
declining support
support for
for artificial
artificial intelligence
intelligence
DENDRAL
project.
there seems
applications
We
applications in chemistry.
chemistry.
We feel that
that support
support of
of this
this area is essential to the
advancement
in this
The
NIH Research
advancement of
of medicine
medicine
this country.
country.
The lack
lack of
of chemists
chemists on NIH
Resources computing
computing peer review
review 'is
.is contributing
contributing to the problem.
problem. In general
general the AIM
AIM
communit.y
community would
would benefit
benefit by
by involving
involving disciplines
disciplines other
other than
than computer
computer science.

E. A. Feigenbaum
Feigenbaum

158

SOLVER
SOLVER Project
Project

5P41
5P41 RR00785-11

II.A.2.5.
SOLVER
Project
II.A.2.5.
SOLVER
Project

SOLVER:
Problem
Solving
Expertise
SOLVER:
Problem
Solving
Expertise
Dr.
Dr. P.
P. E.
E. Johnson
Johnson
Center
for
in
Learning
Center
for Research
Research
in Human
Human
Learning
University
of
University
of Minnesota
Minnesota
Dr.
Dr. W.
W. B.
B. Thompson
Thompson
Department
of
Science
Department
of Computer
Computer
Science
University
of
University
of Minnesota
Minnesota

I.
I.

SUMMARY
OF
PROGRAM
SUMMARY
OF RESEARCH
RESEARCH
PROGRAM
A.
A. Project
Project Rationale
Rationale

This
This project
project focuses
focuses upon the development
development of strategies
strategies for
for discovering
discovering and
documenting
documenting the
t,he knowledge
knowledge and skill
skill of
of expert
expert problem
problem solvers. In the last
last fifteen
fifteen years,
considerable
considerable progress has been made in synthesizing
synthesizing the expertise
expertise required
required for
for solving
solving
extremely
Computer programs
programs exist
exist with
with competency
competency comparable
comparable to
ext’remely complex
complex problems.
problems. Computer
human
human experts
exprrt,s in diverse areas ranging
ranging from
from the analysis
analysis of mass spectrograms
spectrograms and
nuclear
nuc!ear magnetic
rnagnet,ic resonance (Dendral)
(Dendral) to the diagnosis of
of certain
certain infectious
infectious diseases
diseases
(Mycin).
(Mycin).
Design of
particular task
of an expert
expert system
system for
for a particular
task domain
domain usually
usually involves
involves the
interaction
of
two
distinct
groups
of
individuals,
"knowledge
engineers,"
interaction of two distinct groups of individuals, “knowledge engineers,” who
who are
primarily concerned
problem solving
primarily
concerned with
with the specification
specification and implementation
implementation of
of formal
formal problem
solving
t,echniques,
problem area) who
provide factual
(in the relevant
relevant problem
who provide
factual and
techniques, and "experts"
“experts” (in
heuristic
problem solving
hcurist,ic information
information of
of use
use for
for the problem
solving task
t,ask under
under consideration.
consideration. Typically
Typically
the knowledge
particular
knowledge engineer
engineer consults
consults with
with one or
or more
more experts
experts and decides on a particular
represent3tional
Next, "units"
“units” of
of factual
factual information
information
representational structure
structure and inference
inference strategy.
strategy. Next,
are specified.
properties of
problem domain
specified. That
That is, properties
of the problem
domain are decomposed into
into a set of
of
manageable
processing by
by the inference
Once this
this
msnagesble elements
elements suitable
suitable for
for processing
inference operations.
operations.
organization ha.s
has been established,
established, major
major efforts
efforts are required
required to
to refine
refine representations
representations and
organization
acquire fact.ual
factual knowledge
knowledge organized
organized in an appropriate
appropriate form.
form.
Substantial research
research
a,cquire
Substantial
prohlems exist
exist in developing
developing more
more effective
effective representations,
representations, improving
improving the
the inference
inference
problems
process, and in finding
finding better
better means of
of acquiring
acquiring information
information from
from either
either experts
experts or
or the
the
problem area itself.
itself.
problem
Programs currently
currently exist
exist for
for empirical
empirical investigation
investigation of
of some of
of these questions
questions for
for a
Programs
particular problem
problem domain
domain (e.g. AGE,
AGE, UNITS,
UNITS, RLL).
RLL). These tools
tools allow
allow the investigation
investigation
particular
of a.lt,ernnt,e
alternate organizations,
organizations, inference
inference strategies,
strategies, and rule
rule bases
bases in an efficient
efficient manner.
manner.
of
\Vhat is still
still lacking,
lacking, however,
however, is a theoretical
theoretical framework
framework capable of
of reducing
reducing dependence
\Vhst
the expert’
expert's
intuition or
or on near
near exhaustive
exhaustive testing
testing of
of possible organizations.
organizations. Despite
Despite
on the
s intuition
their successes,
successes, there
there seems to
to be a consensus that
that expert
expert systems
systems could
could be better
better than
than
their
they are.
Most expert
expert systems
systems embody
embody only
only the
the limited
limited amount
amount of
of expertise
expertise that
that
Most
they
individuals are able to
to report
report in
in a particular,
particular, constrained
constrained language (e.g. production
production rules).
rules).
individuals
current systems
systems are approximately
approximately as good
good as human
human experts,
experts, given
given that
that they
they represent
represent
IfIf current
only
a
port,ion
of
what
individual
human
experts
know,
then
improvement
in the
the
only
portsion of what individual human experts know, then improvement in
"knowledge capturing”
capturing" process should
should lead to
to systems
systems with
with considerably
considerably better
better
“knowledge
performance.
performance.

159

A. Feigenbaum
Feigenbaum
E. A.

RR00785-11
5P41 RR00785-11

SOLVER
SOLVER Project
Project

and Collaboration
Collaboration
B. Medical
AIedical Relevance
Relevance and
Collaboration
Collaboration with
with Dr.
Dr. James Moller
Moller MD
MD in the Department
Department of
of Pediatrics,
Pediatrics, Dr.
Dr.
Donald
Donald Connelly
Connelly MD
MD in the Department
Department of
of Laboratory
Laboratory Medicine,
Medicine, at the University
University of
of
r-.finnesota.
hfinnesota. Collaboration
Collaborat,ion with
with Dr.
Dr. Eugene Rich
Rich MD
MD and Dr.
Dr. Terry
Terry Crowson
Crowson MD
MD at St.
Paul
Paul Ramsey
Ramsey Medical
Medical Center.
Center.

of Research
C. Highlights
Highlights of
Research Progress
Progress
Prior research at Minnesota
Minnesota on expertise
expertise in
Accomplishments
Accomplishments of
of This
This Past
Past Year
Year -- Prior
diagnosis
diagnosis of
of congenital
congenital heart
heart disease
disease has resulted
resulted in a theory
theory of
of diagnosis
diagnosis and an
embodiment
embodiment of
of that
that theory
theory in the form
form of
of a computer
computer simulation
simulation model,
model, Galen,
Galen, which
which
diagnoses cases
cases of
of congenital
congenital heart
heart disease
disease [Thompson,
[Thompson, Johnson & Moen,
Moen, 1983].
19831.
Galen
Galen is descended from
from two
two earlier
earlier programs
programs written
written here at
at Minnesota:
Minnesota:
and
Deducer [Swanson,
a.nd Dedrrcer
(Swanson, 1977].
19771. Deducer
Deducer is a program
program that
that builds
builds hemodynamic
hemodynamic
models of
of the circulatory
circulatory system
system that
that describe specific
specific diseases.
diseases. The
The models are built
built, by
by
using
usirig knowledge
knowledge about
about how
how idealized
idealized parts
parts of
of the
the circulatory
circulatory system
system are causally
causally related.
related.
Diagnoser
program
Dia.gnoser is a recognition-driven
recognition-driven
program that
that performs
performs diagnoses by
by successively
successively
hypothesizing
hypothesizing one or
or more
more of
of these models and matching
matching them
them against
against patient
patient data.
data. The
The
models that
tha,t match
match best are used as the final
final diagnosis.
diagnosis. A
A series of
of experiments
experiments carried
carried
out at Minnesota
Minnesota have shown
shown that
that Diagnoser/Deducer
Diagnoser/Deducer performs
performs as well
well (and
(and sometimes
sometimes
out
better)
better) than
than expert
expert human
human cardiologists
cardiologists [Johnson
[Johnson et aI.,
al., 1981].
19811.
Dial]lloser
Diagnouer

Despite their
their early
early successes,
successes, Diagnoser
Diagnoser and Deducer
Deducer did
did not
not have a clear,
clear,
Despite
comprehensible
comprehensible structure
structure that
that is required
required for
for the
the kind
kind of
of experiments
experiments we wish
wish to
t,o perform.
perform.
Galen
Galen was built
built to remedy
remedy this
this problem,
problem, taking
taking advantage
advantage of
of the experience
experience gained
gained in the
of Diagnoser
Disgnoser and Deducer.
Deducer.
design of
Galen
Galen consists of
of four
four major
major components:
components: a working
working memory
memory called
called the scratchpad,
scratchpad,
a knowledge
proposer and a
knowledge base
base of
of rules
rules and hypotheses,
hypotheses, a procedure
procedure called
called the proposer
procedure called
called the reviewer.
reviewer.
procedure
The
The scratchpad
scratchpad contains
contains data
data about
about the problem
problem that
that Galen
Galen is trying
trying to
to solve and
the hypotheses
hypotheses that
that are being
being investigated
investigated to
to explain
explain that
that data.
data. In
In effect,
effect, the scratchpad
scratchpad
represents
represents Galen's
Galen’s current
current execution
execution state.
state.
Rules are pattern-action
pattern-action pairs.
pairs. The
The pattern
pattern part
part of
of a rule
rule describes a possible state
state
of the
t,he scratchpad.
scratchpad. Patterns
Patterns can contain
contain imbedded
imbedded logical
logical connectives
connectives (e.g. ANDs,
ANDs, ORs,
of
NOTs) and can be constructed
NOTsj
constructed to match
match at
at varying
varying levels of
of detail.
detail. The
The action
action part
part is a
procedure
procedure that
that is executed
executed if
if the pattern
pattern part
part matches
matches the scratchpad's
scratchpad’s contents.
contents. Each
Each
action
nct,ion part
part writes
writ,es an assertion
assertion on the
the scratchpad
scratchpad about
about a hypothesis,
hypothesis, together
together with
with the
evidence for
for making
making that
that assertion.
assertion. These assertions
assertions can express that
that a new hypothesis
hypothesis is
being considered,
considered, or
or that
that an old
old hypothesis
hypothesis has been accepted, rejected,
rejected, confirmed
confirmed or
or
disconfirmed.
Action
disconfirmed.
Action parts
parts can also assert that
that a hypothesis
hypothesis is sufficient
sufficient to
to solve the
the
current. problem,
problem, or
or that
that the problem
problem is not
not solvable.
solvable.
current.
Because the pattern
pattern parts
parts of
01 rules
rules can examine
examine anything
anything on the scratchpad,
scratchpad, it
it is
possible to express rules
rules about
about hypotheses
hypotheses as well
well as
as rules
rules about
about problem
problem data.
data. In
pa.rticular, this
this makes it
it possible to
to directly
directly examine
examine the accumulated
accumulated evidence for
for and
particular,
against
against each currently
currently contending
contending hypothesis,
hypothesis, making
making numerical
numerical measures of
of certainty
certainty
unnecessary.
unnecessary.

A hypothesis
hypothesis is simply
simply a named
named collection
collection of
of rules. The
The hypotheses
hypotheses in Galen's
Galen’s
A
knowledge
knowledge base
base can be thought
thought of
of as a directed
directed graph,
graph, in which
which vertices
vertices are hypotheses
hypotheses
and
snd edges
edges are rules. One hypothesis
hypothesis IIpoints
“points toll
to” another
another if
if the first
first hypothesis
hypothesis contains
contains a
rule
rule whose action
action part
part can assert something
something about
about the second.
E. A. Feigenbaum
Feigenbaum

160

SOLVER
SOLVER Project
Project

5P41
5P41 RR00785-11

The
The level of detail
detail of such aa knowledge
knowledge base
base leads
leads to serious problems
problems with
with the
computational
focuses its computational
computational resources
compllt,st.iona! complexit.y
comp1exit.y of search processes.
processes. Galen focuses
so
so that
that the knowledge
knowledge embodied
embodied in the graph
graph of hypotheses can be
be used
used in an efficient
efficient
manner.
manner. Successful diagnoses result
result from
from good first
first hypotheses about
about possible defects and
efficient
efficient mechanisms
mechanisms for refining
refining these hypotheses.
Galen
Gslen works
works by using the proposer
proposer and the reviewer
reviewer to investigate
investigate hypotheses
hypotheses (Le.
(i.e.
search the graph)
graph) by applying
applying rules (Le.
(i.e. following
following the edges
edges from
from one vertex
vertex to another).
another).
vVhenever
Whenever a new piece of problem
problem data
data is
is written
written on the scratchpad,
scratchpad, the proposer
proposer applies
all
a!! relevant
relevant rules specific
specific to the type
type of that
that piece of data.
data. These rules write
write assertions
assertions on
the scratchpad
scra.tchpa,d about
about new hypotheses, effectively
effectively identifying
identifying vertices
vertices in the graph
graph that
that are
worthy
Next, the reviewer
reviewer applies all relevant
relevant rules
worthy starting
starting points
point,s for further
further search. Next,
contained
contained in the hypotheses
hypotheses that
that are named in assertions on the scratchpad.
scratchpad. Successfully
Successfully
applying
applying one of these rules corresponds
corresponds to propagating
propagating the search along a specific
specific edge
edge of
the graph.
graph. The
The search is
is constrained
constrained because
because (1) only
only the most promising
promising vertices
vertices in the
graph
graph are ever used
used to initiate
initiate search; (2) only
only a small
small number
number of
of edges
edges are ever followed;
followed;
a.nd
and (:3)
(3) most
most rules in a hypothesis
hypothesis deal with
with evidence for
for and against
against the hypothesiS
hypothesis itself,
itself,
giving
giving a graph
gra.ph where
where the number
number of
of effectively
effectively outward-pointing
outward-pointing edges
edges at each vertex
vertex is
small.
small.
The
The read-propose-review
rea,d-propose-review cycle repeats
repeats in this
this way
way until
until some hypothesis
hypothesis has been
shown
shown correct,
correct, until
unt,il the
t,he problem
problem has been shown
shown unsolvable,
unsolvable, or
or until
until all the data
data has been
examined.
examined.
Currently,
Currently, data
data given
given to Galen
Galen is taken
taken from
from a (possibly
(possibly imaginary)
imaginary) patient's
patient’s
medical
Hypotheses
medical chart.
chart.
Hypotheses in the knowledge
knowledge base
base represent
represent the ten most
most commonly
commonly
occurring
physiological
occurring congenital
congenital heart
heart diseases
diseases and their
their variants,
variants, useful intermediate
intermediate physiological
findings,
findings, and classes
classes of
of hypotheses.
hypotheses. Since hypotheses
hypotheses are implemented
implemented as named
named teams
teams of
of
production rules, it
product,ion
it is also possible to
to represent
represent other
other kinds
kinds of
of hypotheses
hypotheses should
should the need
arise.
Moreover,
arise.
IMoreover, Ga.len
Ga,!en has been constructed
constructed so that
that its
its inference
inference engine does not
not contain
contain
any procedures
procedures specialized
pediatric cardiology.
specialized for
for pediatric
cardiology. It
It is therefore
therefore conceivable
conceivable to extend
extend
Galen
Gnlen to other
ot,her domains
domains if
if effective
effective knowledge
knowledge bases
bases for
for those domains
domains can be constructed.
constructed.
To
To determine
determine the generality
generality of
of our
our model
model of
of expertise
expertise in diagnostic
diagnostic reasoning,
reasoning, we
are
part of
a.re also investigating
investigating domains
domains outside
outside medicine.
medicine. As part
of this
t,his effort,
effort, we have developed
developed
a computational
program debugging
computational model of
of the fault
fault localization
localization process in program
debugging [Sedlmeyer,
[Sedlmeyer,
198:3]
with our
our work
work in congenital
congenital heart
heart disease,
disease,
19831 that
that is not
not, based directly
directly on Galen.
Galen. As with
concentrated on the design of
of mechanisms
mechanisms for
for structuring
structuring problem
problem specific
specific
we have concentrated
knowlt'dge a,nd
and for
for focusing
focusing limited
limited computational
computational resources.
knowledge
Progress -- Since human
human experts
experts are notoriously
notoriously poor
poor at
at describing
describing
Resra.rch in Progress
Rrsca.rch
their own
own knowledge,
knowledge, our
our work
work requires
requires the
the creation
creation of
of problem
problem solving
solving tasks
tasks through
through
t,heir
which expert.s
experts can reveal
reveal crit.eria
criteria for
for initiating
initiating specific
specific hypotheses
hypotheses and methods
methods for
for
which
those hypot.hcses.
hypotheses.
invest iga ting t,hose
investigating
Current techniques
t.echniques of
of representing
representing hypotheses
hypotheses and their
their expectations
expectations for
for diagnosis
diagnosis
Current
do not,
not, however,
however, provide
provide much
much detailed
detailed information
information about
about the
the control
control processes expert,s
experts
do
use to
to guide
guide their
t.heir reasoning.
reasoning. Such control
control processes typically
typically incorporate
incorporate highly
highly refined
refined
use
heuristics about
about. which
which the
the experts
experts are almost
almost wholly
wholly unaware.
unaware. To
To discover
discover the
the needed
heuristics
cont.rol knowledge,
knowledge, we ask experts
experts to
to complete
complete tasks
tasks in which
which we have systematically
systematically
cont,rol
perturbt~d aspects of
of the
the problem
problem data.
data. The
The data
data in
in these tasks
tasks are chosen so that
that
perturbed
members
of
an
overlapping
set
of
hypotheses
will
be
suggested
during
while
solving
the
members of
overlapping
of hypotheses will
during while solving the
problem.
Success
in
solving
such
problems
depends
on
the
ability
to
overcome
an
initially
problem. Success in solving
problems
on the ability to overcome
initially
plallsihle incorrect
incorrect hypot,hesis
hypothesis in favor
favor of
of a later,
later, more
more correct
correct alternative.
alternative.
plausible

161
161

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11
RR00785-11

SOLVER
SOLVER Project
Project

Several
Several examples
examples illustrate
illustrate our
our approach.
approach. \Ve
We are studying
studying performance
performance of
of Galen
Galen
cases [Johnson
[Johnson &
& Thompson,
Thompson, 1981]
19811 that
that were initially
initially misdiagnosed
misdiagnosed in
on "garden
“garden path"
path” cases
hospit.al
hospit,al files. Analysis
Analysis of
of such cases
cases suggest that
that errors
errors are made because
because experts
experts rely
rely on
very efficient
efficient heuristics
heuristics that
that are not
not universally
universally correct.
correct. In
In one such example,
example, a seemingly
seemingly
very
plausible
plausible hypothesis
hypothesis is suggested early
early in the case.
case. Although
Although the hypothesis
hypothesis superficially
superficially
seems
seems to
t,o explain
explain what
what is observed
observed about
about the patient,
patient, the hypothesis
hypothesis is incorrect.
incorrect. Because
the incorrect
incorrect hypothesis
hypothesis seems
seems marginally
marginally adequate, it
it acts to prevent
prevent a more correct
correct
Success in such a case
case hinges on the ability
ability
hypothesis
hypothesis from
from being
being suggested in
in its
its place.
place. Success
to use
use the proper
proper set of
of competing
competing hypotheses
hypotheses in order
order to provide
provide more than
than one
to
Investigation of
of this
this phenomenon
phenomenon in human
human experts
experts has
explanation
explanation of
of the case
case data.
data. Investigation
linking disease
disease hypotheses
hypotheses in Galen.
Galen. It
It has
suggest.ed
suggested implementation
implementation of
of "transition
“transition rules"
rules“ linking
that watch
watch for
for potential
potential garden
garden
also suggested implementation
implementation of
of "monitor
“monitor hypotheses"
hypotheses” that
pat.h
path errors
errors and avoid
avoid them
them before
before they
they become serious.

We are also investigating
investigating several research questions
questions relevant
relevant to
to t.he
the architecture
architecture of
of
We
Galen.
Galen. \Ve
We have designed an interface
interface to
to Galen
Galen so that
that users who
who are unfamiliar
unfamiliar with
with the
inner workings
workings of
of the program
program can interactively
interactively enter
enter case
case data.
data. Designing
Designing the interface
interface
inner
questions about
about what
what forms
forms of
of data
data are necessary to adequately
adequately and completely
completely
raised questions
We are also studying
studying ways in which
which a causal reasoning
reasoning
represent
represent all possible cases.
cases. We
component.
Proposer
component, can be integrated
integrated with
with the prototypical
prototypical reasoning
reasoning components
components (the
(the Proposer
and Reviewer)
Reviewer) that
that are already
already present
present in Galen.
Galen. In
In particular,
particular, we are interested
interested in
st.udying
studying ways in which
which causal reasoning
reasoning can aid or
or replace prototypic
prototypic reasoning
reasoning when
when it
it
becomes inadequate
inadequate to reach a diagnosis.
diagnosis.
become~
In another
another project,
project, we are investigating
investigating methods
methods of
of probabilistic
probabilistic reasoning.
reasoning. Most
Most
sy:::tems
systems rely
rely on numerical
numerical schemes for
for weighting
weighting evidence or
or ranking
ranking observed
observed data.
data.
weights are often
often probabilistic
probabilistic in nature,
nature, but
but other
other schemes have also been used.
These weights
Myrin.
Mycin, for
for example.
example, uses
uses certainty
certainty factors
factors and PIP
PIP uses
uses likelihoods
likelihoods composed of
of matching
matching
scores and binding
binding scores. In
In contrast,
contrast, humans
humans do not
not seem to rely
rely upon
upon such numerical
numerical
techniques.
t,echniques. Research
Resea.rch has shown
shown that
that people are often
often quite
quite poor
poor at probabilistic
probabilistic
reasoning.
However,
reasoning.
However, experts
experts make
make decisions which
which involve
involve weighting
weighting evidence and
selecting
selecting from
from competing
compet.ing alternatives.
alternatives. They
They must
must utilize
utilize a reasoning
reasoning process which
which serves
as
as an alternative
alternative to a numerical
numerical weighting
weighting technique.
technique.

We believe the process of
of weighting
weighting alternatives
alternatives along various
various criterial
criteria] dimensions
dimensions
\Ve
is not
not a domain
domain specific
specific technique,
technique, but
but rather
rather a general process which
which is applied
applied in specific
specific
t,he coming
coming year,
year, we wiJI
will examine
examine this
t,his process in various
various domains
domains and
instances.
In the
instsnccs.
a.ttempt
a.tt,empt to ut.ilize the result·s
result,s in designing
designing more
more powerful
powerful reasoning
reasoning techniques.
techniques.
In
In the area
a.rea of
of law,
law, our
our work
work has focused
focused on the area of
of corporate
corporate law
law (the
(the problem
problem
of
a proposed
We
of structuring
structuring
proposed corporate
corporate acquisition).
acquisition).
We have collected
collected data
data from
from ~4
24
pra.ct.icing law.\"ers
lawyers and in the coming
coming year
year a PhD
PhD thesis will
will be completed
completed describing
describing this
this
practicing
work.
work. In the coming
coming year
year we will
will be also completing
completing a study
study of
of the corporate
corporate acquisition
acquisition
problem
problem in man~1!;ement
man:tgement in order
order to
to further
further refine
refine our
our knowledge
knowledge capturing
capturing tools.
tools.

D. List
Li.'3t of
Relevant Publications
Publications
D.
of Relevant

1. Connelly,
Connelly, D. and Johnson, P.E.:
P.E.: Medical
solving. Human
Human Pathology,
Pathology,
1.
fl.;fedical problem
problem solving.
11(5):41?-419,
11(5):41~-419.

1980.
1980.

2. Elstein,
Proposed Reaeatch
Research
Elstein, A.,
A., Gorry,
Gorry, A.,
A., Johnson, P. and Kassirer,
Kassirer, J.: Proposed
Efforts. IN
Efforts.
IN D.C.
D.C. Connelly,
Connelly, E. Benson and D. Burke
Burke (Eds.),
(Eds.), CLINICAL
CLINICAL
DECISION MAKING
MAKING AND
AND LABORATORY
LABORATORY
University of
of Minnesota
Minnesota
DECISION
USE. University
Press.
Press, 1982,
1982, pp. 327-334.

E. A. Feigenbaum
Feigcnbaum

162
152

SOLVER
SOLVER Project
Project

5P41 RR00785-11
RR00785-11

P.J.:
Knowledge based components
P.J.: Knowledge
components
diagnosis.
Learning
diagnosis. Learning Research and Development
Development
PDS-:!,
PDS-2, University
University of
of Pittsburgh,
Pittsburgh, September,
September, 1981.
1981.

3. Feltovich,
Feltovich,

of
in
of expertise
expertise
in
Center
Technical
Center Technical

medical
medical
Report
Report

Role and
4. Feltovich,
Feltovich, P.,I.,
P.J., Johnson,
Johnson, P.E.,
P.E., Moller,
Moller, J.H.
J.H. and Swanson, D.B.:
D.B.: The Role
and
Development of
J\Jedical Knowledge
Knowledge in
Diagnostic Ezpertise.
Expertise.
IN
IN
Development
of &fedical
in Diagnostic
W.
W’. Clancey
Clancey and E.H.
E.H. Shortliffe
Shortliffe (Eds.),
(Eds.), READINGS
READINGS IN
IN MEDICAL
MEDICAL AI.
AI. (In
(In
press)
5. Johnson,
Models of
Medical Problem
Problem Solvers.
Cognitive Models
of Medical
Solvers. IN
IN D.C.
D.C.
Johnson, P.E.:
P.E.: Cognitive
Connelly,
Connelly, E. Benson, D. Burke
Burke (Eds.),
(Eds.), CLINICAL
CLINICAL DECISION
DECISION MAKING
MAKING AND
AND
LABORATORY
USE. University
LABORATORY
University of
of Minnesota
Minnesota Press, 1982,
1982, pp. 39-51.
6. Johnson,
Johnson, P.E.:
P.E.:

}Vhat kind
should a system
system be? J. Medicine
What
kind of
of expert
ezpert should
Medicine and

Philosophy,
Philosophy, 8:77-97,
8:i7-97, 1983.

Expert Mind:
Mind: A new
for the Information
Information
7. Johnson, P.E.,
P.E., The Expert
new Challenge
Challenge for
SciWtl:st
IN
SYSTElvl
Scientist
IN Th.
Th. M.
M. A.
A. Bemelmans
Bemelmans (Ed.),
(Ed.), INFORMATION
INFORMATION
SYSTEM
DEVELOPMENT
FOR
EFFECTIVENESS,
Elsevier
DEVELOPMENT
FOR ORGANIZATIONAL
ORGANIZATIONAL
EFFECTIVENESS,
Elsevier
Science Publishers
Publishers B. V.
V. (North-Holland),
(North-Holland), 1984.
1984.
Design of
8. Johnson,
D.G. and Feltovich,
Feltovich, P.J.:
P.J.: Design
of decision
decision
Johnson, P.E.,
P.E., Severance, D.G.
support agatema
systems in
Rationale and
principles from
from the analysis
support
in medicine:
medicine: Rationale
and principles
analysis of
of
ph!Jsician expertise.
physician
expertise. Proc.
Proc. Twelfth
Twelfth Hawaii
Hawaii International
International Conference
Conference on System
System
Science, 'Western
Western Periodicals
Periodicals Co.
Co. 3:105-118,
3:105-118, 1979.
1979.
9. Johnson,
Johnson, P.E.,
P.E., Duran,
Duran, A.,
A., Hassebrock,
Hassebrock, F.,
F., Moller,
Moller, J., Prietula,
Prietula, M.,
M., Feltovich,
Feltovich,
P. and Swanson, D.: Expertise
Expertise and
in diagnostic
diagnostic reasoning.
reasoning. Cognitive
Cognitive
and error
error in
Science 5::!35-283,
5:235-283, 1981.
1981.
10. Johnson.
Jvfodels
Johnson, P.E.
P.E. and Hassebrock,
Hassebrock, F.:
F.: Validating
Validating Computer
Computer Simulation
Simulation Models
of
Expert
Reasoning.
IN
R.
Trappl
(Ed.),
CYBERNETICS
AND
SYSTEMS
of Expert Reasoning. IN
Trappl (Ed.), CYBERNETICS AND SYSTEMS
RESEARCH.
North-Holland Publishing
RESEARCH. North-Holland
Publishing Co., 19.82.
1982.
and Thompson,
down
garden path:
path:
Thompson, \V.B.:
W.B.: Strolling
Strolling
down the garden
and
from error
problem solving.
solving. Proc.
and recovery
recovery
from
error in
in expert
expert problem
Proc. Seventh
Seventh
IJCAI,
IJCAI, Vancouver,
Vancouver, B.C.,
B.C., August,
August, 1981,
1981, pp. 214-217.

11
11.. .lohnson,
Johnson,

P.E.
P.E.

Detection
Detecfion

P.E.,
Jvfultimethod study
study of
Moller. J.H.:
J.H.: bfultimethod
of
P.E., Hassebrock,
Hassebrock, F. and Moller,
clinical
judgement. Organizational
Behavior
clinical
judgement.
Organizational
Behavior and Human
Human Performance
Performa.nce
30:201-230, 1982.

12.
12. Johnson,
Johnson,

13. l\1011er,
New techniques
hfo!ler, J.H
J.H.,.. Bass, G.M.,
G.M., Jr. and Johnson, P.E.:
P.E.: New
techniques in
in the
of
patient management
problems. Medical
Education
ron.'ltrurtion
construction
of patient
management
problems.
Medical
Education
15:1.')0-153,
1981.
15:l.W153,
1981.
14.
simulation of
problem solving
solving in
14. Swanson, D.B.:
D.B.: Computer
Computer
simulation
of expert
expert problem
in medical
medical

diagnosis.
diclgnosis. Unpublished
Unpublished Ph.D.
Ph.D. dissertation,
dissertation, University
University of
of Minnesota,
Minnesota, 1978.
1978.
Psychological Analysis
Analysis of
15. Swanson, D.B.,
D.B., Feltovich,
Feltovich, P.J.
P.J. and Johnson, P.E.:
P.E.: Psychological
of
Physician
Expertise:
Implications
for
The
Design
of
Decision
Support
Physician Ecpertiae:
Implications
for
Design of Decision
Support
Systems.
In D.B.
North-Holland
Systems.
D.B. Shires and H. \Vold
\Vold (Eds.),
(Eds.), MEDINF077,
MEDINF077,
North-Holland
Publishing
Publishing Co., Amsterdam,
Amsterdam, 1977,
1977, pp. 161-164.
161-164.
15.
Recognition-based diagnostic
16. Thompson,
Thompson, W.B.,
W.B., .Johnson,
Johnson, P.E.
P.E. and Moen,
Moen, J.B.: Recognition-based
diagnostic
reasoning.
reasoning. Proc.
Proc. Eighth
Eight,h IJCAI,
IJCAI, Karlsruhe,
Karlsruhe, West
West Germany,
Germany, August,
August, 1983.
1983.

163
163

E. A.
A. Feigenbaum
Feigenbsum

5P41 RR00785-11
RR00785-11

SOLVER
SOLVER Project
Project

17.
17. Sedlmeyer,
Sedlmeyer, R.L.,
R.L., Thompson,
Thompson, W.B.
W.B. and Johnson, P.E.:
P.E.:
localization
in
debugging.
The
localization in debugging. The Journal
Journal of
of Systems
Systems and

Knowledge-based fault
fault
Knowledge-based
Software
Software (in
(in press).

18.
Diagnostic reasoning
18. Sedlmeyer,
Sedlmeyer, R.L.,
R.L., Thompson,
Thompson, W.B.
W.B. and Johnson, P.E.:
P.E.: Diagnostic
reasoning
in
software fault
fault localization.
Proc.
in software
localization.
Proc. Eighth
Eighth IJCAI,
IJCAI, Karlsruhe,
Karlsruhe, West
West
Germany,
Germany, August,
August, 1983.
19. Sedlmeyer,
Knowledge-Based Fault
Fault
Sedlmeyer, R.L.,
R.L., Thompson,
Thompson, W.B.,
W.B., 8l
& Johnson,
Johnson, P.E.:
P.E.: Kno,wledge-Baaed
Debugging. The
Localization in
Localization
in Debugging.
The Journal
Journal of
of Systems
Systems and Software
Software (in
(in press).

E. Funding
Funding and
E.
and Support
Support
\Vork
\Vork on the SOLVER
SOLVER project
project is currently
currently supported
supported by
by a grant
grant from
from the Control
Control
Data
to
Dat,a. Corporation
Corporation
to Paul
Paul Johnson
Johnson ($90,000;
($90,000; 1983-85) and by
by a grant
grant from
from the
Microelectronics
Microelectronics and Information
Information Sciences Center
Center at the University
University of
of Minnesota
Minnesota to
to Paul
Paul
Johnson, William
William Thompson
Thompson and two
two colleagues ($800,000;
($800,000; 1984-87).

II.
II.

INTERACTIONS
vVITH
THE
RESOURCE
INTERACTIONS
WITH
THE SUMEX-AIM:
SUMEX-AJM
RESOURCE
A. Medical
Afedical Collaborations
Program Dissemination
Dissemination via
A.
Collaborations and
and fiogram
via SUAIEX
SUMEX

\Vork
\Vork in medical
medical diagnosis
dia.gnosis is carried
carried out
out with
with the
the cooperation
cooperation of
of faculty
faculty and
students
Paul Ramsey
Ramsey Medical
Medical
students in tht'
the University
University of
of Minnesota
Minnesota Medical
Medical School and St. Paul
Center.
Cent,er.

B. Sharing
Interactions with
ProJects
B.
Sharing and
and Interactions
with Other
Other SUA-lEX-AIM
SUfifEX-AIM
Projects
A
A year
year ago, conversations
conversations were begun with
with \Villiam
William Clancey
Clancey at Stanford
Stanford University
University
regarding
regarding collaboration
collaboration on the study
study of
of current
current knowledge
knowledge capturing
capturing methods.
methods. We
We plan
plan
to develop
develop this
this collaboration
collaboration in the coming
coming year.
year.

C. Critique
Resource Management
Management
Critique of
o j Resource
(None)
(None)

ill.
RESEARCH
PLANS
III.
RESEARCH
PLANS
A. Project
Project Goals
Plans
A.
Goals and
and Plans
Near term
Near
term -- Our
Our research objectives
objectives in the near term
term can be divided
divided in three
three
parts. First,
parts.
First., we are committed
committed to
to the design, implementation,
implementation, and evaluation
evaluation of
of Galen,
Ga.len, as
as
described
We have completed
completed an interactive
interactive front
front end so that
that physicians
physicians can
described above. \Ve
directly
direct.& enter
enter patient
ont.ient data,
data, and Galen's
Galen’s knowledge
knowledge base
b%e is currently
currently being "tuned"
“tuned” with
with
tile
the help of Dr.
Dr. James Moller
Moller MD,
MD, an expert
expert physician
physician collaborator
collaborator from
from the University
University of
of
f\Iinnesota
r\linnesot,n Pediatric
Pediatric Cardiology
Cardiology Clinic.
Clinic. During
During the coming
coming year,
year, Galen's
Galen’s performance
performance will
will
be compareo
compwred with
wit,11that
thn.t of
of the Diagnoser
Dingnoser program
program and with
with expert
expert phYSicians.
physicians.
Our
Our second objective
objective consists of
of making
making extensions
extensions to the knowledge
knowledge capturing
capturing
strategies
loped in our
strategies deve
developed
our original
original work
work in medical
medical diagnosis. In
In the near term
term this
this work
work
will
will examine
examine descriptive
descriptive strategies
strategies in which
which experts
experts attempt
attempt to use
use a formalized
formalized language
to express what
production rules),
rules), observational
observational strategies
strategies in which
which
what they
they know
know (e.g. production
experts
perform
t;1sks
designed
to
reveal
information
from
which
a
theory
expert,s perform t.asks
reveal information from which
theory of
of task
task specific
specific
expl'rtise
expcrtize can be bnilt,
built, and intuitive
intuitive strategies
strategies in which
which either
either experts
experts behave as
as knowledge
knowledge
engineers or knowledge
knowledge engineers attempt
attempt to
to perform
perform as
as pseudo experts.
experts. In the coming
coming
year
year we ,viII
will also be attempting
attempting to develop a program
program to
to automate
automate the early
early stages of
of

E. A. Feigenbanm
Feigenbaum

154
164

SOLVER
SOLVER Project
Project

5P-11 RROO785-11

knowledge
knowledge capturing,
ca.pturing, analogous to the "prototype
“prototype stage"
stage” of design referred
referred to in software
software
engineering.
engineering.
Our
Our third
t)hird near term
term objective
objective will
will be
be to investigate
investigate one
one of the central
central problems
problems of
recognit.ion
classify problems
problems when solving
solving them.
them.
recognition ba.<;ed
based problem
problem solving,
solving, how to classify
Questions
Questions related
relat.ed to problem
problem classification
classification which
which we will
will be
be examining
examining include:
include: What
What
patterns
patterns do experts
experts and novices detect
detect in aa problem
problem that
that allows them
them to classify
classify it
it as
as an
instance
inst,ance of aa problem
problem type
type that
that is already
already known?
known? How does
does an expert
expert make an initial
initial
choice of the level of abstraction
abstraction to be
be used in solving
solving aa problem?
problem? How
How can an expert
expert
recover
recover from
from an
a.n initial
initial incorrect
incorrect choice of levels?
levels? How
How can the difference
difference between
between causal
and
a.nd prot,otypic
prot,ot,ypic modes of reasoning
reasoning be modeled as
as differences
differences in levels of abstraction,
abstraction, and
how can aa common
common model for
for these two
two types of reasoning
reasoning be
be constructed?
constructed? \Ve
We will
will be
be
pllrsuing
pllrsuing these questions in the area of physics problem
problem solving,
solving, as
as well
well as
as in medicine.
medicine.
Long range -- Our
objective is
is to improve
improve the methodology
methodology of the
Our long range objective
"knowledge
“knowledge capturing"
capturing” process that
that occurs in the early
early stages of the development
development of
expert
expert systems
systems when
when problem
problem decomposition
decomposition and solution
solution strategies
stra.tegies are being specified.
specified.
Several
Severs1 related
related questions
questions of interest
interest include:
include: What
What are the performance
performance consequences
consequences of
of
different
different approaches,
spproaches, how can these consequences
consequences be evaluated,
evaluated, and what
what tools
tools can assist
in making
making the best choice?
choice? How
How can organizations
organizations be determined
determined which
which not
not only
only perform
perform
well,
well, but,
but are structured
structured so as
as to facilitate
facilitate knowledge
knowledge acquisition
acquisition from
from human
human experts? In
t.he
t,he coming
coming year
year we will
will be exploring
exploring these questions in areas of
of design and management
management
a.'
as3 ','iell
well as
as in law, physics
physics and
a.nd medicine.
medicine.

B. Justification
for Continued
Jelati jication and
and Requirements
Requirements for
Continued SUMEX
SUMEX Use
Use
Our
Our current
current model
model development
development takes
takes advantage
advantage of
of the
the sophisticated
sophisticated Lisp
Lisp
programming environment
programming
environment, on SUMEX.
SUMEX. Although
Although much
much current
current work
work with
with Galen
Galen is done
A.,'( 11/780,
benefit from
using aa. version
version running
running on a local V
VAX
11/780, we continue
continue to benefit
from the
int.eraction
by the SUMEX
use
int,era.ct,ion with
with other
other researchers facilitated
facilitated by
SUMEX system.
system. We
We expect
expect to use
SU~IEX to allow
program. We
plan to continue
use
SU:,lEX
allow other
ot,her groups
groups access
access to t.he
the Galen
Galen program.
We also plan
continue use
of
of the knowledge
knowledge engineering
engineering tools
tools available
available on SUMEX.
SUMEX.

C. Needs
Needs and
Plans for
for Other
Resources Beyond
Beyond SUAfEX-AIA1
and Plans
Other Computing
Computing Resources
SUibfEX-Al&f
Paul
Paul .Johnson
Johnson is a member
member of
of the
the group
group of investigators
investigators who
who have recently
recently
submitt.ed
proposal to
submitt,ed a proposal
to establish
establish a national
national computer
computer network
network for
for cognitive
cognitive scientists
scientists
(COGNET). In addition,
addition. our
our current
current grant
grant will
will permit
permit the
the purchase
purchase of
of some single-user
single-user
(COGNET).
computers (we are currently
currently comparing
comparing several
several alternative
alternative machines).
machines).
SUMEX will
will
computers
SUMEX
continue to
to be used for
for collaborative
collaborative activities
activities and for
for program
program development
development requiring
requiring
cont.inue
tools not
not available
available locally.
locally.
tools

D. Recommendations
Recommendations for
for Future
Future Community
Community and
and Resource
Resource Development
Development
D.
(None)
(None)

165

A. Feigenbaum
Feigenbaum
E. A.

Pilot
Pilot Stanford
Stanford Projects
Projects

5P41 RR00785-11
RR00785- 11

II.A.3.
Pilot
Projects
II.A.3.
Pilot Stanford
Stanford
Projects
Following
pilot projects
projects currently
Following are descriptions
descriptions of
of the informal
informal pilot
currently using
using the
Stanford
portion of
resource,
pending funding,
St,anford portion
of the SUMEX-AIM
SUMEX-AIM
resource, pending
funding, full
full review,
review, and
authorization.
authorization.
In addition
progress reports
presented here, abstracts
project are
addition to the
the progress
reports presented
abstracts for
for each project
submitted
submitted on a separate
separate Scientific
Scientific Subproject
Subproject Form.
Form.

E. A.
A. Feigenbaum
Feigenbaum

156
166

CAMDA
CAMDA Project
Project

5P41
RfW0785-11
51’
41 RFt00585-11

IT.A.3.1. CAMDA
CAMDA Project
Project
II.A.3.1.
CAMDA Project
Project
CAMDA
CAMDA Research
Research Staff:
Staff:
CAMDA
Samuel Holtzman,
Holtzman, Co-PI
Co-PI
Samuel
Prof. Ronald
Ronald A.
A. Howard,
Howard, Co-PI
Co-PI
Prof.
Jack Breese
Breese
Jack
Dr. Emmet
Emmet Lamb
Lamb
Dr.
Dr. Robert
Robert Kessler
Kessler
Dr.
Dr. Frank
Frank Polansky
Polansky
Dr.

Engineering-Economic
Engineering-Economic Systems
Systems
Engineering-Economic Systems
Systems
Engineering-Economic
Engineering-Economic
Engineering-Economic Systems
Systems
School
School of
of Medicine
Medicine
School
School of
of Medicine
Medicine
School
School of
of Medicine
Medicine

Associated faculty:
faculty:
Associated
Engineering-Economic
Engineering-Economic Systems
Systems
Engineering-Economic
Engineering-Econom ic Systems
Systems

Prof. Edison
Edison Tse
Tse
Prof.
Prof. Ross
Ross Shachter
Shachter
Prof.

I.
SUMMARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
I. SUMMARY
A. Project
Project Rationale
Rationale
A.
The
project is an attempt
The Computer-Aided
Comput,er-Aided Medical
hiedical Decision
Decision Analysis
Analysis (CAMDA)
(CAMDA) project
attempt to
t,o
develop
by taking
develop intelligent
int,elligeut medical
medical decision
decision systems by
taking advantage
advantage of
of the complementary
complementary
methodologies
methodologjes of
of decision
decision analysis
a.nalysis and artificial
artificial intelligence.
intelligence.

Medical Relevance
Relet'ance and
B. Medical
und Collaboration
Collaboration
The
primary effort
project during
The primary
effort of
of the CAMDA
CA>lDA project
during 1983 was focused on the design
and implementation
implement,st.ion of
of RACHEL,
RACHEL, an intelligent
intelligent decision system
system for
for infertile
infertile couples. This
This
effort
physicians and patients
patients deal with
effort is aimed
a.imed at
at helping
helping physicians
with difficult
difficult choices regarding
regarding
pertinent.
RACHEL is being developed in close
close cooperation
cooperation between
between
pertinent, medical
medical procedures.
procedures. RACHEL
t.he
depart,ment of Obstetrics
Obstetrics and
the department
depart*ment of Engineering-Economic
Engineering-Economic Systems, the department
Gynecology,
Gynecology, and the
t.he department
department of
of Surgery
Surgery (Urology
(Urology Division),
Division), all at Stanford.
Stanford.

C. Highlights
Highlights of
of Research
Research Progres8
Progress
C.l
past year
year
C.l Accomplishments
Accomplishments this
this pad
The
The CAMDA
CAbIDA project
project began in the summer
summer concentrated
concentrated our
our efforts
efforts on three
three
specific
specific tasks: the development
development of a formal
formal representation
representation for
for uncertain
uncertain decisions, the
design and implementation
implementat.ion of solution
solution algorithms
algorithms for
for formal
formal decision problems,
problems, and the
construction
constructZion of an inferential
inferential processor specifically
specifically tailored
tailored to the process of
of formalizing
formalizing
decision problems.
problems.
Most
Most of our
our research has been based on the concept
concept of an influence
influence diagram
diagram
(Howard
(Howard and Matheson,
Mstheson, 1984) which
which is
is generalization
generalization of decision trees as
as aa representation
representation
for decision problems.
Influence diagrams
diagrams (IDs)
(IDS) have several major
major features
fentures that
that make
problems. Influence
them
them attractive
attractive for use
use in intelligent
intelligent decision systems. Technically,
Technically, IDs
IDS prevent
prevent the loss
loss of
information
information often
often incurred
incurred in constructing
constructing asymmetric
asymmetric decision trees (Olmsted,
(Olmsted, 1984),

157
167

Feigenbaum
E. A. Feigenbaum

P41
P41 RR00785-11
RR00785-11

CAMDA
CAMDA Project
Project

wit,hout.
wit,hout suffering
suffering from
from the
the explicit
explicit exponential
exponential growth
growth of
of symmetric
symmetric decision
decision trees.
trees.
Furthermore,
Furt,hermore, unlike
unlike decision
decision trees,
trees, ID
ID decision
decision models
models ta.ke
take full
full advantage
advantage of
of probabilistic
probabilistic
inclependence
independence relat.ions,
relations, which
which can
can have
have aa significant
significant impact
impact on
on the
the simplicity
simplicity of
of the
the
decision
decision model,
model, and
and on
on the
the efficiency
efficiency oC
of its
its formal
formal solution.
solution.
A
A particularly
particularly useful
useful feature
feature of
of influence
influence diagrams
diagrams which
which we have recently
recently begun
investigating
investigating isis that
that fact
fact that
that they
they can
can be
be used
used to
to represent
represent deterministic
determinist.ic as
as well as
as
probabilistic
probabilistic relations
relations between
between model
model elements.
elements. In
In Cact,
fact, deterministic
deterministic relations
relations can be
be
exploited
exploited to
to describe
describe complex
complex probabilistic
probabilistic behavior.
behavior. This
This Ceature
feature allows the construction
construction
of simple knowledge
bases
(composed
primarily
of
deterministic
knowledge bases
primarily
deterministic statements)
statements) which
which can be
be
llsecl
used to create
crea.te problem-specific
problem-specific probabilistic
probabilistic decision models.
In addition
addition to their
their technical
technical advantages
advantages IDs
IDS have been
been empirically
empirically shown to be
intuitively
int.uit,ively appealing
a.ppealing to decision makers
makers (Owen,
(Owen, 1978),
1978), and to provide
provide an excellent
excellent means
of communication
communication between experts
experts in different
different fields. In the context
context of our
our own efforts,
effort,s,
we have found
found that
that the physicians
physicians who
who are participating
participating in the development
development of RACHEL
RACHEL
have ha.d
had liWe
little difficulty
difficulty using IDs
IDS as
as aa simple
simple representation
representation for
for expressing the
t,he decisions
they
they and their
their patients
pat!ients face.
face.
Another
Another important
important feature
feature of IDs
IDS is the Cact
fact that
that they
they are naturally
naturally constructed
constructed in
a backwards,
bsckwards. goal-directed
goal-directed fashion
fashion (decision
(decision trees usually
usually lead to
to a forward-reasoning
forward-reasoning
approach).
approach). Backward
Backward development
development of decision models has two
two important
importa.nt advantages
advantages for
for
our
our purposes. First,
First, it
it has a strong
strong attention-focusing
attention-focusing effect
effect since it
it encourages the
dt~ci"ion
decision maker
maker to first
first think
think of what
what he or she
she wants,
wants, and then
then about
about what
what can be done
to change the
t.he world
world according
according to the expressed preferences.
preferences. Decision
Decision trees usually
usually have
t,he
opposite
effect.
Thus,
they
often
lead
the
decision
process
along
paths
that
although
t,he opposite effect. Thus, they often
decision
along paths that although
releva.nt
attention-focusing effect
effect of
of
releva.nt to the decision
decision at
at hand,
hand, have little
little effect
effect on it.
it. The
The attention-focusing
IDs
its efficiency.
efficiency. The
The second
IDS on the
t,he decision making
making process tends to
to contribute
contribute to
to its
advantage
of intelligent
intelligent decision
decision
advantage of
of the goal-directed
goal-directed nature
nature of
of IDs
IDS for
for the
the construction
construction of
systems is that
amenable to
to computer-based
computer-based
that it
it makes the formulation
formulation of
of decision problems
problems amenable
automation
a,utomation as
as a rule-based
rule-based system.
system.
Having decided
decided on IDS
IDs as a means to
to represent
represent decision
decision problems,
problems, we have designed
Having
implemented several
algorithms to
to solve well-formed
well-formed influence
influence diagrams.
diagrams. This
This effort
effort
and implemented
several algorithms
resulted in the development
development of
of a powerful
powerful software
package which
which can generate
generate
has resulted
software package
optimal strategies
their certain
certain equivalent
equivalent directly
directly from
from an ID.
ID. This
This package is
optima.1
st.ra.tegics and their
beginning to
to be tested
tested and
a.nd augmented
augmented to
to make
make itit easier to
to use by
by researchers
researchers other
other tha.n
than
beginning
its developers.
developers. In part,
part, this
this package
package is based on the
the work
work of
of Olmsted
Olmsted (1983),
(1983), and
and on a
its
constructive proof
proof by
by Shacht.er
Shachter (1984)
(1984) that,
that, given
given certain
certain technical
technical features,
features, shows
shows that
that an
construct,ive
influence
diagram
can
always
be
solved
in
finite
time.
inflllence diagram
a.lways
solved in finite time.
An importSant
important feature
feature of
of RACHEL
RACHEL is that
that itit attempts
attempts to
to help
help its
its users in
in the
the
An
development of
of models
models for
for their
their decisions.
decisions. Thus,
Thus, unlike
unlike most
most other
other decision
decision analysis
analysis tools,
tools,
development
RACHEL is designed to
to use domain
domain knowledge.
knowledge. Therefore,
Therefore, a central
central element
element in
in the
the
RACHEL
archit,ecture of
of the
the RACHEL
RACHEL system
system is an
an algorithm
algorithm which
which performs
performs symbolic
symbolic inference.
inference.
archit.ecture
Although several
several general-purpose
general-purpose inference-engines
inference-engines exist
exist within
within our
our research
research environment,
environment,
hlt.holugh
found itit advantageous
advantageolls to
to implement
implement our
our own
own for
for reasons of
of efficiency
efficiency and
and
we have found
compatibility. Furthermore,
Furthermore, our
our inference
inference algorithms
algorithms are particularly
particularly well
well suited
suited for
for the
the
compatibility.
construction of
of decision-analytic
decision-analytic models.
models.
construcbion
Finally, from
from the
the standpoint
standpoint of
of computer
computer implementation,
implementation, we
we have
have developed
developed a
Finally,
d~tta structure
structure which
which allows
allows us to
t,o represent
represent a wide
wide class of
of multiple-entry
multiple-entry disconnected
disconnected
data
cyclical directed
directed graphs,
graphs, where
where both
both vertices
vertices and
and edges can
can be
be associated
associated with
with arbitrary
arbitrary
cyclical
data
structures
(such
as
frames).
For
short,
we
refer
to
these
graphs
as
WEBs
(as in
in aa
data structures (such
frames). For short, we refer to these graphs as WEBS (as
spider's),
and we
we have
have used
used them
them to
to represent
represent a multitude
multitude of
of small
small and
and medium-sized
medium-sized
spider’
s), and

E. A.
A. Feigenbaum
Feigenbaum
E.

158
168

5P4 1 RR00785-11
RROOi85- 11
5P41
objects such as
as
objects
tables,
help
text
tables,
text

CAMDA
CAMDA Project
Project

influence diagrams,
diagrams, medical
medical
influence
databases, and mathematical
mathematical

decision knowledge
knowledge bases,
bases, command
command parse
data
(e.g.,
vectors
and
matrices).
data
vectors
matrices).

169
159

A. Feigenbaum
Feigenbaum
E. A.

CAr-.mA
CAhlD~4 Project
Project

P41
P41 RR00785-11
RR00785-11

C.2
C.2 Research
Research in
in progress
progreso
The immediate
project
immediate goal of the
the CAMDA
CAMDA
project is
is to complete
complet,e aa pilot-level
pilot-level
implementation
implementation of the RACHEL
RACHEL system
system within
within the next few months.
months. As we define it, aa
pilot
pilot system
system isis aIle
one where the essential algorithms
algorithms work
work beth
bcth individually
individually and interactively
interactively
with
with one another,
another, operating
operating with
with knowledge
knowledge that
that is
is representative
representative of the system's
system’s domain.
domain.
Such a system
system lacks two
two important
important elements
elements that
that must
must exist
exist within
within a prototype-level
prototype-level
implementation:
implement,ation: an extensive
extensive knowledge
knowledge base,
base, and aa front
front end usable by trained
trained users who
who
may
the system.
system.
may not
not be
be familiar
fa.miliar with
with the details
details of the
To
To complete
complete aa pilot
pilot implementation
implementation of RACHEL,
RACHEL, we intend
intend to direct
direct our
our efforts
efforts
towards
towards the following
following four
four tasks: incorporating
incorporating aa medical
medical value model elicitation
elicitation facility.
facility,
strengthening
strengthening our
our influence-diagram
influence-diagram solution
solution procedure,
procedure, improving
improving the performance
performance of
RACHEL's
implementing an explanation
explanation module
module to justify
justify the
R.4C’HEL’s inference
inference engine, and implementing
deci~ion
decision model being developed. Once this
this implementation
implementation is completed,
completed, RACHEL
RACHEL will
will be
brollght
broiight. to the
t,he participating
participating physicians
physicians to begin to develop its knowledge
knowledge base.
base.

D.
D. Publication.,]
Publications
"IDEAS:
“IDEAS: Influence
Influence
Diagram
System",
Department of
of Engineering-Economic
Engineering-Economic Systems,
Diagram Elicitation
Elicitation
System “, Department
Stanford
Stanford University,
University, Stanford,
Stanford, California,
California, 1983.

1.
1. Breese, J.S., Davis,
Davis, D., Parnell,
Parnell, G.S., and Taneja,
Taneja, R.:

2. Holtzman,
Ana/ysi.,
Holtzman, S.:"A
S.: “A A/ode!
Model of
of the Decision
Decision
Analysis Process",
Process II, Department
Department of
of

Engineering-Economic
Systems, Stanford
Engineering-Economic
Stanford University,
University, Stanford,
Stanford, California,
California,
1981.
1981.
3. Holtzman,
Decision Aid
Aid for
for Patients
Patients with
End-Stage Renal
Renal Disease
Disease",“,
Holtzman, S.:"A
S.: aA Decision
with End-Stage

Department
Department of
of Engineering-Economic
Engineering-Economic Systems, Stanford
Stanford University,
University, Stanford,
Stanford,
California,
1983.
California,
4. Holtzman,
IrOn
Holtzman, S.: “On

the Use of
Formal Models
Models in
Decision Making
Making",“, Proc.
o j Formal
in Decision
Proc.

TIMS/ORSA
Nat. Mtg.,
TIhLIS/ORSA Joint
Joint Nat.
Mtg., San Francisco,
Francisco, May,
May, 1984.
5. ((*)
.. ) Holtzman,
Holt,zman,

Department
Depa.rtment
forthcoming.
fort,licoming.

S.:

of
of

"Intelligent
Decision Systems",
Dissertation,
“Intelligent
Decision
Systems “, Ph.D.
Ph.D.
Dissertation,
Engineering-Economic
Systems,
Stanford
University,
Stanford
Engineering-Economic
University,

Howard, R..4.,
R.A., and Matheson,
Matheson, J.E.:
J.E.: "Influence
Diagrams", in Howard,
Howard, R.A.,
R.A.,
6. Howard,
“Influence Diagram”“,
Matheson, J.E.
J.E. (Eds.):
(Eds.): “The
"The Principles
Principles and
and Applications
Applications of
of Decision
Decision
and Matheson,
Analysis, ”II Vol.
Vol. II,
II, Strategic
Strategic Decisions
Decisions Group,
Group, Menlo
Menlo Pa,rk,
Park, California,
California, 1984.
Analysis,
Olmsted, S.M.:
S.M.: “On
IrOn Representing
Representing and
and Solving
Solving Decision
Decision Problems”,
Problems", Ph.D.
Ph.D.
7. Olmsted,
Dissertation. Department
Department of
of Engineering-Economic
Engineering-Economic Systems,
Systems, Stanford
Stanford
Dissertation.
University, St,anford,
Stanford, California,
California, 1983.
University,
Owen, D.L.:
D.L.: "The
Influence
Diagrams in
in Structuring
Structuring Complex
Complex
8. Owen,
The Use oof
j In
jluence Diagrams
Deri.'3ion Problems
Problems",“, in Howard,
Howard, R.A.,
R.A., and Matheson.
Matheson, J.E. (Eds.):
(Eds.): “The
"The
Derision
Principles a.nd
and Applications
Applications of
of Decision
Decision Analysis,”
Analysis," Vol.
Vol. II,
Strategic Decisions
Decisions
Principles
II, Strategic
Group, Menlo
Menlo Park,
Park, California,
California, 1984.
Group,
Shachter, R.‘
R.:: “Evaluating
"Evaluating In
Influence
Diagrams",
working paper,
paper, Department
Department
9. Shacht.er.
jluence Diagrams
“, working
of Engineering-Economic
Engineering-Economic Systems,
Systems, Stanford
Stanford University,
University, Stanford,
Stanford, California,
California,
of
1984.

A. Feigenba,um
Feigenbaum
E. A.

170
170

CAMDA
CAMDA Project
Project

5P41
5Pal RROOi85-11
RR00785-11

E.
E. Funding
Funding Support
Support
The
However,
The CA~mA
CAniDA project
project does
does not
not yet
yet have
have direct
direct funding
funding support.
support.
However, in
in
addition
addition to
to SUMEX
SUAIEX computer
computer usage,
usage, the
the project
project has
has benefited
benefited from
from aa number
number of
of hardware
hardware
gifts
gifts and
and research
research support
support for
for individuals.
individuals.

E.1
E.l Stanford
Stan ford .Medical
Medical School
School
The
The dt'partment
department of
of Obstetrics
Obstetrics and
and Gynecology
Gynecology and
and the department
department of Surgery
Surgery
(Urology
Samuel
(Urology Division)
Division) have
have provided
provided various
various types
types of
of support
support to the project.
project.
Samuel
Holtzman
Holtzmsn has
has received research assistantship
assistantship awards for several quarters.
quarters. In addition,
addition, the
Inft'rtility
Infert.ility Clinic
Clinic at Stanford
Stanford has
has purchased
purchased several terminals
terminals for
for the specific
specific purpose of
developing
RACHEL
and
other
CAMDA
decision
systems.
developing RACHEL
other CAMDA

E.2
E.2 Decision
Decision Systems
Systems Laboratory
Laboratory
The CA~mA
CAhlDA project
project has access
access to the facilities
facilities of the Decision
Decision Systems Laboratory
Laboratory
(DSL)
Systems, and constitutes
(DSI,) in the Department
Department of Engineering-Economic
Engineering-Economic
constitutes the
laboratory's
laborat.ory3 most
most active
act,ive research project.
project. The DSL maintains
maintains several terminals,
terminals, printers
printers
and aa personal computer
computer for research on the development
development of computer-based
computer-based decision
systems. The majority
majority of the terminals
terminals and printers
printers were recently
recently donated
donated to the DSL
DSL by
by
Qume
MAD
Qume Corporation.
Corporation.
MAD Computer
Computer of Santa
Santa Clara
Clara has also contributed
contributed to
to the support
support
of the CAMDA
CAMDA project
project through
through the consignment
consignment of a MAD-l
MAD-l persona.l
personal computer,
computer, and
provision
provision of a.a research assistantship
assistantship for
for Samuel Holtzman.
Holtzman.

IT.
WITH
THE
RESOURCE
II. INTERACTIONS
INTERACTIONS
WITH
THE SlThtIEX-AIM
SUMEX-AIM
RESOURCE
II.A Medical
Afedical Collaborations
Dissemination Via
SUMEX
II.A
Collaborations and
and Program
Program Dissemination
Via SUMEX
Since its inception,
from an active
active relationship
relationship
inception, the CAMDA
CAMDA project
project has benefited
benefited from
between
decision
analysts,
computer
scientists,
and
members
of
the
Stanford
medical
between
analysts, computer scientists,
members of the Stanford medical
community.
In
particular,
RACHEL
is
being
developed
in
close
cooperation
with
particular, RACHEL
being developed
cooperation with
community.
physicians in the Infertility
Infertility Clinic
Clinic at
at Stanford.
Most of
of this
this cooperation
cooperation has, up
up to
to this
this
physicia,ns
Stanford. Most
point, consisted
consisted of
of an intense
intense mutual
mutual learning
learning experience for
for all
all project
project participants.
participants. The
The
point,
primary purpose
pmpose of
of this
this initial
initial effort
effort has been to
to develop
develop an effective
effective means to
to represent
represent
primary
medical decision
decision knowledge.
knowledge. As we have described
described above, this
this work
work has culminated
culminated in
in the
medical
definition of
of a representation
representation language based on influence
influence diagrams.
diagrams.
definition
\Vithin
the next
next few months,
months, RACHEL
RACHEL is expected
expected to
to attain
attain pilot-level
pilot-level
\\‘
ithin
the
performance, and its
its knowledge
knowledge base will
will begin
begin to
to be deveioped.
developed. At
At this
this point,
point, most
most of
of
performa.nce,
interaction involving
involving participating
participating physicians
physicians will
will shift
shift to
to the
the design and
and
the intera.ction
implementation of
of an infertility
infertility decision
decision knowledge
knowledge base.
This task
task will
will involve
involve
implementation
This
considerable
direct
use
of
the
SUMEX
facility
by
medical
personnel.
considerable direct, use of the SUMEX facility by medical personnel.
As an added
added benefit
benefit of
of the
the development
development of
of Rr\CHEL,
RACHEL, itit often
often occurs
occurs that
that specific
specific
,4s
subsystems become useful
useful in
in their
their own
own right.
right. For
For instance,
instance, a simple
simple program
program to
to aid
aid
subsystems
physicians in
in determining
det.ermining a course of
of action
action in
in cases
cases of
of idiopathic
idiopathic infertility
infertility has been
physicians
implemented and made available
available on SUMEX
SUMEX to
to the
the staff
staff of
of the
the Stanford
Stanford infertility
infertility clinic,
clinic,
implemented
who have used itit on an experimental
experimental basis.
who

IT.B. Sharing
Sharing and
and Interactions
Interactions with
with other
other SUMEX-AIM
SlThtIEX-AIM Projects
Projects
ILB.
II.B.l SUMEX-AIM
SUAIEX-AlN! 1988
1983 Workshop:
Workshop:
II.B.l
Samuel Holtzman
Holtzman chaired
chaired the
the working
working group
group on
on decision
decision analysis
analysis and
and artificial
artificial
Samuel
intelligence in
in medicine.
medicine. This
This group
group considered
considered the
the current
current status
status and
and future
future of
of medical
medical
intelligence

171

E. A.
A. Feigenbaum
Feigenbaum
E.

RR00785-11
P41 RR00785-11

CAMDA
CAh.IDA Project
Project

A full
full report
report of
of the
the working
working group's
group’s deliberations
deliberations and conclusions
conclusions is
decision
decision systems. A
online
SUMEX
file
< HOLTZMAN.CAMDA>AIM-DA-FINALHOLTZMAN.CAMDA
>AIM-DA-FINALavailable
at SUMEX
in tile
<
ova.ilable online
REPORT.TXT.
and should
REPORT.TXT,
should appear
appear in the
the forthcoming
forthcoming workshop
workshop report.
report.

lI.B.2
Participation in
Knowledge Representation
Representation seminar
II.B.2 Participation
in the Knowledge
seminar at Stanford
Stanford
As part
part of
of the CAMDA
CAMDA project,
project, we have made several presentations
presentations to
to the general
general
Stanford medical
medical and
and’'computer
computer science community.
community. These presentations
presentations have been made
Stanford
within
jointly by
within the context
context of
of the Knowledge
Knowledge Representation
Representation seminar,
seminar, held jointly
by the computer
computer
science department
department and the medical
medical school, and well
well attended
attended by
by other
other SUMEX
SUMEX
researchers
researchers at Stanford.
Stanford.
The
The speakers, and titles
titles of
of the
the most
most recent
recent presentations
presentations follow:
follow:
Samuel Holtzman:
HOI tzman:
Samuel

On the
the Design
Design and Implementation
Implementation of
of
On
Computer-Based
Computer-Based Decision
Decision Systems.
Systems.

Samuel
Samuel Holtzman:
Holtzman:

A Simple
Simple Representation
Representation
for Uncertain
Uncertain Knowledge
A
for

Prof.
Prof. Ross Shachter:
Shachter:

Influence Diagrams
Diagrams and their
their Use
Use in
Influence
Representing and Solving
Solving Complex DeciSion
Decision
Representing
Problems.
Problems.

II.C. Critique
Resource Management
Management
II.C.
Critique of
of Resource
The
The CAMDA
CAMDA project
project has been immeasurably
immeasurably aided by
by the availability
availability of
of the
SUt-.IEX
SUhlEX computing
computing resources. In general
general we find
find the overall
overall physical
physical facilities
facilities to
to be of
of
excellent
In
excellent quality.
quality.
In addition,
addition, we have been quite
quite impressed with
with the quality
quality of
of the
particular, we have found
found it
it to
to be a pleasure to deal with
with Ed
Ed
SUMEX
In particular,
SUMEX staff.
staff.
Pa.t.termann,
Pa.t,terma.nn, who
who has
ha.s been invariably
invariably courteous,
courteous, responsive
responsive to our
our needs,
needs, and effective
effective in
his actions.
act,ions. Pam
Pa.m Ryalls
Ryalls has also provided
provided much
much needed help in managing
managing the CAMDA
CAMDA
project.
project. in a manner
manner that
that is friendly
friendly and efficient.
efficient.

There are,
a.re, however,
however, two
two areas where
where we feel service and performance
performance could
could be
There
improved to the
t,he benefit
benefit of
of the entire
entire SUMEX
SUMEX community.
community. The
The first
first concerns
concerns the SUMEX
SUMEX
improved
facilities
facilities themselves.
themselves. the other
other refers
refers to
to our
our means of
of communicating
communicating with
with these facilities.
facilities.

lI.(U
II.C.l SU.~IEX
XrAIEX load
load
In the pt'riod
period that
that the CAMDA
CAMDA project
project has been active,
active, we have noticed
noticed a
significant
during
significa.nt increase
increa.se in the maximum
maximum machine
machine loading,
loading, particularly
particularly
during weekday
weekday
afternoons.
afternoons. Although
Although this
this is a normal
normal feature
feature of
of time-shared
time-shared computer
computer systems, the load
has become
sufficiently high
become.sufficiently
high in
in recent
recent months,
months, that
that it
it is beginning
beginning to
to be difficult
difficult to work
work
on SUMEX
SUMEX d1uing
during business hours.
hours. In addition,
addition, reliability
reliability has
ha5 been adversely
adversely affected
affected in
inst,a.nces. An
An increase in SUMEX
SUMEX computing
computing capacity.
capacity, or
or a means of
of preventing
preventing
some inst.ances.
overloading
overloading of
of the machine
machine should
should be considered.
considered. We
We believe that
that an emphasis on
distributing some of
of SUl'vlEX's
SUMEX’s computing
computing power
power away
away from
from a centralized
centralized mainframe
mainframe could
could
distributing
have a significant
significant effect
effect on reducing
reducing the system
system load.

II.C.
II.C.22 Ethernet
Ethernet
The
The CA~lDA
CALIDA project
project uses
uses SUMEX
SUMEX almost
almost exclusively
exclusively through
through Ethernet
Et.hernet software
software
hardware located
located in Terman
Terman Engineering
Engineering Center,
Center, where
and hardware
where the department
department of
of
Engineering-Economic Systems
Systems is located.
located. This
Engineering-Economic
This software
software has on occasion been extremely
extremely
unreliable for
for extended
extended periods
periods of
of time,
time, resulting
resulting in substantially
substantially reduced
reduced productivity
productivity for
for
unreliable
project. Adequate
Adequate communicat.ion
communication facilities
facilities at Stanford
Stanford are of
of critical
critical importance
importance t.o
to
the project.

E. A. Feigenbaum
Feigenbsum

172

CAMDA Project
Project
CAMDA

5P411 RR00785RR00785-11
5P4
11

the successful
successful conduct
conduct of
of our
our research.
research. Although
Although Stanford
Stanford Ethernet
Ethernet management
management is not
not
the
directly under
under the
the jurisdiction
jurisdiction of
of SUMEX,
SUMEX, in
in order
order for
for the
the SUMEX
SUMEX resource
resource to
to be utilized
utilized
directly
to the
the fullest,
fullest, the
the planning
planning and
and administration
administration of
of networking
networking at
at Stanford
Stanford needs to
to be
to
better coordinated.
coordinated. We
We have
have begun
begun to
to explore
explore several
several means to
to improve
improve the
the current
current
better
situation. and
and we believe
believe that
that explicit
explicit SUMEX
SUMEX support
support of
of our
our efforts
efforts would
would be quite
quite
situation,
beneficial.
beneficial.

m.

III.

RESEARCH PLANS
PLANS
RESEARCH
III.A Project
Project Goals
Goals and
and Plans
Plans
III.A

For the
the near
near term
t.erm future
future the
the primary
primary goals of
of the
the CAMDA
CAMDA project
project are to
to develop
develop a
For
"pilot" and
and then
then “prototype”
"prototype" version
version 01
of the
the RACHEL
RACHEL system.
system. Over
Over an extended
extended period,
period,
“pilot”
our objective
object.ive is to
to arrive
arrive at
at useable, fully-validated
fully-vaiidated and
and documented
documented systems
systems for
for support
support
our
of medical
medical decision
decision making
making in
in infertility
infertility and
and other
other domains.
domains.
of
Implementat.ion of
of a pilot
pilot system
system is primarily
primarily an int.egrative
integrative task
task at
at this
this point,
point,
Implementation
bringing together
together the
the medical
medical knowledge
knowledge base, symbolic
symbolic inference
inference procedure,
procedure, decision
decision
bringing
problem solution
solution procedure,
procedure, and
and influence
influence diagram
diagram data
data structures.
structures.
All of
of these
All
problem
components exist
exist independently.
independently. The
The pilot
pilot system
system will
will consist
consist 01
of these systems
systems interacting
interacting
components
to provide
provide a simplified
simplified version
version of
of infertility
infertility decision
decision counseling.
counseling.
to
The prototype
prototype implementation
implementation of
of RACHEL
RACHEL will
will include
include substantially
substantially greater
greater
The
amounts of
of medical
medical knowledge
knowledge than
than the
the pilot.
pilot. The
The major
major task
task at
at this
this stage will
will be the
the
amounts
in~orporation of
of expert
expert knowledge
knowledge regarding
regarding functional
functional relations,
relations, probability
probability distributions,
distributions,
incorporation
a TILl decision
decision alt,ernat,ives
alt.ernatives in the infertility
infertility domain.
domain. At
At this
this point
point in its
its development,
development, the
the
and
system
system will
will be available
available for
for use by
by participating
participating physicians
physicians at
at the
the infertility
infertility clinic
clinic on a
"test"
justification of
“test” basis, beginning
beginning the
the critical
critical phase of
of validation
validation and justification
of the
the system.
system.
A
A major
major goal of
of the
the project
project is to
to bring
bring RACHEL
RACHEL to
to a IIdefensible
“defensible” ll level of
of
performance
domain.
A
A working
working system
system with
wit.h full
full documentation,
documentation,
performance in the
the infertility
infertility
domain.
expla.nation
Over
Over the
the long
long term,
term,
cxpla.nation of
of it,s
it,s conclusions,
conclusions, and
and user interface
interface is envisioned.
envisioned.
infcrtilit.y
infertilit,y is but,
but a single
single example
example of
of the
the range of
of medical
medical decisions
decisions amenable
amenable to
to decision
decision
analytic
After RACHEL
RACHEL has been fully
fully implemented
implemented
analytic treatment
t.reatment in an automated
automated system.
system. After
and tested.
tested, other
other systems
systems focusing
focusing on cardiology
cardiology or
or oncology,
oncology, for
for example,
example, might
might be
These
systems
would
consist
of
a
common
core
of
procedural
knowledge
developed.
developed.
systems would consist of common
of procedural knowledge based
on decision
analysis,
and
be
instantiated
with
the
medical
decision analysis,
instantiated with
medical knowledge
knowledge of
of the particular
particular
domain.
domain.

IlI.B
and
for Continued
SUlHEX
1II.B Justification
Justification
and Requirements
Requirements for
Continued
SiYMEX Use
The
It
It draws
draws on elements
elements of
of decision
decision
The CAMDA
CAMDA project
project is truly
truly interdisciplinary.
interdisciplinary.
analYSis.
The project
project has the potential
potential to
to
analysis, artificial
artificial intelligence,
intelligence, and medical
medical science. The
contribute
SUMEX-AIM provides
provides the
contribute to each of
of these disciplines
disciplines in important
important ways.
ways. SUMEX-AIM
resonrces
resources to continue
continue this
this research
research with
with the necessary access
access to
to members
members of
of the Stanford
Stanford
research community.
communit,y.
The
The development
development of
of automated
automated decision
decision systems
systems has the
the potential
potential to greatly
greatly
increase the use
In the past,
past, although
although
use and acceptance
accept,ance of decision analysis
analysis methods.
methods.
decision
decision analysis
analysis has be~n
been shown
shown to be an extremely
extremely effective
effective means of asSisting
assisting in
dccision
effort involved
involved in
sad uncertain
uncertain domains,
domains, the cost and effort
decision m'aking
making in complex
complex and
producing
producing an analysis
analysis was prohibitive
prohibitive for
for most
most individuals.
individuals. Automated
Automated decision
decision analysis
analysis
can result
result in a much
much lower
lower cost per user, allowing
allowing decision
decision theoretic
theoretic techniques
techniques to achieve
much
much wider
wider application.
application.
The
The development
development of decision
decision systems
systems owes much
much to advancements
advancements in the fields of

173
173

E.
E. A. Feigenbaum
Feigenbaum

CA~1DA Project
Project
CA>lDA

P41 RR0078511
RR0078S-11
P41

artificial intelligence,
intelligence, expert
expert systems,
systems, and
and knowledge
knowledge engineering.
engineering.
One continuing
continuing
One
artificial
challenge in
in these
these fields
fields has
has been
been representation
representation and
and reasoning
reasoning with
with probabilistic
probabilistic
challenge
knowledge. The
The representation
representation of
of knowledge
knowledge in
in influence
influence diagrams,
diagrams, and
and the
the use
use of
of
knowledge.
decision analysis
analysis in
in probabilistic
probabilistic reasoning
reasoning are
are both
both significant
significant topics
topics of
of research
research being
being
decision
pursued within
within the
the CAMDA
CAMDA project.
project.
pursued
For the
the medical
medical community
community the
the CAMDA
CAMDA project
project has
has the
the potential
potential for
for providing
providing
For
tools
and
techniques
that
greatly
improve
the
quality
of
decision
making
in
medicine.
t.ools and techniques that greatly improve the quality of decision making in medicine.
RACHEL
explicit.ly
considers
uncertainty,
decision
alternatives,
and
patient
preferences
in
RACHEL explicitly considers uncertainty, decision alternatives, and patient preferences in
developing
recommendations.
The
objective
is
to
develop
insight
and
understanding
The objective is to develop insight and understanding
developing recommendations.
regarding t,radeoffs
tradeoffs and
and alternatives,
alternatives, both
both for
for the
the patient
patient and
and the
the attending
attending physician.
physician.
rega.rding
SUMEX-AIM provides
provides aa unique
unique resource
resource for
for the
the continuation
continuation of
of the
the CAMDA
CAMDA
SUMEX-AIM
project.
The
available
computing
resources,
plus
access
to
the
Stanford
AI
and
medical
project. The available computing resources, plus
to the Stanford AI and medical
communities
are
of
critical
importance
for
the
successful
completion
of
the
research.
communities are of critical importance for the successful completion of the research.
.
III.C
Needs
and
Plans
for
other
Computing
Resources
beuond
SUAIEX-AIM
III.C Needs and Plans for other Computing Resources beyond SUMEX-AIM
\Ve are
are pursuing
pursuing the
the purchase
purchase or
or donation
donation of
of several
several computing
computing resources
resources for
for
We
the Decisions
Decisions System
System Laboratory.
Laboratory. Our
Our primary
primary need at
at present
present is for
for a
inst,allation in
in the
installation
LISP machine
machine (e.g.,
(e.g., Symbolics
Symbolics 3600),
3600), enabling
enabling us to
to perform
perform local
local processing
processing and
and increase
increase
LISP
our graphics
graphics capabilities.
capabilities.
our
At present
present the
the project
access to
to one MAD-l
MAD-l personal
personal computer
computer (IBM-PC
(IBM-PC type).
type).
At
project has access
\Ve are considering
considering various
various other
other PC/workstation
PC/workstation facilities
facilities to
to use as front
front ends for
for
We
CAMDA products.
products.
CAA4DA

IlI.D Recommendations
Recommendations for
for Future
Future Community
and Resource
Resource Development
Development
III.D
Community and
Increases in distributed
distributed computing
computing capabilities
capabilities on the
the SUMEX-AIM
SUMEX-AIM system
system is a
Increases
primary
II.C.l, distributed
distributed file
file editing
editing and
and
primary need at this
this point.
point. As we mentioned
mentioned in Section
Section II.C.l,
graphics
the mainframe.
mainframe. At
At this
this time,
time,
gra.phics capabilities
capabilities would
would simultaneously
simultaneously reduce load on the
we are pn.rtiClllarly
environment where
where a
particularly interested
interested in the possibility
possibility of designing
designing an environment
cen~ralized
interact at
at a high
high level
level with
with a
centralized processor
processor (such
(such as the
the SUMEX
SUMEX 20/60)
20/60) would
would interact
mllch
much less
less powerful
powerful dedicated
dedicated processor
processor (such as
as a SUN
SUN workstation,
workstation, or
or an Apple
Apple Lisa
Lisa or
or
Macintosh)
A?acint.osh) wit,h
with specific
specific capabilities
capabilities such as
as bit-mapped
bit-mapped graphics
graphics and special
special purpose
purpose
hardware.
hardware.

E.
E. A.
A. Feigenbaum
Frigenbsum

174
174

5P41 RR00785-11
RR00785-11
5P41

MENTOR Project
Project
MENTOR

ll.A.3.2. MENTOR
MENTOR Project
Project
II.A.3.2.
MENTOR Project
Project
MENTOR
Stuart M.
M. Speedie,
Speedie, Ph.D.
Ph.D.
Stuart
Terrence F.
F. Blaschke,
Blaschke, M.D.
M.D.
Terrence
Department of
or Medicine
Medicine
Department
Division of
of Clinical
Clinical Pharmacology
Pharmacology
Division
Stanford University
University
Stanford

SUM1viARY OF
OF RESEARCH
RESEARCH PROGRAM
PROGRAM
I.I. SUMMARY
.'L Project
Projat Rationale
Rationale
A.
The goal of
of the
the MENTOR
MENTOR (Medical
(Medical EvaluatioN
EvaluatioN of
of Therapeutic
Therapeutic ORders)
ORders) project
project is
The
to design
design and
and develop
develop an expert
expert system
system for
for monitoring
monitoring drug
drug therapy
therapy for
for hospitalized
hospitalized
to
patients t,hat
that will
will provide
provide appropriate
appropriate advice
advice to
to physicians
physicians concerning
concerning the
the existence
existence and
and
patients
management of
of adverse
adverse drug
drug reactions.
reactions. The
The computer
computer as a recording-keeping
recording-keeping device
device is
management
becoming increasingly
increasingly common
common in hospital-based
hospital-based health
health care, but
but much
much of
of its
its potential
potential
becoming
remains unrealized.
unrealized. Furthermore,
Furthermore, this
this information
information is provided
provided to
to the
the physician
physician in
in the
the
remains
form
of
raw
data
which
is
often
difficult
to
interpret.
The
wealth
of
raw
data
may
The wealth of raw data may
form of raw data which
often difficult to interpret.
effectively
important information
the patient
patient from
from the
the physician.
physician. This
This is
effectively hide important
information about
a.bout the
part.icularly true
by
particularly
true with
with respect
respect to
to adverse reactions
reactions to
to drugs
drugs which
which can only
only be detected
detected by
simultaneous
simultaneous examinations
examinations of
of several
several different
different types
types of
of data
data including
including drug
drug data,
data,
laboratory
laboratory tests and clinical
clinical signs.
In
In order
order to
to detect
detect and appropriately
appropriately manage
manage adverse drug
drug reactions,
reactions, sophisticated
sophisticated
medical
medical knowledge
knowledge and
and problem
problem solving
solving is required.
required. Expert
Expert systems
systems offer
offer the possibility
possibility of
of
embedding
embedding this
this expertise
expertise in a computer
computer system.
system. Such a system
system could
could automatically
automatically gather
gather
the approprhte
appropriate information
information from
from existing
existing record-keeping
record-keeping systems
systems and continually
continually monitor
monitor
for
for t.lle
t,he occurrence
occurrence of adverse drug
drug reactions.
reactions. Based on a knowledge
knowledge base of
of relevant
relevant data,
data,
it
it, could
could analyze
a.nalyze incoming
incoming data
data and inform
inform physicians
physicians when
when adverse reactions
reactions are likely
likely to
occur
The MENTOR
MENTOR project
project is an attempt
attempt to explore
explore the
occur or when they
they have occurred.
occurred. The
problems
problems associated with
with the development
development and implementation
implementation of such a system
system and to
implement
implement a prototype
protot,ype of a drug
drug monitoring
monitoring system
system in a hospital
hospital setting.
setting.

B.
R. Medical
Medicnl Relevance
Relevance and
and Collaboration
Collaboration
A number
number of independent
independent studies
studies have confirmed
confirmed that
that the incidence
incidence of adverse
reactions
reactions t,o
to drugs
drugs in hospitalized
hospitalized patients
patients is
is significant
significant and that
that they
they are for the most
most
part
pa.rt preventable.
preventable. Moreover,
Moreover, such statistics
statistics do not
not include
include instances
instances of suboptimal
suboptima,l drug
drug
t.herapy
extended length-of-stay,
length-of-stay, or ineffective
ineffective
therapy which
which may
may result
result in increased costs, extended
therapy.
thera.py. Data
Data in these areas are sparse, though
though medical
medical care evaluations
evaluations carried
carried out
out as
as
part
of
hospital
quality
assurance
programs
suggest
that
suboptimal
therapy
is
common.
part of hospital quality assurance programs suggest that suboptimal therapy is common.
Other.
Other computer
computer systems
systems have been
been developed
developed to influence
influence physician
physician decision
decision
making
by
monitoring
patient
data
and
providing
feedback.
However,
most of
of these
these
making
monitoring patient data and providing feedback. However, most
~ystems
This shortcoming
shortcoming involves
involves the
systems suffer
suffer from
from aa significant
significant structural
structural shortcoming.
shortcoming. This
evaluation
In all
all cases,
cases, these criteria
criteria consist
consist of
of
evaluation rules that
that are
are used to generate
generate feedback.
feedback. In
discrete,
discrete, independent
independent rules.
rules. Yet,
Yet, medical
medical decision
decision making
making isis aa complex
complex process
process in
in which
which
many
many factors
factors are
are interrelated.
interrelated. Thus
Thus attempting
attempting to
to represent
represent medical
medical decision-making
decision-making as
as aa

175
175

E.
E. A.
A. Feigenbaum
Feigenbaum

5P41
11
5P41 RR00785RR00785-11

MENTOR Project
Project
MENTOR

discrete set
set of
of independent
independent rules,
rules, no
no matter
matter how
how complex,
complex, is a task
task that
that can,
can, at
at best,
best,
discrete
result in
in a first
first order
order approximation
approximation of
of the
the process. This
This places an inherent
inherent limitation
limitation on
on
result
the quality
quality of
of feedback
feedback that
that can
can be provided.
provided. As
As a consequence itit is extremely
extremely difficult
difficult to
to
the
develop feedback
feedback that
that explicitly
explicitly takes
takes into
into account
account all
all information
information available
available on
on the
the
develop
patient. One
One might
might speculate
speculate that
that the
the lack
lack of
of widespread
widespread acceptance
acceptance of
of such
such systems
systems may
may
patient.
be due to
to the
the fact
fact that
that their
their recommendations
recommendations are often
often rejected
rejected by
by physicians.
physicians. These
These
systems must
must be made
made more
more valid
valid ifif they
they are to
to enjoy
enjoy widespread
widespread acceptance
acceptance among
among
systems
physicians.
physicians.
The proposed
proposed MENTOR
MENTOR system
system is designed
designed to
to address the
the significant
significant problem
problem of
of
The
adverse drug
drug reactions
reactions by
by means of
of a computer-based
computer-based monitoring
monitoring and
and feedback
feedback system
system to
to
adverse
inflnence physician
physician decision-making.
decision-making. ItIt will
will employ
employ principles
principles of
of artificial
artificial intelligence
intelligence to
to
influence
create a more
more valid
valid system
system for
for evaluating
evaluating therapeutic
therapeutic decision-making.
decision-making.
create
The work
work in the
the MENTOR
MENTOR project
project is intended
intended to
to be a collaboration
collaboration between
between Dr.
Dr.
The
at Stanford
Stanford and
and Dr.
Dr. Speedie at
at the
the University
University of
of Maryland.
Maryland. Dr.
Dr. Speedie is
Blaschke 3.t
Blaschke
spending t,he
the 1983-84
academic year
year on sabbatical
sabbatical with
with Dr.
Dr. Blaschke
Blaschke in the
the Division
Division of
of
spending
1983-8-I academic
Pharmacology at
at Stanford
University. While
While at
at Stanford,
Stanford, Dr.
Dr. Speedie has been
Clinical Pharmacology
Clinical
Stanford University.
strengthening his expertise
expert.ise in the
the area
area of
of artificial
artificial intelligence
intelligence and establishing
establishing links
links in the
the
strengthening
AI community.
community. Dr.
Dr. Speedie has begun
begun work
work on the
the development
development of
of the
the MENTOR
MENTOR system
system
AI
pilot project
project on the
t.he SUMEX-AIM
SUMEX-AIM facility.
facility. Over
Over the
the past
past nine months,
months, Drs.
Drs. Blaschke
Blaschke and
and
pilot
have worked
worked closely
closely together
together to
to design the
the MENTOR
MENTOR project.
project. The
The blend
blend of
of
Speedie ha.ve
previous experience,
experience. medical
medical knowledge,
knowledge, computer
computer science knowledge
knowledge and evaluation
evaluation
previous
expertise they
they represent
represent is vital
vital to
to the
the successful
successful completion
completion of
of the
the activities
activities in the
the
design expert,ise
MENTOR project.
project.
h!ENTOR

C. Highlights
Highlights of
Research fiogrcsa
Progress
of Research
The
The MENTOR
MENTOR project
project was initiated
initiated in December
December 1983. The
The work
work to date
date has
consisted
National Center
consisted of
of preparation
preparation of
of a grant
grant proposal
proposal for
for the National
Center for
for Health
Health Services
Research and initial
initia.1 exploration
exploration of
of the problem
problem of
of designing
designing the MENTOR
MENTOR system.
system. Work
Work
has begun
begun on constructing
constructing a system
system for
for monitoring
monitoring potassium
potassium in patients
patients with
with drug
drug
therapy
therapy that
thjt can adversely
adversely affect
affect potassium
potassium levels.

E. Funding
Funding Support
E.
Support
Application
Applica.tion for
for grant
grant support
support is pending.
pending.

ll.
WITH
RESOURCE
II. INTERACTIONS
INTERACTIONS
WITH THE
THE SUMEX-AIM
SUMEX-AIM
RESOURCE
A
i\ledical Collabor.ations
via
.-I...•Medical
Collaborations and
and Program
Rogram Dissemination
Dissemination
via SUMEX
SUMEX
This
This project
project represents
represents a collaboration
collaboration between
between faculty
faculty at Stanford
Stanford University
University
Medical
hizdical Cent.er
Center and t.he
the University
University of
of Maryland
Maryland School of
of Pharmacy
Pharmacy in exploring
exploring
SUMEX, through
through its communications
communications
computer-based
SUMEX,
computer-based monitoring
monitoring of
of drug
drug therapy.
therapy.
returns to the University
University of
of
capabilities.
capabilities, will
will facilitate
facilitate this
this collaboration
collaboration when
when Dr.
Dr. Speedie returns
Maryland
Maryland in August
August of
of 1984.

B. Sharing
Projects
Sharing and
and Interactions
Inteiactions with
with Other
Other SUAIEX-AIM
SUMEX-MM
Projects
Interactions
projects
Interactions with
with other
other SUMEX-AIM
SUMEX-AIM
projects has been on an informal
informal basis.
bcasis.
Personal
Personal contacts
contacts have been made with
with individuals
individuals working
working on the ONCOCIN
ONCOCIN project
project
concerning
concerning issues
issues related
related to the formulation
formulation of the previously
previously mentioned
mentioned proposal.
proposal. We
expect
expect interactions
interactions with
with other
other projects
projects to increase significantly
significantly once the groundwork
groundwork has

E. A.
‘4. Feigenbaum
Feigenbaum

176
176

MENTOR
hlENTOR Project
Project

SPH
SF’41 RR00785-11
RRO0785-11

been
been laid
laid and
and issues
issues directly
directly related
related to
to AI
AI are
are being
being addressed.
addressed. Given
Given the
the geographic
geographic
separation
separation of
of the
the investigators,
investigators, the
the ability
ability to
to exchange
exchange mail
mail and
and programs
programs via
via the
the SUt\lEX
SUMEX
system
system as
as well
we!! as
as communicate
communicate with
with other
other SUMEX-AIM
SUMEX-AIM projects
projects isis vital
vita! to
to the
the success
success of
of
the
ect.
t’he proj
project.

C.
C. Critique
Critique of
of Resource
Resource AJanagement
A4anagement
To
To date,
dn.te, the resources of
of SUMEX
SUMEX have been
been fully
fully adequate
adequate for
for the needs
needs
project.
The staff
staff have been
been most helpful
helpful with
with any
any problems
problems we have had and
and
project. The
fully
fully sat.isfied
sat,isfied with
with the current
current resource management.
management. The
The only
only concern
concern we have
to the
t.he stat·e
st.8t.eof the documentation
documentation on the system.

of
of this
we are
are
relates

ill.
PLANS
III. RESEARCH
RESEARCH
PLANS
A. Project
Project Goals
Gouls and
and Plans
Plans
To
To accomplish
accomplish the goa.ls
goa.!s described in the Project
Project Rationale,
Rationale, aa number
number of tasks will
will
be
be underta.ken.
underta.ken. The short-term
short-term task
task is to develop an initial
initial prototype
prototype of the medical
medical
knowledge
knowledge base
base and inference mechanisms
mechanisms for arriving
arriving at appropriate
appropriate therapy
therapy monitoring
monitoring
decisions. This
This initial
init,ial work
work focuses
focuses on monitoring
monitoring for
for hyperkalemia
hyperkalemia and the decisionmaking
making process wit.h
with respect to ordering
ordering potassium
potassium levels. We will
will then
then attempt
attempt to
construct
const,ruct a system
system combining
combining frames and rules that
that will
will model
mode! this
this process. The
The purpose
purpose
of this
this initial
initial exercise is to explore
explore the problems
problems involved
involved in constructing
constructing an AI
AI system
system
that
that me('ts
meet)s the needs
needs of drug
drug therapy
therapy monitoring
monitoring and to establish
establish development
development guidelines
guidelines
for
ect.
for the larger
larger proj
project.
The
project depend on the outcome
The long-range
long-range plans for
for the
the MENTOR
MENTOR project
outcome of
of the
funding
project has the
funding decision. However,
However, assuming
assuming a favorable
favorable decision,
decision, the full
full project
following
following goals:
1.
prototype computer
patient drug
1. Implement
Implement a prototype
computer system
system to
to continuously
continuously monitor
monitor patient
drug

thernpy
t,hera.py in a hospital
hospit,al setting.
setting. This
This will
will be an expert
expert system
system that
that will
will use
use a
modular,
frame-oriented
form
of
medical
knowledge,
separate
inference
frame-orient.ed
form
of
medical
knowledge,
a
separate
inference
modular,
engin(' for
for applying
applying the
th(' knowledge
knowledge to
to specific
specific situations
situations and automated
automated
engine
collection of
of data
data from
from hospital
hospital information
information systems to
to produce
produce therapeutic
therapeutic
collectlion
ad visories.
advisories.
2. Select a small
small number
number of
of important
important and frequently
frequently occurring
occurring medical
medical settings
settings
!?.
combination therapy
therapy with
with cardiac
cardiac glycosides
glycosides and di:retics)
diuretics) that
that can lead
(e.g., combination
to therapeutic
therapeut.ic misa,dventures,
misadventures, construct
construct a comprehensive
comprehensive medical
medical knowledge
knowledge
to
base necessary to
to detect
detect these situations
situations using
using the
the information
information typically
typically found
found
b:lse
comput.erized hospital
hospital information
information system
system and generate
generate timely
timely advisories
advisories
in a comput,erized
intended to
to a!t,er
alter behavior
behavior and
and avoid
avoid preventable
preventable drug
drug reactions.
reactions.
intended
3. Seiect
Seiect and test
test several
several methods
methods of
of formulating
formulating and
and providing
providing
physicians
in
order
to
find
an
optimal
method
of
feedback
that
physicians in order to find
optimal method of feedback that
and useful
useful to
to physicians
physicians and
and is feasible to
to implement.
implement.
and

advisories to
to
advisories
is acceptable
acceptable

Design and
and begin
begin to
to implement
implement an evaluation
evaluation of
of the
the impact
impact of
of the
the prototype
prototype
4. Design
MENTOR system
system on
on physicians’
physicians' therapeutic
therapeutic decision-making
decision-making as well
well as on
on
MENTOR
outcome measures
measures related
related to
to patient
patient health
health and
and costs of
of care.
outcome

B. .Justification
Justification and
and Requirements
Requirements for
for Continued
Continued StJMEX
SUMEX Use
B.
This project
project needs
needs continued
continued use
use of
of the
the SUMEX
SUMEX facilities
facilities for
for two
two reasons.
reasons. First’
First is
is
This
that itit provides
provides access
access to
to an
an environment
environment specifically
specifically designed
designed for
for the
the development
development of
of AI
AI
that

177

E. A.
A. Feigenbaum
Feigenbaum
E.

rvtENTOR
MENTOR Project
Project

RR00785-11
5P41 RR00785-11

systems. The
The MENTOR
MENTOR project
project focuses
focuses on the development
development of
of such as
as system
system for
for drug
drug
monitoring that
t,hat will
will explore
explore some neglected
neglected aspects of
of AI
AI in medicine.
medicine. Access to
to SUMEX
SUMEX
monitoring
is necessary for
for t.imely
t,ime!y development
development of
of the MENTOR
MENTOR system,
system, as
as well
well as advice and
a£sistance
assistance in the design and development
development of
of a well-designed
well-designed and efficient
efficient system.
system. Access
to SUMEX
SUMEX is also necessary to
to support
support the collaborative
collaborative effort
effort in this
this project
project as described
described
to
previously.
previously.

and Pfans
Other Computing
Computing Resources
SUMEX-AIM
C. Needs
Needs and
Plans for
for Other
Resources Beyond
Beyond SUAIEX-AIM
A
A major
major long-range
long-range goal of
of the MENTOR
MENTOR project
project is to
to implement
implement this
this system
system on a
independent
It
independent hardware
hardware system
system of
of suitable
suitable architecture.
architecture.
It is recognized
recognized that
that the full
full
monitoring
monit.oring system
system will
will require
require a large patient
patient data
data base
base as well
well as
as a sizeable medical
medical
knowledge base
base and must
must operate
operate on a close to
to real-time
real-time basis. Ultimately,
Ultimately, the SUMEX
SUMEX
knowledge
facilil·ies
Thus
facilities will
will not
not be s11itable
suitable for
for these applications.
applications.
Thus we intend
intend to transport
transport the
the
prototype
prot.otype system
system to
to a dedicated
dedicated hardware
hardware system
system that
that can fully
fully support
support the the planned
planned
system
system and which
which can be integrated
integrated into
into the SUMe
SUMC Hospital
Hospital Information
Information System.
System.
However, no firm
firm decisions have been made about
about the requirements
requirements for
for this
this system
system since
However,
many
many specification
specification and design decisions remain
remain to
to be made.

D. Recommendations
Recommendations for
for Future
Future Community
Resource Development
Development
D.
Community and
and Resource
In the brief
brief time
time we have been associated with
with SUMEX,
SUMEX, we have been generally
generally
In
p!ca,sed with
with the facilities
facilities and services. However,
However, it
it is evident
evident that
that disk
disk space is a critical
critical
pleased
factor in the functioning
functioning of
of the facility.
facility. It
factor
It would
would seem wise to
to increase disk
disk storage
storage in
order to
to meet
meet the needs of
of the users. Our
Our experience also indicates
indicates that
that an attempt
attempt, needs
order
to
with
to be made to organize
organize and update
update the
the documentation
documentation associated
associated
with the
the various
various
SUMEX systems. Being
Being new users, we found
found that
that paths
paths to
to useful
useful software
software was somewhat
somewhat
SUMEX
longer
longer t.han
than one might
might, expect.
expect. An
An expanded
expanded introduction
introduction to the system
system that,
that, at
at least,
briefly
briefly described
described the software
software available
available on SUMEX
SUMEX would
would be useful.

E. A. Feigenbaum
Feigenbaum

178

Protein
Protein Secondary
Secondary Structure
Structure Project
Project,

5P41
5P41 RH00785-11
RR00785-11

ll.A.3.3.
Protein
Secondary
Structure
Project
lI.A.3.3.
Protein
Secondary
Structure
Project

Protein
Project
Protein Secondary
Secondary Structure
Structure
Project
Robert
M.D.
Robert M.
M. Abarbanel,
Abarbanel,
M.D.
Section
Science
Section on
on Medical
Medical Information
Information
Science
University
of
Medical
University
of California
California
Medical Center
Center
University
of
at
University
of California
California
at San
San Francisco
Francieco

I.
I.

SUM1vL<\.RY
PROGRAM
SXJMMARY OF
OF RESEARCH
RESEARCH
PROGRAM
A.
A. Projcct
Project Rationalc
Rationale

Development
Development of a protein
protein structure
structure knowledge
knowledge base
base and tools for
for manipulation
manipulation of
that
knowledge
to
aid
in
the
investigation
of
new
structures.
System
that knowledge
investigation
structures.
System to include
include
cooperating
cooperat,ing knowledge
knowledge sources that
that work
work under
under the guidance
guidance of other
other system
system drivers
drivers to
find
Evaluations of structure
structure predictions
predictions using
find solutions
solutions to protein
protein structure
st,ructure problems.
problems. Evaluations
known
known proteins
prot,eins and other
other user feedbacks available
available to aid user in developing
developing new methods
methods
of prediction.
prediction.

B. Afcdical
Medical Relcvancc
Relevance and
wad Collaboration
Collaboration
!\!any
Many important
important proteins
proteins have been sequenced
sequenced but
but have not,
not, as
as yet,
yet, had their
their
secondary
The systems
systems developed here would
would aid medical
medical
secondary or
or tertiary
tertiary structures
structures revealed.
revealed. The
scientists
particular configurations,
scientists in the search for
for particular
configurations, for
for example,
example, around
around the active
active sites
in enzymes. Predictions
Predictions of
of secondary
secondary structure
structure will
will aid in the determination
determination of
of the
the full
full
"natural"
configuration
of
important
biological
materials.
Development
of
systems
“natural” configuration of important biological materials. Development of systems such
3S
as these will
will contribute
cont,ribute to
to our
our knowledge
knowledge of
of medical
medical scientific
scientific data
data representation
representation and
retrieval.
retrieval.

Highlights of
of Research
Rcsearch Progress
Progrcss
C. Nighlights
The prediction
prediction of
of beta-alpha
beta-alpha protein
protein structures
structures is complete.
complete. The
The system
system was
The
developed on a.
a VAX
VAX 11
/750 at
at the University
University of
of California,
California, San Francisco,
Francisco, to
to allow
allow
developed
111750
rese3Tchers to
to describe
describe patterns
patterns of
of amino
amino acid
acid residues that
that will
will be sought
sought in the
the sequences
sequences
researchers
under st*udy.
study. The
The presence or
or absence of
of these “primary”
"primary" patterns
patterns are then
then combined
combined
under
with ot’
otht'r
measures of
of structure,
structure, like
like hydrophobicity,
hydrophobicity, to
to suggest possible alpha
alpha helix
helix or
or
wit,11
her measnres
beta sheet
sheel or
or turn
turn configurations.
configurations.
beta
The segments
segments of
of a sequence
sequence between
between turns
turns are then
then analyzed
analyzed to
to determine
determine the
the
The
allowable extent
extent of
of the
the possible
possible secondary
secondary structure
structure assignments.
assignments. Any
Any segments
segments remaining
remaining
allowable
to generate
generate al!
all possible
possible complete
complete structures.
structures. Only
Only two
two beta
beta strands
strands with
with the
the
are t.hen used to
character of
of sheet edges are allowed
allowed in
in any
any prediction.
prediction. This
This hierarchical
hierarchical generation
generation and
and
character
pruning result,s
r,~sults in nearly
nearly 95%
95% t,urn
turn prediction
prediction accuracy,
accuracy, and
and excellent
excellent delimiting
delimiting of
of helices
pruning
and sheets. In
In some cases,
cases, one and
and only
only one secondary
secondary structure
structure is predicted.
predicted.
and
Research in
in Progress
Progress --- At
At this
this time,
time, work
work is under
under way
way to
to extend
extend this
this a/P
a/ f3
Research
assignment work
work to
to a set
set of
of cancer
cancer causing
causing viral
viral proteases.
proteases. These
These proteins
proteins are believed
believed t,o
to
.lulsant, B. and
and Servan-Schreiber,
Servan-Schreiber, D.:Knowledge
D.:Knowledge engineering:
engineering:
daily
14.
AA daily
acit'vity on a hospital
hospital ward,
wurd, Computers
Computers in Biomedical
Biomedical Research, 1984.
activity

Spitzer, R. L.,
L., Endicot.t,
Endicott, J. and
and Robins,
Robins, E.: Research
Research diagnostic
diagnostic criteria,
criteria, (2d
(2d
15. Spitzer,
cd.) New
New York
York State
Sta.te Department
Department of
of Mental
Mental Hygiene,
Hygiene, New
New York
York Psychiatric
Psychiatric
ed.)
Institute, Biometrics
Biometrics Research
Research Division,
Division, 1975.
Institute,
Spitzer, R. L.:
L.: (Ed.).Diagnostic
(Ed.).Diugnostic and
and statistical
stutistical manual
manuul of
of mental
mental disorders,
disorders,
16. Spit,zer,
(3d ed.). Washingt,on,
·Washington, D.
D. C.:
C.: American
American Psychiatric
Psychiatric Associa.tion,
Association, 1980.
(3d
17. Vsn
Van Melle,
Melle, W.:The
W.:The EIZffCIN
EJvfYCIN Manual,
Manual, Computer
Computer Science Department,
Department,
Stanford
University,
Report
HPP-81-16,
1981.
Stanford University, Report HPP-81-16,

E. Funding
Funding Support
Support
E.
\Ne have
have
We
at
the
Univ
of
at t,he Univ of

submitted
submitted
Michigan,
hfichiga.n,

an application
application for
for support
support to
to the
the Vice-President
Vice-President for
for Research
Research
who
has
funds
for
"seed
money"
for
faculty
research
(Total
who has funds for “seed money” for faculty research (Total

199

E. A.
A. Feigenbaum
Feigenbaum
E.

5P41 RR00785-11
RR00785-11

RXDX Project
Project
RXDX

Direct
Direct Costs
Costs =
= $3215). \Ve
We have prepared
prepared a grant
grant application,
application, to
to be sent to
to the NIH
NIH
"Small
Program for
for the May
Ma.y I,
1, 1984 deadline
deadline (Total
(Total Direct
Direct Costs =
= $13,850).
$13,850).
“Small Grants"
Grants” Program
These funds
funds should
should enable us to
to gather
gather the pilot
pilot data
data we will
will need as
as part
part of
of a major
major
gran
Iication.
grsnt t app
application.

ll.
WITH
RESOURCE
II. INTERACTIONS
INTERACTIONS
WITH THE
THE SUMEX-.AIM:
SUMEX-AIM
RESOURCE
A.
Collaboration and
and Program
via SUMEX
SUMEX
A. Medicai
AJediwl Collaboration
Program Dissemination
Dissemination via
\Ve
We are collaborating
collaborating via
via SUMEX
SUMEX with
with Dr.
Dr. Jon Heiser, who
who worked
worked with
with Ruven
Ruven
Brooks
Brooks on HEADMED
HEADMED in the late 1970's.
1970’s. We
We are sharing
sharing a common
common SUMEX
SUMEX account,
account, and
Dr. Heiser
Heiser will
will write
write the section
section of
of the expert
expert
communicating
communicating using computer
computer mail.
mail. Dr.
system
system dealing
dealing with
with the treatment
treatment of
of depression (and
(and eventually
eventually of
of other
other psychiatric
psychiatric
disorders)
disorders) while
while Drs. Feinberg
Feinberg and Lindsay
Lindsay work
work on the diagnostic
diagnostic parts
parts of
of the system.
system.

B. Sharing
with
Projects
B.
Sharing and
and Collaboration
Collaloration
with other
other SUAIEX-AIAf
SUMEX-AIM
Projects
\Ve
\%‘e are also collaborating,
collaborating, although
although more
more loosely, with
with Mess~s.
Ivfessrs. Benoit
Benoit Mulsant
Mulsant and
David
David Servan-Schreiber.
Servsn-Schreiber. They
They wrote
wrote an expert
expert system
system (BLUEBOX)
(BLUEBOX) for
for the
the diagnosis
diagnosis and
treatment
treatment of
of depression
depression which
which was a first
first step in the direction
direction we are going.
going. We
We have
access to BLUEBOX
BLUEBOX through
through SUMEX,
SUMEX, and have been able to learn
learn from
from its
its successes
successes and
access
failures.
David will,
will, we expect,
expect, be able to offer
offer us many
many helpful
helpful suggestions
suggestions on
failures. Ben and David
our
our expert
expert system
system (RXDX)
(RXDX) as
as they
they pursue
pursue their
their training
training in Psychiatry
Psychiatry and continue
continue their
their
work in AI
AI in medicine.
medicine.
work

C. Critique
Resource Management
AJanagement
Critique of
of Resource
We
We have been using
using EMYCIN
EMYCIN to
to set up our
our knowledge
knowledge base, and have found
found this
this
program
pr0gra.m invaluable,
invaluable, since it
it has saved us many
many hours
hours of
of programming
programming in LISP.
LISP. There
There are
problems with
with EMYCIN,
EMYCIN, many
many of
of which
which center
center around
around discrepancies
discrepancies between
between the
some problems
the version
version of
of EMYCIN
EMYCIN described
described in the manual
manual and the version
version actually
actually running
running on
SUMEX. vVe
We would
would suggest that
that EMYCIN
EMYCIN be more
more strongly
strongly supported
supported than
than is now
now the
SUMEX.
case. if
if it
it and SUMEX
SUMEX are to
to be even more
more useful
useful to
to beginners
beginners in AI
AI in Medicine.
Medicine. This
This
cast'.
may involve
involve added expense,
expense, such as would
would be involved
involved in the purchase
purchase of
of an updated
updated
may
version
version of
of EMYCIN,
EMYCIN, but
but we would
would certainly
certainly be able to
to make
make use
use of
of the updated
updated version.
version.
We
SUivlEX
SUMEX itself
itself has been invaluable.
invaluable.
We don't
don’t have easy access
access to any
any other
other
machine
ma.chine of
of equal
qua.1 computing
computing power
power which
which also has a strongly
strongly supported
supported LISP
LISP available.
available.
Specifically,
Specifically, the Dandelion
Dandelion LISP
LISP machine
machine at Michigan
Michigan is not
not easily
easily accessible, while
while t.he
t.he
LISP compiler
compiler available
available on the Amdahl
Amdahl 5860 here differs
differs from
from those used at major
major AI
AI
LISP
centers such as
as Stanford
Stanford and MIT.
MIT. We
We have also made good use
use of
of the ARPANET
ARPANET
centers
connections
connections that
that SU~IEX
SUMEX offers.
offers. Feinberg
Feinberg will
will spend a month
month of
of his sabbatical
sabbatical working
working
with Prof.
Prof. Peter
Peter Szolovits
Szolovits at
at MIT,
MIT, learning
learning about
about AI
AI in Medicine.
Medicine. (This
(This is an
a;n obvious
obvious and
with
necessary step for
for any
any physician
physician wanting
wanting to
to begin work
work in the
the field.)
field.) This
This visit
visit was
Lindsay and Feinberg
Feinberg will
will be able to
arranged
arranged using computer
computer mail
mail through
through SUMEX.
SUMEX. Lindsay
continue
continue t,heir
their collaborative
collaborative work
work while
while the latter
latter is in Cambridge,
Cambridge, using
using the same
medium.
medium. The
The alternative
alternative would
would be days lost
lost in the mails
mails and many
many dollars
dollars spent
spent on
phone calls. vVe
We have also been able to
to get rapid
rapid help with
with problems
problems that
that arise with
with
Er-.-IYCIN
EMYCIN using computer
computer mail,
mail, saving
saving days and/or
and/or dollars.
dollars.

ill.
III. RESEARCH
RESEARCH PLAN
PLAN

A.
Goals and
and Plans
A.. Project
Project Goals
Plans
Our immediate
immediate objective
objective is to
to develop an expert
expert system
system which
which can
cau differentiate
differentiate
Our
patients
pat.ient,s with
wit,11 the various
various subtypes
subtypes of
of depressive disorder,
disorder, and prescribe
prescribe appropriate
appropriate
Feigenbaum
E. A. Feigenbaum

200

RXDX
RXDX Project
Project

5P41
SP41 RR0078.')-1l
RR0078.5-11

treatment.
This
This system
system should
should perform
perform at
at about
about the
the level
level of
of aa board-certified
board-certified
treatment.
psychiatrist,
psychia.t.rist, i.e.
i.e. better
better than
than an
an average
average resident
resident but
but not
not as
as well
well as
as aa human
human expert
expert in
in
depression.
depression. Eventually,
Eventually, we
we plan
plan to
to enlarge
enlarge the
the knowledge
knowledge base
base so
so that
that the
the expert
expert system
system
can
can diagnose
diagnose and
and prescribe
prescribe for
for aa wider
wider range
range of
of psychiatric
psychiat)ric patients,
patients, particularly
particularly those
those
with
with illnesses
illnesses which
which are
are likely
likely to
to respond
respond to
to psychopharmacological
psychopharmacological agents. We will
will
design
design the
the system
syst,em so
so that
that itit could
could be
be used
used by
by non-medical
non-medical clinicians
clinicians or by non-psychiatrist
non-psychiat.rist
MD's
MD’s as
as an
an adjunct
adjunct to
to consultation
consultation with
with aa human
human expert.
expert.

B.
. ~tification and
for continued
continued SUMEX
StJMEX use
R. Ju
Justification
and Requirements
Requirements for
This
This project
project is
is entirely
entirely dependent
dependent on access
access to SUMEX.
SUMEX. We are using the EMYCIN
EMYCIN
syst.em
on
SUMEX.
That
software
is
not
available
to
us anywhere
system
SUMEX. That software is
available
anywhere else.
else. We also make
extensive use
EX as
use of SUM
SUMEX
as aa means of communication
communication and file-sharing
file-sharing with
with our
our
consnltant"
consulta.nt3, Jon Heiser, and with
with David
David Mulsant
Mulsant and Benoit
Benoit Servan-Schreiber.
Servan-Schreiber. The
The access
access
to SUMEX
S’CJMEX resources isis essentially
essentially our
our sale
sole means of maintaining
maintaining contact
contact with
with the
community
community of researchers working
working on applications
applications of AI
AI in medicine.
medicine.
\Ve
We ant.icipate
ant,icipate that
that our
our requirements
requirements for computing
computing time
time and file space
space will
will
continue
continue to
t,o grow
grow as
as the system
system evolves.

C. Needs and
and Plans
Plans for
for Other
Other Computing
Computing Resources
Resources
As our
our project
project evolves and we run
run into
into the limitations
limitations of
of EMYCIN
EMYCIN and the timetimeshared SUMEX
SUMEX facility,
facility, we anticipate
anticipate employing
employing different
different expert
expert systems software.
software. At
At
this
this time,
time, we are not
not at a stage to say exactly
exactly what
what that
that will
will be, but
but our
our project
project is not
not
sufficiently
sufficient,ly large that
that we will
will be able to mount
mount such a software
software development
development project
project
ourselves, so we will
depend
on
development
and
support
elsewhere.
Ultimately,
when our
our
will
development
support
Ultimately, when
consultant
is
made
available
for
field
trials
and
clinical
use,
it
will
need
to
be
transported
consulta.nt
available for field trials
clinical use, it will
to
transported
to a personal
yet inexpensive
inexpensive enough
enough to
to
personal computer
computer that
that is large enough support
support the system
system yet
be widely
\Vhile current
current prices
prices of
of the
widely available.
ava.ilable. A
A LISP
LISP machine
machine is an obvious
obvious candidate.
candidate. While
necessa.ry
to drop.
drop. Our
Our design
necessa.ry hardware
hardware are too
too high,
high, computer
computer prices are continuing
continuing to
st.rategy
to that
that which
which is affordable
affordable
st,rategy is to
t,o avoid
avoid limiting
limiting ourselves and our
our aspirations
aspirations to
today; instead
inst.ead we will
will attempt
attempt to
to project
project the growth
growth of
of our
our project
project and the
the pricepricetoday:
performance curve
curve of
of computing
computing such that
that they
they meet at
at some reasonable point
point in the
performance
fut.nre.
future.

D. Recommendations
Recommendations for
for Future
Future Community
and Resource
Resource Development
Development
D.
Community and
Valuable as
a.s the
the present
present SUMEX
SUMEX facilities
facilities are to
to us, they
they are in
in many
many ways
ways limited
limited
Valuable
and awkward
awkward to
to use. The
The need for
for more
more and more
more computer
computer cycles and memory
memory
cont.inues to
to grow,
grow, of
of course. However,
However, the
the major
major limitation
limitation we feel is the
the difficulty
difficulty and
and
continues
sometimes t,hr
the impossibility
impossibility of
of making
making contact
contact with
with everyone
everyone who
who could
could be of
of value
value to
to us.
sometimes
greater emphasis will
will be put
put on internetwork
internetwork gateways.
gateways. ItIt is important
important not
not
\Vee hope that
that, greater
W’
only t,o
t.o establish
est.ablish more
more of
of these, but
but to
to develop
develop consistent
consist.ent and
and convenient
convenient standards
standards for
for
only
elert.ronic mail,
mail, electronic
electronic file
file transfers,
transfers, graphic
graphic information
information transfer,
transfer, national
national archives
archives
elect.ronic
and dsta
data ba.ses,
ha:oes, and personal
personal filing
filing and
and retrieval
retrieval (categorization)
(categorization) systems.
systems. The
The present
present
and
state of
of the
the art
art is quite
quite limiting,
limiting, now
now that
that the
the basic concepts
concepts of
of comput.er
computer networking
networking have
st.ste
available and
and have proved
proved their
their potential.
potential.
become available

201

E. A.
A. Feigenbsum
Feigenbaum
E.

Books, Papers, and Abstracts
Abstra.cts

5P41
5P41 RR00785-11
RR00785-11

II.B.
and
ILB. Books,
Books, Papers,
Papers,
and Abstracts
Abstracts
Publications
Publications for the various
various collaborative
collaborative projects
projects are summarized
summarized in their
their
respective
They also have been submitted
submitted separately
separately on the Scientific
Scientific
respective progress reports.
reports. They
Subproject
Subproject Form
Form lIB.
IIB. They
They are not
not reproduced
reproduced here to avoid
avoid redundancy.
redundancy.

IT.C.
Resource
Summary
Table
KC.
Resource
Summary
Table
Detailed
is
Detailed resource usage
usage information
information
is summarized
summarized starting
starting on page 30.
Tabulations
of
this
information
also
have
been
submitted
separately
Tabulations
this information
submitted separately on the requested
request.ed
Scientific
Subproject
Form.
These
are
not
reproduced
Scientific Subproject Form.
not reproduced here to avoid
avoid redundancy.
redundancy.

A. Fcigenbaum
Feigenbanm
E. A.

202
202

Appendix
Appendix A
A

SP41
5P41 RR0078S-11
RR00785-11

Appendix
A
Appendix
A
AIM
Committee
Membership
AIM Management
Management
Committee
Membership
Following
are the
Following
management
mana.gement committees:
committees:

current
current

membership
membership

lists
lists

of
of

ALr..J Erecutiue
Executive Committee:
AIM
Committee:
FEIGENBAUM,
FEIGENBAUM, Edward
E.dward A.,
A., Ph.D.
Ph.D. (Chairman)
(Chairman)
Principal
Principal Investigator
Investigator - SUMEX
SUMEX
Heuristic
Heuristic Programming
Programming Project
Project
Department
Department of
of Computer
Computer Science
Margaret
Margaret Jacks Hall
Hall
Stanford
Stanford University
University
Stanford.
Stanford, California
California 94305
(415) 497-4879
LEDERBERG,
LEDERBERG, Joshua, Ph.D.
Ph.D.
President
President
The
The Rockefeller
Rockefeller University
University
1230 York
York Avenue
Avenue
New York,
New York
New
York, New
York 10021
10021
(212) 570-8080, 570-8000
KULIKOWSKI,
Casimir,
KULIKOWSKI,
Casimir, Ph.D.
Ph.D.
Department
Department of
of Computer
Computer Science
Rutgers
Rutgers University
University
New Brunswick,
New Jersey 08903
New
Brunswick, New
(201) 932-2005
932-2006

203

the

various
various

SUMEX-AIM
SUMEX-AIM

Appendix AA
Appendix

5P41 RR00785RR00785-11
5P41
1I

LINDB8RG, Donald
Donald A.B.,
A.B., M.D.
M.D.
LINDBERG,
605 Lewis
Lewis Hall
Hall
605
University of
of Missouri
Missouri
University
Columbia, Missouri
Missouri 65201
65201
Columbia,
(314) 882-6966
882-6966
(314)

(Adv Grp
Grp Chrmn)
Chrmn)
(Adv

MYERS, Jack
Jack D.,
D., M.D.
M.D.
MYERS,
School
of
Medicine
School of Medicine
Scaife Hall,
Hall, 1291
1291
Scaife
University
of
Pittsburgh
University of Pittsburgh
Pittsburgh,
Pennsylvania
15261
Pittsburgh, Pennsylvania 15261
(412) 624-2649
624-2649
(412)
SHORTLIFFE, Edward
Edward H.,
H., M.D.,
M.D., Ph.D.
Ph.D.
SHORTLIFFE,
Co-Principal Investigator
Investigator -- SUMEX
SUMEX
Co-Principal
Division of
of General
General Internal
Internal Medicine,
Medicine, TC117
TC 117
Division
Stanford University
University Medical
Medical Center
Center
Stanford
Stanford, California
California 94305
Stanford,
(415) 497-6970
(415)

204
204

5P41
5P41 RR00785-11
RR00785-11

Appendix
Appendix A
A

AIM Advisory
Advisory Group:
AIM
Group:
LINDBERG,
LINDBERG, Donald
Donald A.B.,
A.B., M.D.
M.D.
505
Lewis
Hall
605 Lewis Hall
University
University of Missouri
Missouri
Columbia,
Columbia, Missouri
Missouri 55201
65201
(314)
(314) 882-5955
882-6966

(Chairman)
(Chairman)

AMABEL,
AhIAREL, Saul, Ph.D.
Ph.D.
Depart.ment
Depart,ment of Computer
Computer Science
Science
Rutgers
Rutgers Universit.y
University
New Jersey 08903
New Brunswick,
Brunswick, New
(201)
(20 1) 932-3546
932-3546
BAKER,
(Exec. Secretary)
Secretary)
BAKER, \Villiam
William R., Jr.,
Jr., Ph.D.
Ph.D.
Biotechnology
Resources
Program
Biotechnology Resources Program
National Institutes
National
Institutes of Health
Health
Building
31,
Room
Building
Room 5B43
9000 Rockville
Rockville Pike
Pike
Bethesda, Maryland
Maryland 20205
20205
(301)
(301) 496-5411
496-5411
FEIGENBAUM,
FEIGENBAUM, Edward
Edward A.,
A., Ph.D.
Ph.D. (Ex-officio)
(Ex-officio)
Principal
Principal Investigator
Investigator - SUMEX
SUMEX
Heuristic
Heuristic Programming
Programming Project
Project
Department
Department of Computer
Computer Science
Science
Margaret
Margaret Jacks Hall
Hall
Stanford
Stanford University
University
Stanford,
Stanford, California
California 94305
94305
(415)
(415) 497-4879
497-4879
KULIKOWSKI,
KULIKOWSKI, Casimir,
Casimir, Ph.D.
Ph.D.
Department
Department of Computer
Computer Science
Science
Rutgers
Rut.gers University
University
New Jersey 08903
New Brunswick,
Brunswick, New
(201) 932-2006
932-2006
LEDERBERG,
LEDERBERG, Joshua, Ph.D.
Ph.D.
President
President
The
The Rockefeller
Rockefeller University
University
1230
1230 York
York Avenue
Avenue
New York,
York, New York
York 10021
(212)
(2 12) 570-8080,
570-8080, 5iO-8000
570-8000
MIi\SKY.
MINSKY, Marvin.
Marvin, Ph.D.
Ph.D.
Artificial
Artificial Intelligence
Intelligence Laboratory
Laboratory
Massachusetts
Massachusetts Institute
Institute of
of Technology
Technology
545 Technology
Technology Square
Cambridge.
Cambridge, Massachusett.s
Massachusetts 02139
(517)
(617) 253-5854
253-5864

205

Appendix
Appendix A
A

5P41 RR00785-11
RR00785- 11

f"IOHLER,
MOHLER, William
William C., M.D.
M.D.
Associate
Associate Director
Director
Division
Division of
of Computer
Computer Research and Technology
Technology
National
Institutes
National Institutes of
of Health
Health
Building
Building 12A, Room
Room 3033
9000 Rockville
Rockville Pike
Pike
Bethesda,
Bethesda, Maryland
Maryland 20205
(30l)
(301) 496-1168
MYERS,
MYERS, Jack
Jack D., M.D.
M.D.
School of
of Medicine
Medicine
Scaife Hall,
Hall, 1291
1291
University
University of
of Pittsburgh
Pittsburgh
Pittsburgh,
Pittsburgh, Pennsylvania
Pennsylvania 15261
15261
(412) 624-2549
624-2649
PAUKER,
PAUPER, Stephen
Stephen G., M.D.
M.D.
Department
Department of
of Medicine
-Medicine - Cardiology
Cardiology
Tufts
New England
Tufts New
England Medical
Medical Center
Center Hospital
Hospital
171
171 Harrison
Harrison Avenue
Avenue
Boston,
Boston, Massachusetts
Massachusetts 02111
(517)
(617) 956-591O
956-5910
SHORTLIFFE,
(Ex-officio)
SHORTLIFFE, Edward
Edward H., M.D.,
M.D., Ph.D.
Ph.D.
(Ex-officio)
Co-Principal
Co-Principal Investigator
Investigator - SUMEX
SUMEX
Division
Division of
of General
General Internal
Internal Medicine,
Medicine, TC117
TC117
Stanford
University
Medical
Stanford University Medical Center
Center
Stanford,
Stanford, California
California 94305
(415)
(415) 497-0970
497-6970
SIMON,
SIMON, Herbert
Herbert A.,
A., Ph.D.
Ph.D.
Department
Depa.rt,ment of
of Psychology
Psychology
Baker
Baker Hall,
Hall, 339
Carnegie-Mellon
Carnegie-hlellon University
University
Schepley
Schenley Park
Park
Pit.tsburgh.
Pittsburgh. Pennsylvania
Pennsylvania 15213
(412)
(4 12) 578-2787,
578-2787, 578-2000
578-2000

205
206

Appendix
Appendix A
A

RR00785-11
5P41 RRO0785-11

Stanford
Community Advisory
Advisory Committee:
Committee:
Stan
ford Community
FEIGENBAUM, Edward
Ed'ward A.,
A., Ph.D.
Ph.D. (Chairman)
(Chairman)
FEIGENBAUM,
Heuristic Programming
Programming Project
Project
Heuristic
Department of
of Computer
Computer Science
Department
Margaret Jacks Hail
Hall
Margaret
Stanford University
University
Stanford
Stanford, California
California 94305
Stanford,
(415) 497-4879
(415)
DJERASSI, Carl,
Carl, Ph.D.
Ph.D.
DJERASSI,
Department of
of Chemistry,
Chemistry, Stauffer
Stauffer I-106
1-106
Department
Stanford University
University
Stanford
Stanford, California
California 94305
Stanford,
(415) 497-2783
(415)
~'IAFFLY, Roy
Roy H., M.D.
M.D.
MAFFLY,
of Nephrology
Nephrology
Division of
Division
Veterans Administration
Administration Hospital
Hospital
Veterans
Miranda. Avenue
Avenue
3801 hfiranda
Palo Alto,
Alto, California
California 94304
Palo
8.58-3971
(415) 858-3971

SHORTLIFFE, Edward
Edward H., M.D.,
M.D., Ph.D.
Ph.D.
SHORTLIFFE,
Co-Principal Investigator
Investigator - SUMEX
SUMEX
Co-Principal
Division of
of General
General Internal
Internal Medicine,
Medicine, TC117
TC117
Division
Stanford
Stanford University
University Medical
hjedical Center
Center
Stanford,
Stanford, California
California 94305
(415) 497-5970
497-6970

207
207

5P‘1I RR00785-11
RR00785- 11
5P41

Appendix
Appendix B

Appendix
B
Appendix
B
Community
Growth
and Project
Project
Abstracts
Community
Growth
and
Abstracts
This appendix
appendix contains
contains a graphical
graphical display
display of the development
development of the SUMEX-AIM
SUMEX-AIM
This
community over
over the years and abstracts
abstracts of
of currently
currently active
active projects.
projects. Figure
community
Figure 15
15 below
illustrat,es the substantial
substantial growth
growth in the cumulative
cumulative number
number of projects
projects in the Stanford,
Stanford,
illustrates
Nat,ional AIM,
AIhf, and Rutgers-AIM
Rutgers-AIM communities
communities since the resource began operation
operation in 1974
1974
National
until this
this past year.
year. The
The recent
recent decrease
decrease in the total
total number
number of projects
projects is due to the
up until
severa,l long time
Lime SUMEX-AIM
SUMEX-AIM projects,
projects, namely
namely Dendral,
Dendral, Puff/Vm,
Puff/Vm. Act,
Act, and
closure of several
Protein. Activity
Activity in the community
community however
however remains
remains high, as
as evidenced by the number
number of
Prot,ein.
pilot projects
projects (5
(5 Stanford
SLa.nford pilots,
pilots, 22 Aim
Aim pilots,
pilots, and 11 Rutgers
Rutgers pilot)
pilot) currently
currently active
active in the
pilot
SUMES-AIM community.
community.
SUMCX-AIM

25
25

Numbzr
of Projects
Projects
Number of

Rutgers Projects
Projects
Rutgers

20

p=-j
4J+------L

IS

National AIM
AIM Projects
Projects
National

10

~--------~'----

r--------------------~

5

~/

P974

1975

1976

1977

Stanford Projects

1978

Figure
Figure 15:
15:

1:)79

1930

"l!JS1

19B;?

1D83

198'~

1985

SUh?EX-AIM
SUMEX-AIM Growth
Growth by
by Community
Community

209
:::!09

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11
RR00785-11

Appendix
Appendix B

National AIM
National
AIM Project:
Project:

CADUCEUS
CADUCEUS (formerly
(formerly INTERNIST)
INTERNIST)

Principal
Principal Investigators:
Investigators:

Jack D. Myers,
Myers, M.D.
M.D. (MYERS@SUMEX-AIM)
(MYERSBSUMEX-AIM)
Harry
Harry E. Pople,
Pople, Ph.D.
Ph.D. (POPLE@SUMEX-AIM)
(POPLEQSUMEX-AIM)
University
University of
of Pittsburgh
Pittsburgh
Pittsburgh,
15261
Pittsburgh, Pennsylvania
Pennsylva.nia 15261
Dr.
Dr. Pople:
Pople: (412) 624-3490
6243490

The major
major goal of
of the CADUCEUS
CADUCEUS Project
Project is to
to produce
produce a reliable
reliable and adequately
adequately
The
complete
complete diagnostic
diagnostic consultative
consultative program
program in the field
field of
of internal
internal medicine.
medicine. Although
Although this
this
program is intended
intended primarily
primarily to
to aid skilled
skilled internists
internists in complicated
complicated medical
medical problems,
problems,
program
the program
program may
may have spin-off
spin-off as a diagnostic
diagnostic and triage
triage aid to
to physicians'
physicians’ assistants,
assistants,
rural health
health clinics,
clinics, military
military medicine
medicine and space travel.
travel. In
In the
the design of
of CADUCEUS
CADUCEUS and
rural
its
I, we have attempted
its predecessor INTERNIST
INTERNIST
attempted to
to model
model the
the creative,
creative, problemproblemformulation
formulation aspect of
of the clinical
clinical reasoning
reasoning process. The
The program
program employs
employs a novel
novel
heuristic procedure
procedure that
that composes differential
differential diagnoses, dynamically,
dynamically, on the basis of
of
heuristic
clinical
clinical evidence. During
During the course of
of a CADUCEUS
CADUCEUS or
or INTERNIST
INTERNIST I consultation,
consultation, it
it is
not
not uncommon
uncommon for
for a number
number of
of such conjectured
conjectured problem
problem foci
foci to
to be proposed
proposed and
investigated, with
with occasional
occasional major
major shifts
shifts taking
taking place in the program's
program’s conceptualization
conceptualization
investigated,
of
of the task
task at hand.
hand.

SOFTWARE
SOFTWARE AVAILABLE
AVAILABLE ON
ON SUMEX
SUMEX

Versions of
of INTERNIST
INTERNIST
available for
for experimental
experimental use,
use, but
but the project
project
Versions
are available
continues
continues to
to be oriented
oriented primarily
primarily towards
towards research and development;
development; hence, a stable
stable
production version
version of
of the system
system is not
not yet
yet available
available for
for general
general use.
use.
production

REFERENCES
REFERENCES
Pople,
DIALOG model
The DIALOG
model of
of diagnostic
diagnostic
Pople, H.E.,
H.E., Myers,
Myers, J.D. and Miller,
Miller, R.A.:
R.A.: The
logic
logic and
and its
its use in
in internal
internal medicine.
medicine. Proc.
Proc. Fourth
Fourth IJCAI,
IJCAI, Tbilisi,
Tbilisi, USSR,
September,
September, 1975.
in diagnostic
diagnostic
problem
Pople, H.E.:
H.E.: The formation
of composite
composite hypotheses
hypotheses in
Pople,
formation of
problem
solving:
exercise in
in synthetic
synthetic reasoning.
reasoning. Proc.
Proc. Fifth
Fifth IJCAI,
IJCAI, Boston,
Boston,
An exercise
solving: An
August,
August, 1977.

Feigenbaum
E. A. Feigenbaum

210
210

5P41
5P41 RR00785-11
RR00785-11

Appendix
Appendix BB

National
National AIM
AIM Project:
Project:

SECS
SECS --- SIMULATION
SIMULATION AND
AND EVALUATION
EVALUATION
OF
OF CHEMICAL
CHEMICAL SYNTHESIS
SYNTHESIS

Principal
Principal Investigator:
Investigator:

W.
W. Todd
Todd Wipke,
Wipke, Ph.D.
Ph.D.
Department
Department ot
of Chemistry
Chemistry
University
University ot
of California
California at
at Santa
Santa Cruz
Cruz
Santa
Santa Cruz,
Cruz, California
California 95064
95064
(408)
(408) 429-2397
429-2397 (WIPKE~SUMEX-AIM)
(WIPKE@SUMEX-AIM)

The
The SECS
SECS Project
Project aims at developing
developing practical
practical computer
computer programs
programs to assist
investigators
in
designing
syntheses
of
complex
organic
molecules
of
biological
investigators
complex organic
biological interest.
interest.
Key
Key features
features of this research include
include the use
use of computer
computer graphics
graphics to allow
allow chemist
chemist and
computer
computer to work
work efficiently
efficiently as
as aa team, the development
development of knowledge
knowledge bases
bases of chemical
chemica.1
reactions,
reactions, and the formation
formation ot
of plans to reduce the search for solutions.
solutions. SECS is
is being
used
used by the pharmaceutical
pharmaceutical industry
industry for designing syntheses of drugs.
A spin-off
spin-off project,
project, XENO,
XENO, is aimed at predicting
predicting the plausible
plausible metabolites
metabolites of
foreign
foreign compounds
compounds for carcinogenicity
carcinogenicity studies.
studies. First,
First, the metabolism
metabolism is simulated;
simulated; then
then
the metabolites
met,abolites are evaluated
evaluated for
for possible carcinogenicity.
carcinogenicity.
SOFTWARE
SOFTWARE AVAILABLE
AVAILABLE ON
ON SUMEX
SUMEX
SECS--

An
An organic
organic synthesis
synthesis design prograr.1
progra,m available
available with
with a reaction
reaction library
library
over
500
reactions.
The
program
is
accessible
to
users
via
over
reactions. The program
to
via a teletype
teletype
DEC
DEC GT40
GT40 type
type graphics
graphics terminal.
terminal.

of
of
or
or

XENO-XENO--

A
of xenobiotic
xenobiotic compounds.
compounds.
A program
program for
for prediction
prediction of metabolites
metabolites of
Although
early development
development stages, this
this
Although the project
project is still
still in the early
program
testing.
program is available
available for
for preliminary
preliminary exploration
exploration and testing.

PRXBLD-PRXBLD--

A
molecular models
models
A facility
facility for
for building
building approximate
approximate 3-dimensional
3-dimensional molecular
from
The program
program employs
employs an
from their
their 2-dimensional
2-dimensional representations.
representations.
The
energy minimization
minimization approach
approach and is available
available both
both stand-alone
stand-alone and as
energy
part of
of SECS.
part

QED-QED--

A domain-independent
domain-independent inference
inference engine which
which represents
represents knowledge
knowledge in
A
first order
order predicate
predicate calculus.
calculus.
first

FSECS-FSECS--

forward-working synthesis
synthesis prototype
prototype program
program for
for finding
finding starting
starting
A forward-working
A
material oriented
oriented syntheses.
material

SST-SST--

A program
program for
for searching
searching through
through a library
library of
of possible
possible starting
starting materials
materials
A
to
suggest
potential
starting
materials
for
a
given
target
molecule.
to
potentia.1 starting materials for given target molecule.

REFERENCES
REFERENCES
\Vipke, W.T.,
W.T., Rogers,
Rogers, D.:
D.: Rapid
Rapid Subgraph
Subgraph Search
Search Using
Using Parallelism.
Parallelism.
\Vipke,
Chern. If.
If. Comput.
Comput. Sci. (Submitted
(Submitted April
April 24, 1984).
J. Chem.
W.T.: An
An Integrated
Integrated System
System for
for Drug
Drug Design,
Design, in
in COMPUTERS
COMPUTERS
\Vipke, W.T.:
Wipke,
A-Z: AA Manufacturer’
Manufacturer's
Guide to
to Hardware
Hardware and
and Software
Software for
for the
the
A-Z:
s Guide
Pharmaceutical Industry,
Industry, Aster
Aster Publishing
Publishing C.,
C., Springfield,
Springfield. Oregon.
Oregon.
Pharmaceutical
(In press)
press)
(In
Wipke, W.T.,
\V.T., and
and Rogers,
Rogers, D.:
D.: Artificial
Artificial Intelligence
Intelligence in
in Organic
Organic
Wipke,

211
211

E. A.
A. Feigenbaum
Feigenbaum
E.

5P41 RR00785-11
RR00785-11

Appendix
Appendix B

Synthesis.
An
Synthesis. SST:
SST: Starting
Starting !vfaterial
Material Selection
Selection Strategies.
Strategies. An
Application of
Application
of Superstructure
Superstructure Search.
Search. J. Chern.
Chem. Inf.
Inf. Comput.
Comput. Sci.,
24:0000, 1984.
Wipke,
prediction
Wipke, W.T.,
W.T., Ouchi,
Ouchi, G.!.
G.I. and Chou,
Chou, J.T.:
J.T.: Computer-assisted
Computer-assisted prediction
of
CORRELATIONS
of metabolism.
metabolism. IN
IN L. Goldberg
Goldberg (Ed.),
(Ed.), STRUCTURE-ACTIVITY
STRUCTURE-ACTIVITY
CORRELATIONS
AS A
PREDICTIVE.
TOOL
IN
TOXICOLOGY.
Hemisphere
Publishing
A PREDICTIVE.TOOL
IN TOXICOLOGY.
Hemisphere Publishing Corp.,
Corp.,
New York,
New
York, 1983, pp 151-169.
Wipke,
Wipke, \V.T.,
W.T., Ouchi,
Ouchi, G. and Krishnan,
Krishnan, S.: Simulation
Simulation and
and evaluation
evaluation of
of
chemical
synthesis--SECS. An
An application
intelligence
chemical synthesis--SECS.
application of
of artificial
artificial
intelligence
techniques.
techniques. Art.ificial
Artificial Intelligence
Intelligence 10:999, 1978.

E. A.
-4. Feigenbaum
Feigenbaum

212

Appendix
Appendix B

5P41
5P41 RR00785-U
RR00785-11
National
National AIM
AIM Project:
Project:

CLIPR
CLIPR -- HIERARCHICAL
HIERARCHICAL MODELS
MODELS
OF HUMAN
HUMAN COGNITION
COGNITION

Principal
Principal Investigators:
Investigators:

Walter
Walter Kintsch,
Kintsch, Ph.D. (KINTSCH@SUMEX-AIM)
(KINTSCH@SUMEX-AIM)
Peter
Peter G. Polson,
Poison, Ph.D.
Ph.D. (POLSON@SUMEX-AIM)
(POLSON@SUMEX-AIM)
Computer
Computer Laboratory
Laboratory for Instruction
Instruction
in Psychological
Psychological Research (CLIPR)
(CLIPR)
Department
Department of Psychology
Psychology
University
University of Colorado
Colorado
Boulder,
Boulder, Colorado
Colorado 80302
80302
(303) 492-6991
492-6991
Contact:
Contact: Dr.
Dr. Peter
Peter G. Polson (Polson@SUMEX-AIM)
(Polson@SUMEX-AIM)

The CLIPR
CLIPR Project
Project isis concerned with
with the modeling
modeling of complex psychological
psychological
processes.
It is comprised
comprised of two
two research groups. The
The prose comprehension group has
has
processes. It
completed
completed a project
project that
that carries out
out the microstructure
microstructure text
text analysis described by
by Miller
Miller
a.nd
and Kintsch
Kintsch (1980),
(1980) yielding
yielding predictions
predictions of the recall and readability
readability of that
that text
text by
human
More recently,
recently, this
this group
group has been
been interacting
interacting with
with the Heuristic
Heuristic
human subjects.
subjects.
Programming
Programming Project
Project at Stanford,
Stanford, using the AGE
AGE and UNITS
UNITS packages to build
build a more
complex
model
of
the
knowledge-based
processes
characteristic
of
prose
comprehension.
complex
processes characteristic
The
The planning
planning group is working
working toward
toward a model of the planning
planning processes
processes used
used by
by expert
expert
computer
comput,er software
software designers.
designers. The initial
initial processes
processes involved
involved in learning
learning to use
use computers
computers
and other
other complex devices.

SOFTWARE
SOFTWARE AVAILABLE
AVAILABLE ON
ON SUMEX
SUMEX
A
programs has been developed
perform the microstructure
A set of
of programs
developed to
to perform
microstructure text
text analysis
described in Kintsch
Kintsch and van Dijk
Dijk (Psychological
(Psychological Review, 1978) and Miller
Miller and Kintsch
Kintsch
(1980). The
program accepts a propositionalized
propositionalized text
The program
text as
as input,
input, and produces indices that
that
can be used.
based in
used to estimate
estirnate the text's
text’s recall
recall and readability.
readability. A
A more complex
complex model based
AGE
AGE and UNITS,
UNITS, which
which emphasizes
emphasizes the knowledge-based aspects of
of comprehension, is
currently
under development.
currently under
development.

REFERENCES
REFERENCES
Jeffries, R., Turner,
Turner, A.A.,
A.A., Polson, P.G.
P.G. and Atwood,
Atwood, M.A.:
M.A.: The F’
Processes
Jeffries,
rocesses
Involved in
in Designing
Des!'gning So
Software.
IN J.R. Anderson
Anderson (Ed.),
(Ed.), COGNITIVE
COGNITIVE SKILLS
SKILLS
Inuoieled
jt,ware. IN
AND THEIR
THEIR ACQUISITION.
ACQUISITION. Hillsdale,
Hillsdale, NJ,
NJ, L. Erlbaum
Erlbaum Assoc., 1981.
(Forthcoming)
AND
1981. (Forthcoming)
Kieras, D.E.
D.E. a.nd
and Poison,
Polson, P.G.:
P.G.: The formal
formal analysis
analysis oof
Kiera.s,
j
user
complexity.
Int.
J.
Man-Machine
Studies,
In Press.
user complexity. Int.
Man-Machine Studies, In
Kintsch, W.:
W.: On
On modeling
modeling comprehension.
comprehension. Educ.
Educ. Psychologist,
Psychologist, 14:3-14, 1979.
Kintsch,
fI.,tiller, J.R.
J.R. and Kintsch,
Kintsch, W.:
W.: Readability
Readability and
and recall
recall oof
short prose
prose
Miller,
j short
passages: A
A. theoretical
theoretical analysis.
analysis. J. Experimental
Experimental Psychology:
Psychology:
passages:
Human Learning
Learning and Memory,
Memory, 1980.
1980. (In
(In press)
Human

213
213

A. Feigenbaum
Feigenbaum
E. A.

5P41 RR00785-11

Appendix
Appendix B

Rutgers
Rutgers AIM
AIM Project:
Project:

RUTGERS
RUTGERS RESEARCH
RESEARCH RESOURCERESOURCE
COMPUTERS
COMPUTERS IN
IN BIOMEDICINE
BIOMEDICINE

Principal
PI aincipal Investigators:
Investigators:

Saul Amarel,
Amarel, Ph.D.[1982-83],
Ph.D.[1982-831, Casimir
Casimir Kulikowski,
Kulikowski, Ph.D.
Ph.D.
Sholom
Sholom M. Weiss, Ph.D.[H)83-84j
Ph.D.1 1983-841
Department
Department of Computer
Computer Science
Science
Rutgers
Rutgers University
University
New Brunswick,
Brunswick, New Jersey 08903
(201) 932-3546 (AMAREL@RUTGERS)
(AMARELQRUTGERS)
(201) 932-2006 (KULIKO\VSKI@RUTGERS)
(KULIKOWSKI@RUTGERS)
(201) 932-2379 (WEISS@RUTGERS)
(WEISS@IRUTGERS)

The
The Rutgers
Rutgers Research Resource provides
provides the research support
support with
with artificial
artificial
intelligence
intelligence systems, and the computing
computing support
support with
with its DEC2060
DEC2060 facility
facility to a large
number
There are currently
currently 86 investigators
investigators
number of biomedical
biomedical scientists
scientists and researchers. There
associated with
with the Resource. Research activities
activities are concentrated
concentrated in three
three major
major areas:
expert
expert medical
medical systems, models for
for planning
planning and knowledge
knowledge acquisition,
acquisition, and general
general AI
AI
systems
systems development.
development.
One of
of the most
most significant
significant achievements
achievements in bringing
bringing the work
work of
of the Resource to
to
bear
bear on clinical
clinical research and practice
practice lies in the transfer
transfer of
of technology
technology from
from our
our large
DEC20
DEC20 machine
machine to microprocessor
microprocessor compatible
compatible representations.
representations. The
The initial
initial breakthrough
breakthrough
came with
with the automatic
automatic translation
translation of
of a serum
serum protein
protein electrophoresis
electrophoresis interpretation
interpretation
model
model so that
that a version
version could
could be incorporated
incorporated in an instrument
instrument - the
t,he scanning
scanning
densitometer
densitometer (CliniScan)
(CliniScan) produced
produced by
by Helena
Helena Laboratories.
Laboratories. After
After testing,
testing, it
it was
disseminated
disseminated commercially,
commercially, marking
marking the
the first
first successful transfer
transfer of
of technology
technology from
from the
Resource to
being used in over
to general
general availability
availability in the clinical
clinical community.
community. It
It is now
now being
over one
hundred
hundred clinical
clinical locations.
locations.
During
period, we have started
project with
During the current
current period,
started a new project
with long
long term
term
implications
implications for
for the impact
impact of
of AIM
AIM technology:
technology: the development
development of
of a hand-held
hand-held
microcomputer
microcomputer version
version of
of an expert
expert consultation
consultation system
system for
for front-line
front-line health
health workers.
workers. In
collaboration
with
collaboration
with Dr.
Dr. Chandler
Chandler Dawson
Dawson (UCSF),
(UCSF), Director
Director of
of the \Vorld
World Health
Health
Organization's
Organization’s Collaborative
Collaborat,ive Centre
Centre for
for the Prevention
Prevention of
of Blindness
Blindness and Trachoma,
Trachoma, we
have developed
prototype model
primary eye care. This
developed a prototype
model for
for consultation
consultation on primary
This has been
orient.ed
problems of
oriented at problems
of injury,
injury, infection,
infection, malnutrition
malnutrition and cataract
cataract in situations
situations where
where an
opht,hamologist is unavailable.
unavailable. In
In most
most developin,
developingc nations,
nations, the incidence
incidence of
of blindness
blindness is
ophthamologist
10%
to 40%
40% higher
higher than
than in the
the USA
USA because of
of these kinds
kinds of
of problems.
problems. With
With the
the help of
of
10% to
grant from
from the
the CJSAID,
USAID, we are developing
developing the
the systems needed for
for management
management of
of eye
a gra.nt
by front-line
front-line health
health workers
workers in developing
developing nations,
nations, and outlying
outlying parts
parts of
of the
the USA.
USA.
disease by
disease
Another significant
significant technology
technology transfer
transfer experiment
experiment involves
involves a very
very large
Another
consultation model.
model. The
The rheumatology
rheumatology knowledge
knowledge base developed
developed by
by our
our collaborators
collaborators
consultation
Drs. Lindberg
Lindberg and Sharp
Sharp at
at the
the University
University of
of Missouri
Missouri has been transferred
transferred by
by us to
to the
the
Drs.
MC58000 microprocessor
microprocessor based system,
system, and
and in the
the past
past year
year testing
testing has begun
begun at
at their
their
MC68000
sit.e. This
This represents
represents a major
major step in bringing
bringing the
the results
results of
of artificial
artificial int,elligence
intelligence research
research
sit,e.
to the point
point where
where clinical
clinical researchers
researchers who
who do not
not have access
access to
to large research
research machines
machines
to
will be able to
to make
make use of
of the
the results.
results. We
We are designing
designing a specialized
specialized rheumatology
rheumatology
will
marhine
which
can
carry
out
the
same
sophisticated
reasoning
that
now
the
machine which
carry out the
sophisticated reasoning that now needs the
Resource DEC20
DEC20 , but
but will
will cost
cost little
little over
over $10,000.
$10,000. Because the
the transfer
transfer has been
Resource
accomplished we can continue
continue to
to develop
develop large scale models
models using
using the
the full
full facilities
facilities of
of the
the
accomplished
Resource DEC“0
DEC20
but wit,h
with the
the confidence
confidence that
that they
they can then
then move
move out
out into
into clinical
clinical
Resource
- 9, but
research environments
environments when
when completed.
completed.
research

A. Feigenbaum
Feigenbaum
E. A.

214

5P41
5P41 RR00785-11
RR00785-11

Appendix
Appendix BB

REFERENCES
REFERENCES
Weiss,
Weiss, S.M.
S.M. and
and Kulikowski,
Kulikowski, C.A.
C.A. A Practical
Practical Guide
Guide to
to Designing
Designing
Expert
Expert Systems,
Systems, Rowman
Rowman and
and Allanheld,
Ailanheld, H184.
1984.
Kulikowski,
Kulikowski, C.A.
C.A. contributor
contributor to the Knowledge
Knowledge Acquisition
Acquisition chapter
chapter
edited
edited by B.
B. Buchanan
Buchanan in
in the book Building
Building Expert
Expert Systems
Systems (F.
(F. HayesHayesRoth,
Roth, et
et aI.,
al., eds)
eds) Addison-Wesley,
Addison-Wesley, 1983
1983 (in
(in press).
Yao, Y. and KUlikowski,
of
Strategies
of Reasoning
Reasoning
Kulikowski, C.A.,
C.A., It-fultiple
Multiple Strategies
for
for Expert
Expert Systems,
Systems, Proc.
Proc. Sixteenth
Sixteenth Hawaii
Hawaii International
International Conference
Conference on
Systems Sciences,
Sciences, pp. 510-514,1983.*
510-514 , 1983.)
Kulikowski,
in Expert
Ezpert AI
AI Medical
Medical Consultation
Consultation Systems:
Systems:
Kuiikowski, C.A.Progress
C.A.Progresa in
1980
IQ880- 1988,
198.9, Proc.
Proc. MEDINFO
MEDINFO '83
‘83 , pp. 499-502, Amsterdam,
Amsterdam, August
August 1983.*
1983.”
Kastner,
for
Kastner, J.K.,
J.K., Weiss, S.M.,
S.M., and Kulikowski,
Kulikowski, C.A.,An
C.A.,An Efficient
Efficient Scheme for
Time-Dependent
Time-Dependent Consultation
Consultation Systems,
Systems, Proc.
Proc. MEDINFO
MEDINFO '83,
‘83, pp.619-622,
pp.619-622, 1983.*
Kulikowski,
Kuiikowski, C.A.Expert
C.A.Ezpert Medical
Medical Consultation
Consultation Systems,
Systems, Journal
Journal of
of Medical
Medical
Systems, v.7, pp. 229-234, 1983.*

in
Weiss, S.M.,
Ezpertise
in
S.M., Kulikowski,
Kulikowski, C.A.,
C.A., and Galen, R.S.,Representing
R.S.,Representing Expertise
a Computer
Computer Program:
Program: The Serum
Serum Protein
Protein Diagnostic
Diagnostic Program,
Program, Journal
Journal of
of
Clinical
Clinical Laboratory
Laboratory Automation,
Automation, v.3,
v.3, pp. 383-387, 1983.
1983.**
Kastner,
Expert System
for
Kastner, J.K.,
J.K., Weiss, S.M.,
S.M., and Kulikowski,
Kulikowski, C.A.,
C.A., An
An Ezpert
System for
Primary Eye Care,
Front-Line Health
Health Workers
Front-Line
Workers in
in Primary
Care, Proc.
Proc. Seventeenth
Seventeenth Hawaii
Hawaii
International
International Conference
Conference on Systems Sciences,
Sciences, pp. 162-166, 1984.
1984.**
Kulikowski, C.A.
C.A. Knowledge
Knowledge Acquisition
Acquisition and
and Learning
Learning in
in EXPERT,
EXPERT,
Kulikowski,
Proc. 1983 \Vorkshop
\Vorkshop on Machine
Machine Learning,
Learning, Univ.
Univ. of
of Iilinois,Champaign-Urbana
Illinois,Champaign-Urbana
Proc.
1983.
Indicate by
by an asterisk
asterisk (*)
(*) that
that the
the resource
resource was given
given credit.
credit.
Indicate

215
215

E. A.
A. Feigenbaum
Feigenbaum
E.

5P41 RR00785-11

Appendix
Appendix B

National
National AIM
AIM Project:
Project:

SOLVER
SOLVER -- PROBLEM
PROBLEM SOLVING
SOLVING EXPERTISE
EXPERTISE

Principal
Principal Investigators:
Investigators:

Paul
Paul E. Johnson, Ph.D.
Ph.D.
Center
Center for
for Research in Human
Human Learning
Learning
205 Elliott
Elliott Hall
Hall
University
University of Minnesota
Minnesota
Minneapolis,
Minneapolis, Minnesota
Minnesota 55455
(612) 373-5302 (PJOHNSON@SUMEX-AIM)
(PJOHNSON@SUMEX-AIM)
William
William B. Thompson,
Thompson, Ph.D.
Ph.D.
Department
Department of Computer
Computer Science
136 Lind
Lind Hall
Hall
University
University of Minnesota
Minnesota
Minneapolis,
Minneapolis, Minnesota
Minnesota 55455
(612) 373-0132 (THOMPSON@SUMEX-AIM)
(THOMPSON@SUMEX-AIM)

The
The Minnesota
Minnesota SOLVER
SOLVER project
project focuses
focuses upon
upon the development
development of
of strategies
strategies for
for
discovering
discovering and representing
representing the knowledge
knowledge and skill
skill of
of expert
expert problem
problem solvers. Although
Although
in the last 15 years considerable
considerable progress has been made in synthesizing
synthesizing the expertise
expertise
required
for
solving
complex
problems,
most
expert
systems
required for solving complex problems, most expert systems embody
embody only
only the limited
limited
amount
amount of
of expertise
expertise that
that individuals
individuals are able to report
report in a particular
particular constrained
constrained
lan~uage
language (e.g., production
production rules).
rules). What
What is still
still lacking
lacking is a theoretical
theoretical framework
framework capable
of reducing
reducing dependence upon
upon the
the expert's
expert’s intuition
intuition or
or on the near exhaustive
exhaustive testing
testing of
possible organizations.
organizations. Our
Our methodology
methodology consists of: (1) extensive
extensive use
use of
of verbal
verbal thinking
thinking
aloud
aloud protocols
protocols as
as a source of
of information
information from
from which
which to
to make inferences
inferences about
about
underlying
processes; (2) development
underlying cognitive
cognitive structures
structures and processes;
development of
of computer
computer models
models as
az a
means of
protocol studies;
of testing
testing the
the adequacy
adequacy of
of inferences
inferences derived
derived from
from protocol
studies; (3) testing
testing and
refinement
refinement of
of the cognitive
cognitive models
models based upon
upon the study
study of
of human
human and model
model
performance in experimental
problem-solving
performance
experimental settings.
settings. Currently,
Currently, we are investigating
investigating problem-solving
expertise
physics, engineering,
expertise in domains
domains of
of medicine,
medicine, physics,
engineering, management,
management, and law.
law.
SOFTWARE
SOFTWARE AVAILABLE
AVAILABLE ON
ON SUMEX
SUMEX
A
A redesigned
redesigned version
version of
of the Diagnoser
Diagnoser simulation
simulation model,
model, named
named Galen,
Galen, has been
implemented
implemented on SUMEX.
SUMEX.
REFERENCES
R.EFERENCES
Johnson. P.E.:
P.E.: What
kind of
of expert
expert should
should a sycltem
system be?
Johnson,
What kind
Medicine and
and Philosophy,
Philosophy. 8:77-97, 1983.
J. Medicine
Johnson. P.E.,
P.E.. Duran,
Duran. A.,
A .. Hassebrock,
Hassebrock. F.,
F .• Moller,
Moller, J., Prietula,
Prietula. M.,
M ..
Johnson,
Feltovich. P. and
and Swanson,
Swanson. D.: Expertise
Expertise and
and error
error in
in diagnostic
diagnostic
Feltovich,
reasoning. Cognitive
Cognitive Science 5:235-283,
5:235-283. 11981.
981.
reasoning.
Thompson. W.B.,
\V.B .. Johnson,
Johnson. P.E.
P.E. and Moen,
Moen. J.B.:
J.B.: Recognition-based
Recognition-based
Thompson,
diagnostic reasoning.
reasoning. Proc.
Proc. Eighth
Eighth IJCAI,
IJCAI, Karlsruhe,
Karlsruhe, West
West
diagnostic
Germany. August,
August. 1983.
1983.
Germany,
Sedlmeyer. R.L.,
R.L.. Thompson,
Thompson. W.B.
\V.B. and
and Johnson,
Johnson, P.E.:
P.E.: Diagnostic
Diagnostic
Sedlmeyer,
reasoning in
in software
software fault
fault localization.
localization. Proc.
Proc. Eighth
Eighth IJCAI,
IJCAI.
reasoning
Karlsruhe. West
West Germany,
Germany. August,
August. 1983.
1983.
Karlsruhe,

Feigenbaum
A. Feigenbaum
E. A.

216
216

5P41
5P41 RR00785-11
RR00785-1 I

Appendix
Appendix BB

Johnson,
Johnson. P.E.,
P.E., "The
“The Expert
Expert Mind:
Mind: A
A New
New Challenge
Challenge for
for the
the Information
Information
scientist."
scientist,” in
in Beyond
Beyond Productivity:
Productivity: Information
In formation System
System Development
Development
for
for Organizational
Organizational Effectiveness,
Effectivenese, Th.
Th. M.
M. A.
A. Bemelmans
Bemelmsns (editor),
(editor),
Elsevier
Elsevier Science
Science Publishers
Publishers B. V. (North-Holland),
(North-Holland), 1984.
1984.

217
217

E. A.
A. Feigenbaum
Feigenbaum
E.

Appendix
Appendix B

5P41
5P41 RR00785-11
RR00785- 11

National
National AIM
AIM Project:
Project:

Computer-Aided
Computer-Aided Diagnosis
Diagnosis of
Malignant
Malignant Lymph
Lymph Node Diseases
Diseases (PATHFINDER)
(PATHFINDER)

Principal
Principal Investigator:
Investigator:

Bharat
Bharat Nathwani,
Nathwani, M.D.
M.D.
Department
Department of Anatomical
Anatomical Pathology
Pathology
City
City of Hope National
National Medical
Medical Center
Center
Duarte,
Duarte, California
California
(213)
(213) 359-8111
359-8111 x 2456
2456 (NATHWANI@SUMEX-AIM)
(NATHWANIQSUMEX-AIM)
Lawrence
Lawrence M. Fagan, M.D.,
M.D., Ph.D.
Ph.D.
Department
Department of Medicine
Medicine
Stanford
Stanford University
University Medical
Medical Center
Center - Room
Room TC135
TC135
Stanford,
Stanford, California
California 94305
(415) 497-6979 (F
AGAN@SUMEX-AIM)
(FAGANQSUMEX-AIM)

\Ve
to assist in the
We are building
building a computer
computer program,
program, called
called PATHFINDER,
PATHFINDER,
diagnosis
dia,gnosis of lymph
lymph node pathology.
pathology. The
The project
project is based at the City
City of Hope National
National
medical
medical center
center in collaboration
collaboration with
with the Stanford
Stanford University
University Medical
Medical Computer
Computer Science
Group.
Group. A
A pilot
pilot version
version of
of the program
program provides
provides diagnostic
diagnostic advice on 45 common
common benign
benign
and malignant
malignant diseases
diseases of
of the lymph
lymph node based on 77 histologic
histologic features.
features. Our
Our research
plans are to develop a full-scale
full-scale version
version of
of the computer
computer program
program by
by substantially
substantially
increasing
increa.sing the quantity
quantity and quality
quality of
of knowledge
knowledge and to develop techniques
techniques for
for knowledge
knowledge
represent.ation
represent,ation and manipulation
manipulation appropriate
appropriate to this
this application
application area. The
The design of
of the
program
jCADUCEUS program
strongly influenced
influenced by
by the INTERNIST
INTERNIST/CADUCEUS
program
program has been strongly
developed
developed on the SUMEX
SUMEX resource.

SOFT\V
ARE AVAILABLE
S0FTWAR.E
AVAILABLE ON
ON SUMEX
SUMEX
PATHFINDER-A
program is available
for
PATHFINDER-A version
version of
of the
the PATHFINDER
PATHFINDER
program
available
for
experimentation
on
the
DEC
2060
computer.
This
version
is
a
pilot
experimentation
DEC
computer.
This version
pilot
version
program, and therefore
version of
of the program,
therefore has not
not been completely
completely tested.
tested.

A. Feigenbaum
Feigenbaum
E. A.

218
218

Appendix
Appendix B

5P41
5P41 RROOi85-U
RR00785-11

Stanford
Stanford Project:
Project:

EXPEX
EXPEX -- EXPERT
EXPERT EXPLANATION
EXPLANATION

Principal
Principal Investigator:
Investigator:

Edward
Edward H. Shortliffe,
Shortliffe, M.D.,
M.D., Ph.D.
Ph.D.
Departments
Departments of Medicine
Medicine and Computer
Computer Science
Science
Stanford
University
Medical
Center
Stanford University Medical Center - Room
Room TC135
TC135
Stanford,
Stanford, California
California 94305
(415) 49i-6979
497-6979 (SHORTLIFFE@SUMEX-AIM)
(SHORTLIFFE@SUMEX-AIM)

EXPEX
EXPEX isis a recent
recent Stanford
Stanford research project
project that
that is involved
involved with
with the development
development
of new representation
This
representation schemes
schemes to facilitate
facilitate knowledge
knowledge acquisition
acquisition and explanation.
explanation. This
includes
includes not
not only
only the study
study of fundamental
fundamental representational
representational formalisms
formalisms but
but also the
encoding
encoding of various
various types
types of knowledge,
knowledge, such as
as causal information
information and user models. The
The
research effort
effort deals with
with medical
medical domains
domains and is being undertaken
undertaken on SUMEX
SUMEX or
or on
professional
professional workstations
workstations linked
linked to the central
central resource.
Our
Our interest
interest in explanation
explanation derives
derives from
from the insights
insights we gained
gained in developing
developing
explanatory
that system
system and its descendants, we
expia,natory capabilities
ca.pabilities for
for the MYCIN
MYCIN system.
system. In that
were able to generate
generate intelligible
intelligible explanations
explanations by
by taking
taking advantage
advantage of
of a rule-based
rule-based
representation
The limitations
limitations of
of the justifications
justifications generated
generated using MYCIN's
MYCIN’s
representation scheme. The
explanation
explanat.ion capabilities
capabilities have become increasingly
increasingly obvious,
obvious, however,
however, and have led to
improved
improved characterization
characterization of
of the kinds
kinds of
of explanation
explanation capabilities
capabilities that
that must
must be developed
developed
if
EXPEX is devoted
devoted to
if clinical
clinical consultation
consultation systems are to be accepted
accepted by
by physicians.
physicians. EXPEX
the development
practical and theoretical
problem.
development of
of new practical
theoretical insights
insights into
into this
this problem.

REFERENCES
REFERENCES
Teach,
An analysis
physician att-itudes
Teach, R.L.
R.L. and Shortliffe,
Shortliffe, E.H.:
E.H.: An
analysis of
of physician
att,ltudes
systems. Comput.
regarding
regarding computer-based
computer-based clinical
clinical consultation
consultati.on systems.
Comput. Biomed.
Biomed.
Res. 14:542-558, H18l.
1981.

Explanatory power
power for
for medical
Wallis,
Wallis, J.\V.
J.W. and Shortliffe,
Shortliffe, E.H.:
E.H.: Explanatory
medical expert
expert
systems:
Studies
in
the
representation
of
causal
relationships
for clinical
clinical
systems: Studies in the representation of causal relationships
for
corlsultations. Meth.
Meth. Info.
Info. Med.
Med. 21:127-136, 1982.
consultations.
Langlotz, C.P.
C.P. and Shortliffe,
Shortliffe, E.H.:
E.H.: Adapting
Adapting a consultation
consultation
La.nglotz,
system to critique
critique user
user plans.
plans. International
International
system
Journal of
of Man-Machine
Man-Machine Studies,
Studies, 19:479-496 (1983)
Journal
Tsuji, S. and Shortliffe,
Shortliffe, E.H.:
E.H.: Graphical
Graphical access to the knowledge
knowledge base
Tsuji,
of a medical
medical consultation
consultation system.
system. Proceedings
Proceedings of
of AAMSI
AAMSI Congress
of
83, pp.
pp. 551-555, San Francisco,
Francisco, Ca.,
Ca., May
May 1983.
W., Clancey,
Clancey, W.
W. J., and
and Rennels, G.:
G.:
Hasling, D. W.,
Hasling,
Strateg£c
explanations
for
a
diagnostic
consultation
system.
Strategic explanations for
diagnostic consultation system.
International Journal
Journal of
of Man-Machine
Man-Machine Studies,
Studies, Spring
Spring 1984 (in
(in press).
International

219

E. A.
A. Feigenbaum
Feigenbaum

5P41 RR00785-11
RR00785- 11

Appendix
Appendix B

Stanford
Stanford Project:
Project:

GUIDON/NEOMYCIN
-GUIDON/NEOMYCIN
KNOWLEDGE
KNOWLEDGE ENGINEERING
ENGINEERING
FOR
FOR TEACHING
TEACHING MEDICAL
MEDICAL DIAGNOSIS
DIAGNOSIS

Prinripal
Principal Investigators:
Invest,igators:

\Villiam
William J. Clancey,
Clancey, Ph.D.
Ph.D.
701 \Velch
Welch Road
Road
Department
Department of
of Computer
Computer Science
Stanford
Stanford University
University
Palo
Palo Alto,
Alto, California
California 94304
(415) 497-1997 (CLANCEY@SUMEX-AIM)
(CLANCEY@SUMEX-AIM)
Bruce
Bruce G. Buchanan,
Buchanan, Ph.D.
Ph.D.
Computer
Computer Science Department
Department
Margaret
Margaret Jacks Hall
Hall
Stanford
Stanford University
University
Stanford,
Stanford, California
California 94305
(415) 497-0935 (BUCHANAN@SUMEX-AIM)
(BUCHANAN@SUMEX-AIM)

SOFTWARE
SOFTWARE AVAILABLE
AVAILABLE ON
ON SUMEX
SUMEX
GUIDON--A
GUIDON--A system
system developed
developed for
for intelligent
intelligent computer-aided
computer-aided instruction.
instruction. Although
Although
it
it was developed
developed in the context
context of
of MYCIN's
MYCIN’s infectious
infectious disease
disease knowledge
knowledge base,
base, the tutorial
tutorial
rules will
will operate
0perat.e upon
upon any
any EMYCIN
EMYCIN knowledge
knowledge base.
base.
NEOMYCIN--A consulation
NEOMYCIN--A
consulation system
system derived
derived from
from MYCIN,
MYCIN, with
with the knowledge
knowledge base
base
greatly
In
In contrast
contrast with
with MYCIN,
MYCIN,
greatly extended
extended and reconfigured
reconfigured for
for use in teaching.
teaching.
dia,gnostic
dia.gnostic procedures,
procedures, common
common sense
sense facts,
facts, and disease
disease hierarchies
hierarchies are factored
factored out
out of
of the
basic finding/disease
finding/disease associations.
associations. The
The diagnostic
diagnostic procedures
procedures are abstract
abstract (not
(not specific
specific to
to
any
a,ny problem
problem domain)
domain) and model
model human
human reasoning,
reasoning, unlike
unlike the exhaustive,
exhaustive, top-down
top-down
approach
approach implicit
implicit in MYCIN's
MYCIN’s medical
medical rules. This
This knowledge
knowledge base
base will
will be used in the
GUIDON2
GUIDON2 family
family of
of instructional
instructional programs.
programs.

REFERENCES
REFERENCES
Clancey,
Clancey, \V.J.:
W.J.: Overview
Overview of
of GUIDON.
GUIDON. In
In A.
A. Barr
Barr and E.A.
E.A. Feigenbaum
Feigenbaum (Eds.),
(Eds.),
THE
Vol.
THE HANDBOOK
HANDBOOK OF
OF ARTIFICIAL
ARTIFICIAL INTELLIGENCE,
INTELLIGENCE,
Vol. 2. William
William Kaufmann
Kaufmann Assoc.,
Los Altos,
Altos, CA,
CA, 1982. (Also
(Also to
to appear
appear in J. of
of Computer-based
Computer-based Instruction)
Instruction)
Clancey,
Methodology for
for building
system.
Clsncey, W.J.:
W.J.: Methodology
building an intelligent
intelligent tutoring
tutoring system.
In Kintsch,
Kintsch, Polson,
Poison, and Miller,
Miller, (Eds.),
(Eds.), METHODS
METHODS AND
AND TACTICS
TACTICS IN
IN COGNITIVE
COGNITIVE
SCIENCE.
NJ. (In
SCIENCE. L. Erlbaum
Erlbaum Assoc., Hillsdale,
Hillsdale, NJ.
(In press) (Also
(Also STAN-CS-81-894,
STAN-CS-81-894,
HPP
HPP 81-18)
81-18)
Clancey,
Acquiring, representing,
Clancey, W.J.:
W.J.: Acquiring,
representing, and
and evaluating
evaluating a competence
competence
model
of
diagnosis.
In
Chi,
Glaser,
and
Farr
NATURE
model of diagnosis. In Chi, Glaser,
Farr (Eds.),
(Eds.), THE
THE NATURE
OF
EXPERTISE.
In
preparation.
HPP-84-2.
OF E?
Source Exif Data:
File Type                       : PDF
File Type Extension             : pdf
MIME Type                       : application/pdf
PDF Version                     : 1.6
Linearized                      : No
XMP Toolkit                     : Adobe XMP Core 4.2.1-c041 52.342996, 2008/05/07-21:37:19
Format                          : application/pdf
Creator                         : StreetI
Description                     : 
Title                           : 
Create Date                     : 2001:07:17 15:29:44Z
Creator Tool                    : Acrobat Capture Server 2.01
Modify Date                     : 2018:03:20 08:52:17-07:00
Metadata Date                   : 2018:03:20 08:52:17-07:00
Keywords                        : 
Producer                        : Adobe Acrobat 9.0 Paper Capture Plug-in
Document ID                     : uuid:128fb35a-42bb-1040-ae6a-5143bd225111
Instance ID                     : uuid:25363a4d-6f61-1f46-a4f0-d61979dbc07d
Page Layout                     : SinglePage
Page Count                      : 233
Author                          : StreetI
Subject                         : 
EXIF Metadata provided by EXIF.tools

Navigation menu