1961 05_#19 05 #19

1961-05_#19 1961-05_%2319

User Manual: 1961-05_#19
Open the PDF directly: View PDF .
Page Count: 668
Download
Open PDF In Browser	View PDF
Proceedings of the

WESTERN JOINT COMPUTER CONFERENCE
~

. . . . . . . . . . . " . . . . . u· . . . . . I.uuu'uu,"uu,uuu"u,""""""'"'''"uu"'"'''"",u""uuoo,,,,uuuuuu",uoooooouoouoooouoouoooo,,oooooooooouuoouuuuu

May 9- 11, 1961

EXTENDING MAN'S INTELLECT

Los Angeles, California

Spo·nsors:

THE INSTITUTE OF RADIO ENGINEERS
Professional Group on Electronic Computers

THE AMERICAN INSTITUTE OF ELECTRICAL ENGINEERS
Committee on Computing Devices

THE ASSOCIATION FOR COMPUTING MACHINERY

Vol. 19

Price $4.00

PROCEEDINGS OF THE
WESTERN JOINT COMPUTER CONFERENCE

PAPERS PRESENTED AT
THE JOINT IRE-AIEE-ACM COMPUTER CONFERENCE
LOS ANGELES, CALIF 0, MAY 9-11, 1961

Sponsors
THE INSTITUTE OF RADIO EN·GINEERS

Professional 'Group on Electronic Computers
THE AMERICAN INSTITUTE OF ELECTRICAL ENGINEERS

Committee on Computing Devices
THE ASSOCIATION FOR COMPUTING MACHINERY

Published by
WESTERN JOINT COMPUTER CONFERENCE

ADDITIONAL COPIES
Additional copies may be purchased from the following sponsoring
societies at $4.00 per copy. Checks should be made payable to any
of the following societies:
INSTITUTE OF RADIO ENGINEERS
1 East 79th Street, New York 21, N.Y.
AMERICAN INSTITUTE OF ELECTRICAL ENGINEERS
33 West 39th Street, New York 18, N.Y.
ASSOCIATION FOR COMPUTING MACHINERY
2 East 63rd Street, New York 21, N.Y

© 1961 by
National Joint Computer Committee

The ideas and opinions expressed herein are solely those of the authors,
and are not necessarily representative of, or endorsed by, the 'WJCC
Committee or the NJCC Committee.

Manufactured in the U.S.A. by
Griffin-Patterson Co., Inc., Glendale, Calif.

WESTERN JOINT COMPUTER CONFERENCE COMMITTEE
General Chairman

Walter F. Bauer, Ramo-Wooldridge, Canoga Park, California

Vice Chairman . .

Keith W. Uncapher, The RAND Corp., Santa Monica, California

Conference Administrator

Robert W. ·Rector, Space Technology Laboratories, Los Angeles, Calif.

Program Chairman . . . .

Cornelius Leondes, UCLA, Los Angeles, California

Associate Program Chairman

Paul Armer, The RAND Corp., Santa Monica, California

Associate Program Chairman

J. D. Madden, System Development Corp., Santa Monica, Calif.

Associate Program Chairman

John McLeod, Convair-Astronautics, San Diego, California

Publications . . . . .

Glenn Morgan, IBM Corp., Los Angeles, California

Hotel Arrangements

William Dobrusky, System Development Corp., Santa Monica, Calif.

Public Relations

Santo Lanzarotta, DATAMATION, Los Angeles, California

Finance ..

William S. Speer, United Aircraft Corp., Costa Mesa, California

Registration

Marvin Howard, Ramo-Wooldridge, Canoga Park, California

Exhibits ..

Richard H. Hill, Ramo-Wooldridge, Canoga Park, California

Printing and Mailing

L. C. Hobbs, Aeronutronic Div. of Ford Motor Co., Newport Beach, Calif.

Trips . . . . . . . .

Joel Herbst, Ampex Computer Products Co., Culver City, California

Women's Activities

Phyllis Huggins, Bendix Corp., Computer Div., Los Angeles, California

Public Relations Consultant

Lynn-Western, Los Angeles, California

Exhibits Manager . . . . . .

John Whitlock, Oakton, Virginia

iii

NATIONAL JOINT COMPUTER COMMITTEE
Chairman
Morris Rubinoff
Moore School of Engineering
University of Pennsylvania
Philadelphia, Pennsylvania

Vice Chairman
J. D. Madden
System Development Corp.
Santa Monica, California

Secretary-Treasurer
Margaret R. Fox
National Bureau of Standards
Department of Commerce
Washington, D.C.
AlEE Representatives
Morris Rubinoff
Moore School of Engineering
University of Pennsylvania
Philadelphia, Pennsylvania

IRE Representatives
Richard O. Endres
Rese Engineering Co.
Philadelphia, Pennsylvania
Frank E. Heart
Lincoln Laboratories
Lexington, Mass.

G. L. Hollander
Hughes Aircraft
Fullerton, California

Charles W. Rosenthal
Bell Telephone Laboratories
Murray Hill, New Jersey

R. R. Johnson
General Electric Co.
Phoenix, Arizona

Willis H. Ware
The RAND Corporation
Santa Monica, California

C. A. R. Kagan
Western Electric Co., Inc.
Princeton, New Jersey

A eM Representatives
Paul Armer
The RAND Corporation
Santa Monica, California

Ex-Officio Representatives
Harry D. Huskey
University of California
Department of Mathematics
Berkeley, California

Walter M. Carlson
E. I. du Pont de Nemours
Wilmington, Delaware

R. A. Imm
IBM Corporation
Rochester, Minn.

J. D. Madden
System Development Corp.
Santa l\ionica, California

Arnold A. Cohen
Remington Rand UNIVAC
St. Paul, Minnesota

R. W. Hamming
Bell Telephone Laboratories
Murray Hill, New Jersey

Bruce Gilchrist (ACM)
IBM Research Center
Yorktown Heights, New York

Headquarters 'Representatives
R. S. Gardner (AlEE)
American Institute of Electrical Engineers
New York, New York

iv

L. G. Cumming (IRE)
The Institute of Radio Engineers
New York, New York

FOREWORD
The technical papers included in these Proceedings represent all the
technical presentations made at the Western Joint Computer Confer~
ence. In addition, there is included a message from the Chairman of
the National Joint Computer Committee, Dr. Morris Rubinoff. We are
proud of the excellent contributions recorded here, so many of which
directly support the 1961 WJCC theme "Extending Man's Intellect."
Also, we are happy that we have been able to make these Proce:dings
available at the time of the Conference, thus enhancing the benefits
of the Conference to registrants and making available the information
in a timely manner.
It should be recognized. however, that the papers presented herein
have not been selected by the usual procedures wherein a paper is
refereed as to its appropriateness and edited for its content. Neither
the NJCC nor the WJCC Committee can take responsibility for the
accuracy of the facts or opinions expressed. We are confident, however, that the overwhelming majority of the papers presented here are
responsible in all ways. Many papers were called but few were chosen;
we are happy to record them here for the continuing advance and
lasting annals of information processing technology.
WALTER F. BAUER
General Chairman
1961 Western Joint Computer
Conference

v

MESSAGE FROM NICC CHAIRMAN
(3) Prepare, publish, and disseminate information of
a tutorial nature to laymen, high school teachers
and students, government offices and officials, etc.

This is an historic occasion. The close of this 1961
Western Joint Computer Conference will signal the
change-over in administration of Joint Computer Conferences from the National Joint Computer Committee to
the American Federation of Information Processing Societies (AFIPS), with broader scope and greater flexibility. As you know, AFIPS is a society of societies organized to represent through a single body the professional
societies of the American computer and data processing
world. The enthusiastic response to the formation of
AFIPS is highly gratifying and lends encouragement, confidence and a sense of mission to those whom you have
charged with conducting its activities.

( 4 ) Maintain relations among American and foreign
technical societies through conferences and symposia, cooperation with other societies in organizing sessions at their conferences, provide reference material to other societies on the computational sciences.
(5) Maintain membership in the International Federa-

tion of Information Processing Societies (IFIPS).
(6) Aid in certain actions of member societies involv-

There are times when the path to the future is best
appreciated through a re-examination of the past. I would
like to quote from a letter dated December 15, 1959,
written by the late Chairman of NJCC, Professor Harry
Goode, who contributed so much both to NJCC and to
the birth of AFIPS:

ing participation and cooperation by more than
one society.
(7) Sponsor the ICC's."

The Constitution of AFIPS reflects these views in their
entirety. With your frequently demonstrated cooperation
and support, the Board of Governors of AFIPS will
continue to conduct our successful Joint Computer Conferences and to represent the United States in our International Federation, IFIPS. As new societies join the
Federation, it will gradually provide the hoped-for broad
representation of the American information processing
profession. We will seek to establish AFIPS as the information center on data processing including not only
bibliographies of written material, but also a calendar of
events of computer activities in the United States and
throughout the world, a roster of individuals active in
information processing, and a current file of developments
in progress or recently consummated. We plan to establish
a speakers' bureau to carry information on the information
processing field to educational institutions and professional
societies. We plan to establish a public information committee which, through the media of personal contacts, press
releases and tutorial articles, will make available to laymen,
to government agencies, to affiliated and member societies
and to the profession as a whole, the present status and the
probable future of information processing in the United
States.

"I believe the major objective in the formation of the
society is to provide for information flow in all other instances than those provided for by the individual societies to their members.
"There are four types of such flow:
( 1) Information flow between members of information

processing societies nationally.
(2) Information flow between our national informa-

tion processing society and foreign information
processing societies.
(3) Information flow between societies in the informa-

tion processing profession and other professions.
(4) Information flow from the information processing

societies to the general and educational public.
"If we can recognize a firm set of objectives such as
these (which of course need to be rewritten into a proper
set of words), then what the society is to do is relatively
clear-cut.

"The functions follow immediately from the objectives:
(1) Act as the American representative body on mat-

I trust that with your continued cooperation and support
our efforts will meet with a long string of successes.

ters related to computing application and design,
in a broad area of computational and information
processing sciences.
(2)

Respectfully submitted,

Advance the field by stimulating research into new
aspects of computer sciences emphasizing the
cross-pollination of ideas among member societies.

Morris Rubinoff, Chairman
National Joint Computer Committee

vi

TABLE OF CONTENTS
DIGITAL SIMULATION

Page

1.1
1.2
1.3
1.4

"Simulation: A Survey" by H. H. Harman, System Development Corp ........................................................ .
"Management Games and Computers" by J. M. Kibbee, Remington Rand UNIVAC ..................................
"An On-Line Management System Using English Language" by A. Vazsonyi, Ramo-Wooldridge ................
"Application of Digital Simulation Techniques to Highway Design Problems" by A. Glickstein,
S. L. Levy, Midwest Research Institute ..............................................................................................................
1.5 "The Use of Manned Simulation in the Design of an Operational Control Systen( by M. A. Geisler,
W. A. Steger, The RAND Corp .........................................................................................................................

11
17
39
51

MICROSYSTEM ELECTRONICS
2.1 "A Survey of Microsystem Electronics" by Peter. B. Meyers, Semiconductor Products Division of
Motorola, Inc .......................................................................................................................................................
2.2 "Testing of Micrologic Elements" by R. Anderson, Fairchild Semiconductor Corp .........................................
2.3 "Interconnection Techniques for Semiconductor Networks" by J. S. Kilby, Texas Instruments, Inc .................
2.4 "Microsystem Computer Techniques" by E. Luedicke, A. Medwin, Radio Corporation of America ........~ .....

63
75
87
95

MODELING HUMAN MENTAL PROCESSES
3.1
3.2
3.3
3.4

"Modeling Human Mental Processes" by H. A. Simon, Carnegie Institute of Technology .............................. ll1
"The Simulation of Verbal Learning Behavior" by E. Feigenbaum, University of California .......................... 121
"Simulation of Behavior in the Binary Choice Experiment" by J. Feldman, Universify of California ............ 133
"Programming a Model of Human Concept Formulation" by C. 1. Hovland, E. B. Hunt, Yale University .. 145

RECENT ADVANCES IN COMPUTER CIRCUITS
4.1 "Parallelism in Computer Organization Random Number Generation in the Fixed Plus Variable Computer
System" by M. Aoki, G. Estrin, University of California, T. Tang, National Cash Register Co................. 157
4.2 "The CELLSCAN System~a Leucocyte Pattern Analyzer" by Kendall Preston, Jr., Perkin-Elmer Corp ..... 173
4.3 "Application of Computers to Circuit Design for UNIVAC LARC" by G. Kaskey, N. S. Prywes,
H. Lukoff, Remington Rand UNIV AC ................................................................................................................ 185
4.4 "Wide Temperature Range Coincident Current Core Memories" by R. S. Weisz, N. Rosenberg, Ampex
Computer Products Co ................................................................................... :................................................... 207
PROBLEM SOLVING AND LEARNING MACHINES
5.1 "Descriptive Languages and. Problem Solving" by Marvin Minsky, Massachuetts Institute of Technology ...... 215
5.2 "Baseball: An Automatic Question-Answerer" by Bert F. Green, Jr., Alice K. Wolf, Carol Chomsky,
Kenneth Laughery, Massachuetts Institute of Technology ............................................................................ 219
5.3 "A Basis for a Mathematical Theory of Computation" by John McCarthy, Massachusetts Institute
of Technology ...................................................................................................................................................... 225
INFORMATION RETRIEVAL
6.1 "Information Retrieval: State of the Art" by Don R. Swanson, Ramo-Wooldridge .... :.................................. .239
6.2 "Technical Information Flow Pattern" by M. M. Kessler, Massachusetts Institute of Technology ................. .247
6.3 "A Screening Method for Large Information Retrieval Systems" by Robert T. Moore, Princeton University .. 259
AUTOMATA THEORY AND NEURAL MODELS
7.1 "What Is an Intelligent MachineT by W. Ross Asby, University of Illinois .................................................... 275
7.2 "Analysis of Perceptrons" by H. D. Block, Cornell University .......... :............................................................... 281
7.3 "Physiology of Automata" by Murray L. Babcock, University of Illinois ........................................................ 291

vii

TABLE OF CONTENTS, continued
NEW HYBRID ANALOG·DIGITAL TECHNIQUES

Page

8.1 "Combined Analog-Digital Computing Elements" by Hermann Schmid, Link Division, General
Precision, Inc. ...................................................................................................................................................... 299
8.2 "Optimization of Analog Computer Linear System Dynamic Characteristics" by C. H. Single,
E. M. Billinghurst, Beckman Instruments, Inc .................................................................................................. .315
8.3 "Design and Development of a Sampled-Data Simulator" by J. E. Reich, J. J. Perez, Space Technology
Laboratories, Inc. . .................................................... _......................................................................................... 341
8.4 "Digital Clock Delay Generators and Run Counter tor a Repetitive Analog Computer" by T. Brubaker,
H. Eckes, University of Arizona ............................. _......................................................................................... 353
LARGE COMPUTER SYSTEMS
9.1 "Trends in Design of Large Computer Systems" by C. W. Adams, Charles W. Adams Associates.................. 361
AUTOMATIC PROGRAMMING
10.1
10.2
10.3
10.4

"Current Problems in Automatic Programming" by Ascher Opler, Computer Usage Company ..................... .365
"A First Version of UNCOL" by T. B. Steel, Jr., System Development Corporation ..................................... .371
"A Method of Combining ALGOL and COBOL" by J. E. Semmet, Sylvania Electric Products ..................... .379
"ALGY-An Algebraic Manipulation Program" by M. D. Bernick, E. D. Callender, J. R. Sanford,
Philco Corporation .............................................................................................................................................. 389
10.5 "A New Approach to the Functional Design of a Digital Computer by R. S. Barton, Computer
Consultant ............................................................................................................................................. _.............. 393
10.6 "The JOVIAL Checker" by M. Wilkerson, System Development Corporation .................................................. 397

MEMORY DEVICES AND COMPONENTS
11.1 "Factors Affecting Choice of Memory Elements" by Claude F. King, Logicon, Inc ........................................ .405
11.2 "A Nondestructive Readout Film Memory" by R. J. Petschauer, R. D. Turnquist, Remington Rand
UNIVAC ...............................................................................................................................................................411
11.3 "Tunnel Diode Storage Using Current Sensing" by E. R. Beck, D. A. Savitt, A. E. Whiteside, Bendix Corp .. ,427
11.4 "The Development of a New Nol1destructive Memory Element" by A. W. Vinal, Federal Systems-IBM ..... ,443
11.5 "High-Speed Optical Computers and Quantum Transition Memory Devices" by L. C. Clapp, Sylvania
Electronic Systems................................................................................................................................................475
APPLIED ANALOG TECHNIQUES
12.1 "Optimization of a Radar and Its Environment by GEESE, General Electric Electronic System Evaluator
Techniques" by L. Berger, R. M. Taylor, General Electric Defense Systems Department.. ......................... ,490
12.2 "The Spectral Evaluation of Iterative Differential Analyzer Integration Techniques" by M. Gilliland,
Beckman Instruments ............................................... _......................................................................................... 507
12.3 "An Iteration Procedure for Parametric Model Building and Boundary Value Problems"
by Walter Brunner, Electronic Associates, Princeton University ...................................................................... 519
12.4 "Analog Simulation of Underground Water Flow in the Los Angeles Coastal Plain" by D. A. Darms,
E.AJ. Computation Center, H. N. Tyson, IBM Corp .................................................................................... .535
PATTERN RECOGNITION
13.1 "A Self-Organizing Re~ognition System" by R. J. Singer, Aeronutronic Div., Ford Motor Co ..................... 545
13.2 "A Pattern Recognition Program that Generates, Evaluates and Adjusts its Own Operators" by L. Uhr,
University of Michigan, C. Vossler, System Development Corp .................................................................... .555
13.3 "An Experimental Program for the Selection of Disjunctive 'Hypotheses'" by M. Kochen, IBM Corp ........ .571
13.4 "Time-Analysis of Logical Processes in Man" by U. Neisser, Brandeis University .......................................... 579

viii

TABLE OF CONTENTS, continued
Page

COMPUTERS IN CONTROL

14.1 "Computer-Based Management Control" by A. J. Rowe, Hughes Aircraft Co.............................................. .587
14.2 "American Airlines 'SABRE' Electronic Reservations System" by M. N. Perry, W. R. Plugge,
American Airlines .................................................... _......................................................................................... 593
14.3 "Real-Time Management Control at Hughes Aircraft" by D. R. Pardee, Hughes Aircraft Co ........................... 603
14.4 "The 465L (SACCS) Computer Application" by P. D. Hildebrandt, System Development Corp ..................... 609
THE "HUMAN" SIDE OF ANALOG SYSTEMS
15.1 "The Computer Simulation of a Colonial, Socio-Economic Society" by W. D. Howard, General
Motors Corp .............................................................. _......................................................................................... 613
15.2 "X-15 Analog Flight Simulation-Systems Development and Pilot Training" by Norman Cooper, North
American Aviation, Inc. . ........................................ _......................................................................................... 623
15.3 "Analog-Digital Hybrid Computers in Simulation with Humans and Hardware" by O. F. Thomas,
Simulation and Computer Center, U.S. Naval Ordnance Test Station .......................................................... 639
15.4 "The Automatic Determination of Human and Other System Parameters" by T. F. Potts, G. N. Ornstein,
A. B. Clymer, North American Aviation, Inc ................................................................................................... 645
LIST OF EXHIBITORS........................................................ _......................................................................................... 661

ix

1

1.1

SIMULATION: A SURVEY
Harry H. Harman
System Development Corporation
Santa Monica, California

Introduction
Simulation may be traced back to the beginning of time -- be it the make-believe world of
the child at play,or the adult make-believe world
of the stage. The impetus for modern scientific
simulation came with the development of analog
computers in the 1930's; and progressed even
further when the electronic digital ,computers wee
created.
The very definition of an analog computer contains the notion of simulation, viz., a
device which simulates some mathematical process
and in which the results of this process can be
observed as physical quantities,such as voltages,
currents, or shaft positions.
While there is no
doubt that the analog computer represents one aspect of simulation, the truly new simulation advances came with the digital computer. In the
past two decades, since the development of Mark I
by Howard Aiken and since Eckert and Mauchly designed the ENIAC, tremendous strides have been
made in science and technology ascribable directly to the flourishing new computing discipline.
The revolutionary impact of the electronic
computer on our society may well be equal to
that of atomic energy -- and may actually surpass
it in the long run. A direct consequence of the
computer is the burgeoning activity which collectively goes under the name, "simulation".
The
growing awareness, and popularity of this field
of activity is evidenced by a recent article in
Business Week in which a parallel is drawn
between the group of simulation experts and the
group of painters known as the Futurists.
Just
as the art works might bear no direct resemblance
to the subjects for which they were named, so the
mathematical formulas, flow diagrams, and computer outputs bear no direct resemblance to the
physical world which they simulate.
Moreover,
this symbolic art ·'represents a massive assault
on tradition -- in this case, the traditional art
of managing large organizations!'I6rhis assault -involving scientific systems analysis and simulation techniques -- first occurred on military systems problems, but more recently has found its
way in~o business and industrial systems problems
as well.

subject is the raison d'etre of this session. In
my review of the work in this area, I came across
John Harling's paper, "Simulation Techniques in
Operations Research -- A Review".5 From the title
it would appear that my work had been done for me.
His opening remarks draw attention to the fact
that "simulationtt is a somewhat ill-defined subject and that considerable confusion exists in
the terminology employed, and he goes on to say:
"The term 'Monte Carlo' is presently
somewhat
fashionable; the term 'simulation' is to be preferred, because it does not suggest that the
technique is limited to what is familiar to statisticians as a sampling experiment." (p.30?) He
equates "simulation" with "Monte Carlo methods"
and thereby implies a much more restrictive usage
of simulation than is intended in thepresent Survey.
The term "simulation" has recently become
very popular, and probably somewhat overworke~
There are many and sundry definitions of simulation, and a review and study of some of these
should help us gain a better perspective of the
broad spectrum of simulation.
Webster only provides the fundamental notion that simulation is
an act of "assuming the appearance of, without
the reality'.
Thomas and Deemer 20 suggest the
following paraphrase of Webster: "to simulate is
to attain the essence of, without the reality."
Note that the substitution of "essence"
for
"appearance" makes the vital difference between
the scientific and the casual use of simulation
It not only is not necessary that the simulator
not "appear" as its real-life counterpart,
but
frequently attempts to imitate reality
closely
may be detrimental to the purposes of the simulation. For example, to expedite the training of
pilots a relatively accurate duplication of the
cockpit is n-ecessary for the trainer, but to duplicate the bulky whole of the airplane would defeat the purpose of the simulator. Thomas and
Deemer advise that ·'we should deplore the tendency to introduce trappings and ornaments in ~
lation to gain the 'appearance' of reality when
~the 'essence' which we need." (p.5)
In a technical dictionary? the term "simulator" is defined as follows:

Definitions
To appraise the current work in simulation
and to apprise you of the general status of this

A physical system which is analogous
to a model under study (as, for instance,
an electric network in which the elements

2
1.1
are in correspondence with those of an economic model). The variables of interest in
the model appear as physical variables (such
as voltages and currents) and may be studied
by an examination of the physical variables
in the simulator. (p.267)
This definition covers what we
normally would
consider simulation when accomplished by analog
or digital computers. Nonetheless, it is not the
universally accepted definition, alternatives being proposed by practically each separate field
of application.
Thus, in the area of Opemtions Research,Harling5 states:"By simulation is meant the technique
of setting up a stochastic model of a real situation and then performing sampling experiments upn
the model.The feature which distinguishes simulation from a mere sampling experiment in the classical sense is that of the stochastic mode!'!
(p.307) As noted above, this definition of simulation is equivalent to the Monte Carlo technique; and is, in fact,
almost identical with
the definition of the latter provided by A.S.
Householder: 6
The Monte Carlo method may briefly be
described as the device of studying an ar~
ificial stochastic model of a physical or
mathematical process •••• The novelty ;- of
the Monte Carlo method~ lies rather in the
suggestion that where an equation arising
in a nonprobabilistic context demands a numerical solution not easily obtainable
by
standard numerical methods, there may exist
a stochastic process with distributions
or
parameters which satisfy the equation, and
it may actualry be more efficient to construct such a process and compute the statistics than to attempt to use those standard methods (p.v).
While this represents a very powerful and useful
technique in simulation,
Monte Carlo does not
encompass all the legitimate scientific aspects
of simulation.
In their book, System Engineering,4 . Goode
and Machol give a ha~zen or more examplesof
simulation in which the Monte Carlo Method is
used in queueing problems. They do not,however,
take the foregoing definition.
Instead,
they
define simulation to be "the study of a system
by the ~ut-and-try examination of its mathematical representation by means of a
large-scale
computer". (p.403) While some people (not in this
audience) might object to the qualifier that
a
"large-scale comnuter" be the means of the study,
they would certainly grant its modus operandi.
This is an operational definition~d as such
it proposes more or less exact procedures to be
followed in executing a program of simulation.
Specifically, Goode and Machol propose a series
of steps (pp.404-7) including the choice of computer
(analog or digital,
in particular);con-

struction of the computational flow diagram (it
being assumed that the mathematical model of the
system has been formulated);
determination of
preliminary (analytical) solutions;
choice of
cases to be treated, with a view toward reducing
the number of runs; data reduction and analysis
(some to be done run by run); and consideration
of the simulation of human beings (by same simple
anal~tical
function or by actual inclusion in
the simu~ation).
A type of working definition is proposed in
the field of Management Science.
Here, simulation is conceived as "the science of employing
computational models as description for the purposes of (1) learning, (2) experimenting, (3)
predicting in management problems."19 A similar
definition, which more specifically delimits the
area of consideration, is the following:l
The systematic abstraction and partial duplication of a phenomenon for the purposes
of effecting 1) the transfer of training
from a synthetic environment to a real environment; 2) the analysis of a specific
phenomenon; or 3) the design of a specific
system in tenns of certain conditions, behavio~, and mechanisms.(p.6)
The behavioral scientist, accumstamed to laboratory experimentation,puts it even more directly:8
"By simulation, we mean a technique of substituting a synthetic environment for a real one -- so
that it is possible to work under laboratory conditions of control."
The foregoing definitions range in emphasis
from a sampling plan (which distorts distributions in order to obtain relatively
efficient
estimates of the parameters)
and the mere use of
a large-scale computer, to a simple delineation
of the area of inquiry.
What they have in common is an attempt to substitute other elements
for some or all of the real elements of a system.
Perhaps the simplest and most direct definition
of simulation is merely the act of representing
some aspects of the real world by numbers or
other symbols that can be easily manipulated in
order to facilltate lts study.
In thlS sense,
simulation is one of the oldest analytical tools.
Classifications
However simulation is defined, there nmains
the problem of selecting the appropriate elements
of a system to be simulated.
Which aspects are
represented, and how they are represented,
constitute the distinguishing characteristics of the
different types of simulation. Hopefully,
these
considerations should also provide for the meaningful classification of simulation types.
After an exhaustive search of the literature, and several months' cogitation, the writer
was reluctantly forced to conclude that there is
no completely adequate taxonomy of
simulation

3
1.1

types. Perhaps some day a reasonable basis will
evolve for classifying simulation types into major and subordinate categories, and the practitioner will be assisted thereby; but at the present time, we can do very little in that direction.
About the best that has been proposed (see
for example, I. J. Good3)
is a single continuum
on which the model is classified according to its
degree of abstraction from the real-life system,
operation, or procedure. Thus, the focus is on
the simulation model and its relationship to its
real-life counterpart. This conceptual basis for
ordering simulation types follows:
(1) In the most extreme instance (ultimate
or trivial, depending on your point of view), the
real system can be used as the "model" to gain
knowledge about itself. However direct and simple
it might sound, it is usually neither practical
nor feasible to detennine the inherent properties
of a system by observing its operations. Limited
time and resources often force the use of morler,
less expensive methods than the ·'identity simulation".
(2) Only one step removed from the reallife instance is the attempt to replicate it with
the highest degree of fidelity,
by means of an
operational model of the system in its normal environment.
A SAC mission flown to test the air
defenses of the United States is an example of
an essential replication of a war situation. Enemy bombers are replaced by SAC bombers; ADC fires
no weapons. Such "replication simulation" really
involves very little abstraction from reality,and
also provides very little gain;
except tv make
possible the limited study of selected dangerous
or future situations. A subcategory of this classification might involve essential replication of
operational gear while employing abstracted inputs.A case in point is the Air Defense Command's
System Training Program (discussed below).
(3) Next, along our continuum, the replkation might be attempted in the laboratory instead
of in the field. Here it is necessary to choose
the relevant features of the real system for representation in the laboratory, and also to decide on the means of such representation. A system
may be made up of such diverse elements as people,
hardware,operating procedures, mathematical functions, and probability distributions. A laboratory model might consist of the actual replication
of some elements and the abstraction and substitution by symbolic representation of others. It
should be noted that every kind of substitution
is possible: people are often simulated by hardware, but the reverse is also done. A wide range
of simulation types is encanpassed by ulaboratory
simulation", and perhaps is best exemplified by
operational gaming.

(4) More clear-cut abstraction from reality
is involved in the complete ttCXl1lPJ,ter simulation"

of a real system. In some circles this is the
only admissable type of simulation.
There is no
roam for human beings or real hardware components
in this model of the system. All aspects of the
system must be reduced to logical decision rules
and operations which can be programmed.
If the
model of the system consists only of mathematical
functions, the simulation is said to be deterministic.
If it also includes probability distributions then it is stochastic.
This type of simulation is quite cammon in operations research ,
with a popular example being a "computer simulation" of a (hypothetical) business finn.
(5) The highest degree of abstraction leads
to the complete "analytical simulation", wherein
the real system is represented completely Qv mEans
of a mathematical model and a solution ( at least
theoretically) can be obtained by ~ical mea n s.
Essentially, the problem here is that of solving
a set of equations. Even if a closed form is not
available, approximate methods (including Monte
Carlo) can be employed to get a solution.
The
least and the highest degrees of abstraction -t1identity simulat ionttand complete nanalytical simulation"- may not be of much experimental value,
but they do provide useful conceptual bounds for
the simulation continuum.

Need for further classification.-- While the
foregoing considerations provide a fundamental
(philosophical) continuum on which
simulation
types might be ordered,
it is not sufficiently
discriminating.
The bulk of the simulation studies reported in the literature would fall into
one or two categories only. Further, more detailed distinctions could lead to generalized principles and thus to the full development of a
discipline of simulation.
The additional dimensions of simulation cannot be adequately determined at the present rudimentary stage of development of this field.
Dichotomous classifications.-- What is frequently done as an alternative is to break the
total field of simulation into two classes. Commonly encountered examples of such dichotomy, or
pOlarity, is detenninistic-stochastic; deductiveinductive; analytical-physical; computerized-manual;
or one of the many variants of these. An important consideration is the absence or presence of
at least one human being in the simulated model.
While this seems to offer a real
distinguishing
characteristic, it does not help nearly as much
as anticipated. There can still be
stochastic
models which are simulated entirely in a computer,
or by means of a computer and people.
For this
reason, the writer discarded an earlier plan in
which the primary dichotomy was into "automatonsimulation" and tlbio-simulation". Differences in
simulations' that are fully computerized and those
that involve human beings may be useful ,but should
probably be subordinated to more fundamental classification concepts.

4
1.1

Even this crude classification scheme may
provide a useful guide in planning a simulation
experiment. As a general rule, increasing experimental control can be attained by moving in the
direction of a complete mathematical model, but
unfortunately this usually is associated with decreasing realism.
The more that is known about
the properties of an element of a system, the better it can be simulated. Imperfectly understood
system elements probably should be used tt as is"
in the model rather than approximated in a probabilist:i.c manner or by decision rules.
Adequate
simulation of a system in the laboratory requires
a detailed systems analysis with particular atte~
tion paid to the functional structure of the various tasks and the operations to be performed by
the human beings in the system.
Since the human
actions are certainly of a stochastic nature,realistic simulation of a man~achine system
can
best be accomnlished by having the human elements
in the model.
Classification by objectiveo-- An alternative breakdown of simulation activities can be
made according to the purpose or objective of the
simulation. The principal categories usually employed are evaluation, training, and demonstration. With the emergence of very large military
command and control systems, the old trial-anderror method had to give way to simulation as the
primary technique for the design and development
of such systems, as well as for the evaluation of
alternative solutions to system problems. Again,
in the implementation and operation of such systems, simulation has been found to be a very effective device for training. Not only have simulators been employed for individual flight instruction in place of expensive and dangerous
procedures, but similar efficiencies have been
realized in training groups in total system operations through simulat ion.
This is one of the
chief objectives of mana~ement games as well as
the specific training programs of military systems.
In the demonstration role, simulation serves as a
means of indoctrination -- to exhibit the feasibility of a complex system.
Simulation as a research tool
While this very brief account of the uses of
simulation for evaluation,demonstration,and training immediately points up its value, some IIDre definite indication of the advantages of simulation
as a research tool in the study of complex ~~s
seems to be in order. First of all, the real system in the field is not as amenable to control as
a simulation of it. At the same time there is no
interruption of the on-going activities in order
to conduct the research.Also, productive research
requires the taking of quantitative measurements,
which again can better be accomplished in a simulation study than by observati~n of the actual
system.
These primary advantages are really the advantages of the laboratory over the field,regard-

less of whether it is a chemistry laboratory or a
digital-computer laboratory. Simulation as a research technique ha~ more specific advantages:
(1) It can compress or expand real time. A
business operation of a year can be simulated in
minutes in order to study long term trends or to
study the operations under varying alternatives.
On the other hand, the process can be slowed down
to permit the more detailed study of critical situations.
(2) It provides the ability to experiment,
test,and evaluate new systems or proposed changes
to existing systems in advance of having to make
firm commitments.Aside from great economy of time,
simulation of this type makes it possible to consider hypothetical systems which may be dangerous
or impossible to try any other way.An interesting
example involves the procedure the Cornell Aeronautical Laboratory employed in designing and CO~
structing the Mark I perceptron for the automatic
identification of simple patterns. They first demonstrated by simulation on a computer (IBM 704)
that such an experimental machine could be built.
(3) It makes for more economical experimentation, both in time and money. A complete t1 computer simulationU of a system usually can be run in
very short time once the program has been developed. However, the cost of creating a large-scale
computer simulation program can be prohibitive.
Usually it is justified because of continued experimentation with the model, but on occasion the
payoff may be so great as to ,justify even a single
trial.
(4)It permits the replication of experiments
under different conditions. An important example
is the replication of economic time-series, which
just could not be accomnlished without simulatio~
Review of simulation activities
Extent of literature.-The acceptance of
simulatlon eVldently has been widespread -- as
witness the increasing n'~ber of simulation studies in the last decade. Prior to 1951 there was
nothing in the scientific literature on this subject. The most recently published bibliogranhyl5
contains 344 entries (including 6 other bibliographies)and except for one reference (IIA Simplified j'lar Game, 1897) the earliest article is· dated 1951. Two other bibliographies merit special
mention.
Malcolmll
presf.:nts what he terms "a
fair sampling of simulation literature to date'~
Concerned primarily with the application of simulation to manage~ent problems, he subdivides the
165 titles into industrial and military applications am separates simulation games from the rest.
The other,12 while not snecific:llly Addressed to
simulation,presents 477 references'to the closely
allied subject of systems research.
One of the
interesting aspects of the latter bibliography is
that it also contains a topical outline of the
field and each reference is assigned to one or

5
1.1

more of the classification categories. The extent
of the literature on simulation has grown to such
immense proportions, in so short a time, that the
truly scholarly exploration of this field loans as
a formidable effort for all but the most serious
student.
No attempt will be made here to review the
content of different simulation studies. The objective is only to indicate the
scope of such
studies.One such collection of 17 studies appears
in the "Report of Systems Simulation Symposium';
published in 1958. These include typical inventory-control, scheduling, cargo handljng, and waiting-line problems on the industrial side; related
problems on logistics systans peculiar to themilitary,as well as military"laboratory simulations",
incorporating systems of men and equipment;
and
even same methodological considerations directed
at increasing the speed of simulation and statistical problems associated with Monte Carlo sampling.
As regards the technical aspects of simulation, the results of current research activities
appear, principally, in the Operations Research
journal, 'specialized statisticat journals, and
publications of various research institutes. Of
special interest is the report of the first Symposium on the Monte Carlo Method6 and two subsequent symposia 17,18 on the same subject.
Operational gaming.-- The simulation studies that have attracted the most attention in
recent years may be described by the generic term
"games" -- intended to cover such activities as
war gaming, business management games, and operational gaming in general.
In their excellent
article, Thomas and Deemer 20
first distinguish
the basic concepts of simulation, Monte Carlo,and
operational gaming; present a brief reviewof some
of the theory of games of strategy; and then compare the approaches of gaming andncn-gaming techniques to competitive situations.
The role of
operational gaming is best expressed in their
words:
Although simulation and Monte
Carlo
methods are often used in gaming we feel that
the essence of operational gaming lies rather
in its emphasis on the p~ of a game.
There is playing to formulate a game, playing to solve a game,
and playing to impart
present knowledge of a game. Thus we define
operational ~aming as the serious use of
playing as a primary device to formulate a
game, to solve a ~ame,or to impart something
of the solution of a game. (p.6)
In practical applications, the technique of
gaming is aimed principally at providing practice
in working through alternative sequences in considerable detail. Within the framework of a particular game certain input parameters canbe altered to provide innumerable variations. When human
teams participate in such games, they not only

gain practice in comprehending the consequences
of particular moves and sequences of events, but
also gain some insight into the perspective of
the participants.
The development and present usage of management games is reviewed by Joel Kibbee in the following paper on this Program.
He stresses the
importance of computers in this area, an~ discusses the building of models and programming of management games.
It should be remembered that
non-computer or manual business games (e.g., as
developed by Stanley Vance at the University of
Oregon and by John L. Kennedy at Princeton University) have considerable merit as tools for
management training and_development as well.
Management control.-- Perhaps aneof the most
powerful tools for management control of largescale programs is the activity known as PERT
(Program Evaluation Review Technique). This ~tem
of charting the key milestones into a network for
the accomplishment of an objective, dependent on
many and diverse factors, was first developed in
conjunction with the Polaris program. lO As a result of such management control, the Polaris program became operational two years earlier than
originally anticipated.
A similar technique developed for the Air Force by Douglas Aircraft
Company in conjunction with the Skybolt program
is PEP (Program Evaluation Procedure). The PERT/
PEP program evaluation techniques now are being
extended to almost all Army, Navy, and Air Force
weapons systems. 9 Among other computer-based methods for monitoring schedules being developed
is SCANS (Scheduling and Control by Automated
Network Systems) at System Development Corporation. The aspect of these techniques which is
especially germane to this Session is the optimization of networks through simulation. By devising a "computer simulation" of the scheduling
technique, alternative management decisions can
be tried, and from the output an optimal solution can be determined. Closely related to these
types of programs is the Decision Gaming work on
which Dr. Vazsonyi reports later in this Session.
Social behavior.-- Turning to another area,
of studies on simulation of social processes being carried out in universities and research
laboratories from coast to coast. His survey is concerned with research in the behavioral sciences which
use computers in the sL~ulation of social behavior. The studies range from experiments in interactions and conformity of small groups to in=
tergroup relations in the community to the behavior of an entire society and international relations.
~llis Scott14 , calls attention to dozens

Vehicular traffic.--8till another area which
is receiving more and more attention is that of
vehicular traffic control.
While the' earliest
works, by H. H. Goode, G. F. Newell, and others,
only date back about six years, the activity has
been gaining considerable momentum since
then.

6
1.1
Research is going on in all parts of the count~
The extent of the national interest is evidenced
by the conference on transportation research convened by the National Academy of Sciences last
fall. About 150 participants from government, industr,r, universities, and research institutions
met to review and formulate a program of research
on transportation in the United States. A more
recent conference13 was devoted exclusively to
the utilization of simulation as a research tool
in the areas of highway and vehicle improvement,
traffic control and enforcement,
and driver and
safety education.
An example of a physical model for studying
driver performance, car construction, and road
des.ign is the "driving simulator" at the UCLA
Institute of Transportation and Traffic Engineering. The cab of this simulator consists of a
standard station wagon on a treadmill of steel
rollers, which faces a lO-ft high semicircular
screen and with a small screen on the car's rear
window. Movie projectors throw traffic scenes
on both screens and a battery of instruments record changes in steeringwheel movement, acceleration, braking, and in the driver's breathing rate
and in emotional stress.
Although the ultimate goal is to ~der the
total system, including the driver and the traffic, at this stage of development of methodology, it seems wise to distinguish "driving simulationt1 from "traffic simulation". Early work on
traffic simulation was restricted to one or two
lanes of very short stretches of highway, and required inordinate amounts of comouter time.Nonetheless, such work pointed to the feasibility of
running simulation studies of traffic flow. A
much more extensive model of expressway traffic
flow has been developed at the Midwest Research
Institute, and is reported by Glickstein and Levy
later in this Session.
Simulation in

man~achine

laboratory research

The foregoing review points to many exciting and challenging activities -- emerging as a
result of the development of the digital electronic computer, the use of simulation, and the increased awareness of the "systems approach".Thus,
the study of large, complex man~achine systems
has become possible.
Just as trial-and-error experimentation has
been a respected technique in the development of
the classical sciences,so in the study of complex
systems the new techniques of smulation may be
employed to explore and to define the problem itself. The direction and course of study of a manmachine system should be permitted (at least in
the early stages) to be altered and restructured
during the simulation and according to insights
gained from the simulation itself. This use of
sL~ulation as a new kind of research tool is perhaps the outstanding feature of such laboratories

as RAND's Logistics Systems Laborator,y and SDC's
Systems Simulation Research Laborator.y discussed
below.
NEWS.-- Entire laboratories have been built
to exploit simulation for teaching purposes and
evaluation of systems.
Perhaps the first such
facility to be conceived (in 1945), but which was
not funded until 1950 and then took eight years
to build) is the simulator at the U.S.Naval War
College at Newport, Rhode Island.
This facility
and the exercise conducted in it is called NEWS
(NaVal Electronic Warfare Simulator). At" the
heart of the system is a very large analog computer (known as the Damage Computer) which is designed primarily to assess damage and to provide
feedback to the several forces playing, to indicate their remaining effectiveness. the exercise
is primarily a training device -- used in war
gaming, in the final stages of tactical training
of naval officers from the fleet.
SRL.-- Another laboratory in which simulation was employed as the principal tool was the
Systems Research Laboratory (SRL) of The RAND
Corporation. From 1951 to 1954 this laboratory
employed simulation to generate stimuli for the
study of information processing centers.
The
esse~tial features
of a radar site were created
in the laboratory and by carefully controlling
the synthetic inputs to the system and recording
the behavior of the group it was possible to stuqy the effectiveness of various man~achine combinations and procedures.
STP.-- The research in SRL eventually gave
rise to the Air Defense Cormnand' s System Training Program (STP) -- probably the largest-scale
simulation effort ever attempted.
STP is now in
operation throughout the United States, as well
as in Alaska, Canada, and Europe. Training exercises are conducted in the normal' working environment at the radar sites, direction centers in
the SAGE system, Division Headquarters, and higher
cormnands.
Fundamental to this vast program is
the creation of problem materials by means of an
IBM 709 and special off-line and EAM equinment.
Through these means synchronized radar pictures
for large areas of the country are simulated Wong
with other innuts required by the operating system, e.g., fli~ht plan information, intelligence
and weather information, and commands fram higher
headquarters.
Also, various lists and maps are
prepared for the trainers to assist them in
observing and recording crew actions in order to
furnish feedback on system performance to the
crew immediately after each exercise. Through
simulation of this type it is possible to provide
exercise of air defense procedures and regulations, applicable either in peace or in war situations, at a fraction of what it would cost with
tfreolication simulationtt •
L5L.-- In 1956, the Logistics Systems Laboratory-{LSL) was established at RAND under Air

7
1.1

Force sponsorship. The first study in this labor~
atory involved the simulation of two large logistics systems for purposes of comparing their effectiveness under different governing policies
and resources. The system consisted of men and
machine resources together with policy rules on
the use of such resources in simulated stress situations such as war. The simulated environment
required a certain amount of aircraft in flying
and alert states while the systems' capability to
meet these objectives were limited by malfunctioning parts, procurement and transportation delays,
etc. The human participants represented management personnel while higher echelon policies in
the utilization of resources were simulated in
the computer. The ultimate criteria of the effectiveness of the systems were the number of
aircraft in commission and dollar costs.
While
the purpose of the first study in LSL was to
test the feasibility of introducing new procedu~
into an existing Air Force logistics system and
to compare the modified system with the original
one, the second laboratory problem has quite a
different objective. Its purpose is to ~prove
the design of the operational control system
through the use of simulation. The complete description of this study is presented by Dr. Steger
later in this program.
ASDEC.-- A somewhat different type of facility in which simulation is employed to test and
evaluate electronic systems is the Applied Systems Development ~valuation Center (ASDBC) of the
Naval Electronics Laboratory at San Diego.Recently the Navy Tactical Data System was being evaluated. The operational system was simulated by
means of actual hardware components such as the
Univac M460 computer and cardboard mockups of display and control equipment. The facility includes
an analog-to-digital computer which
generates
synthetic radar data used in the testing of operational systems.
NBS Study.-- Perhaps the largest single step
in the exploitation of simulation for research
purposes was the recent Feasibility Study~nduct
ed by the National Bureau of Standards.
The
broad objectives of this study are best indicated
in its opening paragraph:
This report presents the results of a
study of the feasibility,
design, and cost
of a large-scale tool to be used in a research program on man-machine systems. This
tool facilitates the simulation of complex
weapon, systems for purposes of laboratory
experimentation with human subjects in the
system feedback loops. It is intended to aid
in the optimization of system performance
through studies of man-machine dynamics. It
incorporates capabilities which represent a
substantial advance over those of existing
facilities for research on man~achine systems.
Feasibility was demonstrated through the actual
design, implementation and operation of a scale

model of the desired facility. The work done at
the National Bureau of Standards provided
the
fundamental guidelines and philosophy for the more
ambitious laboratory facility being built by the
System Development Corporation in Santa Monica.
SSRL.-- Recognizing the importance of recent
work in simulation, as well as recognizing the
need for continued and expanded support for the
further development of this area, with particular
emphasis on its use in the study of complex manmachine systems, SDC decided to c reate a generalpurpose, computer-based, facility in which such
research could be conducted. Plans for the Systems Simulation Research Laboratory (SSRL) were
initiated about fifteen months ago and are about
to come to fruition.
My report on SSRL is in
greater. detail because of my involvement and familiarity with it.
The physical facility, covering about 20,000
square feet, has just been completed. The main
experimental operations space is a room approximately 45 x 50 feet with 20-foot clearance from
floor to ceiling. It is completely surrounded by
an elevated observation area. This large room may
be divided into appropriate smaller areas bymearn
of movable walls. Adjacent to the large, highceiling space are smaller, standard height experimental areas, which a Iso may be adjusted in size
-and shape to acco~modate the operations and observation requirements of specific projects.
A basic concept in planning a laboratory of
this kind is the distinction between universaltype and project-specific type equipment. Of the
former type,the most important E the general-purpose digital comnuter. A Philco 2000 system was
selected and is now installed. Another major
piece of equipment is a transducer that permits
human beings and other real-time elements of a
system to com~unicate with the computer. Such a
real-time switch and storage unit (RL-IOl) has
been designed and built at SDC and will be ready
for integration with the computer next month. An
internal telephone system (up to 120 stations), a
public address system, recording facilities for
any audio line, and a closed-circuit
television
system round out the general-purpose equipment of
the laboratory at this time. The specific hardware requirements for the first couple of projects
are now being determined.
Another basic concept is a general-rur~ose
programming system. Perhaps some day we will have
will
a general-purpose simulation program which
greatly facilitate the execution of research projects. For the present, however, we refer to the
basic utility program system for the Philco 2000
operating with the HL-I01. At SDC we are using a
problem oriented language, known as JOVIAL, 'ofhich
is patterned after Algol (the International Algebraic Language). The principal effort involves
the preparation of a JOVIAL Translator for the
Philco 2000; but which has been written in such
a manner that preliminary testing and actual compilation could be done on an IBM 709.
Also, an
executive control program hasbeen developed which

8
1.1

takes cognizance of the requirements introduced
by the RL-IOI and of the unusual nature of ~he
applications of the Philco 2000. The programmlng
for the initial research projects is proceeding
concurrently with the utility programming.
The new laborato~ is expected to enhance tie
present research efforts of SDC and to open entirely new avenues of research endeavor. In the
former category are a number of research projects
that have necessarily been limited in scope, but
which can now be broadened because of the new facilities.
One such area is that of automated
teaching. Successful research in this area has
been conducted at SDC in the last two years, but
the constraint of a single student to the teaching machine has been a severe limitation. ~his
made the gathering of statistical data very tlffieconsuming.
Also, any potential application of
automated teaching techniques in the academic or
the military or industrial organizations would
certainly require more efficient means than individual tutoring.
Thus, the next stage in this
research effort is to create a Computerized Laboratory School System (CLASS) which project will
be studied in SSRL very soon.
Another example of present research at SDC
which can be expanded through the medium of the
new laboratory is the study of Management Control
Systems. At the present time, the research consists of a "computer simulation" of the behavior
of a business system.
This model enables the
study of the reaction of the organization
to
specific changes under alternative sets of decision rules. As interesting as the computer sImulation might be, it will be found lacking in a
basic ingredient insofar as acceptance by realworld managers is concerned. That ingredient is
the true human variability in decision making. The
particular model certainly can be made more valid
-- albeit, more complex and less controllable -by introducing human decision makers at certain
critical points in place of decision rules. Such
a tllaboratory simulationll model, at a later phase
of the research, will be possible in the SSHL.
The first new research endeavor to exploit
the SSRL facilities is a study of a terminal air
traffic control system operating in a post-19 70
air environment. Projected increases in traffic
volume and aircraft speeds indicate that terminal
control zones will increase in size and will therefore inclUde many airports wi thin a single complex.
Coordination among many airports of the control
of high density traffic of widely differing performance characteristics poses significant problems of organization and planning. It is believed that in order to effect the safe, orderly and
expeditious flow of air traffic in a termina complex, there will be a need for a new planning agency in addition to the control agencies in intimate
contact with the details of the environment. The
general purpose of this Droject is to investigate
the functional interactions among the cmtrol agencies, and to evolve alternative hypotheses regardin~ superordinate planning agencies.

In the first phase of the project the configuration simulated is an air traffic control system for a two-airport terminal complex. The system consists of the operators and equipment representing the following agencies for each of the
two airports:Stack Control,Aoproach Gate Control,
ADProach Control, Departure Control, and Flight
Data Processing.
Some of these agencies include
human operators while others are represented by
completely automatic processes.
The objectives
of Phase I are to study inter-airport coordkation
problems and to identify significant variables fur
future systematic investigation. Additional p1anning and coordination f-,mctions will b~ a~ded
in subsequent configurations as they are lndlcated by Phase I results. This project -- involving
a "laboratory simulation" model --is an excellent
example of the utilization of the best aspects of
the broad range of simulation techniques in order
to experiment with a complex man-machine system.

In this wide range of simulation work which
we have reviewed two distinct
activities stand
out, neither one taking much cognizance of the
other. On the one hand, simulation work is being
done in the Operations Research field which may
be classified - largely as "computer simulationu •
On the other hand, there is the group of behavioral scientists, experimental psychologists in
particular, engaged in the simulation of environmental conditions which may be called "laboratory
simulation'l. Each of these groups could learn a
great deal from the other. Furthermore, there is
increasing evidence that ft pure" simulation will
have to be modified if it is to stand the test of
validation. What is necessary is the marriage of
the two approaches -- a realistic possibility in
the new man~achine system laboratory.
References
1.

Bogdanoff, E., et ale Simulation: An Introduction to a new Technology. TM-499.
Santa
Monica,Calif.: System Development Corp., 196~

2.

Ernst, Arthur A. Feasibility Study for a ManMachine Systems Research Facility. WADC Technical Report 59-51. AST1A Doc. No. AD 213589.
Good, 1. J. Discussion in "Symposium on Monte
Carlo :Methods," J.Royal Stat.Soc.,B 16(1954),
68-69.

4. Goode, Harry H., and Machol, Robert E. System
Engineering. New York, N.Y.: McGraw-Hill Book
Co., Inc., 1957. Pp. xii + 551.

5. Harling, John."Simulation Techniques in Operations Research - A Review,1t
search, 6 (1958), 307 -19.

Operations Re-

9
1.1

6.

Householder, A.H.,Forsythe,G.E., and Germond,
H.H. (eds.). Monte Carlo Method. Natl. Bur.
of Standards Applied Math.Series,No.12, 1951.

7.

Kendall, Maurice G., and Buckland, ~'lilliam R.
A Dictionary of Statistical Terms. New York,
N.Y.: Hafner Publishing Co.,l957.Pp.ix + 493.

8.

9.

Kennedy, J.L., Durking, J.E., and Kling, F.R.
"Growing Synthetic Organisms in Synthetic Environments. 1t
Unpublished Paper, Princeton
University, 1960.
Klass, Philip J. !TPERT/PEP Management Tool
Use Grows,"
Aviation {leek, Vol. 73,
No. 22
(Nov. 28, 1960), 85-91.

10. Malcolm, D.G., Rosebloom, J.H., Clark, C.E.,
and Fazar, W. "Application of a Technique for
Research and Development Program Evaluation,1l
Operations Research, 7 (1959), 646-69.
11. Malcolm, D. G. "Bibliography on the Use of
Simulation in Management Analysis, "Operations
Research, 8 (1960), 169-77.
12. McGrath, Joseph E.,
and Nordlie, Peter G.
Synthesis and Comparison of System Research
Hethods, ApD.B (1960). ASTIA No. AD 234463.
13. National Conference on Driving Simulation.
Sponsored by Automotive Safety roundation,
Public Health Service. Santa Monica, Calif.,
Feb. 27 to March 1, 1961.

14. Scott, Ellis. Simulation of Social Processes:
A Preliminary Report of a Survey of Current
Research in the Behavioral Sciences. TM-435.
Santa Nonica ,Cali f.': System lJevelopment Corp.,
1959. Pp. 15.
15. Shubik,Martin. "Bibliography on
Simulation,
Ga~ing,
Artificial Intelligence and Allied
Topics,11 J. Am. Stat.Assoc.,S5 (1960),736-51.
16. Silk, L. J. f'The Gentle Art of Simulation, II
Business Week, Nov. 29, 1958, 73-82.
17. "Symposium on Monte Carlo Hethods,1'
Stat. Soc., B 16 (1954), 23-75.

J. Roy.

18. Symposium on Monte Carlo Methods (Herbert A.
Meyer, ed.). New York, N.Y.: vlileyand Sons,
1956. Pp. 382.
19. The Institute of Management Science Bulletin,
Vol. 5, No. 2 (1958), 'p.3.
20. Thomas Clayton J., and Deemer, Walter L., Jr.
t'The Role of Operational Gaming 'in Operations
Research," Operations nesearch,5 (1957),1-27.

11

1.2

MANAGEMENT GAMES AND COMPUTERS
By
Joel M. Kibbee
Director of Education
Remington Rand Univac
Division of Sperry Rand Corporation
Management Games
Management games, although a relatively new
educational technique, are being widely utilized,
and much discussed. They are primarily of concern
to the educator and to the research scientist, but
since many of these games are played with the aid
of an electronic computer, they should be of interest to computer people in general. In addition to the use of a computer for existing games,
new games are being developed and will require programming. Many papers have been published on the
educational aspects of management games; this
paper has been written primarily to arouse interest in them as a computer application.
The first management game to become widely
known was one developed by the American Management Association in 1956. It continues to be
used, along with four or five other games since
developed by A.M.A., as part of their management
education courses and seminars. Over one hundred
different management games now in use are listed
in a forthcoming book on the subject*, and more
than 30,000 executives have participated in at
least one of them.
It seems worth-while to give a brief description of a typical game play for those who have
never participated. The Remington Rand Univac
Marketing Management Simulation will be used as
an example.
The game session begins with a briefing. At
this time the instructor describes to the participants the type of company they are about to
manage, the economic environment, the general
nature of the product, and the competitive forces
they will face. He also discusses the scope of
their authority, the functions to be filled, the
decisions to be made, and the information they
will receive.
The participants are divided into management
teams, and after the briefing, the various teams
meet to develop an organization, set objectives,
and decide upon policies and procedures. In a
typical game; involving perhaps forty to fifty
_*Management Games, by Joel M. Kibbee, Clifford J.
Craft, and Burt Nanus, soon to be published by
the Reinhold Publishing Company.

executives, there might be six teams each with
seven or eight members.
Games are played in "periods," with a period,
depending on the particular game, being a simulated day, week, month, quarter or year. The
Univac Marketing Game takes place in months. The
participants are given operating statements for
December and begin by making decisions for January. In addition to the operating statements
they are also provided with a case history, sales
forecast, and data on material and operating
costs, production facilities, and shipping times.
The decisions for January are processed by
the computer and operating reports are produced,
and are returned to the participants. Decisions
are now made for February, and so forth, perhaps
for one simulated year. In a typical play the
companies have a half-hour in which to make decisions, and reports are returned about ten minutes
after the decisions have been submitted.
In the Univac Marketing Game, each company
manufactures one product and markets it in three
different regions, East, West, and South. All
companies are competing in the same c'onsumer
market. The managers set price, spend money on
advertising, hire Or fire salesmen, set the
salesmen~ compensation rate, set production lev~l,
engage in special market research projects, etc.
The total market is shared among the companies
according to their pricing and advertising policies, according to the number of salesmen and
their degree of training, and so forth. The operating reports show the sales obtained, the net
profit achieved, inventory on hand, etc. The report also s~ows the number of salesmen on hand;
companies can pirate experienced salesmen away
from one another.
Each management attempts to achieve the
largest possible accumulated net profit, and a
"winner" might be proclaimed. This is usually
discouraged, however, as good performance in a
management game as in real business, depends on
many factors such as return on investment, share
of market, personnel policy, and numerous others
that contribute to success. At the end of the
game play, a discussion session takes place.
This "critique" is held to focus attention on the

12

1.2
lessons which were to be taught, and it gives the
participants the opportunity to review their performance, discuss management principles with
other members of the group, and receive feedback
from the game administrator and other observers.
Existing management games vary widely in the
types of models used. The original A.M.A. Game,
and similar games developed by IBM, UCLA, Pillsbury and other organizations, are concerned with
general management. The Carnegie Tech Management
Game is based on the detergent industry. Other
games exist for banking, petroleum, telephone exchanges, insurance companies and super markets.
Some games concentrate on a particular management
function, such as marketing, materials management
or manufacturing. There is a game concerned with
the management of a gas station, and three different existing games are based on an auto dealer
model. The military have, of course, been playing
war games for many centuries, and they are now
utilizing computers for this purpose.
Most games are played by one or more management teams, where a team might be made up of anywhere from one to twenty executives. Since most
games contain some obvious measure of performance
there is always a "rivalry" between teams. There
mayor may not be a direct "interaction." In a
marketing game various teams may be in competition for a common market, and the action of one
team, say price of its product, will affect all
other teams. In an inventory control game, on
the other hand, each team is attempting to achieve
the best performance beginning from the same conditions. A game with interaction is like tennis,
a game without interaction is like golf. Both
such games engender rivalry.

computer. This has given rise to an obvious
classification into manual games and computer
games. Computers are used for management games
for the same reasons they are used in any business application, primarily to perform computations speedily, accurately, and automatically.
Computer games can also be more flexible, as will
be discussed more fully below.
Some rather odd advantages have been attributed to manual games. It has been stated, for
example, that manual games can be made very
simple and easy to play. Obviously a computer
game can be made just as simple, though there is
a temptation to use the full capacity of the
computer with a resultant complexity that is not
needed to satisfy the educational objectives.
Similarly, one published statement claims that an
advantage of manual games is less time pressure
on the participants. Just because the computations are made rapidly does not mean that the
participants must make their decisions quickly.
Manual games do have advantages. They usually
cost less initially, do not require special
facilities, and can be scheduled as desired.
Good design is required to keep down the number
of clerks and administrators needed. There are
so many advantages to computer games, however,
that, at least for this author, they see~ well
worth the development and operating costs.

Manual and Computer Games

Since games vary widely in complexity, it is
not possible to state how much computer time or
programming is generally involved. However,
several moderately complex games which the author
helped develop, and which were designed for a one
or two day management exercise, involve something
of the order of 3,000 to 5,000 instructions, and
took from three to nine man-months to program.
The total development cost of a game includes
more than programming, however. More or less
time can be spent on the creation of the model,
consulting fees can be required, as well as the
cost of materials and test plays. Most often
the programmer is involved in the development of
the model as well as the computer coding. These
moderately complex games are usually designed for
a one day management exercise and might involve
five to eight hours of computer time, although
the computer might actually be used only part of
the time and be free for other processing between
game periods. As an example, a play of the
Univac Marketing Management Simulation, lasting
for perhaps eight to ten simulated months, and
accommodating about fifty executives, might cost
$300 to $500 for computing time. The cost is
about the same if the game is played discontinuously using a Univac Service Center. However, a
group playing a game might also have non-computer
expenses for special facilities, materials or
staff.

Management games, like the business situation
they simulate, require that information be processed and calculations made. The extent of the
computations depend on the particular game, and
may require anything from a pencil to a large

It is not necessary to have a computer on
the premises to conduct a management game. Recently, five different cities in the Midwest
simultaneously played the Univac Marketing Game
through the use of leased telephone lines. Of

The word "competitive" has often been used in
the classification of games, usually as synonymous
with interaction. There is, however, an economic
meaning for "competition," namely, competing for a
share of a common market. A marketing game would
include competition, but such competition can be
either of an interactive form, with the competitors being other participating teams, or a noninteractive form, in which the economic competition is built into the model itself.
Games also differ as to the level of management for which they are intended. They differ
widely as to the complexity of the model. Some
are meant to be played quickly, others require
considerable analysis. In general, then, a
management game is a dynamic case history in which
the participants, faced with a simulated business
situation, make decisions, and are fed back reports based upon these decisions.

13

1.2
more interest, however, is what we call the "disc;ontinuous" mode of play. Decisions are made
perhaps once a week and mailed to a service center
and the reports are mailed back. This enables the
game to be played without infringing upon regular
production time since the game can be run at any
odd 15 minutes at the convenience of the center.
Many educatio~al, industrial and professional
organizations are at this moment engaged in discontinuous plays of management games.
Management Games And Computer Personnel
Because of the widespread use of management
games, and their ever increasing growth, it is
likely that most computer installations will at
one time or another find themselves involved in
running a game session. Several computer manufacturers have developed management games and
happily supply the programs to their users. Most
computer games developed by other business Or educational organizations are also available.
Directors of Training or of Management Development
are now generally interested in this new tool and
will probably be getting in touch with the manager
of the computer installation if they have not already done so.
The computer installation may also be called
upon to help develop and program a new management
game. Irrespective of their interest in education,
computer personnel will find that management games
can provide an excellent orientation to data processing for top management. It is sometimes difficult to get a company president to watch a payroll demonstration, or a matrix being inverted,
but, as personal experience has shown, he can become very much interested in the computer as a result of his involvement in a management game.
The data processing manager might consider
the use of management games for training personnel within his own department. A game called
SMART for systems and p=ocedures managers was developed a few years ago, and the System Development Cocporation developed a game called STEP for
use in training programming supervisors.
The Educational Advantages Of Management Games
Very little research has been done on the
validity of management games as an educational
tool, but a similar statement can probably be
made about most educational techniques now in us~.
The most striking thing about a management game is
the involvement on the part of the participants.
One is continually impressed with the way in
which executives will work "after hours" in planning the operations for their simulated companies,
and it is generally necessary to bring in sandwiches and coffee rather than to attempt to interrupt the play for a luncheon or dinner. Motivation is an important aspect of learning, and
management games are sometimes used at the start
of a course or seminar merely to stimulate
"students" towards a greater receptivity for
lectures or other types of training that will

follow.
Management games are superior to other educational techniques for demonstrating the importance of planned, critically timed decisions; the
necessity of flexible organized effort; and the
significance of reaching a dynamic balance between interacting managerial functions. They can
also demonstrate the need for decision-assisting
tools, such 'as forecasts, control charts and
budgets. They can demonstrate to management the
power of a scientific approach to decision making.
Management games are educational tools, and
to be effective should be used along with other
techniques as part of an overall course or seminar.
The briefing and the critique are as important as
the actual play. Furthermore, most game sessions
involve many "incidents" which are not actually
part of the computer model. The participants
might be asked to formulate a personnel policy, Or
to submit special reports. Job rotation, promotions, appraisals, and so forth can all be made
part of the exercise. Much of the activity that
takes place -- in organizing, planning, com~u
nicating and controlling -- is in addition to the
numerical decisions which are submitted to the
computer. Management games continue to be demonstrated fOT a variety of reasons, but a demonstration is not a course, and a participant
should not form his opinion as to their educational merit from a few hours engaged in a demonstration play.
Building A Management Game
M3nagement games are constructed for educational purposes, and their construction is premised on a set of educational objectives. Working
from the objectives, a model which simUlates a
business situation is constructed. One of the
most important constraints on the model is a need
for simplicity.
A game must be simple to play. This does
not mean that it needs to be easy to make good
decisions, but the participants should not have
to devote considerable time and energy to learning
the rules. It requires skill and experience to
abstract from the real world those elements of
major importance so that a playable game will result. It is here that a programmer must work
closely with the team that is building a game,
and vice versa, the team should get the programmer
into the act as early as possible. Skillful programming can do much to simplify the mechanics of
play for the particip3nts. A good program facilitates the manner in which the players submit their
decisions, and attempts to set up procedures which
will keep clerical errors to a minimum, and even
possibly have the computer edit the decisions for
obvious errors.
In the Univac Manufacturing Game, for example, there are quite a few shipping decisions.
Suppose a company decides to ship 100 Clanks to
the Western Region, and 100 Clanks to the Eastern

14
1.2
Region, but only has 160 Clanks on hand. The
computer automatically interprets the decision as
a desire to ship equally to both regions and 80
are sent, with no interruption in the game session. Similarly in games which have a limited
cash on hand, the computer can issue emergency
loans at some interest rate to accommodate an
error on the part of the participants in spending too much money.
In some games, it is only necessary for the
participants to circle code numbers on a decision form, rather than write out quantities with
possible errors in the number of trailing zeros.
The object of ga~es is seldom to teach the
participants to be less careless in their clerical tasks, and a good computer program can do
much to eliminate clerical chores entirely so
that the teams can concentrate on their decision
problems.
While in university courses, and often in
military games, the time spent on the game session might be lengthy, in most business applications only a day or so of a course or seminar can
be devoted to the game exercise and simplicity
for the participant is extremely important. This
is one of the main constraints on the model, and
even more on the computer program.
Experience has shown that the mathematical
model used in management games can be extremely
simple. One may begin with a curve, perhaps relating a demand to advertising expenditure,
which is defined by a table of fifteen points.
Later it is found that if only five points are
used, the play from the standpoint of the
participants is identical, and the author has
actually used games in which all relationships
are linear -- though there are many interactin3
ones -- without any apparent difference in the
training experience. In the usually short time
a company has to make its decisions, the mere
fact that demand might depend on six or seven
marketing decisions presents adequate complexity
without the use of complex curves.
From the standpoint of the computer computations, it is not usually important whether a more
or less complex curve is used, though extreme
mathematical complexity could lengthen the computation time even in a large computer. However,
throughout the complete model a surprisingly
amount of simplification can be introduced without violating the educational objectives.
Simplicity is important for the game administrator as well as the participants, since it should
not be necessary to have a large staff to conduct
game sessions.
As one continues to emphasize simplicity,the
question of realism always arises. Some games
are used to teach a specific skill or technique,
perhaps production planning and control. In such
cases a realistic and adequately complex model
might be necessary. But most management games
are used to teach general principles, for instance

the importance of planning and control rather
than the specific relationship between inventory
carrying costs and stockout costs. For such
games verisimilitude and not realism is the most
important attribute.
Verisimilitude is the appearance of reality.
It is as important in management games as in the
theater. As long as the relationship between
price and demand, for example, seems similar to
what goes on in the real world, and sufficiently
engrosses the participant in the exercise, it is
not important that the actual curve used, assuming even that it was known, is identical with
that obtained from a detailed study of real data.
Usually one is attempting to train a manager in,
for example, marketing principles, and not the
way in which a particular product with which he
is concerned wili behave in the real market.
The word "Management Game" has generally
been used with a training implication, and the
term "Simulation" used when the object is to
solve a problem, or actually guide management
decision making. Under this terminology a
simulation model must have vatidity, in the
sense of an ability to predict the future, if it
is to be of 'value; the management game model,
used for training, must more often stand the
test of verisimilitude. In fact, it is possible
that a~ over concern with realism can produce a
game that is too complex, too difficult to play,
and can actually destroy verisimilitude, and the
involvement on the'part of the participants
which is so important.
The programmer working together with a
team that is constructing a management game can
do much in the cause of simplicity and verisimilitude. It is this very ability which makes
computer games, in general superior to manual
games. The reports returned to the participants
can resemble the actual reports which they
obtain in everyday life. Furthermore, there is
essentially no limit to the amount of special information that can be provided to simplify the
role of the participant. Thus, a report can give
total costs, unit costs, and percentages. It can
provide a variety of research type information
and statistics. Little of this is usually possible in a manual game. The American Management
Association's Top Management Decision Making
Simulation permits management to engage in
market research studies, at a suitable cost,
which will provide them with the answer to
questions concerning the number of orders for a
.product that might have been received had a different price been set. From the computer standpoint, it is only necessary to loop through the
same set of computations using the research
price rather than the actual price.
A good computer program can also simplify
the task of the game administrator. In the
American Management Association's General Management Simulation, it is possible during the play
to develop additional products through a p~ogra~

15

1.2
of research and-development. When a product has
been developed the staff sends a pre-printed
letter to the company informing them of this.
However, the computer program has been arranged
so that a special report to the administrator is
prepared each period, and the computer itself, on
this report, informs the administrator when to
send out a particular letter. This same report,
incidentally, informs the administrator about
various aspects of the company's performance so
that he is better able to control the session,
and to provide feedback at the critique.
There are many ways in which the relationship between quantities can be introduced into
the model. The most common approach is to use a
mathemati~al function; this itself may take the
form of a graph, a table, or an algebraic expression. Another approach is the use of judges.
There might be one or more "expert" judges who
make a subjective evaluation of the effect of
particular policies. Such a judge should not be
confused with a clerk, sometimes called a judge,
umpire or referee, who, in a manual game, merely
performs the arithmetic computations according to
pre-arranged formulas. This is the function of
the "computer," whether it be a man or a machine.
The "expert" judge is a person with specific
knowledge and experience, who arrives at a
subjective evaluation of the decisions, hopefully
without bias towards the particular participants.
The judges themselves may be a group of
participants, and can be considered to be one
of the playing teams. One might have five
competing companies in a general management game,
together with two teams of , competing bankers who
may invest money in the various companies. A
model may use various combinations of equations,
judges, etc., and in the example just cited only
the capitalization aspect is relegated to judges.
In some games only certain non-quantifiable
factors may be left to observers who can directly
influence company performance in the role of
judges.

on the basis of lowest price. Similarly, raw
material could be offered to the highest bidder,
and the cost would then not be based on a
specific value built into the model. Bidding
techniques have been highly successful in many
games. A particularly good example is the
"Management Business Game" produced by the
Avalon Hill Company, an excellent "parlor" game
that can have serious uses.
Parameters
The mathematical meaning of parameter
applies mainly to the constants used in the
algebraic relationships between quantities, but
for games it is convenient to extend this concept to all constants. For instance, the cost
of carrying inventory in a game might be a
function of the opening value of the inventory
A and the closing value of inventory B, namely,
.05 (rA + sB). The .05 is also a parameter,
but it has been given a specific value for this
illustration. If we set r = 1 and s = 0, we
have a cost dependent only on opening values.
We might prefer to set r = and s =
which is
slightly more realistic. Thus, we are changing
the nature of the model by our choice for rand
s. Similarly, the cost of raw material is a
parameter which may be changed for different
game plays. The inclusion or omission of certain
~actors can be controlled by parameters acting
as switches. Whether or not certain information is to be included on the report can be controlled by a parameter which can take on the
values of 0 or 1. In well-designed computer
games, extensive tables of such parameters are
used, and in this way seemingly different games
can arise from the same model by the choice of
a particular parameter set.

t

t ,

One normally attempts to compute in advance
the parameter values that will be used in a
particular game, but this is usually fol~owed by
actual parameter studies once the program has
been checked out, and, of greater importance,
by numerous test plays. There is always a range
of feasible parameter values, and the particular
set to be used in a particular game session will
depend on the game administrator. A game should
be packaged with many such sets (and they may be
labeled "highly stable som.ewhat price insensitive
industrial product" or "highly competitive very
price sensitive consumer product" and so forth.)
Thus a single game program can be used for different games, and ideally one can imagine one
large model with sufficient parameters to allow
it to be adopted to a variety of industries and
si tuations.

In one game based on a real product, the
judges consisted of several members of top
management, and their own deliberations and
arguments as to the evaluation to be placed on
the various marketing policies being exhibited by
the teams playing the game proved to be, in their
own opinion, an extremely valuable educational
experience. At the other extreme, one might use
judges who did not even know that a game was
being played. For example, marketing policies
together with advertising copy could be presented
to an external group of simulated "customers,"
for their own preference in the products being
offered.

Extreme Values And Limits

A very objective and unbiased manner for incorporating the necessary relationships in a
model is by the use of bidding techniques. For
example, each company might submit a closed bid
as to how many units they could supply at a
particular price, and the demand would be awarded

Like any good program the ga~e program must
be prepared to handle any decisions, no matter
how ridiculous or extreme. Many a program has
"hung up" during a game session because the
game designers were convinced that only a
particular range of decisions might rationally be

16
1.2
made. Not only must the program handle extreme
decisions, but a rational result must be obtained.
For example, a product might be priced somewhere
around $10.00, but the program should be ready to
accept prices from $0 to $999999999 or whatever
the upper limit is that is imposed by the input
design. At some suitably high price the demand
will go to zero, but it must remain there, and
a curve must not be used that departs from zero
again above $999 just because one does not expect
such a decision to be made. It must also be decided what the demand will be if a price of $0 is
set, even though the company that makes it will
be losing money on every item sold.
The American Management Association's
General Management Simulation is immune to participants who might take extreme decisions. It is
possible for a company to fire all of its workers,
close down plants, etc. The program not only
will handle this, but will change its overhead
cost distribution procedure accordingly! As in
a~y computer program, the programmer must be prepared to have any quantity, unless limited by input format (though that is just another method
under his control) take on any value from minus
infinity to plus infinity.
The Future Of Management Games
Management Games are only a few years old,
but one ca~ already look back with fondness on
their infancy, and look ahead with confidence to
their maturity. Like most babies, the first few
games were very much alike; they usually modeled
the marketing or manufacturing of a durable good
and were slanted at higher management. Today
new games, like new teen-age singers, are
arriving on the scene with increasing rapidity.
Management games have been used primarily as
an educational tool. Their use in training is increasing and will increase, and will also spread
well beyond the area qualified by the word
"management." In addition, they will undoubtedly
have considerable application in research,
problem solving, personnel testing, and as a
direct aid to management decision making.
There are still no management games for
mining companies, the fishing industry, or the
mink farm. The entertainment field -- TV,
publishing, motion picture studios -- need
management as well as "talent." Government
city, state or national -- provides a large area
of application of management games for training.
Labor unions, universities and professional
associations also have managers. It is fairly
easy to write down hundreds of training situations which could well use this new educational
tool. And perhaps somebody ought to build a
game to teach people how to build a game.
Games completely different from those nO'N in
use can be expected. A super-game could be constructed which included manufacturing companies,
financial institutions, service organizations,

suppliers of raw material, and even a couple of
management consulting firms. Management games
are having extensive use in management education,
but there is probably an even greater need for
new tools in supervisory training.
Psychologists and sociologists have long
used humans, as well as animals, to study human
behavior. Much work has been done with small
task performing groups. The computer opens the
possibility of new uses of simulation in the
life
sciences, and one can expect an increase
in the number of laboratories now doing such research. Management games can also be expected to
play an important role in economic research.
While the simple manual management game has
a purpose, and is extremely useful in many
training situations, one can safely predict an
increasing use of computers in the management
game area. This paper was presented because of
increasing importance of management games to
computer people. It is hoped that the interested
reader will read elsewhere for those more
important aspects of management games related to
their construction, their educational utilization,
etc.
And it is also hoped that he will find the
opportunity of playing one -- it is not only fun,
it is educational.

17
1.3

AN ON -LINE MANAGEMENT SYSTEM
USING ENGLISH LANGUAGE
Andrew Vazsonyi
Ramo - Wooldridge
Canoga Park, Calif.

Summary
The demonstration model of an on-line management system presented in this paper aims to
provide increased mapagement capability to executives charged with planning and control of
large scale research development and production
programs. The technique is formulated as an
exercise in Decision Gaming and special emphasis is laid to the problem of providing capability
for quick and optimum reprogramming of dollars,
manpower, facilities and other resources. The
task of planning and control is structured into
two components, the more routine tasks are assigned to the equipment, whereas problems
requiring executive judgment are delegated to
players of the Decision Game. Through the use
of mathematical models and computer routines
the consequences of proposed reprogramming
actions are presented to the players in terms of
financial and manpower requirements, facilities
loading. etc. Through a step by step manmachine process, optimum programs and the
best utilization of resources is reached. Management data is retrieved and manipulated on an
on-line basis and all operations of the equipment
are executed through every day English commands.
All data is displayed on cathode ray tubes and
projection screens, including instructions to the
players on how to operate the equipment and how
to play the Decision Game. Input to the equipment is provided through (1) a permanently
labeled keyboard, (2) a blind keyboard that can
be provided with appropriate labels through a set
of plastic overlays. The computer action resulting from depressing of keys must be programmed
and is not wired permanently. By providing a set
of independently operated input-output consoles,
connected on-line to the same computer system,
a significant advance in the art of the design of
management systems is provided.
Intr oduction
The management planning and control technique
described in this presentation has been developed
for certain military and civilian activities with
the purpose of assisting executives in evaluating
and re -programming complex activitie s. However, it is believed that the technique is equally
applicable to the planning and control of other
large scale research, development, production
and constructlon programs.
In order to apply the planning and control technique to an activity it is neces sary to divide the

activity into "elementary" programming blocks.
The technique requires that first alternate scheduling and financial data on each of these planning
blocks be developed. In addition, it is necessary
to formulate explicitly the inter -relationships
between the planning blocks. These relationships
specify the per mis sible time phasing of the elementary programming blocks, the associated
dollar and manpower requirements, facilitie s
requirements and other financial rE;,quirements.
The planning and control technique employs a network analysis of the various activities involved
and permits the exploration of a large number of
planning combinations.
The primary purpose of the management
planning and control system is to assist executives
in re-programrning. As an illustration, suppose
that plans are compared with progress, a deviation is observed and re-programming of the different activities is required. For instance, it
might be necessary (1) to cancel a program, (2)
to stretch another one out, (3) to accelerate one,
or (4) to decrease or increase production quantities. Another situation when the need for reprogramming arises, when there is a budgetary
change and financial trade-offs between various
programs must be considered. For instance,
executives may want to know that if a particular
deadline is postponed by six months, how many
dollars and what manpower can be made available
to another program, and by how much can this
other program be accelerated.
The actual program analysis and re -programming activity is carried out through the medium
of a Management Decision Game.
Brief Description of Decision Gaming Technique
The Decis ion Game is to be played in three
steps. As a first step the players of the Decision
Game gather in the Control Roqm where the Game
is to be played and where the various information
displays can be retrieved with the aid of the computer system. The displays present all the important planning factors relating to the activities
to be re -programmed. The time phasing of
various missions, deadlines and goals, and the
associated loading of various facilities can all be
displayed. The associated financial information
can also be shown with sufficient detail so that
financial consequences of re-programming decisions can be made. Provisions are made to
retrieve further back-up information when requested, from a file of status of progress and

18

1.3
alternatives.
As a first step of re-programming a comprehensive analysis of the status of the programs is
carried out. The computer system is provided
with the capability of furnishing status information on a real-time basis and in everyday English.
Information related to all matters pertaining to
the progress of various programs is displayed in
cathode -ray tubes or in projection screen. After
this status analysis is completed, the players
have adequate information to perform the second
step of the Game.
This second step of the Game consists of making a "move". Such a Ilmove ll may involve a titne
shift of some of the deadlines, mile stones or
goals and / or a change in the delivery quantities
involved in the program. As suggestions for reprogramming Ilmoves II are tnade, the proposed
changes are put into the computer system through
the use of an appropriate keyboard and Communication Display Tube. At the direction of the
operator the computer and associated equipment
takes over and the third step of the Game, that
is the re-programtning computations, are
carried out.
This third step of the Decision Game is executed by the cotnputer in accordance wit,h mathematical models and associated computer routines
stored in its memory. Within a time span of
seconds the computer prepares a new program,
including all the new deadlines the new phasing
of sub -programs, facilities loading, manpower
and financial implications. When the computer
finishes the cOtnputations, the data is presented
to the players through cathode -ray tubes and/or
slide projections. By examining the various
displays and by retrieving tnore detailed infortnation, the players can evaluate whether the suggested solution to the re -progratntning problem
is satisfactory.
In tnost situations the first suggested program
will result in conditions that are not acceptable
to the players of the Game. Therefore, after
considering the results of the Ilmovell and discussing further implications of the data, a new
proposal for re-programtning will emerge and a
new cycle of the Decision Game will be entered
upon. By a series of steps it is pos sible to
develop a final program that is acceptable to the
players.

the examination of a panorama of alternatives.
The analysis can be performed only if -in the memory of the computer, techniques for examining
many alternate possibilities are stored and programmed. It is recognized that it is itnpossible
to store all the possible alternatives and therefore, a method to study alternatives must be
provided. The analysis is made pos sible by the
application of tnathematical modeling techniques
and by the storage of certain basic system paratneters. The mathematical model uses the eletnentary programming activities as basic building
blocks and relates these activities to each other
through mathetnatical relationships. For instance,
alternate ways to accomplish a basic programming block can be as sociated with various estimates of completion dates and costs. The
mathematical model sum.marizes the relationships, and also relates the different activities to
each other through equations and inequalities.
Manpower and financial requirements appear as
dependent variables, whereas the titne phasing
of the various activities as independent variables.
In order to avoid the neces sity of manipulating
a large number of param.eters, sub-optimizing
techniques are introduced. For instance, it might
be required that certain types of sub-programs
be accomplished at a minimum cost and this policy
can be embodied in a system of equations through
mathematical progranuning techniques. By such
relationships, the majority of the variables of
the system can be tnade to depend on a few control variables. With the aid of mathematical
models and sub -optimization techniques it become s
pos sible for the players to manipulate only a few
of the major variables and still examine a large
number of alternate plans.
Equipment Requirements
There is no equipment on the shelf today that
can carry out in all its details the management
planning and control technique described here.
However, there is equipment available, which
with minor modifications ~ould possess the capability required. A detailed study of the RamoWooldridge Polymorphic Computer System and
Display Analysis Console, for instance, shows
that essentially all the required features could be
made available in a short time. This computer
system has been described elsewhere,and in this
discussion equipment details will not be included.
Decision Gaming

At the termination of the gaming exercise all
the implications of the final program are recotnputed with greater accuracy. It is not expected
that this re-cotnputation will result in major
changes, but only that the re-computation will
provide an accurate, acceptable and detailed plan.
All conununications between man and machine
are performed in a real-time manner and in
everyday English.
The File of Status and Alternatives
The Gaming Technique described here allows

Detailed Description of Decision Gaming System
The fundamental concepts underlying the
Decision Game are shown pictorially in a simplified form in Figure 1. Three displays enable
the players to communicate with the computer.
The first of these is a visual representation of the
time phasing of all the important missions and
goals. The information on this display is
schematically represented in the upper part of
Figure 1 and is to be displayed in the IIProgram
Network Tubet! of Figure 2 {projection capability

19
1.3
can be provided if desired so that a group of participants can analyze the data). Sufficient details
will be shown 'so that all milestones of importance
are displayed, but the data will not be so detailed
as to confuse the players. As the Game starts,
various questions will arise which will not be immediately answerable by the displayed material.
To meet this condition, back-up displays will be
stored which can be retrieved by the players as
requested. By this technique, it will be possible
for the players to go into any degree of detail in
the time phasing of the mis sions and goals without
making the presentation too cumbersome or confusing.
A part of the display on the "Program Network
Tube" is the visual representation of the utilization and loading of the different facilities ass ociated with the programs considered. This display
is shown by the third item from the top in Figure
1. All the previous comments made in connection
with the visual representation of Programs A and
B apply for the Facilities Loading displays, too.
Sufficient detail will be given so that the player
can appraise the state and progress of various
programs, and again sufficient back-up information will be available through retrieval.
The second display refers to dollars, costs,
manpower, and other resources. These are
represented graphically in the lower part of Figure 1 and are to be displayed on the "Resources
Requirements Tube" of Figure 2. The dollar and
manpower profiles as they unfold in time will be
represented in sufficient detail so that all the important information for the players will be furnished. In addition, when it is required, the
players will be furnished with hard copies of
printed financial information.
The display capability so far described furnishes the players of the Game with such pertinent
information as past history, status, and future
projections of programs. Particular emphasis is
placed on the preparation of this information in
such a form that organizational structure and
responsibilities are directly tied in to the information displayed.
The lower right corner of Figure 2 shows the
"Man Machine Communication Display". This is
the tube that offers choices of instructions to the
player in plain English. This tube is used mostly
for non-standard ty.pe of instruction to the player,
as ordinary instructions (say: "Machine Is Busy")
are provided through the illumination of status
lights.
So far we have described the display systems
and the type of information stored. We are now
ready to proceed to the description of how the
Decision Game is to be played. In order~to be
able to speak in more specific terms, we take
the hypothetical problem of a new requirement,
that a particular mission is to be accomplished
one year ahead of schedule. This new requirement requires the acceleration of a major program and a reorientation of the resources

available.
When such a problem arises, various discussions take place at different managerial levels.
We do not propose that the Decision Game is to
replace these conferences. However, after a
preliminary consideration of the problem the
appropriate management group gathers in the control room to play the Decision Game. By a step
by step procedure, they evaluate, modify and
sharpen the preliminary ideas that have risen in
connection with this problem of advancing the
completion date of a major mission.
When the group meets the first time in the
control room, the players begin by retrieving a
number of different displays to update and verify
their knowledge of the status programs. Such a
review cons ists of inspecting the principal displays
ass ociated with the problem and of retrieving'
various back-up information. After such a preliminary discussion, a proposed first solution to
the reprogramming problem is suggested and information defining the proposed change is keypunched into the computer.
At the instruction of the players the computer
begins to carry out the routine associated with the
particular reprogramm.ing problem introduced.
The computer consults the Data and Program
Reservoir containing the file of status and alternatives shown on the lower left-hand side in
Figure 2, and on the basis of stored information
and routines, computes dollar and manpower
requirements. In addition, facilities requirements
and loading are checked and computations are
made to determine whether the desired acceleration is feasible at all.
As the computer proceeds through its routine,
it might find that the proposed acceleration is
impossible or impractical. It is possible that
even if all projects are put on a crash basis the
mission could not be accomplished within an
acceptable date. It may be that for instance manpower is not available, even if more ~hifts are
employed. Under such conditions, the computer
will indicate that the plan is not feas ible and it
will display on the "Communication Display Tube"
a warning signal, which shows in detail why the
proposed solution to the reprogramming problem
is not feasible.
At this point, a group discussion follows to
determine whether by a higher order of decision
a solution could be found. For instance, it :r;night
be decided that another facility can be built or
made available, or that another contractor can be
called in. Information available to the decis ion
maker will not always be programmed into the
computer and, conseqlJently, feasibility indicated
by the computer will occasionally be considered
as tentative.

If, indeed, a need for such a new alternative
way of proceeding with the problem exists, this
information must be put into quantitative form and
fed into the machine. On the other hand, if the

20

1.3
computer indicates general feasibility, then the
players can immediately proceed to further evaluation of the proposed program.
When the program modification is feas ible, the
players are primarily con~erned with resource
requirements and with dollar and manpower profiles associated with the program. It is very
likely that the first solution proposed will not be
acceptable from the point of view of budgetary
considerations. It is likely that the costs at certain phases of the program will be beyond possib~e
funding, and perhaps at some other times there
will be an indication of surplus funds. This, then,
is the point where the players reconsider the
time phasing of the mission and goals and propose
an alternative. When the players agree on the
next trial of the program phasing, information is
fed into the computer and the computer proceeds
with computations to prepare a new program.
Again, the computer first explores feasibility and
then proceeds to the detailed generation of the
resource requirements.
It is seen that through a step by step process
of deliberation, discussion and computer routines,
the players will reach better and better solutions
to the reprogramming problem. It is envisioned
that programming computations will be carried
out first by a llquick and dirtyll method and then
by a more accurate routine. This will allow the
players to explore tentative alternatives rapidly
and there will be no unnecessary delay in waiting
for accurate computations which would not be
utilized in actual program plans. The computer
. will carry out accurate computations either
automatically (when co:mputing time is available)
or at the special direction of the player. This
approach allows the decision :makers to make
rapid changes and explore and evaluate dozens of
different program proposals. As the Decision
Game progresses, more and more satisfactory
solutions to the reprogramming problem will be
found. Towards the terminal phase of the gaming
exercise, the players may des ire highly accurate
estimates of the various program details. If
this is so, it may be neces sary to direct the
computer to carry out :more accurate special
program computations, and it may then be necessary for the players to wait for a longer period of
time to get the phasing of programs and the
resource requirements. Finally, the computer
is directed to develop and print a definitive program which will be used as a planning document.
Computation of such a program. may require hours,
and consultation with other agencies and
contractors.
So far, we have given only an outline of how
the Decision Game is to be played and described
only those phenomena that will be observed by
the players. Now we proceed to take a look inside the equipment and see how the various
logical steps, routines and computations are
carried out.

Illustration of Reprogramming Computations
The basic principle in carrying out reprogramming operations is to provide the computer with
data on possible alternatives and also with the
myriads of details on how these alternatives can
be combined into programs. The computer can be
programmed to go through a large number of calculations in an efficient fashion, and therefore
alternate programs can be generated by the computer in a matter of seconds. In order to illustrate the techniques, we will describe an extremely
simple but still significant reprogramming prdblem.
Figure 3 is a chart showing six different jobs
and the time phasing of the start and completion
dates of each of these jobs. In this simplified
programming Game, we are concerned only with
the monthly dollar expenditures which are shown
in the bottom of Figure 3. Suppose the player
desires (1) to accelerate by two months the accomplishment of Goal B (that is the terminal dates
of Job No.3 and 5); (2) to accelerate by three
months the final completion of the mission, that
is of Goal A; (3) leave all other goals unchanged.
The computer is to determine whether such an
acceleration in the program is feasible, and what
kind of dollar expenditures would be associated
with this accelerated program.
As this reprogramming information is keyed
into the machine, the machine examine s all jobs
to see which is immediately affected by the acceleration of Goals A and B. The computer selects
Jobs 3, 5 and 6 and evaluates the possibility of
accelerating those three jobs. It finds that the
time span of Jobs 3 and 5 are to be compressed
by two months and of Job 6 by one month.
At this point, the computer seeks information
on alternative ways of accomplishing Jobs 3, 5,
and 6. As the computer consults the file of alternatives, it finds for each job the time -cost relationship shown in Figure 4. The horizontal axis
shows alternative time spans allowed for the job,
the vertical axis shows the total dollars that must
be expended, if the job is to be accomplished in
the time specified. It is seen, for instance, that
a crash program--doing the job in the shortest
possible time--requires more total funds than a
more orderly and efficient execution of the task.
In the case of a stretch-out, due to overhead and
some other supporting activities, the total cost of
the job would also increase. The computer also
finds how these dollars would be expended in time.
(Dotted lines in Figure 4.) The file of alternatives
has curves of this type for each of the jobs and
therefore the computer can establish that the jobs
can indeed be accelerated to the desired time span,
but that a higher expenditure of funds is required.
Using this information, the computer can replace
the previous budgets for Jobs 3, 5, and 6 with the
new budgets and determine a new dollar profile
associated with the accelerated program. We see
that when the computer reprogram.s, it first proceeds through these computational steps and then
transmits the information to the display devices.
The player can visually observe the required

21
1.3
funding associated with the accelerated program.
We recognize that in a real problem we would
deal with a much more complicated set of routines. Manpower profiles would have to be computed, facilities loadings would have to be checked,
many other items of information on compatibility
would have to be considered. In the case of prototype production, or in other tasks where quantities are involved, relationships dealing with
"quantity made" would have to be included in the
analysis. However, basically, these considerations would only complicate (admittedly by a
great extent) the routines that the computer would
have to go through, but, conceptually, reality
would not add significant new difficulties to the
method of solution.
The time cost relationships as shown in Figure
4 form the basis of the file of alternatives that a
computer has to consult. As we already mentioned, there are types of problems where more
com.plex mathematical models form the building
blocks for the file of alternatives. However, for
purposes of our discussion, we will concentrate
on the concept of time-cost relationships and we
will show how such relationships can be generated.
We will show how the basic input data i,s to be
obtained and how these data can be built into the
appropriate files for representing various alternatives that the programming task may require.

secretary. This establishes his minimum effort
level and gives the "minimum effort'! point in
Figure 5. We connect the three points by a curve
and obtain a time-effort relationship and we
as sume that we could also operate at intermediate points on this curve. With the aid of the
curve shown in Figure 5, we can determine
the time-cost relationship shown in Figure 4.
All we have to do is to multiply the rate of effort
by the time required for the job, to get total
costs.
We see, then, that we have a technique to get
time -cost relationships, at least for relatively
simple jobs. However, if we want to extend
this technique to more complex tasks, we run
into problems. It is difficult or impossible to
find managers who have all the details of a complex job. Consequently, in order to make cost
estimates, the manager must w'brk with his subordinates and must combine in a complex fashion
many items of information. This combination of
data is a tedious and difficult job but is precisely
the kind of task that computers can execute with
great efficiency. Therefore, we propose to prepare time'-cost curves for complex jobs with the
aid of computers.
We will show how, with the
aid of mathematical models and sub -optimization
technique, one can construct time -cost relationships.
Sub -Optimization Cons ider ations

Concept of Alternatives
Let us reiterate the type of information we
seek. The player moves som.e of the gaming goals
in time and certain jobs must be performed within the time limits indicated by the player. We
need to find a way to determine the dollar requirem.ents associated with the various alternatives.
Let us begin by considering a relatively simple
job or task. Suppose that there is a single manager in charge, and let us as sume that this manager has a good grasp of all the details involved
o£ this particular task. The manager does his
own planning with paper and pencil and by discussions with his associates. We ask him to determine how much would it cost to perform this job
in an "orderly" fashion. After studying the problem, he estimates manpower, material, overhead
and dollar requirements. In Figure 5, the financial information is shown in a graphical form. In
the horizontal axis we show the tim.e allowed to
com.plete the task; on the vertical axis, we show
the associated effort (say dollars per week) required. "Orderly" performance of the task is
represented by the "most efficient" point in the
chart. We also ask the manager to determine
what it would take to complete the job on a crash
basis. He would need more men, more resources,
he would require a larger effort, but he could complete the job in a shorter time. This crash program is shown in our chart in Figure 5 by the
" m inim.um time" point. We can also ask him to
determ.ine the minimum level of effort required
to do the job at all. He needs two mechanical
engineers, an electronic expert, a technician, a

Let us take a simple combination of two jobs
which have to be performed in sequence. Various
alternate time spans are allowed either for Job
No. 1 or No.2. This implies a number of combinations of ways that the two jobs can be performed. In Figure 6 we show the problem in a
graphic way. Suppose tentatively we select a
certain duration for Job No.1, and we determine
the associated dollars required with the aid of
the time-cost relationship. In Figure 6 this timecost relationship is represented by point A. Now
by starting with this time span, we can as sign
different time spans to Job No.2. A possible
representation for Job No. 2 is point B. It is
seen that we can combine the two time-cost
curves in many different ways. In Figure 7, the
various poss ible time -cost curves for Job No. 2
are shown by dotted lines. Now we need a policy
to select, out of these many possibilities, the
desirable ones.
Suppose we agree that we want to complete
the two jobs within a given time span, but with
the least amount of money. Let us recognize
that when the combined time -span for the two
jobs is specified, still there are many ways to
do the two jobs; out of these many possibilities
there is one that yields the lowest cost. In
Figure 7, these low cost combinations are represented by the envelope of the dotted curves. We
say then that this envelope, corresponds to our
policy of minimum cost, and this envelope is the
combined time-cost relationship for the two jobs
to be performed. For instance, if we wish to
complete the two jobs at point P in Figure 7, we

22
1.3
draw the ve~tical line from point P until we reach
the envelope at point Q. This gives the combined
cost of the two jobs. Working backwards from
point Q, we can get point R which represents the
time and cost requirements of Job No. 1.
The policy we used here is to perform the two
jobs with the lowest possible cost. If there is
another policy such as say a constant manpower
requirement or the utilization of a facility, etc.,
each of these policies would have to be programmed into the computer. The important point,
however, is that even if complex policies are
formulated, due to the high-speed capability of
the computers, consequences of these policies can
be deduced efficiently.
Actually, the computer would not construct the
envelope of the curves, but would solve the appropriate mathemaJ;ical problem. It is easy to show
that the two jobs are to be combined in such a
fashion that the following equation holds:
(1 )

Here on the left-hand side we have the derivatives
of the time -cost relationship for the first task and
on the right-hand side, the derivative relationship for the second task.
The computer would compute these derivatives,
select the appropriate combinations of the tasks
and generate the new time -cost relationships.
In Figure 8, we show a somewhat more complicated problem when a sequence of jobs is to be
performed. Here it can be shown that the following equation must hold:

de.1


== _____ -----_ .. -- ---------- ------

rr----~-

1962
TIME
Figure 1.

1963
~

1964

1965

Basic Gaming Model

......

N

e.,),-....J

......

N

WOO

MAN-MACHINE SYSTEM

T
ITj

....

(JQ

~

11

PROGRAM

RESOURCES

NETWORK

REQUIREMENTS

TUBE

TUBE

(D

•

~

~
PJ
~
I

~

PJ
:::r'

()

r-I

s·

CONTROL PANEL

(D

CI.l

'<

til
..,.

T

(D

s
COMP1UTER
DATA a
PROGRAM
RESERVOIR

I

I

MAN-MACHINE
COMMUNICATION
TUBE

ELEMENTARY PROGRAMMING GAME

JOB NO.1
JOB NO.2

,I

+

t
I

+8
I

L::.J
,

:,

II

I

I

I

I
I
I

I
I

Fl

Fl
~

F1
------tl:::::.l

F1
l::::::.J

I

:

JOB NO.4

-------~!--~~
-'
------~~

JOB NO.5

------~§

JOB NO.3

JOB NO.6

~

+A

I
I
I

I
I

~~----------~

I .

r=1
~~---+I------------~
II

§~--~
I

------------------------~~

E9
~

S/ MO.

1961

1962
Figure 3.

1963
TIME

1964

1965

Elementary Programming Game

.......

N
w(O

30
1.3

TIME-COST RELATIONSHIPS

$

I

2

Figure 4.

3
4
TIME IN WEEKS

5

6

Time-Cost Relationships

TIME-EFFORT RELATIONSHIPS

MOST EFFICIENT

$/WEEK

MINIMUM
EFFORT

2

3

4

TIME IN WEEKS
Figure 5.

Time-Effort Relationships

5

6

31
1.3

CONCEPT OF GAMING GOALS

$

I

§~--§
JOB NO.1 I

1=1

Figure 6.

TIME--I

I~I JOB NO.2
Concept of Gaming Goals

POSITIONING OF SLAVE GOALS

$

p

TIME
Figure 7.

~

Positioning of Slave Goals

32
1.3

SUBOPTIMIZATION TECHNIQUE

ex

xx

xx

x x

•
de;

CONDITION OF OPTIMALITY:

Tg

I
I
I
I

dC.
dli_ _ _II
_

T3

dC2
dT2

A

dlj

de3
dT3

TI

T3

T2

Figure 8.

Suboptimization Technique

POSITIONING OF SLAVE GOAL THROUGH
SUBSIDIARY GAMING

,
§)(

+

I
I

I
:

¥x

~

I

I

I

I

I

I

I

I

I

~

I

+
I

A

I

:
I

I

I

B

I
I

~~--_*~--X~I--~X~~H~X----X~~
Figure 9.

Subsidiary Gaming

=A

~

ETC.

MULTISTAGE MAN MACHINE DECISION SYSTEM

1 . - - - - - -,
I

I

I
I
I
,

I
I

,
I

I
I

FILE OF
AL TERNATIVES

L_____

r------l
I
I

I

MAN

I

I

I

I
I
I

I
I

FILE OF
ALTERNATIVES

I
'--- - -

I
I

----'

I
,

---1

,-- ----,
I

MAN

r-------,

I"

I

I
I
I

I

I

I

I

I
I
I

:

:

I
I

I

I
I

I
___'__

I

L _ _ _ _ _ _ -'

Figure 10.

I ALTERNATIVES
I
L _ _ _ _ _ --1

Multistage Gaming System

I-'

W
WW

34

1.3

CONFIDENCE LIMITS FOR GOALS
GOAL
NO. I

GOAL
NO.2

Figure 11.

GOAL
NO.3

Confidence Limits

CASCADING PROBABILITIES

.£\..
I

Jl
I

A

I

COMPOSITE
PROB ABI LITY

0-

2

=
Figure 12. Cascading Probabilities

'il~[@!~Wji'
!~~~; li ~ j~i j~ fiI I K~~:g!~D Ij::::~::::::::::::::::::::::::
~j~ ~I !i~ I Ijil S~~~:L lii~!~U~I~SJI ~~!:~T~~: 1111~ii~lmli~i~!1
::::::::::::::::~::::::::::::

START
ROUTINE

::::::::::::::::::::::::::::::'

IN PUT

r~i:::::::~~:i:::::::~:1

CANCEL

NO. I .

::::::::::::::::::::::::::::::

::::::::::::::::::::::::::::::

II-I- - - - - I I

I: : :~: : ~: : :~: : ~: I

PROCEED
WITH
ROUTINE

BAC K

r~:::::ii::i::::::;;:1

SPEC IAL

1::i::ii:~::~ii::;~~~:1 ~~E~~OOR~ I:*::::~±::~~::::::~I

~i@llllil R~~~~~ l~illiliilli~ I~~~!. l@~i~! i~j~j~! !i.~i I=~~:. ' ~@ii!MI~i~ K{~~i£~CD ~li:I@!~!!!!:

11
... - - - - - 1

LEFT

+ ri: : : 1: @lt i:1
I ~~~~~~~~~~~~~~~

rm:iii:Mi:1

RIGHT
TUBE

~ ~ ~ ~ ~ ~ ~ ~ ~ f~ ~ ~

BOTH
TUBES

II

~~~~~~~~~~~~~~

OPERATOR:

IMi:m::t~:1

~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . IN-~~:;~SS ~ ~ ~ 1~ ~ ~ ~ ~t~ ~ ~

ADVANCE

I

r*~ : :~: ~t: :'~: 1

r::i i:'i : i: : :1
~1!!i!I:~MI

~ fi.i{I~!i1i!

BA CK
SPACE

1~~~~~t:~~§~(1
ff!lli~i!i

I~~~~~~*"{:'I
!f~f'lI.jiM

OPERATOR'
ERROR'

I~~~~~::::~~:I
~m!i~ItfI!!i

I
COMM.
TUBE

ri!!l!i!!j!!i!jl
::::::::::::::::::::::::::::::

....

P R I NT

r~i:~@:::::::::~:1

III ~ II II II O::~~::R:I.

Ilt------I
TUBE

II

I!i~:iii:~m::ml

4

1!liijJi.~lljl ::~;~:~ fim1£ii
:::::::~::::::::::~::~::~::

S LID E

::::~~:~:::::::::::::::::::

JPROGRAM

,11'
1!111!1
:::::::::~:::::~:::::::::::
J

T~~T Imiti~:-ril
:::::::::::~::::::::::~::5

-

Figure 13. Control Panel

.......

(".)

•(".) en

..... w

em
W

EXPENDITURES (Million Dollars)
Total
1961
D and P

1. ATLAS

Sys-conn

MINUTEMAN

B-70

1964

58

16

316

398

367

258

183

2437

81

46

74

223

1245

10

39

102

1335

828

620

453

3115

4

4013

Sys-conn

Sys -Conn
D and P

4.

B-52

Sys-conn

5.

B-58

Sys-conn

D and P

____120__ W1~

75

D and P
3.

1963

65

D and P
2.

1962

95
1060

214

25
1149

120
1134

1134

1134

9931

309

2748

58

25

159

324

309

309

Sys-conn

915

741

510

197

D and P

163

87

Sys=conn

174

230

83

D and P
6.

B-47

7.

KC -135

8.

GAM-87

D and P

2363
250

230

230

230

64

26

33

505

3

446

Sys-conn

1200

1200

1200

1100

1100

WEAPONS SYSTEMS

4591

4795

5173

4367

4002

LIMIT AT IONS

4600

4500

NON-SYSTEM
TOT AL FOR ALL

Figure 14.

Weapon Systems Expenditures

2084

Total in Program

276

Total Delivery to Date

8

o

Units per Squadron

12

Active Squadrons

Go Ahead Date

May 1958

1st Alternate

Last Delivery

June 1964

3rd Alternate

2nd Alternate

EXPENDITURES
All System.s

ATLAS
Squadrons
Projected

D&P

Sy. Con.

49

4

65

316

138
1962
1st Alternate
2nd
3rd

11

75

230
1963
1st Alternate
2nd
3rd

19

58

Year

To Be
Delivered

1961
1st Alternate
2nd
3rd

~

Total

Limitation

1200

4591

4400

398

1200

4795

367

1200

5173

Non-Sys.

--

Figure 15.

Atlas Program

......

w

w""

39
1.4
l1PPLICATION OF DIGITAL Sn·IDLATIOH T.8ClITUQ.UES
TO HIGInlAY DESIGN P:t10DLii!l':rs
Aaro:c. Glickstein
l·lid".·Test Re:Jearcn Institute
Kansas City, Missouri
S. L. Le-vJ
l·Iid'·lest Research Institute
ICal1cas City, Hissouri

i\. study of the operatinG chara.cteristics of the driver-vehicle combination has yielded a General diGital siillv~atiolJ. i:.1odel. Tllis sil.1ulation lLlodel, 1iJ.lich can duplicate ".:.raffic flO'll
on a 17,OOO-ft. sectiol1 of a rree'·iB.~T illCludiIl;:;;
two on-rar:lps and tl'lQ off -rarU.ps, can be used to
econoI1ically eval'llate alterIlute desien cri-ver:'a.

The slluulated venicle in the Bodel,
follOl·ring decision rules based on actual traffic
behavior, is alloiTecl ~GO maneuver throuQl the section of free,·ray Ul1cler study. The effects of
cl1anGes in traffic volv,me, traffic velocity,
freeivay configu.ratior., e-c,c., can then be evaluated by notinG chanGes in the computer outl)ut of
traverse time, waitinG time on ramp, volUJ:.:J.evelocity relait:Qnship, vleavinG cOl:lplexity, etc.
The con:.put.er simulatio:1. thus creates a
duplication of the real situation at a' s::-_lall
fraction of the cost of studyinG the real system.
Furthenllore, the siI;iulation allow's: (1) the evalv.ation of various freei-ray configurations "rithout
the expense of their construction, and (2) the
perfor~ce of controlled experll:lents ~npossible
to perfom. ".'lith the actual traffic.

1. \'i11U-G s110uld De tile lenGth of an
acceleration lx..e:·
2. Hho:c is tile effect on traffic floil
of locatin::; int;erchal.1z,e areas at various di8t&"1.ces fra.,l one another.
3. ';':nat is tile effect of various speed
m.iniJllW,lS ar::.cl UlXir,::w.J.s on the trai':2ic f101.[;

DurinG t~e past decade a n~,mber of descriptive theor~es concerninG vehic~~ar traffic
have been nut fOrlm.rd. In a;,: atte;:rfPt to classify
these theories I-IaiL:2TG 7 pro:9osed th~e t;y:.oes.
The first, an analytical and detenninistic r.loa..el cOl1side:rs tne c;eometrical and physical characteristics of tiJ.e autoaobile and the assur,leo.. behavior of t}1e driver. ~lhen specific
values of vclocit,y, acceleration, and. vehicle
f0110vlinC behavior are postulated, it is possi-ole
to descri-be the 1:"..otion of a n'l1:::ioer of cars in one
lane. The resulJcs, altlloU£,;h precise, are lL'liteo..
to a SL.la11 nUltiJer of vehicles. Hork i~ this area
has been carried out by Pipes; 15 Chaxl.dler, Eerl:J.a::.,
and Hont:roll; 1 aYJ.d Eenuan, I-lontroll, Potts, and
Rothery. 9

Introduction
In rece:c.t years, because of t;he Grmrth
of the hi01'i·rclY buildi:c.C; program., traffic enGineers have become concerned over t.he lac::C of
lmoi-Tledce ab01;;:'c, the nature of traffic flo'<7. This
lack of YJlovTledGe has haIf!.pered the desi[...:r.. enGineer in his ability to d_esicn an e;~pressi'i'ay ul1icn
lull 1:10Ve :traffic s;:,loothly and efficiently at a
llunilm.1rll cost. All too fl~equently after a hiGh~my
has been opened it. has been fOlli1d that traffic
:9roolems arise which requi:re ezterlsive :reo.esiQ1
and construction.
For exaraple the fol10vnnG questions
have still to be ans'lvered.

A second ap:?J:'oach has been to consider
trei'fic aD a stochastic process and to treat i~c
by the theory of Queues. Tl;tis apnroach has been
utilized by 1';e'l"eli,13 Tanner. 16 The tl-:eory of
queues can, hm·Tever, only be applied ',Then vehicles r.love at esnentially the same speed and ',T::-ler.l
all vehicles enter t:1e syste;:il at one poin"G.
Reasonable results have been obtained when traffic is act"J.ally Q.ueued, velocity 1..1lliforiJ. a:r~cl tiJ.e
driver has fe-"'l free decisions.
The third approach descriues traffic
flov1 in a continuum. This approacn has been utilized by Nev1ell,11,12 Tse Sun CiJ.ovl,13 Greenbers,6
aYJ.d Lighthill u.nd HhithD.n.10 ..Ul three approaches are limited by the fact that it is

40
1.4
assumed that vehicles do not overtake or pass
each other.
Simulation Models
It is, however, possible to construct a
simulation model for the study of traffic flow
which is much more general than the three types
mentioned. Simulation has been defined by
Harlin88 as tithe technique of setting up a stochastic model of a real system which neither
oversimplifies the system to the point where the
model becomes trivial nor incorporates' so many
features of the real system that the model becomes untractable or prohibitively clumsy."
Early work in the field of traffic simulation was carried out by Gerlo1,lgb.,2 Goode, 4
Wong,19 Trautman, Davis, et al. 17 Gerlough in a
pilot study simulated two lanes of a highway system one-fourth of a mile long. In this simulation model the vehicles were allowed to choose
their speeds from a three increment distribution
of desired speeds. This program run on the SWAC
cQnputer required approximately 35 to 38 seconds
of computer time to simulate one second of real
time.
Goode 5 in his simulation model represented a Single intersection of a simple type.
Provisions were made for the Origination of cars
in lanes approaching the intersection, and for a
stochastic mechanism for the determination of the
direction of travel of any particular car entering the intersection (i.e., ri@lt, straight
ah~~, gr left). The measure of effectiveness
used ivaS the average delay per car. The ratio of
computer time to real time v18.s 3H:':.
Although these early models were of
rather limited extent, they indicated that simulation procedures could be used to run controlled
experiments in order to gain a better understandinG of the nature of traffic flow.
The Expressway Interchange Model
In 1958 the Midwest Research Institute,
a grant frQu the Bureau of Public Roads,
started a research program on the simulation of
expressway traffiC flovT. The first phase of this
proGram was devoted to the simulation of an onrarnp area of the ex-pres sway .14 That model simulated a 1,700-ft. section of a three lane expressway including one on-ramp. Based on the ex~rience of this pilot study a more general model
was developed. That simulation model is described in this paper.
Th~der

The study Area
The portion of the expressway under
study is set up in a 4 x N matrix (N.! 999),
Fig. 1. The four rows represent: (1) the three
through lanes 1, 2, and 3, and (2) the ramp, acceleration, and deceleration Lane 5. Each of the
N blocks represents a 17-ft. section of freeway,
the approximate length of an automobile. For the
simulation runs on the computer, any value of
(N ~ 999) can be utilized. Locations of interchanges are designated as follows:
A = ramp input location
B = nose of on-ramp
C - end of acceleration lane
D = beginning of deceleration lane
E = nose of off-ramp
F = off-ramp output location
The program is very fleXible and permits the onand off-ramps to be located at any point on the
section of freeway under investigation.
Input Factors for Simulation Model
In order to obtain a true duplication
of actual traffic behavior on the freeway the
Simulation model should contain all factors which
influence traffic behavior. In this model the
folloVTing factors are considered:
1.

The volume of entering and exiting

2.

The distribution of vehicles to

3.

The velocity distribution of

vehicles.
lanes.
vehicles.
4. The gap acceptance distribution of
merging and weaving vehicles.
5. Tile acceleration of entering
vehicles.
6. Tne deceleration of exiting
vehicles.
7. The distribution to lanes of exiting vehicles.
In addition, all vehicles are allm'Ted to shift
lanes in order to pass slower mOving vehicles in
front of them.
Simulation yrocedure
Basically, the procedure consists of
simulating the arrivals of cars into the section
of highway under consideration and then controlling the action of the vehicle by a set of decision processes. During each second of real time
each vehicle in the matrix is examined. The

41
1.4
vehicle is allovred to advance, vreave, merge, accelerate, decelerate, or exit according to logical rules describine the behavior of actual vehicle-driver combinations. Just prior to examining all vehicles at each second, each of the
input locations is evaluated. Inspection starts
at vehicles closest to the end of the section of
highvay under examination and proceeds to vehicles in the input location.

7. The
traverse times.
8. The
traverse titles.
9. The
each lane.
10. The

distribution of through-vehicle
distribution of ramp vehicle
average vehicle velocity in
number of veaves from:

a.
b.

A description of the over-all logic involved in processing a vehicle through the system
is given in the flov1 diagram (Fig. 2). Detailed
flo\V diagrams for the computer can be obtained in
Reference No.3.

c.
d.

Lane
Lane
Lane
Lane

1
2
2
3

to
to
to
to

2
1
3
2

Input Data Preparation
Computer Operating Note

The folloving parameters are used in
the flow' diaGrams:

v = total volume in Lanes 1, 2, and 3;
Vi

vehicles per hour
per hour in the ith lane

= vehicles
3

L

En
iBn-l

iBn

v
iVn

\f

V· = V

i=l :l= block number of vehicle under
inspection
= block number of vehicle in ith lane
preceding vehicle under inspection
= block number of vehicle in ith lane
parallel to or behind vehicle
under inspection
= velocity of vehicle under inspection
= velocity of vehicle in ith lane,
parallel to or behind vehicle
under inspection
= time gap

Simulation Output
The present model \Vas programed so that
the follovTing infonnation can be obtained about
each simulation run:

1. The volume of vehicles traversing
the system in each lane.
2. The volume of vehicles entering the
freeway through each on-ramp.
3. Ti~e volume of vehicles exiting the
system at each off-ramp.
4. The number of vehicles \Vhich stop
on the acceleration lane.
5. The len6th of the queue on the ac ..
celeration lane.
6. The number of vehicles that desire
to exit but cannot.

To operate the program, an IBl,l 704 computer is needed vThich has at least 8,192 words of
core storace. One magnetic tape unit is needed
(Logical Tape 0), that one being used to produce
an output tape for printinG an an off-line printer.
On the Canputer Console, Sense Svitch 4
should be depressed if it is desired to obtain in
the problem output a listing Of all problem input
parameters.
To run the program, assemble the proGraLl. deck with a series of Control Cards immediately follOwing the deck. The munber and type of
Control Card.s vTill control the analyses to be run.
Tnere are 13 different Control Cards
that can be entered, which insert the necessary
setup data d.escribinG the system confi2;u.ration
and velocity and vTeaving distributions. Appropriate Control Cards should immediately follOif
the progr&1. decl~. A Problem Card, containing
data peculiar to each analysis then follow's and,
after this ProbleEJ. Card, chane;e cards for any of
the control cards and other Problem Cards may be
entered to control the running of a series of
traffic problems.
Control Card Fonruat
The 13 Control Cards are divided into
four groups. Five cards provide the velocity
distribution data for Lanes 1, 2, 3, and the tvTO
possible on-ramps. A second group of five cards
proy-ide data concerning gap acceptance for veaving operations. A single card specifies the geometry of the model; the length of the through
lanes, the ramp entrance and acceleration lane
geometry for both on-ramps, and the deceleration
lane geomet~J and ramp exit for both off-ramps.

42

1.4

A fourth group, containing two cards give data
concerning the a~iting decision process; one card
ror off-ramp No. 1 and the other card for offramp No.2.
T'.ne Problem Card contains a test of
identification number, the lensth of tin~ the
analysis is to be run, the volume of throU£:-:h
traffic and the input volume to each of the two
on-ramps.

Distribution of volume to lanes:

In

all experiments carried out, the traffic volumes

were assigned to the three lanes according to the
follOwing relationships:
Pl = 0.43693 - 0.22183a- + 0.05730cx 2
_ O. 00046 «3
P2

(1)

0.48820 - 0.03136lX + 0.000060(2
+ 0.000240[3

Detailed description of Control Card
forrllO.t is Given in Reference No.3.

(2)

0.07487 + 0.25319 D::' - 0.05736lX 2
+ 0.00382«3

study of Interchar.ge

Confie~ation

Various types of controlled experbnents
can be carried out using this simulation model.
For example, experiments on the effects on traffic
floi-1 of (1) various on-ram3? vehicle volun:es, (2)
various acceleration lane lengths, (3) various velocity distributions, (~) various eeometric confiGUrations, and (5) combinations of the above,
C8..'YJ. be carried out with this model.
EX3?erliaents have been carried out on
the effect on traffic flm! of spacing betvleen an
011-ra:J.1J and an off-rati:J?, under var'Jing traffic
voluaes.

(3)

where
proportion of total volume in the ith lane
DC

total freeway volume in thousands of vehicles per hour

Velocity distributions: Three different velocity distributions were utilized in the
simulations. One velocity distribution was used
for the on-rrunp vehicles, a second for the vehicles in Lane 1 and a third for vehicles in Lane 2
and Lane 3. These velocity distributions are
presented in Table I.
TABLE I

Interchange Configuration
Input Velocity Distributions
Ti-TO interchange configurations (Fig. 3)
,-,ere exanrlned. Each configuration was 200 blocks
or 3,400 ft. long and contained one on- and offramp cOillbination. Each acceleration and deceleration lane viaS 595 ft. long. In Configuration I,
exiting vehicles have 2,465 ft. to travel to the
nose of the off-ramp vThile in Configuration II
-the distance is 3,230 ft. In Configuration I,
the distance bet"leen the acceleration and deceleration lane is 34..-0 ft. and 1,870 ft. in Configuration II. All exitinG vehicles ,{ere deSignated at
block number 199.
Input Data
Volume of traffic: For both confi~ura
the input to the simulation "lvas for 750
rarap vehicles per hour. For Configuration I experiments ,·lith through-lane volumes of 2,000,
3,000, ~,OOO, 5,000 and 6,000 vehicles per hour
\·;rere conducted. For Configuration II experil"!lents
i-lith throuen-lane volumes of 1,000, 2,.000, 3,000,
4,000, 5,000 and 6,000 were conducted. T"lO tests
I'Tere conducted at each volume.
tior~s,

Vehicle
Velocity
(blocks! sec. )
l.50
2.00
2.38
2.81
3.25
3.69
4.13
4.56
5.00
5.44

Cumulative Per Cent
Ramp Lane ~ Lanes 2 and 3
0.026
0.061
0.116
0.314
0.683
0.888
0.979
0.994
0.998
l.000

0.001
0.010
0.040
0.180
0.435
0.737
0.899
0.989
0.999
1.000

0.003
0.015
0.034
0.064
0.138
0.322
0.759
0.970
0.998
l.000

Gap acceptance distributions: Three
different gap acceptance distributions i-Tere utilized in the simulation tests. The distributions
for merging vehicles (both stopped and moving)
are presented in Table II. The gap acceptance
data for vehicles i-leaving between lanes are presented in Table III.

43
1.4
TABLE II

Gap Acceptance for MerginG Vehicles

%of

Size of
Gap
(sec. )

Total
of Moving
Vehicles
Accepting Gap
r~o.

0.00-1.00
1.01-2.00
2.01-3.00
3.01-4.00
4.01-5.00
5.01-6.00
6.01-7.00
7.01-8.00
8.01-9.00
9.01-10.00

7~ of Total
No. of Stopped
Vehicles
Accepting Gap

12.5

0.0
2.8
17.4
36.0
64.9
95.0
100.0
100.0
100.0
100.0

56.8

77.0
95.1
97.0
98.0
100.0
100.0
100.0
100.0
T.4BLE III

Gap Acceptru1ce Distribution
for v,leaving Vehicles
Length of Gap
(w)
(sec.)

Cumulative
Probability of
J:"cceptance

0.00-0.25
0.26-0.50
0.51-0.75
0.76-1.00
1.01-1.25
1.26-1.50
1.51-1. 75
1. 76-2.00

0.00
0.00
0.00
0.00
0.10
0.30
0.60
1.00

Exiting vehicles: For both freeway con10 per cent of the through vehicles
were designated as exiting vehicles. These exiting vehicles were further allocated to the three
lanes according to the follovnng schedule:
fi~urations

1. 90 per cent of exiting vehicles in
Lane 1 at Block No. 199j
2. 9 per cent of exiting vehicles in
Lane 2 at Block No. 199j and
3. 1 per cent of exiting vehicles in
Lane 3 at Block No. 199.

Experirllental Results
The controlled experiments described in
the previous section I'lere carried out on a 704
digital computer. The ratio of computer time to
real "time varied from. 1 to 5 for low traffic volumes, to almost 1 to 1 for the higher volumes.

The over-all results of the experiments
indicated that there ,,,ere no significant effects
on the traffic flow patterns of the freeway as a
result of this change in geometriC configuration.
A comparison between the through vehicle traverse
times for Configurations I and II are shown in
Fig. 4. The effect of increased vol~e on the
number of ,\-Teaves between lanes is sho\ID in Fig.
5. The computer output also indicated that the
number of vehicles stopping on the acceleration
l~~e increased with increased voluraes of traffic,
FiG. 6.
Conclusions and

Reco~nendations

This study has shO\ID that digital simulation can be useci to faithfully duplicate actual
traffic flow in an on- off-ramp area of a freeway. The output of the Simulation, furthenllore,
gives measures of effectiveness Iv-hich can be used
to evaluate alternate highway designs.
The simQlation experiments performed
indicate the need for research and experL~enta
tion in a wide -variety of areas to answer questions such as the following:
1. What is the effect of various vehicle distributions to lanes on the traffic flow?
2. ~'lhat is the effect on traffic flow
of various distances betw'een adjoining on-ramps'.
3. What is the effect of various desired velocity distributions on traffic flow':
4. At what volurJ.e of traffic do weaving movements between lanes become hazardous:'
5. ~'lhat is the effect on traffic flow
of various volumes of commercial vehicles?

The simulation model developed in this
study can serve as an efficient tool to answer
these and other problems in the continuous quest
for means of moving traffic safely and efficiently.
Acknowledgement
The authors would like to acknm"ledge
the valuable "lork contributed by lvir. L. Findley
in programing the computer.

44
1.4
Bibliography
1.

Chandler, Herman and Montroll, "Traffic Dynamics; Studies in Car Following," Operations Research Journal, J:.'larch, April, 1958.

2.

Gerloug1l, D. L., "Simulation of Freeway Traffic by an Electronic Computer," Proceedings
of the Highvmy Research Board, Vol. 35,
1956.

3.

Glickstein, A., Findley, L. D., and Levy,
S. L., "A Study of the Application of Computer Simulation Techniques to Interchange
Design Problems. Paper presented at 40th
Heeting of Highway Research Board, January
9-13, 1960.

4.

Goode, H. H., and True, If. C., "Vehicular
Traffic Intersections, II paper presented a.t
the 13th National Meeting of the Associat
tion for Computing Machinery, June 11-13,
1958.

5.

Goode, H. H., "The Application of a HighSpeed Computer to the Definition and Solution of the Vehicular Traffic Problem, II
Operations Research Journal, December, 1957.

6.

Greenberg, H., "Analysis of Traffic Flm'l,1f
Operations Research Journal, January,
February, 1959.

7.

Haight, F. A., "Towards a Unified Theory of
Road Traffic," Operations Research Journal,
November, December, 1958.

8.

Harling, J., If Simulation Techniques in Operations Research,1I Operations Research Journal, May, June, 1958.

9.

Herman, Montroll, Potts, and Rothery, "Traffic Dynamics: Analysis of Stability in Car
Following,1I Operations Research Journal,
January, February, 1959.

10.

Lighthill and Hhitham, "On Kinematic Weaves,
I and II,II Proceedings - Royal Society of
London, May, 1955.

11.

Newell, G. F., lI£<1athematical Models for
Freely FlovTing Highvmy Traffic, II Brown
UniverSity, December, 1954.

:2.

Newell, G. F., and Bick, J. H., "A Continuum
l,iodel for Traffic Flow on an Undivided
Highway, II Division of Applied Mathematics,
Brown UniverSity, July, 1958.

13.

Newell, G. F., "Statistical iillalysis of the
Flow of Highway Traffic Through a Signalized Intersection," Q,uarterly of Applied
Mathematics, January, 1956.

14.

Perchonok, P., and Levy, S. L., "Application
of Digital Simulation to Freeway ON-RAMP
Traffic Operations," Proceedings of the
Highway ResaRrch Board, Vol. 39, 1960.

15.

Pipes, L.
II Operational AnalYSis of Traf1'ic Dynamics," Journal of Applied Physics,
1.1arch, 1953.

16.

Tanner, J. C., II A Problem of Interference
Betvleen Two Queues, It Biometrika, June,
1953.

17.

Trautman, D. L., Davis, H., Heilfron, J.,
Er-Chun Ho, Iviathewson, J. H., and Rosenbloom, A., "Analysis and Simulation of
Vehicular Traffic Flow,1I Institute of
Transportation and Traffic Engineering,
University of California, Research Report
No. 20, 1954.

18.

Tse-Sun Chm'l, 1I0perations Analysis of a
Traffic Dynamics Problem,1I Operations
Research Journal, November, December, 1958.

19.

HonG, S. Y., ItTraffic Simulation ''lith a
Digital Computer, It Proceedings - Hestern
Joint Computer Conference, February, 1956.

A.,

BLOCK
BLOCk
BLOCK
BLOCK
BLOCK
BLOCK
LANE
LANE
LANE
LANE

3999
3998
2999
2998
1999
1998

BLOCK 3000

-----'I

o
BLOCK 1000

3
2
I
5

WHERE A- RAMP INPUT LOCATION
B - NOSE OF ON-RAMP
C - END OF ACCELERATION LANE
o - BEGINNING OF DECELERATION LANE
E - NOSE OF OFF-RAMP
F - OFF-RAMP OUTPUT LOCATION
I - THROUGH LANE INPUT
o - THROUGH LANE OUTPUT

study Area Matrix for Computer Simulation of an Interchange Area

I-'~
•
~

(Jl

.......

~

~O'l

START

STOP--------~~~~~~
__------------~~~~~~~
IF NO MORE CARDS
LI~RE~A~D~DA~trA~C~A~R~D~S~I--------------------------------------------~

DATA

DETERMINE

CLEAR OUTPUT
DATA REGISTERS

MOVE ALL CARS AHEAD BY ONE TIME INCREMENT
AND PERFORM NECESSARY WEAVE OPERATIONS

i

QUEUE DATA

YES

Flow Diagram of Over-all Computer Logic

47

1.4

CONFIGURATION I
AB

1

;;H
70N

I
BLOCK
1 = 200
A = 150
B • 1~5
C • 110
o • 90
E = 55
F • 50
o = 00

C

EF

0

~ ~
fIJ

TRAFFIC FLOW

0

~
~

I

DISTANCE
3~0 FT
2550 FT
2~65 FT
1870 FT
1530 FT
935 FT
850 FT
000 FT

CONFIGURATION II
lAB

~
I:

0-=

0

EFO

~

~

~

,..
TRAFFIC FLOW

BLOCK
200
A= 195
B- 190
C- 155

r

C

~5

E = 10
F :r 5
o • 00

DISTANCE
3~00 FT
3315 FT
3230 FT
2635 FT
765 FT
170 FT
85 FT
00 FT

The Configuration of the Two Sections of Freeway Being Examined

...... ,t:.

• co
,t:.

LANE I
VOLUME 2000 V.P.H.

(I)

(I)

w

..J

LANE 3
VOLUME 2000 Y.P.H.

ex
0

w

100

100

:::E

~

:z:
w
>
C!l

90

80

:z:
z:

70

0

~

0
LU

t

CONFIG. I~/

90

80

I

~."!

CONFIG.

CONFIG.

70

IT

60

60

50

50

~O

IJ.()

CONFIG. IT

(I)

C!l

z:

(I)

ex
>
«
ex
LU

~

(I)

30

LU

..J
0

:I:

20

W

>

LL

10

0
~~

90

TIME IN SECONDS

10

20

30

~O

60

70

80 90

TIME IN SECONDS

Cumulative Distribution of Through Vehicle Traverse Times at a Freeway Volume of 2,000 V.P.R.

49

1.4

y

CONFIGURATION I

180
160
114-0
en

LIJ

>


<

120

LIJ
~

u..

100

0

0

:z:

80
60
14-0
20
L-----~

_______ L_ _ _ _ _ _

2

~

3

_ _ _ _~_ _ _ _ _ __ L_ _ _ _ _ _~_ _ _ _ _ _ _ _~x

5

6

VOLUME (THOUSANDS OF VEHICLES PER HR.)
The Number of Weaves in 5 min. on a 3,400-ft. Section of
Freeway as a Function of Total Through Volume

50

1.4

y

CONFIGURATION I
RAMP INPUT 750 V.P.H.

UPPER LIMIT

~

z:

:: 35

o

~

(I)
(I)

L&J

...J

~
:c

25

L&J

o

> 20

o

A.

%

cc
0::

o"~

15

y=-

-8.3100 + 0.0059X

10
5
L-----~----~------~--~~------~----~----------~x

2

~

3

5

6

VOLUME (THOUSANDS OF VEHICLES PER HR.)

y.--------------------------------------------~-----------.
CONFIGURATION n
RAMP INPUT 750 V.P.H.

~5

UPPER LIMIT

~ ~O
z:

:: 35 .
o

t-

(I)
(I)

30

L&J

...J

~ 25

L&J

>

y =-

A.

~
0::

o. 295~

+ O.0035X

"15
o
~

10
5

LOWER LIMIT
L------L----~--

2

____

~

3

__

~~

______L __ _ _ __ L_ _ _ _ _ _ _ _ _ _
5

6

VOLUME (THOUSANDS OF VEHICLES PER HR.)
The Relationship Between the Per Cent of Vehicles Stopping
on Acceleration Lane (Y) and the Total Through Volume (X)

~x

51
1.5
THE USE OF MANNED SIMULATION IN THE DESIGN
OF AN OPERATIONAL CONTROL SYSTEM
M. A. Geisler
W. A. Steger
Logistics Department
The RAND Corporation
Santa Monica, Calif.
Summary
This paper describes the general features of
the planning and operations phases of a new weapon
system. The uncertainties inevitable in planning
mean that considerable effort is made during the
operations phase to adjust the weapon system and
its resources to the actual environment it finds
so as to attain the desired level of operational
capability. The adjustment mechanism is called
an operational control system in this paper.
Elements of such an operational control system are
described.
The proposal is made that a better control
system can be designed if simulation is used to
help design it during the planning phase. The
use of simulation will not only produce a better
control system earlier but it will permit the
planners to adjust the other resources provided
for the weapon system so that they are compatible
with the environment and the control system. An
example of such a study is described in this paper.
I.

Introduction

The military structure is going through tremendous technological change. Successive generations of new weapon systems are coming along at
a fast rate, and each contains such different
characteristics that in many ways past experience
is of little use. Furthermore, the competition
among nations in military technology is causing
efforts to reduce the time .length of the defense
planning cycle, which complicates the necessary
process of coordination and interaction that is
required to produce consistent and compatible
hardware, operational plans, and support plans.
These two factors, compreSSion in planning and
rapid technological change, mean that the planning
process is becoming more difficult and its results
therefore less dependable. The purpose of this
paper is to suggest that the design of better
operational control systems is-one-way of contending with this planning diffICUltY; and further,
that the design of such systems will be considerably aided by the use of simulation techniques.
In developing this thesis, this paper covers first
a discussion of such a system, the use of simulation for designing a specific system, an example
of such a designed system, how the simulation of
the operational process was used to improve the
planning decisions, the role of computers in the
simulation and finally, some of the limitations
of the technique.

II.

Need for an Operational Control System

It takes a long time to develop, procure,
and install modern weapons that are needed to
satisfy a future operational requirement. In
Chart 1, we try to define in general terms the
process that leads to the creation of such systems.
As can be seen, there are two phases: a planning
phase and an operational phase. Tne planning for
new weapon systems takes place over several years.
It usually begins with a formulation of the future
environment for which a new weapon is to be developed. This environment may describe the operational Situation, or requirements foreseen, and
possible enemy intentions and capability. This
environment is then used as a guide to define the
hardware characteristics of the projected weapon
system, the operational plans for use of the
weapon, and the logistiCS or support plan of the
weapon for supporting its operation. This planning involves many organizations, many people,
and as we have said, takes a long time, perhaps
several years, because it is an involved process.
Necessarily, many of the factors developed
for planning use are subject to much uncertainty,
and it is very difficult to define all the relevant relationships affecting both the operating
and support characteristics and functions of the
weapon system. Nevertheless, these plans are used
to determine resource and budget estimates for the
new weapon system. These estimates then go
through a budget and procurement cycle which takes
additional years. We thus see that the planning
for new weapon systems involves many organizations,
and requires many assumptions about the uncertain
future, technical capabilities, and enemy intentions.
The operations phase begins 'when the resources
in the form of weapons, people, equipment, etc.,
resulting from the planning cycle are produced.
These resources are now confronted with the actual
environment. For many reasons, there are many
difficulties involved in fitting the actual environment and existing resources together to achieve
a satisfactory level of operational capability.
For one thing, the predicted environment and the
actual environment differ Simply because of predictive errors. The enemy capabili~ies are now
different from what we expected; the operational
situation is either more or less demanding than
that predicted; furthermore, the actual resources
. may differ appreciably both in quality and quantity
from those planned. The budget cycle provided
funds for less resources than expected; the

52
1.5

reliability of the weapon system is not as great
as projected; and the cost of the weapons and
other resources are greater than planned.
Thus, the major activity during the operations phase is to mesh and adjust the actual
environment and the existing resources so that the
resulting operational capability is as great as
possible. The system performing this meshing and
adjusting is called an operational control system.
It performs this activity, first, by using ~
resources available at anyone time to obtain as
high a level of operational capability as possible, and second, by specifying the adjustments in
future resources that will improve operational
capability. Thus, the operational control system
enters directly into the determination of both
operational capability and efficient resource
determination and utilization. In this respect,
the control system can be treated as another
resource of the total system; and therefore, its
contribution to operational capability and its
cost can be weighed against that of all other
resources.
With this in mind, there are three main
points we should like to make about an operational
control system. First, it is an integral part of
the process that determines operational capability,
and its use is an important means for contending
with planning uncertainty. Second, it is'a complex system in its own right, and its creation
may take a long time and much resources. Third,
if one has a means of predicting the operational capability, this information can be used
during the planning cycle to adjust resource requirements. A more responsive, accurate, and comprehensive control system may require less total
resources to achieve a given level of operational
capability.
It is the premise of this paper that simulation early in the life of a weapon system can
predict (to some degree) the contribution of the
operational control system to the preferred mix of
resources (including the control system) for the
operational weapon. If the composition of this
preferred mix can be known during the planning
process, such information can influence resource
policies and future reqUirements.

operational control system. At any given point in
time, the status of the environment and the existing resources represent what we call the current
system status. These include such data as the
operational condition of the weapon, work assignments of personnel, equipment location and uses,
etc. The operational control system works on this
status to improve the future capability of the
weapons system. Since the actual status may be a
complex of vast geographic distances, thousands of
personnel, many million dollars of weapons, equipment, and faCilities, the control system creates
an abstract representation to help it to Inanipulate and to improve the actual system.
This abstract representation is obtained
through an information system which receives inputs as changes in the current system status occur,
such as weapon malfunctions, personnel availability, etc. "These inputs are then put through a
data-processing system which contains a series of
rules and procedures for manipulating the resulting
data, using appropriate computing equipment to
produce relevant outputs. These outputs are given
in the form of reports or displays to appropriate
managers in the control system. These may show
weapon statuses, personnel shortages, equipment
downtime, etc. Thus, a major function of the
information system is to boil down or transform
the very complex real system to a few elements
that can be grasped and evaluated by a management
group or organization. This evaluation leads to
decisions which then result in management actions
that change the future system status in a way that
is intended to increase operational capability.
Thus, the control system consists primarily of a
set of poliCies or decision rules for achieving
increasing operational capability, some of which
are automated as part of the information system,
an organization for decision making and taking
management actions, and an information system.
IV. Difficulties in Designing
Control Systems for the Future

The control systems that are being developed
or conceived by the Air Force to enable major
commands to control lower echelons have characteristics which far transcend the hardware elements that such systems must necessarily possess.
They are man-machine systems, and although the requirements and availability of hardware to be used
in such systems is obviously very relevant, the
demands upon the man in the system are at least as
important to determine.

In other words, an essential requirement in
the design of control systems is the need to define
the role of man and his relationship to the hardware. The difficulty in doing this arises from
several circumstances. First, the control systems
require several years to develop and produce, the
most ambitious taking from 5 to 10 years. It is
therefore necessary for the deSigner to project
his requirements at least that far in advance.
With the rapidly changing military technology, the
environment within which the control systems must
operate will be radically new and will create conditions that are very different from those existing
today. Much higher alert requirements are likely,
with a further need for fast response to critical
Situations, under conditions of wide dispersion.
Radical weapons, including missiles, possibly
space vehicles, and nuclear powered vehicles are
in the offing.

Chart 2 helps to describe a simplified version of the elements and functions of such an

Not only will the environment be very different, but because of the potentialities of new

III.

General Description of an Operational
Control System

53

1.5
communicating hardware, such as data processing
and communications equipment, and the developing
research in organization and information theory,
the designer finds it very difficult to project
himself readily into this highly complex and different situation in order to take maximum advantage of these new factors in the design of the
system. The relationships involved in these complex man-machine systems are often far too involved for intuitive or analytical treatment alone.
Although we can be sure that the environment
of the next 5 to 10 years will be very different,
we do not know the exact specifications of it.
There are many uncertainties in the rate at which
new weapons will be developed, both of an offensive and defensive nature, and the size and kind
of enemy threat. These conditions as well as
many others, can differ enough so that the demands
on the men and machine in the control systems can
only be estimated with uncertainty. However, the
designer needs to know in detail the impact of
this uncertainty upon the specifications of the
control system so that the hardware manufacturers,
the programmers, the training activities, etc. can
carry out their assignments in a timely, coordinated, and comprehensive way. Thus, the core
problem in the design of control systems is planning complex man-machine systems, requiring several
years to produce, under conditions of uncertainty.
In the remaining portion of this paper, we will
discuss some of the general aspects of the design
of control systems, and how simulation, as used
in RAND's Logistics Systems Laboratory, can help
in treating this core problem.
V.

Use of Simulation

The definition of the elements of a control
system in some detail produces the design of a
feasible and compatible operational control system.
The creation of this design is no simple task
because it requires the deSigner to specify with
some precision just how the decision maker in the
operating system will act to increase his operational capability. The deSigner, therefore, has
to visualize how the operating system would work.
If he had an operating system to study, he could
obtain such a visual aid, but the one he is designing will not exist for several years, and he cannot wait. Typically, in the past this lack of an
operating system to study and manipulate has led
to inappropriate, incomplete, and infeasible
system designs. The gaps and defects were then
left to actual operations to correct. This is
both an inefficient and lengthy procedure which
results in delays and difficulties in obtaining
the maximum capability with the weapon system.
The point of this paper is that if the operating
systerncanbeSimuIa'ted in appropriate detail and
under reaI'fstic conditions, the simulatIOii"Can-help fill the void at least partially, and perhaps,
ther'e'fO're, "1ielpprodUce better designs with resulting less-stresses for the real world adjustment.
--- -- - - --- ---In recent work at the RAND Logistics Systems

Laboratory, efforts were made to use simUlation to
help in this planning process and to design an
operational control system. We reasoned that if
we could simulate the environment of a future
weapon system in sufficient detail we could then
take the proposed hardware characteristics, operating plans, and support plans and determine their
mutual compatibility, and, furthermore, that this
would help us to estimate the operational capability they would produce for the weapon system.
We used simulation because no real-world opera~ng
system existed, and yet we wanted to study an operating system early enough in the planning process
to be able to help evaluate and possibly influence
the proposed plans. Since we realized that our
simulation of a future environment was subject to
uncertainty and error, we varied it over a range
of foreseeable conditions and hoped, in this way,
to allow for our ignorance of the future. Second,
we exposed the policies of the system to the
range of simulated enVironments, and in this way
determined their compatibility and overall performance. Third, we also used the simulated environment to help specify the details of the control
system design, and by using this control system
in conjunction with the other policies, we were
able to determine how the particular control system which we employed modified the other poliCies,
and therefore affected the performance effectiveness and resource requirements of the total system.
We believe that this study produced many useful results, both in the specific context of the
weapon system we used, and in the techniques that
eVOlved, which we believe have an application to
many other such weapon systems, since each of
these weapons is characterized by the planning
difficulties we have described. We should like,
therefore, in the remainder of this paper to
describe the simulation experiment that helped to
produce a design and an evaluation of an operational control system. This experiment is known
as LP-II, and one of its primary objectives was
the design and evaluation of a control system for
an ICBM weapon system of the 1963-1965 time period.
The control system that was developed is for a
tactical unit, such as a squadron or wing, for two
reasons: the ultimate effectiveness of the weapon system depends on performance of this unit,
and second, the bulk of the resources and cost
of the weapon system are also in the tactical
unit.
VI.

Description of Simulated System

Chart 3 contains a schematic diagram of the
simulated system. The first requirement was to
define the hardware of the weapon system under
study. Engineering study and analysis gave us
estimates of the missile configuration likely in
the 1963-1965 time period. The long time required
to convert hardware specifications into field
eqUipment gave us some assurance that we had captured the essence of the missile hardware, and
further, that for practical purposes it would be
most realistic to assume that this part of the
system was basically fixed. We found that we had

54

1.5
to describe the hardware in what we called "Support Unit" terms in order to capture the hardware characteristics in sufficient detail to permit interaction of the operations and support
activities in the tactical unit. A support unit
was defined as a module, black box, or a functionally integrated component. For example, the
checkout and malfunction detection equipment was
basically designed to detect failures within
support units, so that the support unit normally
would be that module or component to be removed
an& replaced on the missile to correct a malfunction. To describe the missile, its ground
support (launching and monitoring) equipment, and
launch facilities in this level of detail, we
found took 1,500 support units: 800 on the missile, 250 on the ground support equipment, and
450 in the facilities.
To represent adequately the operationssupport interactions of this tactical unit in a
minute-to-minute way, we found required about a
hundred different parts characteristics for each
support unit. These characteristics covered such
elements as the skills and equipment needed to
perform remove-and-replace activities; missile
system checkouts; diagnosis of an ambiguous malfunction as reported by the checkout equipment;
calibrations, and so forth. A very significant
characteristic of each support unit ,was its reliability. Since this is one of the most uncertain
parameters, its value was represented by a range
from minimum-acceptable to maximum-likely reliability. In the course of the experiment, we
examined the effect upon the control system of
varying the reliability over this range.
A "failure II model had to be constructed which
would be consistent with the reliability values,
and which would define for each support unit the
probability that it would fail under a given type
of stress. We defined six types of stresses
likely to cause failure for each of the 1,500
support units. The operational and maintenance
statuses were defined for each of the 40 conditions in which the missile system could be placed:
various degrees of alert, launching, countdown,
checkout, replacement, etc. The description of
each status defined the stresses that applied to
each of the 1,500 support units by 15-minute intervals of time. Therefore, given a particular missile system status, the probability of failure for
each support unit could be established for each
15-minute interval, and by making random draws
from a probability distribution, a malfunction
generation model was created. This model generated
a significant portion of the workload for the
squadron (identified by the dashed line in Chart 3).
The squadron contained several launch complexes and a squadron headquarters. A full
squadron complement would normally contain several
hundred people, but for the purposes of our study
we were interested primarily in the management
personnel. We identified the launch complex
commander for each launch complex and the operations, maintenance, and supply officers of the
squadron headquarters as being the key management
personnel. Therefore, they were the "people"

of the simulated squadron, and ~ersonnel to staff
these six positions were brought to the laboratory
from Strategic Air Command missile organizations.
This squadron organization was given a set of
operational requirements, policies and resources
by the embedded organizations who represented
higher headquarters. The embedded organizations
were staffed by laboratory personnel. These embedded policies specified the operations, maintenance, supply, manning, and data-processing
reqUirements to be met by the squadron. The
resources given to them were initially derived
from Air Force manning documents, equipping documents, supply tables, etc., that were in effect
at the start of the experiment. These resources
were represented by punch cards, which could be
assigned to jobs in the performance of operational
or support activities by the managers of the
squadron.
The squadron managers also had access to a
data-processing and analysis center. One function
of this center was to maintain the maintenance and
supply-status data of the squadron, and to perform
various logistical functions, such as automatic
resupply notifications, required engineering
changes, maintenance personnel and eqUipment
availability and utilization, etc.
The experiment was operated by glvlng the
squadron a set of operational requirements to meet,
such as maintaining a specified amount of target
coverage and a number of missiles on alert. The
squadron was also given certain preventive maintenance and engineering tasks to perform. As the
squadron managers tried to satisfy these operational and logistical requirements, they had to
schedule the missiles to be in various situations
or statuses. These statuses in turn caused
stresses to be applied to the support units of the
missile system, which caused malfunctions. The
squadron managers then used the resources of the
squadron to correct these malfunctions; and,
throughout the entire exercise, the squadron tried
to schedule its activities with the given resources
so that maximum operational capability was attained.
As the experiment proceeded through several
runs,* the operational and support policies were
changed, the assumed reliabilities were varied,
the resources provided were modified, and the
organizational structure was altered. At the
same time, the decisions made by the managers
were studied, both by observation of the results
of their decisions and by interviewers. Such
analysis produced inSights into the nature of the
decision processes, and suggested possibly preferred decision rules. For some of these decision
*Each run lasted one simulated week. He
found that such a time period was sufficient to
permit most of the stochastic elements in the
system to "settle down." A simulated week took
about one week of real time, but time compression
was achieved since we simulated the 24-hour days,
7-day weekS, during a real working week of about
35 hours.

55
1.5
processes, we used mathematical models or heuristic
programs to develop the decision rules. The use
of such rules in turn suggested required information and organizational arrangements for making
decisions. This led to an evolving operational
control system, which is the result of experience
by Air Force people in the simulated environment
plus analysis of the results obtained by various
techniques tried during the experiment.
VII.

Resulting Operational Control System

Referring to Chart 2, the control process
illustrated is the classical one of information
and feedback.* The novel problem in LP-II was to
adapt this process to the military environment
posed by ballistic missiles in the 1963-1965 time
period. We can foresee a situation in which
missiles are dispersed over a wide geographic area,
maintained on a high level of alert, and responsive to sudden emergencies. The cost of these
missile systems is very great, and so every effort
will be made to keep the resources and costs they
need to do their job to the minimum. One means of
achieving this goal is to be able to shift resources (within the constraints of possible sudden
war) from one place to another responsively, since
the demands for resources at anyone place are
typically sporadic and loW. Further, the support
costs in equipment and facilities are so large a
fraction of total system cost that minimum missile
system downtime is desirable not only for military
reasons, but also for economics, since support
resource levels are almost fixed, in the short
run, for a missile base. Since the cost of a
weapon down is so great (if we value it at system
cost per time period), the major problem for a
tactical unit is to use its fixed resources to
maximize alert.
ThiS, then, is the environment in which the
simulated control system had to function. The
scheduling rules assigning missiles to various
degrees of alert, and assigning the resources to
repair missiles requiring preventive or other
prescribed maintenance were found to be the critical decisions in the squadron. The formulation of
precise scheduling rules by mathematical analysis
was very difficult because of the many combinations of conditions and constraints that had to be
considered. In effect, the implications of the
schedule had to be determined by support units, and
there were over 50,000 such support units on the
operational missile system in a tactical unit.
Further, the schedule was affected, to some extent,
each 15 minutes as a missile or a resource changed
status.
The useful scheduling rules have been developed by trying many rules of thumb during the experiment. The rules that seem to work best are
*RM-213l, Management Information for the
Maintenance and Operation of the Strategic Missile
Force, D. Stoller and R. Van Horn, The RAND Corporation, April 30, 1958, describes the requirements of management and information systems for a
missile environment.

called "opportunistic scheduling." Under this
rule, all prescribed maintenance is deferred as
long as the policies permit. Then, when a missile
malfunctions, all possible prescribed maintenance
as can be done concurrently is accomplished at
that time. In this way, missiles are taken off
alert as little as possible for prescribed maintenance.
Clearly, such a rule requires a highly
responsive organization and information system
because it must be ready to react instantaneously
to any missile malfunction, change in resource
status, etc., under conditions of wide dispersion.
The structure of the information system that
has been designed to meet this requirement of
unscheduled and rapid responsiveness contains the
following: the current status of each missile
and resource, a complete listing of all missile
and maintenance situations and the resources required to accomplish them, and a listing of all
prescribed maintenance awaiting performance on
each miSSile. As a miSSile or resource changes
status, the information system is updated through
appropriate inputs. As a new situation occurs,
each of the managers interrogates the information
system for the implications it has for his function (SUCh as operations, supply, or maintenance),
revises his schedule accordingly, and assigns
available resources to the urgent activities.
During the experiment, the information system
was able to respond instantaneously to such interrogations. This was made feasible by plaCing all
the necessary information in a very large randomaccess memory that could be interrogated very
quickly. Since the system status changed very
rapidly, an input system had to be devised which
would quickly update the status, and this further
required the minimizing of input data. This was
done by requiring only a single input of each
necessary data element, which served all managers.
Since the organization had to respond very
quickly to emergencies, a good deal of functional
integration was required. Operations, maintenance,
and supply in the squadron headquarters had to
work as a close team since each was affected by
such emergencies. In addition, close communication
between the launch complexes and the squadron
headquarters had to be maintained. Report formats
evolved which helped to present to each manager
the consequences of another function's decisions
on him, and the consequences of his decisions on
them. In this way, each manager was able to react
rapidly to dec~sions that might affect his plans.
Further, the squadron headquarters found it
could not centrally handle all of the deciSions,
since many of them were localized to a particular
launch complex. It therefore worked out conditions
under which the launch complex could act without
prior approval of the squadron headquarters, and
when such prior approval would be necessary. These
conditions have also been further developed by the
laboratory staff into rules of thumb, since the
relationships appear to be too complicated for
analytic solution. Such heuriaU-e techniques welle

56

1.5
found to be very useful during the experiment.
'Ihe data produced by the squadron managers in
the course of their minute-to-minute operational
decision making were also used by them for longerrun or planning decisions. 'Ihus, these data were
used to estimate resource demands and utilization
which provided the basis for determining required
manning, equipping, and stockage of the launch
complexes and squadron echelons. 'Ihese data reflected the particular decision rules used to
schedule the employment of the resources, and so
there was a direct relationship between the shortrun operational decision rules, like scheduling,
and the longer-run planning deciSions, like determination of the resource requirements of the
squadron.
Thus, the simulation helped to produce and
evaluate a control system that might fit the
peculiar environment of the 1963-1965 missile era.
The nature of the deciSions, information system,
and organization for such a control system has
been developed in considerable detail, and as such
it may provide a good guide to the design of such
a system for the real world. Further, this blueprint is available early enough to permit its
appearance in tactical units by 1963. We cannot
say that this is an optimal control system, but
it appears to be ~easible and compatible with the
type of environment experienced in the laboratory.
Simulation thus has helped produce a reasonably
detailed design of a control system for a future
radically different environment.
VIII.

Computers and the LP-II Experience

'Ihere are two aspects to computers and a
simUlation like LP-II: first, computers are used
in the modelling process and runs; and, secondly,
we learn something about the computer needs of the
management system under study.
Only a few words need be devoted to the first
aspect, here. In addition to standard punch card
equipment, two general purpose digital computer
3ystems were used: one with magnetic tapes capable of microsecond arithmatic speeds (in our case,
an IBM 704 system) and a data-processing machine
with relatively slow processing speeds but with a
large disk memory capable of rapid access to any
of several million characters (in our case, an IBM
305 RAMAC). In the laboratory, we have, occasionally, used the high-speed computer more or less
"on-line" with the simulation -- men making decisions which are fed into the computer which in
turn quickly "plays the decisions" through various
models and returns to the relevant men the system
impacts of these deciSions, which in turn causes
new decisions, etc. But in LP-II, most of our
operations on the high speed computer were not
"on-line". 'Ihey were either for analysiS purposes or for computing lists of "potential failure
'rates" which were then interpreted by laboratory
personnel to reflect the minute-to-minute decisions
of the laboratory partiCipants (see the description of the failure model above).
As a

result, the entire project consumed only a few
hundred IBM 704 hours, a relatively small portion
of the total cost of the operation, which also
required more than 70 man-years. In other manned
simulations where the participants and the highspeed computer "talked" to one anather many times
daily, the computer costs were a considerably
larger proportion of the project's total costs. l
The computer with the large memory (the RAMAC)
on the other hand, was used primarily on-line,
minute to-minute, during the runs, and-Served
mainly2 as an information storage and interrogation device for the laboratory participants.
Study of this computers' programs and use permitted
us the statements we could make about the kinds of
computers such a tactical unit might need if it
were to obtain the same effectiveness as our simulated unit: The RAMAC operation was basically a
"status of resources" system and served to provide
the resource manager with "up-to-the-minute" data
needed to make decisions on the assignment of
these resources. Hence, all of its internal files
were "status files" in the areas of maintenance,
supply and operations. 'Ihe inputs to the system
were changes in status (personnel movements, maintenance actions, etc.). 'Ihe outputs were status
reports (generated in response to the manager's
inquiries) and a comprehensive set of history
cards. The history cards served as a historical
file, and were also used to generate certain summary and history reports. 'Ihe system had two
basic components; the disc files which were the
information acted upon, and the stored programs
which did the acting. 3
The system contained both t'on-line" and "offline" programs. Off-line programs operated independently of each other and of any central control.
They were used to perform functions of an occaSional, isolated nature such as loading the initial
files. Tb use an off-line program, it was necessary to interrupt the normal 305 operative mode.
On-line programs were all controlled by a Master
Routine. These were the programs which processed
the normal input cards. All on-line programs (and
the Master Routine) were stored in the disc files.
The Master Routine was normally on the drum and
operating, and the processing routines were read in
and operated as needed. Several of the on-line
programs used subroutines. 'Ihe subroutines were
stored on the disc files and were read in and controlled by the programs which used them.
IWe have been able to do this several times
during the course of an 8-hour day, which simulated
several daily or weekly "time periods tl •
2It was, also, invaluable for analysis purposes, later, following the runs.
3The remainder of this section is largely due
to the efforts of J. Tupac and K. Labiner. See
their RAND Research Memorandum, RM-2572, The LP-II
Data ProceSSing System, September, 1960.

57

1.5
The LP-II data system did not unduly tax the
machine's stor~ge capacity, uSing less than 90%
at peak usage.
However, the machine's slow processing and output speeds did influence the system's performance considerably, and suggested the
infeasibility of dOing all that had been hoped for
with little or no information lag. To quote from
the report on the LP-II data system: 2
'Slow processing and output speeds strongly
influenced LP-II's system design. As mentioned,
almost all machine time was used in processing the
"on-line" status changes and interrogations. As a·
result, we observed three consequences. First, the
machine did not generate all the summary reports.
Had we used a faster machine, all summary reports
could have been generated in step with its other
operations and thus furnished managers with some
summary data on an inquiry basis. Greater directprinting capacity and speed would also have been
required. Secondly, the machine did not make
decisions. We found it was difficult to define
decision rules when the system was being designed.
In addition, a machine with much higher internal
processing speed and computing capability than
that available for the experiment would be necessary in order to incorporate decision functions.
This would have been true even if the rules could
have been established at the outset. Finally,
there was not as much internal checking of data as
one would have liked."
To summarize the major outputs of LP-II with
respect to control system design are as follows:
1. An integrated control system for a
large missile tactical organization, combining
the organization, policies, data system, with the
novel operational and weapon system environment of
future years.
2. Estimates of workloads, resource
utilization, delays, missile alert perfonnance,
etc., resulting from the interactions of environment, policies and decision-making of the organization.

3. Frequency of, and kinds of techniques used by the management personnel for making
critical decisions, such as targeting, scheduling
of mis'sile status; and resource assignment: af',
tailored to the special demands of the large tactical organizations.

4. The information requirements for
making critical decisions and p~rforming management activity. The reports requested by the
partici9ants changed considerably as the organization was exposed to different Situations, SUCh
as higher alerts, engineering changes, reduced
resources and revised requirements thus generated
provided an explicit estimate of the data displays
IThis was partially due to the scale used
in the simulation models, however.

2RM -2572, op.

cit.,

PP.37-38.

that the new organization may require.

5. A detailed description of the data
processing system for providing the required
information. This system was based upon several
new concepts that imply greater automation, new
methods of reporting data, and a different type of
data organization. These concepts were made operational in the simulation so that many of the specific techniques used may be directly transferable to
a real-world control system. Also, estimates were
obtained for the frequency and urgency with which
different data elements were required. These may
be useful in developing specifications for data
processing hardware.
IX.

Limitations and Future Research

The number of alternatives facing the deSigner of the information portion of a management
control system are enormous, particularly during
the planning stages. Chart 4 gives some idea of
the nwnber and kinds of choices that a system
designer must resolve. For a given budget, he is
expected to choose the right man-machine resource
mix for each of the alternatives mentioned. Anyone who has ever helped design an information
system can supply examples of the kinds of choices
that Chart 4 represent.
We have said, so far, that manned Simulation,
properly combined with all-computer simulation
models, can help the designer make these choices.
LP-II, if nothing else, should increase the designer's belief that this is a possibility. However, lots of problems which remain must be satisfactorily resolved before this becomes part of the
everyday tool-kit of a system design program.
1. The presently high costs of a simulation like LP-II. The sizeable scope, great detail and the use of humans in Simulations are all
factors which make this sort of experiment useable,
primarily, for design~ng larger; complex t1 man machine" systems. Of course, some of these costs
would have to be borne by any Sizeable study of a
control system but others can be reduced by turning to suitable compiler programs, and using only
the degree of detail necessary to obtain the desired results.

2. Experimental DeSign Problems.
Manned simulation experiments cannot produce, for
the same research budget, the same number and
length of runs that their all-computer brethren
can. This makes it difficult to perform the desired amount of sensitivity testing or run long
enough to wipe out) fully, the effects of initial
conditions. Work is proceeding at RAND, and elsewhere, on experimental design techniques to mitigate these effects.

3. Inability to Produce O~timal Designs.
Simulations, by their very nature, produce at best
highly preferred -- but not optimal .. - policies.
Significant betterment in management control design
is, of course, nothing to belittle. The development of all-computer simulation and analytic

58

1.5
models based on the manned simulation as a breadboard model should help, considerably, to produce
even better results.

4. Programming Task Synchronization.
Coordinating the many tasks through time that have
to be performed by the computer portion of the
simulation has, in the past, ordinarily required
setting the basic time unit of the simulation
equal to the minimum time unit of all tasks. The
possibility of using variable time synchronization -- different tasks running at different
time units -- is being investigated. This would
permit running the simulation activity at differential speeds as a function of the question the
particular simulation run is addressing. l
5. Defining a Suitable Level of Scope
and Detail. The system designer might wish the
simulation activity to produce answers to highly
specific questions but this requires the simulation activity to represent great amounts of
detail and many functions and organizations.
This is costly and makes analysis more difficult.
Therefore, we must better understand how to deal
with management control problems in an abstract
way for simulation purposes, yet in a realistic
enough manner to produce valid design factors for
real-world application.

x.

Conclusion

Notwithstanding these difficulties, we nevertheless believe that manned simulation will become a very useful technique for those seeking
to make choices in a system context, between
alternative management control resource mixes.
The problem described in this paper is probably characteristic of many large system-design
problems. Certainly the need for such operational control systems is expanding in the military, and in both civilian and military the need
for improved management systems has been ever
present. Simulation techniques can prove of much
help in these problems by providing a means of
pooling and integrating knowledge from many
sources and by providing the opportunity to iterate and vary the many variables and parameters
that compose such systems. Although most published simulation experiences have involved allmachine models, we have found much value in manmachine simulation when the problems have involved
organizational interactions, the design of information systems, and conflicting or interacting
decision rules, since these undergo considerable
development during the simUlation process.
IJack Little of RAND's Computer Science
Department is working on this problem.

59

1.5

BaZ'dware
~

Characteristics

P
L

ProJected
Future
Envil'Omaent

A

OperatiOD&l
Phase and
Policies

.

~

"1\

Resource

-

PlaD. and

Policies

-

-

-

-

-

-

....

-

System
Requ:1rements

N
N
1
N
G

Budget and

P
H
A

Program

Cycles

-

-

--

II

Actual

Env1romDeD.t

~7

Operational
Control
System

L-

S
E

Available
Resources

1
Operational
Capabillty

Chart 1

PLADIRl-OPERATIONS RELATIONSHIPS FOR A FUTURE WEAPON SYSTEM

o
P
E
R
A
T
I

o
N
S

P
H

A
S
E

60
1.5

Actual
EnvirGnment

f---

;--

Available
Resources

,
r---'=

Current
System
Status

~

.Management

System
Inputs

~

Decisions

Actions

Operational
Capability

Chart 2

ELE1'fJENTS OF AN OPERATIONAL CONrROL SYSTEM

Data
Processing

. Reports
and
Displays

61

1.5

Hardware

Operational

and Parts

Failure
totxlel

Characteristic
Data

end

Maintenance
Problem

I

t

j,

Malfunction
Generator

,

- - -

10-

-

-

\v
)..1

Background

Research
Hardware
Operations
Maintenance
Manning

7

T

1
I

'1"-

~

I

Launch
Complex

7

I

\If

Supply

Data Processing

.r
1

Squadron

.'"

"J

1

Data Processing
and
Ana.l:ysis Center

l;-

I

I"

I
I

_ _ _ _ _ _ -.J

Chart 3

MJDEL OF SIMUIATED WEAPON SYSTEM

Higher
Headquarters

62

1.5

CHART 4

FEATURES OF ASSUMED INFORMATION SYSTEM DEVELOPMENT

INPUTS
l.

2.

3.
4.

DATA PROCESSING STRUCTURE

1.

SPECIFICATION OF
DATA ELEMENTS
LEVEL OF AUTOMATION
MINIMIZE REPETITION
RETAIN BASIC RELATIONSHIP

2.~
11

3·
4.

.~

OUTPUTS
l.

2.

3·
4.
5.

CONTENT FLEXIBILITY
INTEGRATION OF REPORTS
DISPLAY FORMATS
LEVEL OF AUIDMATION
SPECIFICATION FOR
EXCEPTION REPORTING

X

_.,
DATA PROCESSING HARDWARE
l.

2.

.

ON -LINE VERSUS
ANALYSIS NEEDS
INTEGRATION ACROSS
FUNCTIONS, ECHELONS,
TIME AND DISTANCE
ALTERNATIVE PROGRAMMING
TECHNIQUES
MECHANIZATION OF DECISIO~
RULES

3·
4.

DEVELOPMENT AND TESTING
MEMORY CAPACITY
COMPUTING SPEED
INPUT AND OUTPUT
CHARACTERISTICS

63
2.1
A SURVEY OF MICROSYSTEM ELECTRONICS
Peter B. ~ers
Motorola Inc., Semiconductor Products Division
Phoenix, Arizona
Summary
The history of micro system electronics is
briefly traced through the successive stages of:
miniaturization; subminiaturization; microminiaturization; thin-film integrated circuits; semiconductor integrated circuits; and finally morphological integrated circuits or functional blocks.
The term integrated circuit is defined as the
combination, on or within a single chunk of material, of multiple electrical elements to perform
a desired circuit or systems function. Improvement in reliability, decrease in size and weight,
decreased power consumption, and lower cost are
discussed as motivations for Integrated Circuitry.
Fabrication techniques, interconnection, and
accessibility are identified as major problems. A
catalogue of techniques for fabrication of Integrated Circuitry is presented.
What Is Microsystem Electronics?
We define Microsystem Electronics as that
entire body of electronic art which is connected
with or applies to the realization of extremely
small systems. As such it includes power supplies,
input and output transducers, and various forms
of memory where applicable, as well as digital
and analog electronic circuitry.
This paper surveys the techniques of microsystem electronics that apply to the arithmetic
or logical elements of computers and to the electronic circuitry of communications equipment. The
electronics industry has not yet agreed on a name
for this part of micro system electronics but the
term Integrated Circuitry seems to have wide and
growing acceptance as a generic name.
Integrated Circuitry
Definition
We define an integrated circuit as a combination, on or within a single chunk of material,
of a number of basic electrical elements to perform a circuit function.
The concept and terms used can be illustrated
by a simple four-box picture of electronic technology. Let us arbitrarily divide electronic
technology into the following four mutually exclusive and exhaustive categories: Materials,
Components, Circuits, and Systems. If we investigate the emissive properties of a heated tungsten
wire or the dielectric constant of polyethylene
we are clearly studying the properties of
Materials. If we combine a tungsten wire, some
bits of nickel wire and sheet, put it all in a
glass envelope which we then evacuate, we have
made a vacuum tube -- a Component. If we properly
connect our vacuum tube, an Inductor, a capacitor
and a battery, we have an oscillator -- a Circuit.

Finally, if we combine an oscillator, a modulator,
an amplifier, a power supply, and an antenna, we
have a transmitter -- a System.
Note that in the above example we were engaged in four distinctly different types of
activity. In the specialization of today's electronic industry these activities are performed by
four different men. In the example, and more
generally in our four-box picture of electronic
technology, we find communication and interaction
-- bi-lateral feedback -- between each type of
activity and its immediate neighbors. The components man, for example, takes the output of the
materials man, asks him for materials with specific characteristics, and tries to feed back the
results of his experiements in such a way that the
materials man can modify or optimize his output.
The components man also communicates with the
circuits man, listening to his requirements and
providing or designing components to fit. He
receives feedback from the circuit designer as to
how well his components work and how they should
be changed.
Similarly, the circuits man goes to the components man for his building-blocks and to the
systems man about the finished product he hopes to
supply. Each man, in addition to his own problems
and language, must be able to communicate with his
neighbors sufficiently well to exchange the information necessary for his work.
Note, however, that in this simple picture
there is no need for the materials man to talk to
the circuits man or to the systems man, and in
practice he hardly ever does. In fact, they have
so little in common in our conventional electronic
technology that they don't really speak the same
language. Likewise the components man has no real
need to understand the problems of the systems
man, and the latter tends to think of components
only as black boxes with certain failure rates
which limit the reliability of his system.
This partitioning is a result of the need for
specialization in today's technology and of the
finite amount of time and interest the average
technical man has for exploring outside his own
speciality. It is unfortunate in that it leads to
inefficiency and lost opportunity in a growing
number of instances where a new requirement is met
by an adaptation of an old technique instead of by
a fresh examination of the wide range of unexploited possibilities.
Integrated Circuitry removes the internal
partitions and integrates the boxes into a single
field. Integrated Circuitry still has its
specialists, but each must have a working knowledge of the problems and objectives of areas which
yesterday would have been the exclusive domain of
other specialists. For example, the circuits man
may no longer limit his consideration to the black
box characteristics of existing components. He
must be aware of the characteristics of the

64
2.1
materials which go into his components, of the
limitations imposed by the laws of the solid
state, and of the possibilities it opens up. The
components man may no longer accept blindly the
re~uirements set by the circuit designer.
He has
a responsibility to examine the overall re~uire
ments on the system to find the most satisfactory
way'of accomplishing the desired function. As an
example, a conventional circuit designer might
build a power supply with a transformer, a diode,
a choke, a couple of capacitors -- or a simple
RC filter, if you prefer -- in either case he has
used a minimum of five components. Our integrated
circuit designer might choose to build the same
power supply from a single piece of silicon.
Alternating current flowing through one part of
the silicon encounters resistance and generates
heat which filters through the silicon to a
thermoelectric area, where Seebeck Effect produces
filtered dixect current. This particular example
happens to be one in which the Integrated Circuit
bears little resemblance to its conventional
counterpart. The basic electrical elements used
include resistance for the generation of heat,
transistance in the conversion by Seebeck Effect
of heat to direct current, and thermal and electrical insulation at appropriate places.
Basic Electrical Elements
New as it is, Integrated Circuitry is made up
of the same six basic electrical elements which
long have been the ingredients of the electronics
art. They are insulation, conduction, resistance,
capacitance, inductance, transistance, and some
special combinations of these. By insulation we
simply mean the prevention of current flow, or
the isolation of electric or magnetic fields.
Conduction signifies the free flow of electric
current and resistance, capacitance, and inductance have their customary meanings. Transistance
is a new and highly useful term which describes
the gain of active elements, or their ability to
achieve precise control. It applies to all conventional forms of transistors, diodes, and other
Solid State active elements. Finally, by special
combinations we mean elements which can only be
approximated by networks in our conventional
technology. The prime example is the distributed
resistance-capacitance network realized in
Integrated Circuitry by a thin-film resistance
deposited upon a dielectric layer, which in turn
has been deposited on a conducting medium.
Evolution of Microsystem Electronics
To gain perspective in our survey of microsystem electronics let us look at its history and
evolution.
The trend was set by miniaturization which
made use of smaller forms of conventional components interconnected by conventional wires and
solder. In every case the discrete nature of the
individual component and usually its conventional
form has been maintained. Miniaturization represents the first step, chronologically, in the
attempt to make electronic e~uipment smaller.
The next step was sUbminiaturization with
still smaller forms of conventional components.

In most cases the conventional form of the component has ~een maintained but the size and weight
are reduced to the point at which thin wire leads
provide ade~uate support for mounting. The
cordwood techni~ue, in which cylindrical axiallead components are stacked like cordwood with
their leads fed through matched holes in parallel
printed circuit boards in front of and behind the
stack, 'is a good example. Various forms of
printed or etched circuitry, both rigid and
flexible, are used as the combination wiring harness and support.
Microminiaturization is the name given to
the ultimate size reduction of the individual
component. It differs from subminiaturization in
that the conventional shape and form factor are
generally lost, leads are often left off, and a
supporting board or matrix is always necessary.
Microminiaturization still permits the circuit
designer uninhibited freedom of choice in the
selection of his individual components.
Thin-film Integrated Circuitry represented
a major advance by dispensing with separate
mechanical supports for each component and combining multiple thin-film components on a single
glass or ceramic substrate. Overlapping or
touching films form internal connections. At
present a hybrid form of thin-film Integrated
Circuitry is necessary since none of the many
companies working in this field has perfected a
way to fabricate workable thin-film diodes and
transistors on a glass or ceramic substrate.
Current thin-film integrated circuit technology
involves deposition of thin-film passive elements
followed by the appli~ue of any re~uired active
semiconductor elements.
Semiconductor Integrated Circuitry means the
combination of thin-film and semiconductor circuit
elements on and within a single crystal semiconductor substrate. Connections are made by
deposited thin-film conductors, and by physical
juxtaposition. Of all the existing forms of
Integrated Circuitry, the Semiconductor version
permits the widest variety of active and passive
elements ana the greatest potential flexibility.
Acti ve element'; can be formed wi thin or on the
substrate wher~ needed, and either thin-film or
semiconductor techni~ues can be used to form the
passive circuit elements. When semiconductor
technologists have perfected a means of depositing thin-film semiconductor active elements on
glass or ceramic substrates, the passive substrate
approach may well prove more 'flexible than the
active substrate approach. This comes about
because part of each active element is unalterably connected to a common semiconductor crystal
in the active case. Although careful placement
of active elements and tailoring of the shape
and thickness of the active substrate between may
allow ~uite complicated circuits to be built into
a single semiconductor substrate, more complex
circuits could undoubtedly be achieved if both
active and passive elements could be deposited on
an insulating substrate.
MOrphological Integrated Circuitry or what
some people call the functional block, represents
the combination of solid state materials to perform a desired circuit function, although neither
individual components nor precise electrical

65
2.1

circuits are necessarily identifiable. The power
supply mentioned earlier is a Morphological Integrated Circuit. Another example is the familiar
standard fre~uency ~uartz crystal, which is a
homogeneous slab of material although it acts as
a combination of resistance, capacitance, and
inductance. The lack of a physical counterpart
in conventional circuitry makes this type of
Integrated Circuit the most difficult to design.
However, there is a long list of unexplored
physical effects awaiting our attention, and the
possibilities are unlimited.
In the rest of this paper we shall limit our
consideration to Integrated Circuitry, as defined
above, since it represents the most significant
departure from conventional technology. The
following sections first review some of the
reasons for developing Integrated Circuitry and
then present a catalogue of the basic applicable
fabrication techni~ues.
Philosophy of Integrated Circuitry Development
Having briefly examined what it is and the
evolutionary stages through which it is developing, let us consider some of the motivations for
the development of Integrated Circuitry. Although
there may be disagreement as to the order of
importance, it is not hard to identify the following areas of concern as having motivated the electronics industry in this field: reliability,
size and weight, speed, power consumption, accessibility and cost.
Reliability
Everyone talks about reliability, and it is
certainly true that without a definite minimum
reliability neither microsystem nor any other
kind of electronics can long endure, but is there
any significant reason to expect a better reliability from Integrated Circuitry than from conventional component electronic circuitry? Yes,
say its proponents, for at least two reasons:
first, because the number of discrete point
connections is greatly reduced and connections
have always been one of the sources of failure;
and second, because the overall reliability of the
system is no longer the same complicated function
of the individual reliabilities of each individual
component. This latter claim stems from the
common mode of fabrication of Integrated Circuits
where usually all of a given type of electrical
element, say capacitor, are created at the same
time. This common fabrication may seem to have
more bearing on process yield, and hence on cost,
than on reliability but there is a connection with
reliability buried back in the reduction of total
process steps and the conse~uent greater attention
that must be paid to each process step.
It is undeniably true that our electronic
systems, be they computer, communications, or
weapons control, are getting progressively more
complex. With increased numbers of components
all having to function together, we seem to be
approaching an asymptotic barrier where additional complexity can only be achieved at the cost of
decreased reliability. While we are still a long
way from the machine with infinite complexity and

zero reliability, our modern systems are already
uncomfortably close to the complexibility-reliabili ty barrier.
Size and Weight
At least three sources of concern can be
grouped under the heading of size and weight. For
missile and other airborne applications the size
and, often more important, the weight of an electronic system has a cost in propulsion machinery
that provides a strong motivation for reducing
both. While reliability is of very great importance in some of these applications, reducing
size and weight to particular values can mean the
difference between the possible and the impossible.
A second size and weight motivation is
found in the realm of portable products where the
size of a poteutial market, both industrial and
military, often depends on whether a system can
be created and packaged within certain limits.
A third motivation for size reduction is the
economic, where an e~uipment must be housed or
protected or otherwise maintained in certain conditions. If the cost of providing a re~uired
environment is high enough and is proportional to
the volume re~uired, considerable effort can be
justified in reducing the volume. A final motivation is shared with the next area of concern.

For a long time i~creasing the speed of
electronic systems was a matter of improving the
components. Recently the speed of certain systems has reached a point where propagation delays
in the wiring, rather than the characteristics of
the devices, prohibit faster operation. This
point is often called the speed-size ceiling and
it has important bearing on computer Bystems. If
a particular system has reached its speed-size
ceiling, it is impossible to expand it functionally without decreasing its speed of operation,
and vice-versa. The only way through a given
speed-size ceiling is to decrease the physical
size of a given system and thus shorten the
propagation delay through it. Microsystem electronics, and particularly Integrated Circuitry,
shows promise of substantial improvement in
speed-complexity product by significant reduction
of propagation delay within and between integrated circuits.
Power Consumption
Information, in an electronic system, must
be related to some form of energy if it is to be
processed. In general the amount of energy
contained in a given piece of information is very
small in comparison to the energy expended in
processing it. In part this is because of the
inefficiency of the processing e~uipment, but
often a much larger drain is deliberately introduced in order to assure continued operation in
the event of drift or change in certain component
characteristics. Where many different processes
are involved in fabricating the components for an
e~uipment, many different and often uncorrelated
drifts and changes may be expected. To assure

66
2.1
proper operation under these conditions may require ten or one-hundred times the power needed
in the absence of change. The advantage of Integrated Circuitry in this respect is based on the
reduction in number of different processes involved in fabricating the equipment. If all resistors
have the same temperature coefficient and aging
characteristic, for example, variations in bias
and operating point of associated active elements
will be much reduced.
As we shall show in the catalogue of integrated circuit techniques which foxms the major
part of this paper, not all resistors useful to
Integrated Circuitry are made by the same process.
However, many of the resistors in a given integrated circuit will not only be made by the same
type of process but will also have been made at
the same time from the same material and under a
single set of conditions.
Accessibility
There are a number of facets to the problem
of accessibility of Integrated Circuitry. First,
it is apparent that those parts of electronics
systems that must interact with a human operator
must be matched to him in physical size. Knobs
must be such that fing~rs can grasp them, dials
of a size that the eye can read them, and so
forth. Input and output equipment must be
matched to the environment from which it receives
and to which it gives information. These are
problems of accessibility where limits of useful
size reduction can be detexmined and beyond which
it is neither reasonable nor desirable to go.
The second set of problems revolve around
the need for replacing defective parts of the
system upon failure. It is no longer feasible to
think. of "repairing" a
failure. When some part
of an integrated circuit as we have defined it
stops working, it cannot be operated upon in the
field. The service man must treat the entire
circuit, perhaps even a group of related integrated circuits, as a single "component" which he
replaces by a new "component" from an inventory
of known working spares. But we cannot afford to
throw the radio away just because the dial cord
is broken, and hence we have the problems of
suitable modular subdivision and of accessibility
for replacement. The problem further breaks down
into one of size and one of interconnection.
Th~ rroblem of size in accessibility for
replac~ent is much like the first problem of
matching to a human operator. The technician
must be able to get a grip on the faulty module
to remove it.
The problem of interconnection is much
deeper and more fundamental. It reaches across
and can nullify the motivations of reliability,
size and weight, and speed as well as accessibility. Many present day electronic systems use
almost as much volume for interconnecting wiring
behind the panel as they use for componen~s out
in front. Unless the connecting means from
circuit modules to wiring harness is more reliable than the Integrated Circuitry itself,
system reliability will be hurt. If the wiring
harness is physically long, the propagation delay

in getting the signal from one part of the system
to another reduces the maximum speed of operation.
Finally, unless the interconnecting means is
flexible and cleverly arranged, it may be almost
impossible to get at the connections to replace a
module, since the motivations of reliability and
size tend to rule out the use of plugs and sockets.
The final set of problems of accessibility
are concerned with the removal of heat. The
potential saving in power through an allowable
decrease in operating margins has already been
mentioned, but even so a significant part of
integrated circuit design is thermal in nature.

The cost picture for Integrated Circuitry
is a complicated one. Initially some of the
specialized appl~cations for which there are no
other possibilities will probably support Integrated Circuitry regardless of cost. Any sort of
general acceptance, however, will require a cost
competitive with other forms of circuitry. Since
individual components can no longer be selected
after manufacture, both control of process and
process yield will have to be high. An aid in
this respect is the fact that the same process
run usually forms all of a given type of circuit
element on a particular integrated circuit substrate and thus if one part is good all tend to
be good. This simultaneous fabrication of all
diffused resistance regions or all deposited
capacitors at the same time is one of the advantages by which its proponents hope to lower
the cost of Integrated Circuitry.
Catalogue of Integrated Circuit Techniques

Our survey of micro system electronics may
logically end with a catalogue of existing and
proposed means for obtaining the elementary
circuit functions. Since the assembly of complex
circuit functions called for in a designable
Integrated Circuit Technology must rest fundamentally on the precision and process control
attainable in the basic circuit functions, major
continuing effort has been put on the study of
means for obtaining greater control over these
individual processes.
The ten subdivisions which appear below are
the ten basic elements needed by almost every
circuit regardless of techniques used to build it.
Method of Catalogue
There are a number of ways in which Integrated Circuit Technology may be catalogued. We
may catalogue it by process, by materials used,
by physical foxm or arrangement, by the basic
circuit parameters, by the energy forms and transformations, by generic circuit function and,
finally, by specific circuit function in a representative system. For simplicity we have chosen
to subdivide by basic circuit parameters:
1.

2.

3.

Insulation
Conduction
Resistance

4.
5.
6.
7.
8.
9.
10.

67
2.1
Capacitance
Inductance
Special Networks
Active Elements and Substrates
Encapsulation
Mechanics
Design.

The above catalogue is by basic circuit parameters rather than one of the other means of
classification because for a survey such a breakdown seems to be most meaningful and concrete.
In developing these basic elements we sometimes use the bulk properties of the body of the
substrate, sometimes operate at or close to the
surface by alloying or diffus~on, and sometimes
build the element on top of the substrate by _
epitaxial growth of semiconductor materials,
evaporation, plating, or other means of deposition
of thin films of material. The mechanical, thermal, electrical, and chemical interactions resulting from these new approaches to electronic circuits have introduced both new problems and
indeed a new science.
Insulation
Electrical insulation is required in all but
the most trivial Integrated Circuits. Extrinsic
techniques of insulation include operations external to the substrate. The development of
extrinsic electrical insulation includes anodization, vapor phase deposition of dielectrics by
pyrolysis, evaporation, and plasma deposition
techniques as well as the conventional mechanical
coating. Intrinsic insulation includes any
method of isolating fields or current flow within
a semiconductor or other substrate. Among these
are thermal oxidation, fabrication of an isolating
layer of intrinsic semiconductor material, and
creation of one or more back-biased junctions.
Anodization is the formation of an insulating
oxide over ~ertain elements, usually metals, by
electrolytic action. The most commonly anodized
materials are tantalum, aluminum, titanium, and
niobium. Anodiaation is a particularly useful
form of insulation where protection of a conductor
is required since the base metal can form the
conductor and the anodized surface layer can form
the insulator. Since anodized films of tantalum
can be controlled to a high degree and possess a
dielectric constant of approximately 25, the
process finds wide application in formation of
capacitor dielectrics. The high dielectric constant becomes a disadvantage as the frequency of
the energy to be insulated increases.
Pyrolysis as used here is the thermal decomposition of a volatile chemical compound into
nonvolatile and volatile byproducts. It is generally carried out in an inert carrier gas and
this fact distinguishes it from vapor plating,
which generally uses an active carrier gas such
as hydrogen or steam. Silica, silica-based glasses,
and silicone polymers have been deposited successfully.

Evaporation is the deposition in high vacuum of insulation thermally liberated from a
parent source. Silica films of low optical absorption have been produced by electron bombardment of the parent oxide. The technique has
important possibilities in the area of direct or
mechanically masked insulation deposition.
Plasma Deposition involves the spraying of
highly excited atomic particles of the insulator.
Heat, commonly obtained by electric arc or hydrogen flame, is the source of excitation and almost
any elementary material can be applied to almost
any surface by the method. The disadvantages are
the grossness of the spray process and the high
temperatures involved.
Mechanical Coating covers the spraying,
painting, or other physical application of organic
and inorganic insulators and finds certain applications in the grosser insulation of Integrated
Circuitry.
~ means of extrinsic techniques broad area
Jielectric film insulation has been ~eveloped
with field strengths in excess of 10 vOlts/cm.
These insulating films can be reproducibly obtained to thicknesses in excess of 10 microns.
Thermal Oxidation is here used to denote the
formation of a self-oxide upon the exposed surfaces of a semiconductor. The intrinsic insulation of silicon has been developed to a high
degree by this method. Oxidation of parent material can be used to insulate broad areas and
also PN junctions.
Intrinsic Layering refers to the method of
deparating two regions of conductive semiconductor
by a region of near intrinsic semiconductor material which differs enough in resistivity from
the adjacent regions to serve as an insulator.
"The epitaxial growth of silicon is a promising
method of providing layers of intrinsic material.
Back-biased Junctions may be formed within
semiconductor substrates to provide a type of
insulation. With reasonable values of reverse
bias the capacitance across a graded junction can
be reduced to the point of insignificance for lowfrequency applications. The division of semiconducting supstrates into two or more regions
isolated by reverse-biased junctions opens the
way to an eventual increase in the number or
complexity of the functions that might be built
into a single substrate. A disadvantage of the
technique is the collection of any minority
carriers in the area.
Conduction
Conduction is used here to indicate electron
current on or in substantially ohmic material of
negligible resistance. One of the goals of Integrated Circuitry is to achieve integration of
circuit functions on and within substrates so that
ohmic connections from one location on the substrate to another are drastically reduced or

68
2.1
eliminated. In cases where this is impossible or
impractical, extrinsic conducting paths are formed by evaporation of metals, by painting or plating of conducting stripes, by sputtering, by
pyrolysis or by mechanical coating. Intrinsic
conduction can be accomplished by development of
degenerate regions within the semiconducting or
substitute substrate. We can do this by creating
alloyed, epitaxial, or melt grown regions during
or following crystal growth.
Evaporation of conductors covers the vaporizatioll of metals at high temperatures in vacuums
of 10- mm Hg or better. The metallic vapor moves
in substantially straight lines onto a substrate
which may be mechanically masked to limit the
deposition to desired regions. The substrate must
be very clean and must generally be raised to an
elevated temperature in order to assure intimate
contact of the particles with the substrate after
impact before solidifying. Alternative means of
producing conductive patterns involve coating an
entire surface and subsequently removing unwanted
conductive material by one of a number of techniques. Evaporation may be of a single metal or
of an alloy. In the latter case the deposition
may be made simultaneously from a common source
or sequentially from separate sources, after which
the substrate may be heated to produce an alloy on
the surface of the substrate. If the alloy penetrates the substrate, as in the case of a semiconducting substrate, the result is classed as
intrinsic.
In the study of conductors by deposited
techniques, resistivities below one order of
magnitude higher than metal conduction have been
achieved. Where mechanical or thermal considerations make adhesion more important than achieving
the lowest ohmic resistance the introduction of a
chrome-gold alloy has been useful.
Sputtering differs from the evaporation
process discussed above in the conditions under
which the conductive material reaches the substrate. Sputtering is the result of a glow discharge between an inert anode and a bombarded
cathode of the desired conducting material.
Because the presence of gas at 10- 2 to 10-4mm Hg
is necessary for the generation of the ionized
bombarding molecules, the sputtering process is
inherently harder to keep clean. Even so the
process has certain advantages over vacuum
evaporation in the relatively low temperatures
needed or generated in the system, and thus the
lower chance of contamination from the evaporative source. It finds one of its most important
applications in the production of thin films of
tantalum for resistors or capacitors.
Pyrolysis as used for deposition of conducting films is the same process discussed previously
for insulators. It has the advantage of flexibility of materials and conditions of deposition
but the disadvantage that the deposited material
cannot be masked by mechanical shields as satisfactorily as the vacuum evaporated materials.
Pyrolysis finds application where an entire
surface can be coated as for electrostatic or
magnetic shielding, or where the unwanted material

can be removed selectively after deposition.
Pyrolysis may also be carried out on selective
regions under certain circumstances by employing
a catalyst.
Plating of conductors includes electroplating) chemical or electroless plating, and
vapor plating. Electroless plating tends to cover
everything as does vapor plating. Selective
removal of unwanted material can be combined with
electroplating to build up conducting paths. Both
electroplating and electroless plating suffer from
danger of contamination from the wet chemistry
involved. Vapor plating is capable of achieving
high-purity deposition.
Mechanical Coating includes conducting glass
pastes which can be painted onto gross terminal
pads to bridge irregularities that vapor deposition cannot manage. It also includes various
solders that may find temporary use in making
conductive paths to external terminals. The
conductive glasses have the advantages of bonding
well to many substrate materials and of reasonable
match in thermal expansion coefficient.
Degenerate Regions are regions within a
semiconductor substrate in which the conductivity
is very large and in which carrier transport is
sssentially ohmic. These regions are generally
produced by alloying near saturation concentrations
of a doping impurity into a specific pattern.
Because of this method of formation the paths are
generally at the surface of the substrate. By
control of the intrinsic techniques mentioned,
resistivities of 10-3 to 10- 4 ohm-cm can be obtained and in general may be considered to be
feasible for internal conduction.
Connections from one Integrated Circuit to
the outside world or to another substrate are
treated in the Section on Mechanics as in conduction of heat away from dissipative el~ments
within the Integrated Circuit.
Resistance
Resistance may be provided in Integrated
Circuits in at least four ways. The three
intrinsic ways are by bulk resistivity, by transverse conduction in thin diffused back-biased
regions within a semiconductor substrate, or by
an epitaxial layer of opposite impurity backbiased with respect to the parent substrate. The
extrinisic way is by deposited thin-film resistors on top of the substrate. Each method has
particular advantages for certain applications
and a complete Integrated Circuit capability
requires mastery and evaluation of each. Both
methods may be sensitive to their ambients.
Anodized Tantalum, Titanium, or Aluminum
films provide an attractive method for obtaining
highly precise resistance values. The initial
deposit of these metal films is relatively thick
and is easily formed by the vapor plating and
vacuum deposition techniques previously cited.
After deposition, the metal film can be trimmed
to a precise value by anodizing the outer surface
of the metal film to its oxide which is an

69
2.1
insulating semiconductor. Since the oxide thickness is a direct'function of anodizing voltage,
a high degree control of the remaining metal film
thickness can be obtained. Resistance films in the
10 to 500 ohms per square range can be obtained
with±lj4 per cent reproducibility. Accurate
layout of these resistance films is achieved by
the application of photoresist techniques to define areas of anodization. The unanodized film
'can be utilized to form conduction elements.
Tin Oxide films have reached sheet resistance
values of over 5000 ohms per square. With conventional resistance patterns, values of up to 1.0
megohm can be achieved. The films are produced by
hydrolysis in a technique that has been brought
to a high degree of development.
Indium Oxide films have been produced by a
two-step metallizing-oxidizing process involving
a vacuum deposition of pure indium in a low
pressure pure 02 atmosphere followed by a low
temperature (below 200 0 C) thermal oxidation for
several hours. This low temperature technique
has application in instances where higher temperature resistance fabrication would damage
other temperature sensitive thin-film functions.
Nichrome films can be evaporated directly on

~lean substrates by volatizing the alloy from a

tungsten heater. These films show excellent
adhesion when substrate surfaces are heated to
300o C. Nichrome resistance films can be reproducibly deposited and show relatively high stability on standing. They possess an average
temperature coefficient of resistance of 6 x 10-5
ohms per ohm per degree centigrade over the temperature range -50 C to 150 C. However, these
films have low resistivities which limit their
application to 500 ohms per square. In addition,
the surfaces of unencapsulated Nichrome films
are prone to a certain amount of corrosion and
oxidation which limit compatibility with other
thin-film circuit element fabrication. Specific
geometries of Nichrome resistance elements are
usually achieved by the use of mechanical masks
during evaporation. These films are quite amenable to forming ohmic contacts with other metal
films.
Bulk Resistivity makes use of the gross
geometry and parent material of the semiconductor
substrate. It is easily controlled but is suitable only for relatively low values of resistance
based upon maximum usable resistivity of ~500
ohm-cm. Since bulk resistivity of 500 ohm-cm is
not compatible with present semiconductor base or
collector technology, a more realistic upper
limit is 100 ohm-cm with the most compatible
value probably lying between 10 and 30 ohm-cm.
Bulk resistivity has the additional handicap of
a complicated temperature coefficient. This
temperature dependence can occasionally be turned
to advantage in circuit applications for temperature compensation of specific types of circuit.
Diffused Back-biased Regions cover the class
of resistors formed by transverse conduction within a thin diffused back-biased layer of semi-

conductor. They cover a much wider range of values and have additional advantages in their
possible range of temperature coefficients. By
control of the diffusion profile it is possible
to achieve positive, negative, or substantially
zero temperature coefficient at room temperature.
The difficulty in using diffused resistors in
practical circuits lies in their great sensitivity
to the back-biasing voltage including the selfgenerated component of back bias caused by the
voltage drop in the resistor. Where very high
resistance of non-critical value is required,
diffused resistors may have great value. Their
other attractive application is in circuits
needing an electrically alterable resistance.
Epitaxially Grown Resistive Layers can form
resistive regions with useful characteristics.
The method is similar to the diffused back-biased
resistance described above but the epitaxial
process promises better control of the junction
characteristics. The epitaxial resistor requires
a compatible masking technique during layer
growth if mesa techniques and wet chemistry are
to be avoided.
Capacitance
A number of methods have been developed for
providing Integrated Circuit capacitance. Capacitance may be provided intrinsically by reversebiased semiconductor junctions, by self-biased
junctions or extrinsically by deposited thin-film
capacitors using gold or some other conducting
film as a counterelectrode.
Anodized Tantalum, Titanium, JUuminum, or
Niobium can be used to form the lower conducting
layer and dielectric of deposited thin-film
capacitors. After the desired thickness of
dielectric has been formed, a counterelectrode
of some conducting material is deposited to
complete the capacitor. The films are generally
deposited by evaporation or sputtering'. Subsequent anodization can be controlled to a high
degree and pinholes cleared before deposition of
gold as the counterelectrode. Ratings of 5.0
volt microfarads per square centimeter at 50% of
breakdown voltage are now state of the art.
Titanium, Aluminum, and Niobium, can also
be anodized with useful characteristics. Aluminum oxide has a dielectric constant less than
25% that of tantalum but has almost twice the
working voltage for the same forming voltage.
Unless the counterelectrode is of the same material as the anodized material, this type of
capacitor is polar.
Deposited Metal Oxide Glasses can be sandwiched between deposited conductors to form an
alternative thin-film capacitor. Here the dielectric constant is much lower than either of
the anodized dielectrics mentioned above but the
use of low melting silicate type glasses avoids
some of the problems of the anodized capacitor
process. Deposited metal oxide glass dielectric
capacitors can be fabricated at lower temperatures
than are required in the deposition of tantalum.

70

2.1
Deposited Ferroelectrics offer attractive
possibilities as dielectrics for thin-film capacitors. Chief among the attractions is a dielectric constant for barium titanate three orders
of magnitude greater than that for the silicate
type glasses. Such a dielectric might also have
important contributions stemming from its nonlinearity and polarizable nature. The chief disadvantages appear to be a limited operating range
of temperatures and instability or deterioration
of electrical properties. Work with deposited
ferroelectrics is in the early exploratory stage.
Diffused Back-biased Junction capacitors
depend upon the depletion layer as dielectric and,
for step junctions at low voltages, can provide
on the order of 1 volt-microfarad/sq.cm. However,
since the width of the depletion layer varies with
the magnitude of the reverse bias, such capacitors
are electrically alterable with the magnitude of
their effective capacitance depending on their
bias. This voltage dependence can be an advantage
or disadvantage depending on the use to which the
capacitor must be put.
Reverse-biased junctions act as capacitors
whose value depends not only on the junction area
but also on the width of the depletion region at
the junction. This width depends upon the impurity concentration grauient on each side of the
junction and upon the reverse voltage applied
across the junction. Since the normal diffusion
process produces a graded junction, capacitors
formed by diffused junctions tend to have lower
values of capacitance at the same bias than
either the alloyed or the epitaxially grown
junctions discussed below.
If current flows parallel to the junction
outside the depletion region, the bias on the
junction will not be everywhere the same. In
particular, if current flows through the bulk
resistance of the substrate beneath a shallow
diffused but otherwise unbiased junction, the
floating junction will conduct to equalize potential at one end and will thus back-bias the
rest of the junction. The result can be useful
in circuits where a small capacitance is needed
to bridge a load or coupling resistor. Note that
this configuration is really a passive network of
distributed resistance and capacitance.
Alloyed Back-biased Junction capacitors tend
to have a much steeper impurity gradient across
the junction on one side while maintaining practically the impurity concentration of the parent
substrate on the other. The approximation to a
step junction is much better than in the diffused
case and much higher values of capacitance at low
reverse bias are possible. Because of the relatively low impurity concentration of the substrate, the total depletion layer is very nearly
as wide at large values of reverse bias as in the
diffused-junction case.
Epitaxial Back-biased Junction capacitors
hold promise of the ultimate attainable in
reverse-biased junction capacitors. In addition
to being more amenable to control than the alloy
process, they are able more .closely to approximate

the step junction from high impurity concentration
of one type to high impurity concentration of the
other. This means that they should be capable of
having the maximum capacitance, at low values of
reverse bias, of any junction capacitor. It also
means that their voltage dependence should be more
predictable.
Thermal Oxidation on the surface of a semiconducting substrate can provide the dielectric
for a hybrid type of capacitor in which one plate
is the semiconductor substrate and the other plate
is a thin metallic film. Such hybrids can have
occasional applications as special networks.
Where precise and voltage-stable capacitors
of less than a microfarad are needed, deposited
thin films of tantalum, titanium, aluminum, or
niobium can be anodized with good control. Highvoltage or nonpolar capacitors with small values
of capacitance can be fabricated reliably using
silicate glass type dielectrics sandwiched between
metal film electrodes. Voltage variable capacitors
or non-critical coupling capacitors can be formed
in semiconducting substrates by back-biased
junctions.
Inductance
From an Integrated Circuit point of view,
inductance is without doubt the most difficult to
obtain of all the basic circuit parameters. The
major difficulty is the requirement of a volume
for the storage of magnetic flux which does not
lend itself readily to Integrated Circuitry.
Furthermore, the coupling of energy to the volume
poses the requirement of a coil that is not easily
achieved by deposited film techniques. Both a
s~uare-loop and a linear response are required
for a general systems capability_
Deposited Nickel-Iron Films are among the
extrinsic means available for storing energy
resulting from the flow of current. Deposited
nickel-iron films of 82%-18% composition and
1000A to 4000A thickness have been used for
storing energy for logical matrices and as smallvalued low-frequency inductors. A requirement is
the presence of a magnetic field during the deposition process to orient the magnetic anisotropy.
Unfortunately this limits the magnetic film
configuration to simple forms. If the driving
magnetic field is applied in the direction of the
"easy" magnetization of the domains, a squareloop B-H curve results which is useful for memory
and for magnetic logic applications. When the
magnetic field intensity is applied normally to
the direction of "easy" magnetization, a more
linear B-H curve results which is useful for
linear systems and impedance transformation.
Deposited Ferrites have possibilities as a
second extrinsic technique for obtaining inductance. Glass, which is a mixture of metal oxides,
is currently being deposited by the pyrolytic
decomposition of su~table metallic-organic esters.
The reaction temperature required to form the
ferrite material from mixed oxides is in the
order of 300 o C. Since ferrites do not have the

71

2.1
same magnetic anisotropy as thin nickel-iron
films, no magnetic field is needed upon deposition
and the form factor is not limited as it is with
metallic films. Magnesium-manganese ferrite material provides a s~uare-loop B-H curve and is,
therefore, suitable for magnetic logic elements.
Manganese-zinc ferrite provides a high MQ linear
material usable to about 500 kc. Nickel-zinc
ferrite provides a highM Q linear material suitable
for the fre~uency range 0.5 to 100 mc. By proper
masking methods it is possible to form thin-film
solenoids which surround such depostted ferrite
materials and provide a means for coupling energy
into and out of the material. Finally ferrite
materials possess variable permeability which is
a function of the applied dc magnetic field, and
this can provide a control element for ac magnetic
flux. Such variable inductors can serve as electronic tuning elements or other control elements.
Air Core Geometries are suitable for r.f.
coils and other high-fre~uency small-valued
inductors. It is possible to deposit "air core"
pancake-type windings of thin-film conductors.
When associated with thin-film insulators of low
dielectric constant the pancake-type winding can
be formed in multilayers to increase the total
inductance of the element.
Inversion of Capacitance-by Active Element
networks is among the intrinsic techni~ues for
obtaining inductance. Field effect semiconductor
devices can provide an impedance inversion
function. By this means a low-Q capacitor can be
made to appear as a high-Q inductor for circuit
applications where resonance is not re~uired.
Ferrite Substrates provide the possibility of
using a single or multi-aperature ferrite material
both as an inductive core and as a substrate for
other Integrated Circuit elements. The coupling
to the ferrite can be achieved by thin-film
conductors. These elements, because of the dependence of their permeability on the applied
field, can also serve as a means for controlling
magnetic flux.
The Inductance Diode is another intrinsic
approach to the problem of providing inductance
for Integrated Circuits. Although this device is
still in the early experimental stages a number
of workers in the field have succeeded in measuring inductive behavior of germanium diodes. The
maximum effect is found with alloyed or near-step
junctions in which the injection efficiency is
close to unity. Chief difficulty in measuring or
applying the observed inductive behavior is the
instability of the inductive effect, both with
current through the device and with temperature.
The problems surrounding the incorporation
of inductance in Integrated Circuitry are severe
and first efforts at Integrated Circuit design
will probably try to accomplish the desired e~uip
ment functions without the use of inductance.
Special Networks
A few basic circuit parameters are of a hybrid
nature and do not fall logically into any of the

pure parameters discussed above.
Deposited Distributed R-C networks are
among the extrinsic examples of functions that
cannot easily be duplicated by lumped constant
circuits. A simple example of this is an anodized tantalum resistor upon which has been deposited a counterelectrode of gold or some other
metal. Each part of the resistor has a capacitive
relationship through the anodized dielectric to a
unipotential conducting plane. The result is a
series resistance with distributed capacitance
acting all along its length.
Deposited L-C networks are in the same class
of difficulty as ordinary thin-film lumped inductance. About the only simple example is the pancake air core coil deposited as the counterelectrode of a deposited and anodized tantalum
capacitor.
Transformer-like thin-film configurations
have been proposed but both distributed L-C and
thin-film passive impedance transformation for
Integrated Circuitry are in the early experimental
state.
Diffused Back-biased R-C networks are similar
to the thin-film extrinsic kind except that they
are voltage sensitive as to their capacitance and
sometimes even their resistance. If the currentcarrying diffused layer is thin enough, or if the
back-biased junction is between the parent substrate and a thin current carrying epitaxial
layer, variation of the reverse-biasing voltage
will affect Doth the series resistance of the
thin layer and the shunt capacitance to the substrate. Such a configuration can form a tuning
unit in a phase-shift oscillator or feedback
amplifier.
Bulk Resonance effects such as the piezoelectric resonance of specially cut ~uartz
crystals are classed as intrinsic special networks
even though they normally occur in other then
semiconducting substrates. In the case cited,
the mechanical motion of the crystal would probably be disastrously damped by using it as the
substrate for other Integrated Circuits, but it is
conceivable that thin-film circuits could be laid
out entirely along nodal lines.
Active Impedance Transform Networks cover a
wide and largely undeveloped field. 'rheir scope
extends from the impedance transformation characteristics of ordinary non-integrated devices
such as transistors and field effect devices, to
vague and esoteric proposals_for active delay
networks and hybrids of active and passive
phenomena.
By definition, the functions included in this
category do not in general exist in a one-to-one
correspondence outside the solid state. For this
rea$on they are perhaps closer than some of the
other basic circuit functions to being Integrated
Circuits themselves. The members of this category
are expected to increase as new integrated phenomena are developed.

72
2.1
Active Elements and Substxates
The active semiconductor elements useful in
Integrated Circuitry include the logic diode, the
breakdown or zener diodes, various multilayer diodes, the mesa transistor structure, the flat or
oxide-masked mesa, the unipolar transistors,
various other field effect devices, and compound
transistor-like elements. The list of non-semiconductor active elements includes ferromagnetic
and ferroelectric elements and many others.
Vapor Deposition of Semiconducto~ Active
Elements on a passive substrate is undoubtedly the
most important extrinsic technique in this category.
To date, no one has reported on the deposition
of large-area nondegenerate single-crystal thin
films of semiconducting material on an insulating
substrate. When this technique is fully developed
it will be possible not only to build complex
active functions but to build them on either
active or passive substrates as the situation
dictates. One of the most important consequences
following on the eventual achievement of the thinfilm deposition of active semiconductor elements
will be the removal of the topological-electrical
limit to the number of active elements that can be
built into one semiconductor substrate. Another
will be a decrease in the minimum capacitive
coupling attainable in an Integrated Circuit.
Vapor Deposition of Nonsemiconductor Active
Elements on a passive substrate can make useful
many previously overlooked solid state effects.
Both ferroelectric and ferromagnetic materials
are unique in that certain of their basic properties, such as dielectric constant or permeability, can be readily changed by the application
to the material of an electric or magnetic field
of the proper magnitude. This property has been
utilized in microcircuits for the purpose of
circuit tuning. When the problems of the deposition of coherent films of these materials are
overcome, an entirely new generation of active
circuit elements will be possible, utilizing the
nonlinear properties of ferroelectrics and ferromagnetics to actively tune circuits, act as bandpass filters, and so forth.
Applique of Active Elements has been the
universally used extrins1c expedient since thinfilm semiconductor active elements do not yet
exist on insulating substrates. The active elements are attached to passive substrates to make
a hybrid structure. _ They 'IIJB.y either be conventional or special microminiature elements in cans
with leads brought through the passive substrate
and soldered, or they may be unencapsulated or
self-encapsulated devices mechanically and electrically affixed on the passive two-dimensional
substrate.
Conventional Active Element Design can in
general be transferred directly to Integrated
Circuit application. Intrinsic techniques for
creating active elements within semiconductor substrates include doped or rate-grown junctions in

crystals as pulled from the melt, alloyed junctions, diffused junctions, vapor-grown junctions
as epitaxially oriented overgrowth and hybrid
junctions where one side of the junction is the
result of one process and the other of another.
§pitaxial Techniques of growing silicon
layers on silicon parent stock are among the
most promising tools in the area of threedimensional Integrated Circuits. Vapor phase
epitaxially oriented overgrowth was first developed in Europe and has since been intensively
studied and applied by American materials and
components manufacturers.
The epitaxial process permits a number of
desirable improvements over conventional active
element technolOgy. The previous requirement
with active semiconductor substrates that the
active elements be fabricated within the substrate becomes within or on the substrate. The
substrate can now be a lower resistivity than
could be tolerated when it had also to form the
collector junction of our active elements. Largearea step junctions are now much more nearly a
reality and the only diffusion effects are those
occurring at the junction while the epitaxial
layer or layers are being grown.
There are many facets of the process that
need exploration and development but the prospective rewards are great - both from the standpoint of new and hitherto unfeasible active and
passive elements and from the standpoint of
logical extension to automated, low cost, and
highly flexible Integrated Circuit assembly
processes.
Form Controlled Growth. In the extension of
their studies of semiconductor crystal growth a
number of companies are developing methods of
ribbon crystal growth. The basis of all this
work is the ability to control the shape and
structure of a growing semiconductor crystal as
it is drawn from the melt,. In some cases the
ribbon is a dendrite pulled from a supercooled
melt, in other cases different means of obtaining
ribbon shape are used.
Ferrite Substrates. The utilization of
ferrite substrates offers a unique method of
introducing an inductive element into the circuit.
The requirements of such a substrate are those
which must be met by any substrate, such as
surface smoothness, compatibility of thermal expansion and thermal conductivity with other
active elements and physical strength. The use
of a magnetic substrate imposes additional requirements. Magnetostrictive effects might
present the problem of maintaining a bond be~ween
the substrate and deposited films, as well as
changing the electrical properties of the films
through their electrostrictive properties.
In order to take fuli advantage of a magnetic
substrate, the magnetic properties of the material
must be matched to the particular requirements.
For inductive purposes a low loss, high permeability material would be required, suitable for
use at the frequency of interest. The temperature coefficient of permeability might be

73

2.1
important where large fluctuations of the ambient
temperature are anticipated.
The geometry of the substrate would depend'
upon the method used to couple the magnetic field
into the circuit. One method consists of laying
down the coil as a thin film on the surface.
Another consists of providing holes in the substrate
through which, or around which, wires can be
wound.
Encapsulation
The encapsulation problem is a critical one.
Achievement of desired performance, control, and
reliability of semiconductor active elements, and
to a lesser extent passive components, is dependent upon reliable and well-founded solutions
to the protection of surfaces from the migration
of contaminants in the ambient. These include
moisture, radiation, and many other forms of contamination.
Hermetic Seal has been the most widely used
and untlI recently the only satisfactory means
of protecting semiconductor devices from contamination. In the initial work with Integrated
Circui~ry the hermetic seal will undoubtedly
continue to be used. A number of companies have
made great strides, however, in developing other
means of encapsulation and the hermetic seal
should be considered only as a temporary expedient.
Low Melting Inorganic Glasses have shown
remarkable success in improving device perfor~
ance and reliability. The glass may act as an
ion getter in cleaning up surface contaminants
and it protects against 100% humidity for over
10,000 hours. Protective films can be deposited
from the vapor phase over large surface areas:
Their application is limited, however, by the
facts that they may be soft at room temperature,
that sulphur in the glass reacts detrimentally
with certain thin-film materials, and that thermal expansion in mating with lead materials may
cause difficulties.
Inorganic Film Pyrolysis offers a number of
advantages: it is free from pinholes, does not
affect the device, has high dielectric strength,
has high chemical and physical durability, has
low volume permeability to moisture, and re~uires
only low device temperature during encapsulation.
Although the work with pryolitic glass is not
widespread, there are indications that this
approach will have wide applicability to the
physical and chemical protection of Integrated
Circuit Functions of the future.
Anodization has already been discussed under
Insulation, Resistance, and Capacitance. It uses
the parent metal in the oxide formation.
Accelerated Thermal Oxidation is a moderate
temperature reaction carried out at atmospheric
pressure which can form a completely protective
glass type layer by conversion of in situ material.
A characteristic of this process is that it applies
only to material capable of being converted to
glass, and hence is most applicable to silicon

surfaces. In the self-encapsulation of a silicon
device, therefore, the process leaves substantially untouched the leads or contacts. This feature
has advantages for straight device fabrication
but is of restricted applicability where thin
films and semiconductors are combined in Integrated Circuits.
Surface Stabilization is an attractive
possibility involving the deliberate addition of
something to the surface of an active element
which ties up all the possible loose ends and
leaves the surface indifferent to subsequent
contamination.
Mechanics
The mechanics of Integrated Circuitry are as
much a technique as the realization of any circuit
element. They include form factor, masking,
thermal conduction, access, and interconnection
of the Integrated Circuits with each other and
with the outside world.
The Form Factor includes the size and shape
of the substrate and must take into account such
things as heat removal, replaceability, accessability for test, position of leads, and so
forth. The initial form factor must allow for
hermetic seal of each Integrated Circuit. A
second step will probably be individual surface
passivation or self-encapsulation with a number
of related circuits in one can for protection.
Ultimately, surface passivation will probably
also constitute physical protection and the
individual circuits will be entirely selfencapsulated.
Masking is another area falling within the
field of Mechanics. Masking is chemical insulation during the fabrication process. Chemical
insulation as used here refers to means of
restricting the action of chemical etches and to
masking portions of the substrate during evaporation, alloying, and/or diffusion. Metallic
masks are a standard part of the art for evaporation as are the photoresist techniques and wax
coating for etch protection. Recently added as
standard art are self-oxidation and vapor deposition of glass which give promise of simplifying
mask technology by an order of magnitude and
reducing complicated micro layout to a photographic reduction of standard art work.
The Interconnection of two or more Integrated Circuits is one of the most important parts
of the program. It may well be the most important
consideration facing all of Integrated Circuitry
at this time. Interconnection covers not only
the communication of the Integrated Circuitry
with associated equipment but also the communication and supply of power between substrates. As
such it includes some of the most important and
perplexing problems in the field. If size, weight,
and reliability must remain tied to our present
method of plugs and interconnecting wiring, much
of the impetus of Integrated Circuits is lost.
Present efforts make use of ohmic connection,in some cases in the form of strips of

74
2.1
conductor sandwiched between isolating ground
planes. Such low impedance strip lines have a
number of advantages from the circuit standpoint
but the difficulties of removal and replacement
of a particular circuit are formidable. Ultimately, other means of coupling (in addition to ohmic
connection) will be used. Magnetic, capacitive,
photon, and ultrasonic coupling are possibilities
as means of conveying information and power.
Designability
The development of an Integrated Circuitry
depends not only on being able to fabricate and
interconnect the basic circuit parameters discussed above but also on knowing what techniques
are compatible with which, what range of parameter values are realizable, what reliability individual circuit functions may be expected to have
and what control is possible in the constituent
processes. It depends on the design of processes
that fit into automated manufacturing methods.
It depends on building into the manufacturing
methods, from initial conception onwards, sufficient flexibility that the output of the product
line can be changed easily and quickly to yield
a different Integrated Circuit.
Finally, the widespread development of Integrated Circuitry depends on being able to accomplish all the necessary functions and operations
economically. In the last analysis, it is cost
that will determine the acceptance and use of
Integrated Circuitry. If a particular fabrication
process cannot be made economically competitive,
it is the wrong process.
Conclusion
Microsystem electronics has struggled through
a variety of stages on the way to maturity. Undoubtedly it has a long way yet to go. But the
need, significance, and accomplishments to date
are such that it can no longer be ignored by the
computer industry. For a while it will be used
mainly by those for whom nothing else will do,
primarily because of size and weight. After
reliability is proven there will probably be
another period when the main use will be in military applications. Developments during this
period will determine whether Integrated Circuitry
spreads throughout our electronics industry or
remains with a few specialized applications. The
determining factor will be cost. If the ultimate
cost can be brought down to that of conventional
computer circuitry, Integrated Circuitry will take
over large segments of the electronic computer
lndustry.

Such an acceptance of Integrated Circuitry
will force another integration. During the proving period computer manufacturers will probably be
willing to design their equipment around available
~ntegrated circuits, but the time will come when
the computer design engineer will demand greater
freedom of choice in selecting the particular integrated circuits from which he builds his machine. When this happens, and it surely will
happen, our industry will go through some interesting gyrations; computer equipment companies will
be pushed into the semiconductor and solid state
materials and components business, and the semiconductor products manufacturer will be forced into
the systems engineering and equipment fabrication
business. The end point will be remarkably similar for the two, even though there will always be
differences in accent and degree. When dynamic
equilibrium is reached, not only the electronic
circuitry but the people who create and use the
new circuitry will be integrated to an extent
unknown today.
There will always be specialists, as there
always have been, but the new specialists will
have a broader base which will frequently extend
into more than a single scientific or technological
discipline. The new specialist will understand
and speak the languages of the associated disciplines and will contribute his point of view in
matters that previously were considered the exclusive concern of others. The chemist and the
metallurgist will take their rightful places beside the solid state physiCist, the circuit designer, the systems engineer, and the statistician.
The successful integration of computer circuitry will depend upon demonstrating adequate
control of designable, compatible, and above all
reliable techniques.
Acknowledgment
In a survey paper such as this, the work and
techniques described are the result of effort by
many people in many companies. In particular I
would like to mention the help and cooperation of
my colleague, James R. Black of MOtorola's Solid
State Electronics Department, in supplying information on thin-film integrated circuit techniques.
Finally, to all technologists whose contributions
are included in these pages, I wish to express
appreciation for sharing their discoveries and
experience. The rapid growth of microsystem
electronics depends upon just such sharing and
cooperation.

75

2.2
TESTING OF MICROLOGIC

ELEMENTS

Robert H. Norman and Richard C. Anderson
Fairchild Semiconductor Corporation
Palo Alto, California
Summary
The acceptance tests of integrated logicfunction'elements require a departure from the
notion that the parameters of the individual components of a circuit must be known. A test program
based on external characteristics only is made
necessary by the complexity of the circuit; it is
contended, in addition, that such a test has more
valid significance than does measurement of constituent parameters. It is more significant, for
example, when testing a compatible set of digital
functional blocks, to make a measurement which
gives a direct estimate of fan-out than it is to
measure the gains, leakage currents, and resistor
values in the output stages.
This paper will discuss the validity of this
concept and describe a program of tests based on
the concept.

characteristics in such a way that we can accurately determine that its present operation is
satisfactory and, as important, that we can
expect continued operation over wide environmental conditions. If this cannot be done, ther~ is
little point in continuing with an integrated
circuit program.
We will support the assertion that this can
indeed be done by discussing a test program which
is based upon the supposition that the assertion
is correct. This will not, of course, constitute
proof; it will give, we hope, an intuitive confidence that the probability of success is high. A
theoretical proof of the adequacy of an "external
characteristics only" test would be both difficult
and unacceptable. The only ultimate verification
of the adequacy of such a test procedure will be
its success in operation after millions of devicehours.
Micrologic

Introduction
In the production testing of integrated logicfunction circuits, we face two familiar problems:
first, the test must insure that the device operates properly over a wide range of environments,
yet it is desirable that the test be performed at
room temperature. Second, the test must not only
reject those which are initially inadequate, but
it must reject those which are predictable future
failures. In addition to these usual requirements, in testing functional blocks a third
problem soon becomes evident: the integrated
logic circuit contains some active and passive
constituents whose individual parameters cannot
be measured, due to the circuit in which they are
incorporated. Further, the device contains
several internal circuit nodes which are not
accessible for measurements once the device is
packaged and complete. In short, in each circuit
some device parameters would be unmeasurable if
we could get to them . • . and we can't.
This, then, is the new problem posed by the
functional-block circuit: the complexity of the
device dictates that no meaningful reliability
tests and evaluation can be performed unless they
be tests of external characteristics. We can no
longer be concerned, for test and evaluation purposes, with the parameters of every active and
passive constituent of the circuit--with the gains
of the transistors, for example--or even, for that
matter, with the distributions of those parameters. We must be able to test a completed integrated circuit by examining only its external

The remarks above are general, readilyapparent comments concerning the entire genus of
integrated function-block devices; the tests and
methods now to be discussed pertain to a particular species, the micrologic element.
Micrologic is a compatible and sufficient
set of integrated semiconductor logic-function
elements. Each micrologic element consists of
from one to five DCTL NOR gates; the circuit
components--resistors and planar transistors--are
diffused into a single slab of silicoq, and metal
intraconnections are deposited on top of the slab.
The device is then packaged in an eight-lead TO-5
or TO-18 package. The logic and schematic diagrams of a typical element, the flip-flop, are
shown in Figure 1.
Effects of Parameter Distribution and Drift
Before the micrologic development program
was undertaken, some assurance was required that
the problem under discussion could be solved.
Since the circuit configuration chosen can tolerate a wide variation of parameters, and since
these parameters might not initially be measurable, assurance was required that those parameters
far from the expected value would not drift with
time, causing subsequent failures. Intuitively,
the inherent stability and reliability resulting
from the planar process gave this assurance.
To investigate these questions, flip-flops
were made up of conventional components, using
the circuit configuration proposed for the future
integrated 'T" element; the transistors used were

76
2.2
incorporates the test principle we have been
discussing. It is a sequence of DC measurements
of the overall input-output characteristics of
the element with no attempt made to measure
internal pa;ameters. It is designed to determine
the present and expected operability of the element.
A go, no-go indication is given for each element,
and all measured data are recorded.

I

@
2

7

6

3

4

5

3.

The LAB TEST is a measurement of all of

the internal device parameters, such as
Rc' that are accessible.

~,

rb,

These parameters are

measured and recorded over the specified temperature range.
Figure 1.
reject devices.
I

Schematic and Logic Diagrams of
the Micrologic "F" Element.
Approximately half were V
and
EBO
The

ALL
pLOGIC
ELEMENTS

SALES

rejects and half were random rejects.

CBO
V
rejects gave a 20% yield of good flip-flops,
EBO
the I
and random rejects gave an 80% yield,
CBO
indicat i.ng that the DCTL circuit configurat ion is

REJECTS

LIFE AND
ENVIRONMENTAL!V::::=::::!!:.-.,Il
STRESS
TESTING

relatively insensitive to wide parameter variations.

One hundred fifty of these circuits were

then placed on 150°C storage test, together with
25 similar circuits made up of good transistors.
After 7000 hours at 150°C ten of these flipflops have failed to operate under load. Seven of
these failures were caused by leads and three by
EB shorts. There were no other modes of failure.
Environmental tests indicate that these failure
modes will not be a problem with the depositedmetal connection method developed for micrologic
elements. The results to date in this life test
give us, therefore, a reasonable degree of ass~r
ance that transistor parameters which are outsLde
specifications will not drift with time sufficiently to cause operational failure in the micrologic DCTL circuit.
This first-step program of building simulated
micrologic elements thus indicated that, indeed,
individual-constituent parameters are not the
primary criteria for estimates of operability, and
that circuits which are initially good, although
they may have "poor" constituent parameters, will
stay good.
Micrologic Test Program
The test program for micrologic elements is
illustrated by Figure 2, which shows the flow of
completed elements through the three types of
measurement phases:
1. The LOGIC SORT classifies elements as to
type and rejects catastrophic failures. It is a
go, no-go test with no data recording.
2.

The PRODUCTION TEST is the phase that

Figure 2.

Flow Diagram of the Micrologic
Test Program.

It is relatively easy to set up a production
test that evaluates present element operability;
to devise a test that maximizes future operability
over a long time span, and under wide temperature
variations is a more difficult matter. Only with
time can the accuracy and significance of such a
test be verified. For these reasons the life-test
production-measurement loop has been set up, as
shown in gray in Figure 2. Sample quantities of
the elements which are accepted by the Production
Tester are fed into this loop; the elements of the
sample are run through the lab test and all
parameters are recorded. These elements are then
placed on operating and storage life tests. At
given intervals, they are again run through a
production test, samples are again lab-tested,
and the group is returned to the life test.
It is this loop that generates the feedback
information shown in the figure. If the production test indicates a drift with time, the sample
sent through the lab test is increased in size,
and the parameter causing the drift is studied.
If the lab tests, made over the temperature range,
indicate that the room-temperature production test
requires modification, then the production-test
parameters, or criteria, or even test methods are
altered.
The methods, parameters, and criteria of the
production test, then, are initially set up
according to our best first guess. The recirculating life-test, lab-test loop generates the

77
2.2
feedback information that tightens or relaxes the
production test.
As the rate of predictable failures in the
loop decreases, our confidence in the validity of
the production test measurements increases.

Let us now consider in greater detail each of
the phases of the test program.
Logic Sort. The Logic Sorter, shown in
Figure 3, uses a four-stage counter to generate

D U T
OUTPUTIS)

GO

S WIT CHI h G
STANDARD
OUTPUTlS)

L: -"5

TAN-O-A-R-O"

NO - GO

-EL EM-E-N Ts-i

00000;

L _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ .J

Figure 3.

Block Diagram of the
Logic Sorter.

binary inputs which are applied to six "standard"
elements and to the switching network. As the
switch steps through the test sequence, the device
under test is tested first as an F, then as a G,
then as an S element, the correct inputs being
supplied by the switch, and the outputs being
logically compared to the outputs of the corresponding "standard" element.
The need for a logic sort test arises from
the fact that micrologic elements may come from
the assembly process with no classification as to
type. Since the only differences among the processes for the various elements are in the masks
used, no attempt at classification need be made
from the time the wafer is diced. When an element
of unknown type is logic-sorted, the tester steps
through its sequence, and, when a correct comparison with a "standard" is found, the element is
sorted as that type. If the sequence ends without
a comparison, the element is rejected. The sorter
rejects any device that does no~ give a logically
correct output when loaded by a fan-out of two at
room "temperature. Thus it acts both as a convenient sorter and as a measure to prevent elements
with major flaws--such as short-circuits--from
reaching the Production Tester, where they might
cause overloading.
Production Test. As mentioned above, it is
the Production Tester that embodies the principle
of rejecting incipient failures from a series of
measurements of external characteristics. The
tester automatically steps through a different

sequence of tests for each element, setting up
worst-combinations of applied conditions, recording the resultant D.C. measurements, and providing
go, no-go indication and recording.
The establishment of a production test is
essentially the definition of the criteria of what
constitutes a satisfactory device. That is, the
setting of acceptance limits implies that these
limits are the valid and significant criteria by
which one can evaluate the usability as well as
the reliability of the device. For these reasons,
we need first to discuss the factors that govern
our selection of the general criteria of useful~
ness and reliability and then to discuss the DCTL
circuit characteristics and limitations that
determine the specific criteria; then we can
return to discuss the production test conditions
and acceptance limits.
General Criteria

•

The general criterion of acceptance of a
"good" micrologic element is based on the following definition: if any given system operates
correctly, utilizing a given element, then that
element is considered to be good. This definition
is, of course, a very general one, usable only
under certain conditions, namely that "correct
operation" of the system can be clearly defined
and that the set of all circuit environments of
an element in any given system is a limited one,
permitting description of all possible circuit
configurations or of the most stringent configurations.
The nature of the micrologic family is such
that these conditions are met. "Correct operation" of the arbitrary system can be definitely
specified because it is a binary system; "correct
operation" is, therefore, synonymous with errorfree operation.
The set of all possible circuit'environments is known because micrologic elements alone
are used in any micrologic system, and their use
in the system is constrained by the logic design
rules. These design rules specify that no more
than six transistor collectors may be connected
to a given node (i.e., fan-in), and no more than
five micrologic loads may be driven by a given
node (i.e., fan-out). The most severe inputoutput conditions for any element can, therefore,
be. readily derived. In fact, by analyzing several micrologic digital systems, one can determine the frequency-of-occurance distributions of
the many combinations of fan-in and fan-out. A
study is now underway to determine these distributions in a typical small general purpose computer and in several logic subassemblies.
In summary, a micrologic system consists
solely of micrologic elements, and the set of all
possible combinations of these elements is limited
by the design rules, permitting accurate description of the limits of the circuit environment that
will be encountered by any given element. These
factors permit the use, for micrologic, of this

78

2.2
general criterion of acceptance: the element is
good if any arbitrary system utilizing it operates
correctly.

Here transistor Q fans out to N loads, one of
1
which is a simple inverter, and (N-1) of which
are multiple-input gates, all of whose transis-

Direct-Coupled Transistor Logic

tors are in saturation.

The basic circuit, and the basic logic block,
used in micrologic is the DCTL NOR gate, shown
schematically and in logic symbol in Figure 4.

The simple inverter has

the V
/ IB characteristic marked Q on the
BE
2
curves, but the other (N-1) loads have the input
characteristics of Q3' because their collectors
are clamped by the parallel transistors.

+
C = (A+B+---+M)

Ao--_~

For a

given node voltage V , therefore, the inverter
BE
base will draw current IB(Q )' while each of the
2

B

A

M~---1~

Figure 4.

The ~TL NOR Circuit and
Logic Symbol.

The circuit is a simple one, consisting of a
transistor for each input and a single collector
resistor. The output of the gate is a "one" (a
positive potential) if none of the inputs is a
one.
DCTL was selected for use in micrologic
because of its simplicity, low power consumption,
high speed, and insensitivity to wide variation
of certain parameters. One limitation of DCTL is
the requirement for a transistor type which has a
narrow distribution of base current for a given
value of "on" base voltage. This requirement is
necessitated by the "current hogging" characteristic discussed below. One phase of the micrologic program was the development of such a suitable DCTL transistor.
Current Hogging. This euphonious title
applies to the situation shown in Figure 5.

other (N-1) bases draws a far larger current,
IB(Q ). There is then a good possibility that
3
the heavy current drawn by the (N-1) loads will
not permit the voltage V
to rise high enough
BE
to fully turn on Q2.
If, however, the base input characteristics
are closely uniform and if the base input resistance is increased moderately, then the disparity
in input currents is greatly reduced, as shown by
the dotted curves. When these input characteristtcs are incorporated in the DCTL transistor, the
current-hogging problem is minimized, and the
advantages of DCTL are made available.
Specification of Threshold Levels. In the
design of a logic circuit some specification
must be made of the signal levels representing
lIs and O's. In DCTL circuits, a one is represented by a positive potential, the level of
which is determined largely by the bases being
driven. A zero is represented by a near-ground
potential, the value determined by the saturation voltage of the output transistors and the
number of such transistors in parallel.
The degree of saturation in the "on" condition and the permitted amount of collector current in the "off" condition are, within limitations, up to the designer, and there are many
combinations of "one" and "zero" voltages that
might appear to be usable. Just what are the
limit at ions?
In DCTL, we are concerned, of course, with
a distributed system, consisting entirely of
closely similar gates. When we determine the
limiting values that the signal levels may take,
the solution must hold over every logic path in
any system.

OFF

\

V

I !l£ '----ISa-

L_!... __ _

Figure 5.

The "Current-Hogging" Condition.

The problem to be solved can be considered
in this manner: assuming an infinite cascade of
DCTL logic stages, what is the lowest "turn on"
voltage and the highest "turn off" voltage that
may appear at any base, under the condition that
all following stages in the cascade are alternately on and off. Considering Figure 6, what
is the highest voltage that we may apply to the
base of Q and still have the condition that Q
O
O

79
2.2
as a function of V
for a representative group of
BE
DCTL transistors with collector resistors; the
upper and lower limits are the 30' points of the
distribution.

The two-transistor feedback loop of

Figure 7 simulates the "infinite" cascade; transistor A has a VCE characteristic corresponding to the
upper 30- curve, and B is represented by the lower
Figure 6.

Cascaded DCTL Inverters.

and all even-numbered transistors are cut-off,
and that Q and all odd-numbered transistors are
I
on? Similarly, what is the lowest base voltage
{

applied to Q that insures that Q and all evenO
O
numbered devices are "on" sufficiently to hold

curve.
Assume initially that A is cutoff, B is
conducting and that the feedback loop is open; a
trial turn-on voltage, VBE(A)' applied to A will
result in VCE(A) at the collector of A and at the
base of B.

For VBE(B)

= VCE(A) ,

the output of B

is seen to be VCE(B) , which is lower than the

the odd-numbered devices cut-off?

initial assumed applied voltage.

This problem can be graphically analyzed by

The applied

voltage was not sufficient, then, to

s~itch

the

considering the circuit of Figure 7 together with

stable states had the feedback loop been connected,

the curves of Figure 8.

and it would not have been sufficient to drive a

The curves indicate VCE

long cascade.
In Figure 9 let us assume a higher turn-on
40~----r---r----r---"

Figure 7.

Two-transistor Feedback Loop.

40
3.0

I...J

~~

20
1.5

\



«

~

1.0

""

07

0
I-

t®VCE(Al

\

05

::-

03

~
\

\

0.4 JVCE(Bl_
f----

r\ "- ~ I' "---

VBE(B1d

05

0.6

0.7

B

~BE(Al

08

05

06

07

08

09

VBE - BASE VOLTAGE - VOLTS

\

I

02

I-I--

VBE

\

\

0

u

-

...

veE

' 1\

\

U

...J

Re

\"

0

::-

~

0.9

VBE - BASE VOLTAGE - VOLTS

Figure 8. Curves Showing 3a Limits of the
Distribution of VCE as a Function of V
for a
BE
Representative Group of DCTL Transistors with
their Collector Resistors. The First Trial
Solution for the Threshold Value of V
is also
BE
Shown.

Figure 9. Curves of VCE vs. V with
BE
Second Trial Solution (Dashed Lines) and Threshold Operating Point (Dotted Lines).
voltage VBE(A) for the same circuit.
the same steps (by proceeding from

CD

Repeating
to

® ),

we find that the resultant feedback voltage is
higher than the assumed turn-on voltage, so that-had the loop been connected--the voltage VBE(A)
was more than sufficient to insure a reversal of
stable states.

This solution is indicated by

dashed lines in Figure 9. By repetition of this
graphic procedure for intermediate values of

80

2.2
turn-on voltage, we can find the threshold value,

in the circuit shown, the base A wili not allow

VBE(on)' which is just sufficient to turn on A

VCE(B) to rise to the value shown on the graph,

enough that B is sufficiently cut off to cause its

but will clamp it to some lower value, VBE(on)A'

collector to rise to VBE(on)'

There is, however, useful information to be gained

This threshold

condition is shown by the dotted lines in the
figure.

from the value of VCE(B); it indicates the collector current

Since A has the poorest turn-on characteristic, we see that this is a worst-case condition
within the 3 (J" distribution and that all other
transistors are turned on harder than A for the
same minimum-required VBE(on)'

drawn by transistor B with VBE(B)

applied to its base.
I CEX

6V

=R

This current,

(See 1~ of' Figure 9)

c

Selection of this threshold value of turn-on

I LOAD

voltage also implicitly determines the maximum

Rc

voltage which may appear at the base of an off
transistor.

By selecting VBE(on) we have defined

the "on" condition as one in which VCE is VCE(on)
or lower, as shown in the "on" VCE region in
Figure 9.

(b)

Since this voltage is equal to the base

(a)

voltage of the following stage, we see that the
base voltage of an off transistor must be VBE(off)

Figure 10.

(a)

or lower.

(b)

These two values, then, VBE(on) and VBE(off) ,
define the threshold levels for the on and off
transistors in the feedback loop or in the cascade.
These same values would be obtained if we assumed
an initially-applied voltage to turn off B.

The Graphical Solution Considering Loading.
Consider again the two-transistor feedback loop
and the VCE vs. V curves of Figures 9 and 10.
BE
This time the output node of B has a current load.
We again assume the trial turn-on voltage, VBE(A)'
ing points

CD

through

To a first approximation, then, we can
consider transistor B to be a current generator,
drawing current I

from its output node, as
CEX
indicated in Figure lab. In order that the

assumed voltage VBE(A) be sustained while a load

All those readers with experience in logic
circuitry are now entitled to rise up in wrath,
shouting, "What, no noise rejection?" and "What
happened to fan-out!" together with mutterings of
"What did I tell you about DCTL!" And this would
be justified. The graphic solution above is a
preliminary one; it is for the purpose of giving a
basic understanding of the method of analysis of
cascade of DCTL networks by consideration of the
device parameter distributions. It does not consider loading effects or the necessity for noise
rejection. These we will discuss next.

which is higher than the threshold value.

Two-transistor Loop with
Load.
Equivalent Circuit for B.

Follow-

@ in the figure, we see

that the assumed applied voltage results in the
appearance of VCE(B) at the collector of B.

current is drawn from the B output node, the
following is true:
V

Iavailable

=

cc - VBE(A) _ I
R
CEX'

(1)

c

where Iavai1able is the current available from
node B to drive the base A as well as other
unspecified current loads at a voltage VBE(A)'
This current is easily calculated.
If this available load-driving current is
plotted as a function of V , we get a plot
BE
similar to the one of Figure 11, where the distribution of IB as a function of V
is also shown.
BE
Iavailable in this plot is the current available
from node B at a given voltage V , with the same
BE
voltage V applied at the base of A. It is seen
BE
that as the operating V point is reduced from
BE
a high value, the current-available gradually
rises, since more current can be supplied through

Now these VCE curves were plotted for an
unloaded collector node; it is quite apparent that,

Rc to a lower voltage.

As the VBE(on) operating

point is approached, however, the "leakage"

81

2.2
current drawn by the "off" transistor, B, becomes
dominant, and lavailable falls extremely rapidly.

acceptable "on" value of V
is established there.
BE
This value is defined as Yon; it, in turn,
specifies a maximum acceptable turn-off value of

To recapitulate, Yon' the minimum allowed
base "on" voltage, is the voltage required to turn
on all transistors sufficiently well that all
collectors are

Voff

or lower; this voltage is, in

turn, low enough that all transistors to which it
is applied have VCE equal to or greater than Yon
0:

\

\

I

/"

~ o7~------r---~~\--;~~'"/~I'~--~~
...J

8
I

V·

I \{

\

05

V)(

05
04

"1"

1

/I~ :

while supplying current to drive the test04

I'---~

..

~

specified fan-out.
Note that, whereas Yon is the voltage required

03

2~------~--~/~/_'~:1~-4'~~~~~--~02

//
05

06

07

to turn on a "poor" inverter, the loads

! :

08

~

in

equation (2) are those required by transistors with

09

clamped collectors.

VBE - BASE VOLTAGE - VOLTS

Figure 11. Collector Voltage, Base Current,
and Current Available as Functions of Base Voltage
for a Group of DCTL Transistors.

We have, therefore, based our

calculations on the most stringent DCTL fan-out
configuration.
In summary, we have graphically analyzed the

Knowing Iavailable and lBE as functions of
' we can quickly determine fan-out as a function

V
BE
of V ' Clearly, for any operating point of V
BE
BE
the number of transistor bases that can be driven
by a node is equal to the total current available
divided by the current required by each base.

fan-out as functions of V • By establishing a
BE
minimum turn-on voltage and a maximum turn-off
voltage which may be allowed to appear at a
transistor base, we have insured that the calcu-

Fan-out, N, can therefore be expressed by
equation (2):1
N = lavailable

distributions of VCE and IB as functions of V
in
BE
order to determine the available current and the

lated value of fan-out will be achieved at every
node.
(2)

where IB is the upper 3 cr value of base current at
the given V • Since this function has a maximum
BE
in the neighborhood of the maximum value of
lavailable' the specified value of the minimum
lA more accurate value for fan-out is
obtained by using the upper 3 cr- limit of the
distribution of N(l ) , as in equation (3). Here
B
N is an implicit function of Iavailable'

The methods of calculation of current available and fan-out discussed above are largely of
academic interest. They are included here to
give a better understanding of the tests we are
about to discuss and to illustrate the evaluation
procedures used during the development of the
micrologic DCTL transistor. They are useful for
evaluation purposes, but they are not used to set
test acceptance limits. The acceptance of an
element is based upon measurements which give
direct indication of sufficient current available
rather than upon calculated values.
The Production Tester

(3)

where N is equal to fan-out, iB is the mean value
of base current, and ~B is the standard deviation of the base current distribution. This gives
a higher value for N; the use, therefore, of the
simpler equation (2) results in a conservative
estimate of fan-out.

As can be seen from the DCTL circuit considerations and from the foregoing graphical analysis,
an adequate external production-test of micrologic
elements must consist of three basic types of
measurements: input current, output current
available, and output VCE(sat)'
Definitions. The terms to be used in
describing these tests are defined as follows:

82
2.2

Yon

1)

is the highest turn-on base voltage

which will appear in any system.

It is equal to

Vcc applied through the lowest value of collector
res istance.

mente

V
). This is the other output measureCE(sat
With Yon applied to the base of an output

transistor, the collector must pull down below

Yon is the lowest turn-on base voltage

2)

voltage appears at the base.

allowed to appear in any system.
the same manner as the V

-on

It is derived in

V
• If the base is not directly accessible,
sat
then the worst-combination of inputs is applied
such that the lowest possible internally-generated

that we discussed

earlier.

turn-on voltage appears at the base.

3) V
is the highest value of turn-off
off
base voltage allowed to appear in any system. It

The loading conditions and the acceptance
limits for these tests are set above those
required to meet the specified fan-out of the
elements. The margin of excess is one which has
been empirically determined to insure operation
with noise rejection over the specified temperature range.

is set high enough to permit an I
comparable to
CEX
that which will occur at high temperature.
4) Y

is the collector voltage applied
off
during measurement of I B• It is the lower limit
of the voltage appearing at the collectors of
three paralleled saturated transistors.
Vsat is the highest acceptable VCE(sat)
for a transistor driven by Yon at room temperature.
5)

It is appreciably lower than

Voff •

With these definitions we can define the three
types of tests performed by the production tester.
Input Current.

With applied voltages of Yon

at the base and Yoff at the collector, the base
current drawn by any transistor may not exceed a
maximum value, defined

as~.

See Figure 12.

This measurement is performed only on bases which

Once the general test methods and criteria
are established, the only remaining trick is to
determine the worst-combination of conditions to
apply at the other pins when performing a measurement at any given pin. Each element presents its
own problems and requires a separate solution;
the simplest element, "G," requires seven test
steps; the most complex element, "S," requires 16.
It might be helpful to discuss some of the
general considerations involved in the devising
of the tests. The tests of input characteristics
are rather straightforward, since only transistor
bases are connected to the input pins; the only
problem, is to insure that the most severe
collector-clamping conditions prevail. In the
case of the "G" element, where the collector node
is available and could conceivably have additional
transistors connected in parallel, Y
is
off
applied directly. See Figure 14. In the case of

12912

are connected to micrologic element input
terminals.

~

~ON

~OFF

VCE
( MEASURE)

7

Figure 12.
Input Current.
Current Available.

5

Figure 14.
Figure 13.
Current Available
This is a measurement of

element output characteristics.

With

Yoff

applied to the base of an output transistor, and
with current IK drawn from the collector node,
the collector node voltage must exceed Yon
Figure 13.

8
6

1;

See

If--as is often the case--the base is

not accessible, then a combination of inputs is
applied such that the highest possible turn-off

3

4

7

3

6

4

5

Logic Diagrams of the "G"
and "s" Elements.

the "s" element, the input-gate collector nodes
are not accessible, and the worst possible clamping for input current at pins 6 and 8, for
example, occurs when pin 7 is driven by V
on
For current-available measurements at output
terminals, a greater number of variables must be
considered. All transistors whose collectors
are connected to the given terminal must be
turned off as weakly as possible in order to draw
as much leakage current as will ever be experienced; all transistors whose bases are connected
to the given terminal must have the worst possible

83

2.2
collector-clamping in order that the bases will
draw the maximum current. Then, with as much
current as possible being internally drawn from
the node, the external load current is drawn, and
the node voltage checked to insure that it is
higher than Yon'
If a single transistor drives an output node,

TABLE I
Measure
&

Compare

Remarks

1

Yoff at pin 4
Yon at pin 6

Check input
current, Q
l

2

Yoff at pin 2
V at pin 8
-on

Check input
current, Q

the test of VCE(sat) is made by turning it weakly
on with Yon and measuring VCE'

Apply

Test

If several transis-

tors in parallel drive a node, then one is turned
weakly on with Yon and the others are turned off
with Yoff; the test is then repeated until each

4

Yon at pins 6
and 8

V4 < V
Check V
sat
CE (sat) ,

at pins 6
4 -on
V

Check V
V2 < V
sat
CE(sat)'

3

transistor has demonstrated its ability to hold

Q
l

the node voltage sufficiently low.
and 8

Test Example
As an example of the application of these

5

Yoff at pin 4
at pin 8
V
off
Sink IK at pin 2

6

Remove Y
at
off
pin 4;
other inputs
remain

general considerations, let's look at a specifiq
micrologic production test, the test of the "F"

Q4

V 2> Yon

Current available
at pin 4, QZ
clamped

element.
As summarized in Figure 15 and Table 1, the
first two steps of the "F" test are measurements of

V

4

< Vsat Check VCE (sat) ,
QZ

the input base currents with the specified Yon
applied at the base and with the collectors
clamped.

7

Same as test 6

Current available,
Q not clamped
2

The currents are measured, the values

recorded, and comparison made against the maximum

8

~ff

at pin Z

V4

> Yon

V

off at pin 6
Sink IK at pin 4

acceptable value, IB'

Current available
at pin 2,
Q clamped
3

+ 3V
9

10

Current available,
Q not clamped

Remove Yoff'
all others same

V >V
4
-on

Same as test 9

V 1e explain phenomena by reducing
them to other phenomena that seem to us,
someh:)w, simpler and more orderly. How
did I'~endel, for example, explain the
relative frequencies of his different
kinds of peas in successive generations?
He postulated (without any direct
observational evidence) underlying
dominant and recessive factors passed
on from parents to their progeny, whose
interaction determined the physical
type of the progeny. Only lTIany years
later vms any direct evidence obtained
of microscopic structures in the cell-the chromosomes--that could provide the
biological substrate for Mendel's
, factors.
Again, rlJ.organ's studies of
fruit fly populations led him to
postulate even tinier c·omponents of the
chromosomes--the genes. These had to
a',"lai t the electron microscope before
they could be shown, by direct observation, to exist; and even today, we
are still far from an explanation of
these biological structures at the next,
biochemical level.
f'

The goal, then, in Simulating complex human behavior is the same as the

Finally, when vie use computers to
state and test information processing
theories of thinking, we do not postulate
any crude analogy between computer and
brain. We use the computer because it
is capable of Simulating the elementary
information processes that these theories
postulate as the bases for thinking.
We do not assert that there is any resemblance between the electronic means that
realize these processes in the computer
and the neurological means that realize
the corresponding processes in the brain.
We do assert that, at a grosser level,
the computer can be organized to imitate
the brain.
Information

Proce~~ing Lan~~ages

There has been a strong, and not
accidental, interaction between work on
the computer simulation of human thinking
and research on computer programming.
T~e kinds of processes that computers
are called upon to perform when they are
Simulating thinking tend to be quite
different from the processes they perform
when they are carrying out numerical
analyses. A superficial difference is
that the former processes involve little
or no use of arithmetic operations. A

114
3.1

more fundamental difference is that
memory must be organized in quite
distinct ways in the two situations.
Within the past five years there
have been a number of reports to these
conferences on the general characteristics and specific structure of information processing languages specially
designed to facilitate non-numerical
simulation. 15 ,lb I shall not go over
this familiar ground again, except to
point out that when such languages are
used to build psychological theories the
languages themselves contain implicit
postulates--although rather weak ones-about the way in 1-'ihich the central
nervous system organizes its work.
One of the common characteristics
of all of these languages is their
organization of memory in lists and list
structures. By this means there can be
associated with any symbol in memory a
"next symbol--the symbol that follows
it on the list to which they both belong.
By the use of a slightly more complicated
device, the description list, there can
be associated with any symbol in
mem~ry a list of its attributes and
their values. If the symbol, for example,
represents an apple, we can store on its
description list the fact that its color
is red, its printed name is APPLE, and
its spoken name, APUL. The incorporation of these two forms of association-the serial order of simple lists and
the partial ordering of description
lists--in information processing
languages permits one to represent many
of the associative properties of human
memory in a quite simple and direct
way. We can use simple lists to simulate
serial memory--e.g., remembering the
alphabet--and description lists to
simulate paired associations--e.g., the
association between an object as recognized visually and its name.
11

A characteristic of the list
processing languages, which they share
with most other compiling and interpretive languages, is that they organize
behavior in hierarchical fashion.
Routines use subroutines, which have
their own subroutines, and so on. This
characteristic of the languages again
facilitates the construction of programs
to simulate human behavior, which
appears to be organized in a highly
similar hierarchical manner. The ·fact
that most investigators have found it
easier to write simulation programs in
interpretive list languages than in
machine language derives, in all likelihood, from the fact that the former

languages have already taken the first
steps in the direction of organizing the
computer processes to mirror the organization of the human mind.
Heuristic Problem SolVing Programs
The Program of Selfridge and Dinneen
The work of Selfridge and Dinneen
on pattern recognition,7 which I
earlier assigned to the se.cond category
of simulation programs--simulation of
sensory-perceptual processes--really
marks a transition to information
processing simulations. The SelfridgeDinneen program specified a set of
processes to enable a computer to learn
to discriminate among classes of patterns
presented on a two-dimensional Hretina.1!
The patterns could represent, for example,
English letters like nA· and "0 of
varying shape, size, and orientation.
II

In the Selfridge-Dinneen program,
recognition was accomplished by using
various operators to transform the
retinal stimuli--in general to simplify
and I'stylize" them--and then searching
for characteristics of the transformed
stimuli that grouped the various exemplars
of a given alphabetic letter together,
but separated the exemplars of different
letters. Although the program made use
of the arithmetic instructions of the
computer, the operations were basically
topological and non-numerical in nature.
Appropriate organization rather than
rapid arithmetic was at the heart of the
program.
The Selfridge-Dinneen program foreshadowed subsequent work in this area in
another important respect also. The
characteristics used to distinguish patterns were heuristic. They amounted to
rules of thwnb, selected by the computer
over a series of learning trials on the
sole basis that they usually worked-that is, made the desired discriminations.
In more traditional uses of computers it
is usually required that the programs be
algorithms--that they be systematic
procedures which guarantee" solution of
the problem to a desired degree of accuracy. The heuristics generated by the
pattern recognizing program provided no
,such guarantees. Since there are vast
ranges of tesks, handled every day by
human beings, for which no algorithms in
the sense just indicated are knoitm to
exist, the admission of heuristics as
program components opened the vlay to
simulating the less systematic, but often
effective, processes that characterize
much garden-variety, everyday human
thinking.

115
3.1

Subsequent work has tended to
confirm this initial hunch, and to demonstrate that heuristics, or rules of
thumb, form the integral core of human
problem-solving processes. As we begin
to understand the nature of the
heuristics that people use in thinking,
the mystery begins to dissolve from such
(heretofore) vaguely understood processes
as "intuitionl. and 11 judgment •
t,

Some Other

Problem-Solvin~ Pr~~rams

In the period 1956 to 1958 there
came into existence a number of other
compute~ programs that accomplished
complex tasks with a "humanoid" flavor:
composing music,17 playing checkers,18
discovering proofs for ~heorems in
10gic,19 and geometry,20 designing
electric motors and transformers,2l playing chess,22,23,24 and balancing an
assembly line. 2 5 The primary goal in
constructing most of these programs was
to enable the computer to perform an
interesting or significant task.
Detailed simulation of the ways in which
humans perform the same task was only a
secondary objecti ve--or vIas not
considered at all.
Nevertheless, it was discovered
that often the best program for doing
the job was a program that incorporated
some of the heuristics that humans used
in doing'such jobs. Thus, the music
composition program of Hiller and
Isaacson made use of some of the rules
of classical counterpoint; the motor
design programs and line balancing
program were generally organized in much
the same ways as the procedures of
experienced engineers, and so on. Hence,
to a greater or lesser degree, all of
these programs have taught us something
about the ways in which people handle
such tasks--especially about some of the
kinds oi' heuristics they use.
Among these programs Samuel's
checker program and the Los Alamos chess
program place the least emphasis on
heuristics, and hence provide valuable
yardsticks for comparison with heuristic
programs handling the same, or similar
tasks. These two programs make essential
use of the computer's capabilities for
extremely rapid arithmetic, for their
basic strategy is to look at all possible
(legal) continuations of the game for
several moves ahead, and then to choose
that move which appears most favorable
(in a minimax sense) in terms of the
possible outcomes. In contrast,
Bernstein's and the NSS chess programs
examine a small, highly selective subset
of all possible continuations of the

game and choose a move that appears good
in the light of this selective analysis.
Thus, the Los Alamos program, looking two moves ahead, will typically
examine a little less than a million
possible continuations, Bernstein's
program approximately 2,500, and the NSS
program almost never more than one hundred
and more usually only a handful. All
three programs play roughly the same
quality 01 chess (mediocre) t'J'ith roughly
the same amount of computing time. The
effort saved by the heuristic programs
in looking at fewer continuations, is
expended in selecting more carefully
those to be examined and subjecting them
to more thorough examination. Thus, the
more syste~atic, arithmetic programs
provide benchmarks agains t ~lhich the progress in developing heuristics can be
measured.
The General Problem Solver
All of the programs we have
mentioned fell short of human simulation
in one very fundamental respect--apart
from failures of detail. They were all
special-purpose programs. They enabled
the computer to perform one kind of
complex task, and one kind only. Only
in a few cases (the Checker Player 18 and
the Logic Theorist 26 ) did they enable
the computer to improve its performance
through learning. Yet we know that the
human mind is (a) a general-purpose
mechanism and (b) a learning mechanism.
A person ~'lho is brought into a relatively
novel task situation may not handle the
situation with skill but, unless it is
inordinately difficult, will not ftnd
himself at a complete loss. Whether he
succeeds in solving the problem that is
posed him, or not, he is able, at least,
to think about it.
~ve must conclude that if a computer
program is to simulate the program that
a human brings to a problem Situation,
it must contain t'i'l0 components: (a) a
general-purpose thinking and learning
program that Qakes no direct reference
to any particular task or subject
matter; and (b) heuristics that embody
the specific techniques and procedures
which make possible the skilled and
efficient performance of particular
classes of tasks. The program must
incorporate both general intelligence
and special skills.

The General Problem Solver (GPS)
vms the first computer program aimed at
describing the problem solving techniques
used by humans that are independent 01'
the subject matter of the problem. 2 7

116
3.1

Since GPS has been described elsewhere,
I shall say only a word about its structure. It is a program for achieving
the goal of transforming a particular
symbolic object (representing the "given
problem situation) into a different
symbolic object (the l'desired" situation
or goal situation). It does this by
discovering differences between pairs
of objects, and by searching for
operators that are relevant to reducing
these differences. In the form in
which i,t has thus far been realized on
a computer, GPS is not a learning program, hence still falls far short of
Simulating all aspects of what we would
call general intelligence.

ll

In its current computer realization,
GPS has solved some simple problems of
finding proofs for theorems in symbolic
logic (substantially the same task as
that handled b¥ the special-purpose
Logic Theorist). It has solved the wellknown puzzle of Missionaries and
Cannibals--finding a plan for transporting three missionaries and three
cannibals across a river without any of
the missionaries being eaten. Hand
simulation has demonstrated that it can
handle trigonometric and algebraic
identities. On the basis of other investigations that have not fully reached
the programming stage, it appears highly
likely that GPS will be able to solve
certain tactical problems in chess (e.g.,
to find a move leading to a fork of a
pair of enemy pieces), do formal
differentiation and integration, and
write codes for simple computer programs
in IPL V. Several possibilities for incorporating learning processes in GPS,
one of them using GPS in the learning
mechanism itself,2e have also been
explored.
The adequacy of GPS as a simulation
of human problem solving has been
examined, primarily in the task domain
of symbolic logic, by comparing the
computer trace with the thinking-aloud
protocols of college students solving
identical problems. 2 9 The evidence to
date suggests that GPS does indeed capture
the principal problem-solving methods
used by the human subjects. The
detailed comparison of its behavior with
the protocols has cast considerable light
on the processes of abstraction and on
the nature and uses of imagery in
problem solving.
Recent Advances in the
Simulation of Thinking
The remaining papers to be presented
in this session will describe a number of

heuristic programs that have been written
in the past two years, and which extend
very substantially the range of human
mental processes that have been simulated with these techniques. I shall
not anticipate the content of these
programs, beyond indicating what their
relation is to those I have already
mentioned.
Areas of

pSY~Eo~~gical Experime~tation

The simulations mentioned so far
all fall in the area that psychologists
call "higher mental processes.
As I
indicated earlier, these processes have
tended to be underemphasized in American
experiment~l psychology until quite
recently because we did not have tools
for investigating them in an objective
and rigorous way. If computer simulation
has shown itself to be a powerful tool
of research in an area as difficult
as the study of higher mental processes,
we might expect this tool to prove even
more powerful if applied to the simpler
phenomena with which experimental
psychologists have been largely concerned. The papers of this session
report some of the first evidence that
this expectation is justified.
1,

What are the kinds of tasks and
processes that have been most thoroughly studied by psychologists? Perception--the interaction of sensory
organs and central nervous system in
the discrimination and recognition of
stimuli--has been the subject of extensive investigation. A second, very
active, research area has been learning,
and particularly the rote learning of
serial material and of stimulus-response
pairs. A third area has been simple
choice behavior, especially choice among
a small number (usually two) of alternatives with systematic or intermittent
reward. Animal and human maze learning
experiments have been used to study
both rote learning and simple choice
behavior. Finally, there is a rather
varied assortment of work that is
usually classified under the heading of
"concept formation or "concept attainment."
II

No one supposes that the topics
I have mentioned--perception, rote
learning, simple choice behavior, maze
learning, and concept formation--are
mutually exclusive and exhaustive
categories. They are simply pigeon
holes that psychologists have found
convenient for classifying experiments.
It is almost certain that the mechanisms
required to perform tasks in one of
these areas are called into play in some

117
3.1

of the others. Hence, we would have
reason to hope that as heuristic programs
are constructed to handle one or another
of these tasks, the mechanisms employed
in the several programs will begin to
show distinct resemblances--and resemblances also to the mechanisms used in
problem-solving simulations. Such
resemblances and common mechanisms are
already beginning to appear.
Long-Range Goals of Simulation
The long-term research strategy
would again be gradually to replace a
multitude of special-purpose programs
with a more general program aimed at
simulating the whole man--or at least
the cognitive aspects of his behavior.
Although enormous gaps of ignorance still
separate us from that goal, the goal
itself no longer seems entirely Utopian
to the active researchers in the field.
Perhaps the largest single gap at
present--and one that is not filled by
any of the work to be reported today--is
in programs to explain long-range human
memory phenomena. I will venture the
personal prediction that filling this
gap will soon become crucial to progress
in the whole field of information
retrieval.
Another important gap that also has
significant practical implications lies
in the area of simulation of natural
language processes. Here, interest in
language translation and in the improvement of computer programming languages
has already led to exciting progress--as
illustrate~~ for example
by the work
of Chomsky
and Yngve.3 i
Heuristic Programs in New Areas
The areas of rote learning, simple
choice behavior, and concept attainment
are represented in the programs to be
described by Mssrs. Feigenbaum, Feldman,
and Hunt, respectively.
Rote Learning. The Elementary
Perceiver and Memorizer (EPAM) is a
theory to explain how human subjects
store in memory symbolic materials that
are inherently meaningle·ss.
The
typical learning materials are "nonsense
syllables --spoken or printed syllables
that do not correspond to English words.
By studying rote learning, we hope to
tmderstand, for example, how humans
learn to associate names with objects,
and learn to read by associating printed
words vIi th their oral counterparts.
I

t,

I,

Binary Choice. In the so-called
partial reinforcement or binary choice
experiment, the subject is instructed
to guess which of two events will occur
next. In variants of the experiment,
the actual event sequence may be
patterned, or it may be a random sequence.
The binary choice experiment has been
one of the principal situations used to
test the stochastic learning models
that have been develo~~d in psychology
over the last decade. ,32 Mr. Feldman's
Binary Choice program offers an alternative theory to explain these phenomena,
hence provides an interesting example
for comparing and contrasting heuristic
programs with more traditional mathematical models.
Concept Formation. In the simplest
form of the concept formation task, a
rat is given a choice of two gates,
one of which is labelled, say, with a
large triangle, the other with a small
circle. If the experimenter's aim is
to test the rat's attainment of the
concept "triangle, he places a reward
behind the gate labelled with the
triangle. On succeeding trials, the
symbols change in shape, size, or color,
but the gate labelled with a triangle
always leads to the reward. Within the
past year, several computer programs
have been written that simulate slightly
more complex concept learning behavior
in humans. One of these programs, the
Concept Learner, will be described
by Hovland and Hunt.
11

Conclusion
I have tried to outline the development over the past decade of the use
of computers to construct and test nonnumerical information-processing
explanations for human thinking and
learning. Such programs, which are
beginning to be validated by behavioral
evidence, are providing embryonic
theories for these phenomena in terms
of underlying information processes.
Hopefully, the elementary information
processes that are postulated in the
theories will, in turn, find their
explanation in neurological processes
and mechanisms. The papers in this
session describe a few of the programs
of this kind that have been constructed
to date, and provide some basis for
judging the prospects for this approach
to understanding the human mind.

118
3.1

References
1.

\tva 1 ter, Grey, The Living Brain,
Van Nostrand, 1953.

13.

2.

Ashby, W. R., Design for a Brain,
Wiley, 1952.

14.

3.

Fatehchand, R., Machine recognition
of spoken words, in F. L. Alt,
ed., Advances in Computers,
Academic Press, 1960, pp. 193-231.

Bush, R. R. and F. Mosteller,
Stochastic Models for Learning,
vliley-; 1955.

15.

Newell, A. and J. C. Shaw, "Programming the Logic Theory Machine,1t
Proceedings of the 1957 Western
Joint Computer ConTerence, pp. 230-

4.

s.

Fo,;rgie, J. vJ. and C. D. Forgie,
'Results Obtained from a Vowel
Recognition Computer- Prpgram,1l
Jour. of the Acoustical Society
orAinerica,-'3I: 1480-1489 ( November,
1959) .
Gold, B., lMachine R2cognition of
Hand-Sent Illor-se Code, IRE Trans.
on Information Theory, IT-);----1;-1: 17 -21+ (March, 19)9).

1 9 ) ) . - - - ---

21+0.----------------

16.

Shaw, J. C., A. Newell, H. A. Simon,
and T. O. ElliS, "A Command
Structure for Complex Information
Processing, 1, Proc. of the 1958
Western Joint Computer--Con-:ference,
pp. 119-T2IT ;--------- --_.----

17.

Hiller, L. A. and L. M. Isaacson,
Experimental t,1usic, IV!cGraw-Hill,
19:59-:-------~------ ,,"----

18.

Samuel, A. L., 'Some Studies in
rllachine Learning using the Game
of Checkers," IBN J. of Research
and Development;--J:2W":229

11

6.

--(.

8.

Blair, C. R., " On Computer Transcription of f'lanual Horse,' Jour.
of the Association 1')r Computing
i',lachinery, O:L:r~9-44c-rJuly, -19)9) .
Sel.2ridgG, O.G., "Pattern Recognition
and Nodern Computers! and
Dinneen, G.P., "Programming Pattern
Recognition," Proc. of the 19;))
Western Joint Computer Conrerence,
pp :- 91-10Cr:--------·--------Farley, B. G. and \<1. A. Clark,
'Simulation of Self-organizing
Systems by Digital Computer, IRE
Trans. of the Professional Group
on Information ThErory;--P-UIT-::-rr:7O.::m--n-ep-temo er-;--195 11- ) •
l'

9.

10.

Clark, W. A. and B. G. Farley,
'Generalization of Pattern Recognition in a Self-organizing System,'
Proc. of the 1955 Western Joint
Qom:t>~~e~ --cr®1~r-eriEe-;-pp. 86-91.
Rochester, N., J. H. Holland, L. H.
Haibt and 1.1f. L. Duda, Tests on a
Cell Assembly Theory of the Action
of the Brain Using a Large Digital
Computer,' IRE Trans. on Information Theory, IT-2;--ifj-;------(Septemoer,-I9j6) .

11.

Hebb, D.O., Organization of
Be~avi~E.' Wiley-;- 1949.

12.

Rosenblatt, F., "The Perceptron: A
Probabilistic Hodel for Information
Storage and Organization in the
Brain,' Psychological Review,
65: 386-40tr-(1'fovemb-er, 1958).

Johnson, D. Iv'l., The Psychology of
Thought and Judgment, Harper,

-

rjU1.y,---r~r)97-.

19.

Newell, A. and H. A. Simon, lThe
Logic Theory lVlachine, IRE
Trans. on Information TheorY
IT-2rff~"-pp:--6r::rr9-;-\ September,
1956) .
l'

20.

Gelernter, H. and N. Rochester,
"Intelligent Behavior in ProblemSolving Machines, I, IBr~l J. of
Research and Develof)me-n~~:2!336yr~-CocTc:iOer-;-T9~8) .

21.

Goodvlin, G. L., 'Digital Computers
Tap Out Designs for Large Motors
Fast,' PONer, April, 1958.

22.

Kister, J., P. Stein, S. Ulam,
W. Walden and M. \tJells, 'Experiments in Chess,' Jour. of the
Association for Computlngl%aChinery,

1f:-rrrr-r7r--(Ap-rlT; -1.9-57T:--

23.

Bernstein, A., M. de V. Roberts,
T. Arbuckle and IV!. H. Belsky,
'-A Chess-Playing Program for
the IBM '704, Proc. of the 1958
'IlJes tern Joint CompuTer--Conf'erence,
pp. 1)7-=T5Sr:----------- -------- - l'

24.

Newell, A., J. C. Shm·J and H. A.
Simon, "Chess Playing Programs and
the Problem of Complexity," IBIvl J.
of Research and Development, 2:320335 CO-C-£05e-r-;--T958T.-----

119
3.1

25.

Tonge, F. M., A Heuristic Program
for an Assembly Line Balancing
Problem, Prentice-Hall, forthcoming.

26.

Newell, A., J. C. Shaw and H. A.
Simon, "Empirical Explorations of
the Logic Theory Machine, Proc.
of the 1957 Western Joint Computer
Conference, pp. 218-230.
II

27·

Newell, A., J. C. Shaw and H. A.
Simon, itA General Problem-Solving
Program for a Computer, Computers
and Automation, 8:10-17 (July,
1959) .
II

28.

Newell, A., J. C. Shaw and H. A.
Simon, A Variety of Intelligent
Learning in a General Problem
Sol ver, in r,L C. Yovi ts and S.
Cameron, eds., Self-Organizing
s§stems, Pergamon, 1960, pp. 1531 9.
II

I,

29·

Newell, A. and H. A. Simon, "The
Simulation of Human Thinking,1I
in Current Trends in Psychology,
1959, U. of Pittsburgh Press, 1960.

30.

Chomsky, A. N., Syntactic Structures,
Mouton, 1957.

31.

Yngve, V., A Model and an Hypothesis
for Language Structure," Proc.
of the American PhilosophICaI
Society, 104:444-466, October,
1960.

32.

Suppes, P. and R. C. Atkinson,
Markov Learning Models for
Multiperson Interactions,
Stanford U. Press, 1960.

II

121
3.2

THE SIMULATION OF VERBAL LEARNING BEHAVIOR*
E. A. Feigenbaum
University of California
Berkeley, California
and
The RAND Corporation
Santa Monica, California
Summary

An information processing model of

symbolic learning is
given a precise statement as a computer
program, called Elementary Perceiver
and Memorizer (EPAM). The program simulates the behavior of subjects in experiments involving the rote learning of
nonsense syllables. A discrimination
net which grows is the basis of EPAM's
associative memory. Fundamental information processes include processes for
discrimination, discrimination learning,
memorization, association using cues,
and response retrieval with cues. Many
well-known phenomena of rote learning
are to be found in EPAM's experimental
behavior, including some rather complex
forgetting phenomena. EPAM is programmed
in Information Processing Language V.
elementary~human

of an attempt to state quite precisely
a parsimonious and plausible mechanism
sufficient to account for the rote
learning of nonsense syllables •. The
critical evaluation of EPAM must ultimately depend not upon the interest which
it may have as a learning machine, but
upon its ability to explain and predict
the phenomena of verbal learning.
I should like to preface my discussion of the simulation of verbal
learning with some brief remarks about
the class of information processing
models of which EPAM is a member.
a.

H. A. Simon has described some
current research in the simulation of
human higher mental processes and has
discussed some of the techniques and problems which have emerged from this
research. The purpose of this paper is
to place these general issues in the context of a particular problem by describing
in detail a simulation of elementary
human symbolic learning processes.

These are models of mental
processes, not brain hardware.
They are psychological models
of mental function. No physiological or neurological assumptions are made, nor is any
attempt made to explain information processes in terms of
more elementary neural processes.

b.

The information processing model
of mental functions employed is realized
by a computer program called Elementary
Perceiver and f'/lemorizer (EPAJ:v1). The
EPAM program is the precise statement of
an information processing theory of verbal
learning that provides an alternative
to other verbal learning theories which
have been proposed.** It is the result

These models conceive of the
brain as an information processor with sense organs as
input channels, effector org~ns
as output devices, and with
internal programs for testing,
comparing, analyzing, rearranging, and storing information.

c.

The central processing mechanism
is assumed to be serial; i.e.,
capable of dOing only one (or
a very few) things at a time.

d.

These models use as a basic
unit the information symbol;
i.e., a pattern of bits which is
assumed to be the brain'S
internal representation of
environmental data.

e.

These models are essentially
deterministic, not probabilistic.
Random variables play no fundamental role in them.

*1 am deeply indebted to Herbert A.
Simon for his past and present collaboration in this research. This research
has neen supported by the Computer Sciences Department, The RAND Corporation,
and the Ford Foundation. I wish to express appreciation for the help and
critical comments of Julian Feldman,
Allen Newell, J. C. Shaw and Fred Tonge.
**Examples of quantitative (or quasiquantitative) theories of verbal learnin$
are those of Hull, et.al. [1], Gibson [2J,
and Atkinson [3J. - -

122
3.2

THE BASIC EXPERIMENT
Early in the history of psychology,
the psychologist invented an experiment
to simplify the study of human verbal
learning. This "simple" experiment is
the rote memorization of nonsense
syllables in associate-pairs or serial
lists.
The items to be memorized are generally three-letter words having consonant letters on each end and a vowel
in the middle. Nonsense syllables are
chosen in such a way that the threeletter combinations have no ordinary
English meaning. For example, CAT is
not a nonsense syllable, but XUM is.*
In one basic variation, the rote
memory experiment is performed as follows:
a.

A set of nonsense syllables is
chosen and the syllables are
paired, making, let us say, 12
pairs.

b.

A subject
a viewing
syllables
pair at a

c.

First, the left-hand member of
the pair (stimulus item) is
shown. The subject tries to say
the second member of the pair
(response item).

d.

After a short interval, the
. response item is exposed so that
both stimulus and response items
are simultaneously in view.

e.

After a few seconds, the cycle
repeats itself with a new pair
of syllables. This continues
until all pairs have been
presented (a trial).

f.

is seated in front of
apparatus and the
are shown to him, one
time.

Trials ar~ repeated, usually until
the subject is able to give
the correct response to each
stimulus. There is a relatively
short time interval between
trials.

*People will defy an experimenter's
most rigorous attempt to keep the nonsense syllables association-free. Lists
of nonsense syllables have been prepared,
ordering syllables on the basis of their
so-called '-association value," in order
to perm~.t the experimenter to control
"meaningfulness .
P

g.

For successive trials the
syllables are reordered randomly.
This style of carryirig out the
experiment is called pairedassociates presentation.

The other basic variant of the
experiment is called serial-anticipation
presentation. The nonsense syllables
(say, 10 or 12 items) are arranged in
a serial list, the order of which is
not changed on successive trials. When
he is shown the nth syllable, the subject
is to respond with the (n+l)st syllable.
A few seconds later, the (n+l)st syllable
is shown and the subject is to respond
with the (n+2)nd syllable, and so on.
The experiment terminates when the subject
is able to correctly anticipate all of
the syllables.
Numerous variations on this experimental theme have been performed.*
The phenomena of rote learning are well
studied, stable, and reproducible.
For example, in the typical behavioral
output of a subject, one finds:
a.

Failures to respond to a stimulus
are more numerous than overt
errors.

b.

Overt errors are generally
attributable to confusion by
the subject between similar
stimuli or similar responses.

c.

Associations which are given
correctly over a number of
trials sometimes are then
forgotten, only to reappear
and later dissappear again.
This phenomenon has been called
oscillation.**

d.

If a list x of syllables or
syllable pairs is learned to
the criterion; then a list y
is similarly learned; and
finally retention of list x is
tested; the subject's ability
to give the correct x responses
is degraded by the interpolated
learning. The degradation is
called retroactive inhibition.
The overt errors made in the

*For an extended treatment of this
subject, see Hovland, C. I., llHuman
Learning and Retention. 11 [4J
**By Hull [5J. Actually he called it
"oscillation at the threshold of recall,"
reflecting his theoretical point of view.

123
3.2

retest trial are generally
intrusions from the list y. The
phenomenon disappears rapidly.
Usually after the first retest
trial, list x has been relearned
back to criterion.
e.

As one makes the stimulus
syllables more and more similar,
learning takes more trials.

The Information Processing Model
This section describes the processes
and structures of EPAM.
EPfu~ is not a model for a particular
subject. In this respect it is to be
contrasted with the binary choice models
of particular subjects which Mr. Feldman
is presenting in this session. The fact
is that individual differences play only
a small part in the results of the basic
experiment described above.

It is asserted that there are certain
elementary information processes which an
individual must perform if he is to
discriminate, memorize and associate
verbal items, and that these information processes participate in all the
cognitive activity of all individuals.*
It is clear that EPAM does not yet
embody a complete set of such processes.
It is equally clear that the processes
EPfu~ has now are essential and basic.
*Some information processing models
are conceived as models of the mental
function of particular subjects; e.g.,
Feldman's Binary Choice Model [6J. Others
treat the general subject as EPM~ does.
Still others are mixed in conception,
asserting that certain of the processes of
the model are common for all subjects while
other processes may vary from subject to
subject; e.g., the General Problem Solver
of Newell, Shaw and Simon [7J. Alternatively information processing models
may al~o be categorized.ac~ord~ng tOnhow
much of the processing lS harn core
(i.e., necessary and invariant) as opposed
to s trategic" (i.e, the result of
strategy choice by control processes). I
suggest the obvlous: that models ?f
strategies for information processlng
will tend to be mod31s of the general
subject. As exemplars, Lindsay's Reading
fvlachine [8J, a t1hard core" model, treats
the general subject; \I/ickelgren r s model
of the conservative Focusing strategJ"IT
in concept attainment (Hickelgren [9 ;
Br~~er, Goodnow, and Austin [10]), a pure
strategy model, can predict only the
behavior of particular subjects.
II

Overview: Performance and Learning
Conceptually, EPN~ can be broken
down into two subsystems, a performance
system and a learning system. In the
performa.nce mode, EPAM produces responses
to stimulus items. In the learning mode,
EPAM learns to discriminate and associate
items.
The performance system is the
simpler of the two. It is sketched in
Fig. 1. When a stimulus is noticed, a
perceptual process encodes it, producing
an internal representation (an input
code). A discriminator sorts the inpu~
code in a discrimination net (a tree 01
tests and branches) to find a stored
image of the stimulus. A response cue
associated with the image is found, and
fed to the discriminator. The discriminator sorts the cue in the net and finds
the response image, the stored form of
the response. The response image is then
decoded by a response generator le~~er by
letter in another discrimination net
into a form suitable for output. The
response is then produced as output.
The processes of the learning system
are more complex. The discrimination
learning process builds discriminations
by growing the net of tests and branches.
The association process builds associations between images by storing response
cues with stimulus images. These
processes will be described fully in due
course.
The succeeding sections on the
information processing model give a
detailed description of the processes and
structures of both systems.
Input to EPM~: Internal Representations
Of""External Data
The following are the assumptions
about the symbolic input process when a
nonsense syllable is presented to the
learner. A perceptual system receives
the raw external information and codes
it into internal symbols. These internal
symbols contain descriptive information
about features of external stimulus.
For unfamiliar 3-letter nonsense symbols,
it is assumed that the coding is done in
terms of the individual letters, for
these letters are familiar and are welllearned units for the adult subject.*
'\
*The basic perception mechanism I
have in mind is much the same as that of
Selfridge [llJ and Dinneen, whose computer
program scanned letters and perceived
simple topological features of these
letters.

124
3.2

The end result of the perception process
is an internal representation of the nonsense syllable--a list of internal symbols
(i.e., a list of lists of bits) containing descriptive information about
the letters of the nonsense syllable.
Using Minsky's terminology [12J, this is
the !!character" of the nonsense syllable.

called tests, which examine characteristics of an input cede and signal branchleft or branch-right. On each image
list will be found a list of symbols
called the image. An image is a partial
or total copy of an input code. I shall
use these names in the following
description of net processes.

I have not actually programmed this
perception process. For purposes of
this simulation, I have assigned coded
representations for the various letters
of the alphabet based on 15 different
geometrical features of letters. For
purposes of exploring and testing the
model, at present all that is really
needed of the input codes is:

Net Interpreter. The discrimination
net is examined and altered by a number
of processes, most important of which is
the net interpreter. The net interpreter
sorts an input code in the net and
produces the image list associated with
that input code. This retrieval process
is the essence of a purely associative
memory:-----tFle stimulus1nf'ormation i tse If
leads t6~--retrreva:l---or-the infC)rmation
associated with that stimulus~--T~l1e~
interpreter is a very simple process.
It finds the test in the topmost node
of the tree and executes this program.
The resulting signal tells it to branch
left or branch right to find the
succeeding test. It executes this,
tests its branches again, and repeats
the cycle until a terminal is found.
The name of the image list is produced,
and the process terminates. This is the
discriminator of the performance system
which sorts items in a static net.

a.

that the dimensions of a letter
code be related in some reasonable way to features of real
letters.

b.

that the letter codes be highly
redundant, that is, include
many more dimensions than is
necessary to discriminate the
letters of the alphabet.

To summarize, the internal representation of a nonsense syllable is a list
of lists of bits, each sublist of bits
being a highly redundant code for a
letter of the syllable.
Given a sequence of such inputs,
the essence of the learner's problem is
twofold: first, to discriminate each
code from the others already learned,
so that differential response can be
made; second, to assoclate information
about a "response!! syllable with the
information about a "stimulus ll syllable
so that the response can be retrieved
if the stimulus is presented.
Discriminating and Memorizing:
Trees of Images

Growing

I shall deal vtith structure first
and reserve my discussion of process
for a moment.
Discrimination net. The primary
information structure in EPAM is the
discrimination net. It embodies in its
structure at any moment all of the discrimination learning that has taken
place up to a given time. As an information struct').re it is no more than a
familiar friend: a sorting tree or
decoding network. Fig. 2 shows a small
net. At the terminals of the net are
lists called image lists, in which
symbolic information can be stored. At
the nodes of the net are stored programs,

Discrimination Learning. The discrimination learning process of the
learning system grows the net. Initial~
we give the learning system no discrimination net but only a set of simple
processes for growing nets and storing
new images at the terminals.
To understand how the discrimination
and memorization processes work, let us
examine in detail a concrete example
from the learning of nonsense syllables.
Suppose that the first stimulus-response
associate-pair on a list has been learned.
(Ignore for the moment the question of
how the association link is actually
formed.) Suppose that the first syllable
pair was DAX-JIR. The dis9rimination
net at this point has the simple twobranch structure shown in Fig. 3.
Because the syllables differ in their
first letter, Test 1 will probably be a
test of some characteristic on which the
letters D and J differ. No more tests
are necessary at this pOint.
Notice that the image of JIR which
is stored is a full image. Full response
images must be stored--to provide the
information for producing the response;
but only partial stimulus images need
be stored--to provide the information
for recognizing the stimulus. How much
stimulus image information is required

125
3.2

the learning system determines for itself
as it grows its discrimination net, and
makes errors vlhich it diagnoses as inadequate discrimination.

The net as it now stands is shown in
Fig. 4. Test 2 is seen to discriminate
on some difference between the letters
P and D.

To pursue our simple eyample, suppose
that the next syllable pair to be learned
is PIB-JUK. There are no storage
terminals in the net, as it stands, for
the two new it~ms. In other words, the
net does not have the discriminative
capability to contain more than two items.
The input code for PIB is sorted by the
net interpreter. Assume that Test 1
sorts it "doVln the plus branch of Fig. 3.
As there are differences between the
incumbent image (with first-letter D)
and the nevv code (with firs t -let ter p) an
attempt to store an image of PIB at this
terminal vJOuld destroy the information
previously stored there.

The input code for JUK is now sorted
by the net interpreter. Since Test 1
cannot detect the difference between the
input codes for JUK and JIR (under our
previous assumption), JUK is sorted to
th~ terminal containing the image of
JIR. The match for differences takes
place. Of course, there are no firstletter differences. But there are dif-·
ferences between the incumbent image and
the neyV code in the second and third
letters.

Clearly "tvhat is needed is the ability
to discriminate further. A match for
differences between the incumbent image
and the challenging code is performed.
i,'lhen a difference is found, a ne""J test is
created to discriminate upon this difference. The ne~'J test is placed in the net
at the point of failure to discriminate,
an image of the new item is created, and
both images--incUlnbent and new--arc
stored in terminals along their appropriate
branches of the new test, and the conflict
is resolved.*
*VIith the processes just described,
the discrimination net would be grown
each time a new item was to be added to
the memory. But from an information processing standpoint, the matching and netgrowins processes are the most timeconsuming in the system. In general, with
little additional effort, more than one
difference can be detected, and more than
one discriminating test can be added to
the net. Each redundant test placed in
the net gives one emptyli image list.
At some fut'cl.re time, if an item is sorted
to this empty image list, an image can be
stored "\Tithout grouing the net. There is
a happy mediurn bet\'leen small nets vJhich
must be grown all the time and large nets
replete with redundant tests and a wasteful surplus of empty image lists. Experimentation vii th this II structural parameter!!
has been done and it has been found
that for this study one or two redundant
tests per growth represents the happy
medium. Hovlever, I wov.ld not care to
speak of the generality of this particular result.
l'

Noticing Order. In which letter
should the matching process next scan
for differences? In a serial machine
like EPAN, this scanning must take place
in some order. This order need not be
arbitrarily determined and fixed. It
can be made variable and adaptive. To
this pnd EPMl has a noticing order for
letters of syllables, vlhich prescribes
at -- any--ii1ornent a letter-scanning sequence
for the matching process. Because it is
observed that subjects generally consider
end-letters before middle-letters, the
notiCing order is initialized as follows:
first-letter, third-letter, secondletter. lifhen a particular letter being
scanned yields a difference, this letter
is promoted up one position on the
noticing order. Hence, letter positions
relatively rich in differences quickly
get priority in the scanning. In our
example, because no first-letter differences '11ere found between the image, of
JIR and code for JUK, the third letters
are scanned and a difference is found
(between Rand K). A test is created
to capitalize on this third-letter
difference and the net is grown as
before. The result is shown in Fig. 5.
The noticing order is updated; thirdletter, promoted up one, is at the head.
LearninG of subsequent items
proceeds in the same way, and "de shall
not pursue the example further~

Asso~J.5t~n~~~ITl.~~~s:

Ret2'i~y..?-_~{_~ing; Cues

The discrimination net and its
interpreter associa~e codes of external
objects with internal image lists and
images. But the basic rote learning
experiment requires that stimulus
information someho~'J lead ~o responi3e

126
3.2

information and a response. The discrimination net concept can be used for the
association of internal images with each
other (i.e., response with stimulus) with
very little addition to the basic
mechanism.
An association between a stimulus
image and a response image is accomplished
by storing with the stimulus image some of
the coded information about the response.
This information is called the cue. A cue
is of the same form as an input code, but
generally contains far less information
than an input code. A cue to an associated image can be stored in the discrimination net by the net interpreter to retrie'le the associated image. If, for example, in the net of Fig. 3 we had stored
with the stimulus image the letter J as
a cue to the response JIR, then sorting
this cue would have correctly retrieved
the response image. An EPAM internal
association is built oy storing with the
stimulus image information sufficient to
retrieve the response image from the net
at the moment of association.
The association process determines
how much information is sufficient by
trial and error. The noticing order for
letters is consulted, and the firstpriority letter is added to the cue. The
cue is then sorted by the net interpreter
and a response image is produced. It
might be the wrong response image; for if
a test seeks information which the cue does
not contain, the interpreter branches left
or right randomly (with equal probabilities) at this test.* During association,
the selection of the wrong response is
immediately detectable (by a matching
process) because the response input code
is available. The next-priority letter is
added to the cue and the process repeats
until the correct response image is retrieved. The association is then considered complete.
Note two important possibilities.
First, by the process just described, a
cue which is really not adequate to
guarantee retrieval of the response image
may by happenstance give the correct
response image selection during association. This "luck" usually gives rise to
response errors at a later time.
*This is the only use of a random
variable in EPAM. We do not like it. We
use it only because we have not yet discovered a plausible and satisfying adaptive mechanism for making the decision.
The random mechanism does, however, give
better results than the go-one-way-allthe-time mechanism which has also been
used.

Second, suppose that the association
building process does its job thoroughly.
The cue which it builds is sufficient to
retrieve the response image at one particular time, the time at which the two ite
items were associated.' If, at some
future time, the net is grown to encompass
new images being added to the memory,
then a cue which previously was sufficient
to correctly retrieve a response image
may no longer be sufficient to retrieve
that response image. In EPAM, association
links are 11 dated, II and ever vulnerable
to interruption by further learning.
Responses may be lIunlearned" or "forgotten "
temporarily, not because the response information ha& been destroyed in the memory,
but because the information has been temporarily lost in a growing network. If an
'association failure of this type can be
detected through feedback from the
environmental or experimental situation,
then the trouble is easily remedied by
adding additional response information
to the cue. If not, then the response
may be more or less permanently lost in
the net. The significance of this
phenomenon will perhaps be more easily
appreCiated in the discussion of results
of the EPAM simulation.
Responding:

Internal and External

A conceptual distinction is made
between the process by which EPAIvl selects
an internal response image and the process by which it converts this image
into an output to the environment.
Response retrieval. A stimulus item
is presented. This stimulus input code
is sorted in the discrimination net to
retrieve the image list, in which the cue
is found. The cue is sorted in the net
to retrieve another image list containing
the proposed response image. If there is
no cue, or if on either sorting pass an
empty image list is selected, no response
is made.
Response generation. For purposes
of response generation, there is a fixed
discrimination net (decoding net),
assumed already learned, which, transforms
letter codes of internal images into
output form. The response image is decoded letter by letter by the net
interpreter in the decoding net for
letters.
The Organization of the Learning Task
The learning of nonsense symbols by
the processes heretofore described takes
time. EPAM is a serial machine. There-

127
3.2

fore, the individual items must be dealt
with in some sequence. This sequence
is not arbitrarily prescribed. It is the
result of higher order executive processes whose function is to control EPAM's
focus of attention. These macroprocesses,
as they are called, will not be described
or discussed here. A full exposition of
them is available in a paper by
Feigenbaum and Simon. [13]
Stating the f-1odel Precisely:
Computer Program for EPAM
The EPAM model has been realized
as a program in Information Processing
Language V [14] and is currently being
run both on the Berkeley 704 and the RAND
7090. Descriptive information on the
computer realization, and also the
complete IPL-V program and data structures
for EPAM (as it stood in October, 1959)
are given in an earlier work by the
author [15].

can also be printed out.
A number of simulations of the
basic paired-associate and serialanticipation experiments have been run.
Simulations of other classical experiments in the rote learning of nonsense
syllables have also been run. The
complete results of these simulation
experiments and a comparison between
EPAM's behavior and the reported behavior of human subjects will be the subject of a later report. However, some
brief examples here will give an indication of results expected and met.
a.

Stimulus and response generalization. These are psychological terms used to describe
the following phenomenon. If
X and X' are similar stimuli,
and Y is the correct response
to the presentation of X; then
if Y is given in response to
the presentation of X', this is
called stimulus generalization.
Likewise, if Y and Y' are similar responses, and Y' is given
in response to the presentation
of X, this is called response
generalization. Generalization
is common to the behavior of
all subjects, and is found in
the behavior of EPAM. It is
a consequence of the responding
process and the structure of the
discrimination net. For those
lIstimulil! are similar in the
EPAM memory whose input codes
are sorted to the same terminal;
and one 11responseft is f?imilar
to another if the one is stored
in the same local area of the
net as the other (and hence
response error may occur when
response cue information is
insufficient).

b.

Oscillation and Retroactive
Inhibition. We have described
these phenomena in an earlier
section.

IPL-V, a list processing language,
was well suited as a language for the
EPAM model for these key reasons:
a. The IPL-V basic processes deal
explicitly and directly with list structures. The various information structures
in EPAM (e.g., discrimination net, image
list) are handled most easily as list
structures. Indeed, the discrimination
is, virtually by definition, a list
structure of a simple type.
b. It is useful in some places, and
necessary in others, to store with some
symbols information descriptive of these
symbols. IPL-V's description list and
description list processes are a good
answer to this need.
c. The facility with-which hierarchies of subroutine control can be
written in IPL-V makes easy and uncomplicated the programming of the kind
of complex control sequence which EPAM
uses.
Empirical Explorations with EPAM
The procedure for exploring the
behavior of EP~l is straightforward. We
have written an I!Experimenter program
and we give to this program the particular conditions of that experiment as
input at the beginning of an experiment.
The Experimenter routine then puts EPAM
qua subject through its paces in that
particular experiment. The complete
record of stimuli presented and responses
made is printed out, as in the final net.
Any other information about the processing or the state of the EPAM memory
l1

Oscillation and retroactive
inhibition appear in EPAM's
behavior as consequences of
simple mechanisms for discrimination, discrimination learning,
and association. They were in
no sense "designed into" the
behavior. The appearance of
rather complex phenomena such
as these gives one a little
more confidence in the credibility of the basic assumptions
of the model.

128
3.2

These two phenomena are discussed
together here because in EPAM
they have the same origin. As
items are learned over time,
the discrimination net grows to
encompass the new alternatives.
Growing the net means adding
new tests, which in turn means
that more information will be
examined in all objects being
sorted. An important class of
sorted objects is the set of
cues. Cue information sufficient
at one moment for a firm association may be insufficient at a
later moment. As described
above, this may lead to response
failure. The failure is caused
entirely by the ordinary process
of learning new items. In the
case of oscillation, the new
items are items within a single
list being learned. In the case
of retroactive inhibition, the
new items are items of the second
list being learned in the same
discrimination net. In both
cases the reason for the response
failure is the same. According
to this explanation, the phenomena are first cousins (an hypothesis which has not been widely
considered by psychologists).
In the EPAM model, the term
interference is no longer merely
descriptive--it has a precise
and operational meaning. The
process by which later learning
interferes with earlier learning
is completely specified.
c.

Forgetting. The usual explanations of forgetting use in one
way or another the simple and
appealing idea that stored information is physically destroyed
in the brain over time (e.g., the
decay of a Itmemory trace," or
the over~riting of old information by new information, as
in a computer memory). Such
explanations have never dealt
adequately with the commonplace
observation that all of us can
remember, under certain conditions, detailed and seemingly
unimportant information after
very long time periods have
elapsed. An alternative explanation, not so easily visualized,
is that forgetting occurs not
because of information destruction but because learned
material gets lost and inaccessible in a large and growing
association network.

EPAM forgets seemingly welllearned responses. This forgetting occurs as a direct
consequence of later learning
by the learning processes.
Furthermore, forgetting is only
temporary: lost associations
can be reconstructed by storing
more cue information. EPAM
provides a mechanism for explaining the forgetting phenomenon
in the absence of any information loss. As far as we
know, it is the first concrete
demonstration of this type of
forgetting in a learning machine.
Conclusion:

A Look Ahead

Verification of an information
proceSSing theory is obtained by simulating many different experiments and
by comparing in detail specific
qualitative and quantitative features of
real behavior with the behavior of the
simulation. To date, Mr. Simon and I
have run a number of simulated experiments.
As we explore verbal learning further,
more of these will be necessary.
We have been experimenting with a
variety of "sense modes" for EPAM,
corresponding to "visual input and
"written fl output, ttauditorylt input and
Ilorall! output, Itmuscular" 1nputs and
outputs. To each mode corresponds a
perceptual input coding scheme, and a
discrimination net. Associations-acrossnets, as well as the familiar associat10nsWithin-nets, are now possible. Internal
transformations between representations
in different modes are possible. Thus,
EPAM can II sound" in the flmind 1 sear"
what it lIsees tt in the "mind's eye," just
as all of us do so easily. We have been
teaching EPAM to read-by-association,
much as one teaches a small child beginning
reading. We have only begun to explore
this new addition.
lt

The EPAM model has pointed up a
failure shared by all existing theories
of rote learning (including the present
EPAM). It is the problem of whether
association takes place between symbols
or between tokens of these symbols.
For example, EP M'l cannot learn a serial
list in which the same item occurs
twice. It cannot distinguish between
the first and second occurrence of the
the item. To resolve the problem we
have formulated (and are testing)
processes for building, storing, and
responding from chains of token associations.

129
3.2

References
1.

2.

Hull, C. L., C. I. Hovland, R. T.
Ross, M. Hall, D. T. Perkins
and F. B. Fitch, Mathematicodeductive Theory of Rote Learning,
New Haven, Connecticut, Yale
University Press, 1940.
Gibson, E. J., "A Systematic Application of the Concepts of
Generalization and Differentiation
to Verbal Learning," Psychol. Rev.,
Vol. 47, 1940, pp. 196-229.

3.

Atkinson, R. C., "An Analysis of
Rote Serial Learning in Terms
of a Statistical Model,l1 Indiana
University, Doctoral Dissertation,
1954.

4.

Stevens, S. S., ed., Handbook of
Experimental Psychology, New York,
Wiley, 1951.

5.

Hull, C. L., llThe Influence of
Caffeine and Other Factors on
Certain Phenomena of Rote Learning,
J. Gen. Psychol., Vol. 13, 1935,
pp. 249-273.

6.

Feldman, J., ,1 An Analysis of Predictive Behavior in a Two Choice
Si tuation, Carnegie Institute of
Technology, Doctoral Dissertation,
1959.
II

7.

Newell, A., J. C. Shaw, and H. A.
Simon, IIReport on a General Problem
Solving Program," Information
Processing: ProceedIng:S of the
International Conference on
Information Processing~SCO,
Paris, June, 1959, pp. 256-264.

8.

Lindsay, R., "The Reading f.1achine
Problem,1I CIP Working Paper #33,
Carnegie Institute of Technology
(Graduate School of Industrial
Administration), Pittsburgh, 1960.

9.

Wickelgren, W., "A Simulation
Program for Conservative Focusing,1I
unpublished manuscript, University
of California, Berkeley, January,
1961.

10.

Bruner, J. S., J. J. Goodnow, and
G. A. Austin, A Study of Thinking,
New York, Wiley, 1956.

11.

Selfridge, O. G., IIPattern Recognition and Modern Computers,"
Proceedings of the 1955 Western
Joint Computer Conference, IRE,
March, 1955.

II

12.

lVl~nsky,

13.

Feigenbaum, E. and H. A. Simon,
"A Theory of the Serial Position
Effect,1\ CIP Working Paper 1114,
Carnegie Institute of Technology
(Graduate School of Industrial
Administration), Pittsburgh, 1959.

14.

Newell, A., F. M. Tonge, E. A.
Feig;enbaum" G. H. Mealy, N. Saber,
B. F. Green, and A. K. Wolf,
Information Processing Language V
ManUaTlSectionSIanall) ,
Santa Monica, California, The RAND
Corporation, p-1897 and P-19l8.

15.

Feigenbaum, E., An Information
Processing Theory or-VeroarLearning,--Santa l\lonica, California,
The RAND Corporation, p-1817,
October, 1959.

M., "Steps Toward Artificial
Intelligence,1I Proceedings of
the IRE, Vol. 49, No.1, January,
1961, pp. 8-30.

130
3.2

RAW STIMULUS

PERCEIVE FEATURES
OF STIMULUS
EPAM STIMULUS
INPUT CODE

DISCRIMINATE
STIMULUS TO
FIND STIMULUS IMAGE
IMAGE
FIND ASSOCIATED CUE
CUE

DISCRIMINATE
CUE TO FIND
RESPONSE I MAGE
RESPONSE IMAGE
GENERATE RESPONSE
TO ENVIRONMENT
USING DECODING NET

RESPONSE

OUTPUT

Fig. 1- EPA M performance process
for producing the response
associated with a stimulus

131

3.2

@
ITJ
Irtci

= Image and cue at a terminal

0

= Empty terminal

= Discriminating test at a node
= Image at a terminal

Fig. 2 - A typical EPA M discrimination net

STIMULUS

RESPONSE

DAX

JIR

Fig. 3 - Discrimination net after the learning
of the first two items. Cues are not shown.
Condition: no redundant tests added.
Test 1 is a first-letter test.

132
3.2

STIMULUS

RESPONSE

PIB

JUK

Fig. 4 - Discrimination net of Fig.3 after the
learning of stimulus item,PIB.
Tes t 2 is a first letter test

STIMULUS

RESPONSE

PIB

JUK

Fig. 5-Discrimination net of Fig.4 after
the learning of the response item, J UK.
Test 3 is a third-letter test

133
3.3
SIMULATION OF BEHAVIOR
IN nlE BINARY CHOICE EXPERIMENT
Julian Feldman
University of California, Berkeley
Summary
A modern, high-speed digital computer has
been used to simulate the behavior of individual
human subjects in a classical psychological experiment where the subject is asked to predict a
series of binary events. The representation of
models of human behavior in the form of computer
programs has permitted the construction and study
of more realistic hypothesis-testing models of
behavior in this experiment rather than the oversimplified conditioning models previously proposed.
~ model for one subject is described in detail,
and the problem of comparing the behavior of the
model to the behavior of the subject is also discussed.
Introduction
Modern, high-speed digital computers have
been used to simulate large, complex systems in
order to facilitate the study of these systems.
One of these systems that has been studied with
the aid of computer simUlation is man. The present paper describes another addition to the growing list of efforts to study human thinking processes by simulating these processes on a computer. The research summarized here has been
concerned with simulating the behavior of individual subjects in the binary choice experiment. 3
The first section of this paper contains a description of the experiment. An overview of the
model is given in the second section. The model
for a particular subject is described in SOme
detail in the third section.
The Binary Choice Experiment
In the binary choice experiment, the subject
is asked to predict which of two events, El or E2,
will occur on each of a series of trials. After
the subject makes a prediction, he is told ~ich
event actually occurred. The sequence of events
is usually determined by Some random mechanism,
e.g., a table of random numbers. One and only one
event occurs on each trial. The events may be
flashes of light or symbols on a deck of cards.
The subject is usually asked to make as many
correct predictions as he can.
In the research reported here, the experiment
described in the preceding paragraph was modified
by asking the subject to "think a10ud"--to give
his reasons for making a prediction as well as the
prediction itself. The subject's remarks were
recorded. The subject was instructed to "think
aloud" in order to obtain more in forma tion on the
processing that the subject was doing. This technique has been used in some of the classical investigations of problem-solving behavior 2 ,S and

in other computer simulation studies of
thinking. l ,7 A comparison of the behavior of subjects in the binary choice experiment who did
"think aloud" with the behavior of subjects who
did not "think aloud" did not reveal any major
differences. 3 The events in the present experiment were the symbols "plus" and "check." "Check"
occurred on 142 of 200 trials and "plus" on the
remaining S8 trials. The symbols were recorded on
a numbered deck of 3 inch x S inch cards. After
the subject made his prediction for trial t, he
was shown card t which contained a "plus" or
"cheCk." While the subject was predicting the
event of trial t, he could only see the event of
trial t-l. A transcription of the tape recording
of the remarks of subject DH and the experimenter,
the author, in an hour-long binary choice experiment is pre~ented in the Appendix. In the Appendix and the rest of this paper, the symbols "plus"
and "check" are represented by uP" and "C" respectively. The transcription will be referred to as
a protocol.
The Basic Model
To simulate the behavior of an individual
subject in the binary choice experiment, a model
of the subject's behavior must be formulated as a
computer program. If the program is then allowed
to predict the same event series as the subject
has predicted, the behavior of the program--the
predictions and the reasons--can be compared to
the behavior of the subject. If the program's
behavior is a reasonable facsimile of the subject's behavior, the program is at least a sufficient explanation of the subject's behavior. The
level of explanation is really determined by the
subject's statements. No attempt is made to go
beyond theae to more basic processes, e.g., neurological or chemical, of human behavior. Thus, the
model is an attempt to specify the relationship
between the reasons or hypotheses that the subject
offers for his predictions and the preceding hypotheses, predictions, and events. The subject is
depicted as actively proposing hypotheses about
the structure of the event series. These hypotheses are tested by using them to predict events.
If the prediction of the event is correct, the
hypothesis is usually retained. If the prediction
of the event is wrong, a new hypothesis is generally proposed.
The Model for 00
The model for each subject is based on a
detailed examination of the protocol and some conjectures about hUman behavior. Perhaps the best
thing to do at this point is to describe in Some
detail a model for the subject. DH, whose protocol
appears in the Appendix.

134
3.3
The Hypotheses

The Basic Cycle

This model proposes two types of hypotheses
about the event series. The first type of hypothesis is a pattern of events. The model has a
repertoire of nine patterns:

The basic cycle of the model is as follows:
The model uses an hypothesis to predict the event
of trial t. The event is then presented. The
model in Phase One "explains" the event of trial t
with an explanation-hypothesis. In Phase Two a
prediction-hypothesis for trial t+l is formed.
The model uses this prediction-hypothesis to predict trial t+l. The event of trial t+l is presented, and the cycle continues.

progression of C's
progression of P's
single alternation
2 C's and I P
1 C and 2 P's
2 P's and 2 C's
3 P's and 3 C's
4 p's and 4 C's
4 P's and 3 C's
The model can propose that the event series
is behaving according to one of these patterns and
use the pattern-hypothesis to predict the event of
a given trial, t. The predictions of the first two
patterns--progression of C's and progression of
P's--for trial t are independent of the events
preceding trial t. The predictions of the other
patterns (the alternation patterns) are dependent
on these preceding events. Thus, if the subject
proposes the pattern "single alternation" for
trial t and the event of trial t-l was a C, the
prediction for trial t is a P. In order to facilitate the determination of the prediction of an
alternation pattern for trial t, the patterns are
coded as sorting nets. Por example, the pattern
"2 C's and 1 P" is represented in the following
fashion:
Is event t-l a C?
No--Predict C for trial t.
Yes-Is event t-2 a C?
No--Predict C for trial t.
Yes-Predict P for trial t.
The second type of hypothesis that the model
can propose is an anti-pattern or guess-opposite
hypothesis. Por example, the model can propose
that the event of trial t will be the opposite of
that predicted by a given pattern. This type of
hypothesis is the model's representation of the
notion of "gambler's fallacy"--the reason people
predict "tails" after a coin falls "heads" seven
times in a row.
The most general form of hypothesis has two
components: a pattern component and a guess-opposite component. The prediction of the hypothesis is obtained by finding the prediction of
the pattern component. If the hypothesis has a
guess-opposite component, then the prediction of
the hypothesis is the opposite of the pattern prediction. If the hypothesis does not have a guessopposite component, then the prediction of the
hypothesis is the prediction of the pattern component. Thus, while the prediction of the
pattern-hypothesis "progression of C's" is always
a C, the prediction of the hypothesis "guess-opposite-progression-of-C's" is always a P.

Phase One
The basic motivation for this phase of the
model is that the model must "explain" each event.
An acceptable explanation is an hypothesis that
could have predicted the event. The processing of
Phase One is represented in the flow chart of
Pig. 1.
The processing to determine the explanationhypothesis for trial t begins by testing whether
the pattern component of the prediction-hypothesis
for trial t could have predicted the event of
trial t correctly. If the pattern component could
have predicted correctly, the pattern component is
the explanation-hypothesis. If the pattern component could not have predicted correctly, the
pattern-change mechanism is evoked. Thus if the
prediction-hypothesis for trial t contained only a
pattern component and the hypothesis predicted
correctly, the explanation-hypothesis for trial t
is the prediction-hypothesis for trial t. If the
prediction-hypothesis for trial t contained a
guess-opposite component and the hypothesis predicted correctly, the pattern-change mechanism is
evoked because the pattern component could not
have predicted the event correctly by itself. If
the prediction-hypothesis for trial t was a guessopposite-hypothesis and it predicted incorrectly,
the pattern component of the prediction-hypothesis
becomes the explanation-hypothesis for trial. t.
The motivation here is really quite simple
although the explanation may sound involved.
If, in this binary situation, the hypothesis that
a pattern will change leads to an incorrect prediction, the pattern must have persisted; and the
pattern is an acceptable explanation of the event.
If the hypothesis that a pattern will change leads
to a correct prediction, the pattern obviously did
not persist; and the possibility of a new pattern
is considered.
The pattern-change mechanism is evoked on
trial t if the pattern component of the prediction-hypothesis for trial t is unable to predict
the event of trial t. The pattern-change mechanism consists of two parts. The first part evokes
a subset of the nine patterns listed above. The
second part of the pattern-change mechanism
selects a single pattern out of the evoked set.
A pattern is evoked, i.e., considered as a possible explanation of the event of trial t, if the
pattern can predict the events of trials t and
t-l. The pattern of the prediction-hypothesis for
trial t, i.e., the pattern that cannot predict
event t, is included in the evoked set if it can

135
3.3
predict events t-l, t-2, and t-3. Of the patterns
that are evoked, the pattern that has been
selected most often on prior trials is selected as
the pattern component of the explanation-hypothesis. If the pattern component of the prediction-hypothesis for trial t is selected, then the
explanation-hypothesis is an anti-pattern hypothesis which is the model's interpretation of the
subject's hypothesis "you have thrown me off the
pattern" (cf. trial 9 of the protocol in the
Appendix). The model interprets event t as an
attempt to "throw me off" when the following three
conditions are met: (1) the pattern is unable to
predict the event of trial t; (2) the pattern is
able to predict at least the three consecutive
events of trials t-l, t-2, and t-3; and (3) the
pattern is also the most frequently selected of
those patterns that are evoked.
Phase Two
While Phase One is concerned mainly with the
processing of the pattern-component of the hypothesis, Phase Two is concerned with the processing
of the guess-opposite component. Phase Two is
represented in the flow chart of Fig. 2.
If the prediction-hypothesis for trial t contained a guess-opposite component, the guessopposite component is processed in a fashion quite
analogous to the processing of the pattern component in Phase One. If the anti-pattern prediction-hypothesis for trial t predicted the event of
trial t correctly, the guess-opposite component is
retained, and the prediction-hypothesis for trial
t+l is guess-opposite-the-pattern-of-the-explanation-hypothesis. If the anti-pattern predictionhypothesis for trial t predicted the event of
trial t incorrectly, the guess-opposite component
is considered for retention in a fashion analogous
to the ttthrow-me-off" consideration for patterns.
If the prediction-hypotheses for trials t-l and
t-2 had guess-opposite components and these hypotheses predicted correctly, then the guess-opposite component is retained for the prediction-hypothesis of trial t+l. If these conditions are
not fulfilled, the guess-opposite component is
dropped; and the prediction-hypothesis for trial
t+l is the explanation-hypothesis for trial t.
If the prediction-hypothesis for trial t did
not contain a guess-opposite component, the model
considers whether or not the guess-opposite component should be introduced on trial t+1. The
model makes this decision on the basis of its
past experience. It determines the.number of consecutive events including and preceding the event
of trial t that can be predicted by the explanation-hypothesis for trial t. This number will be
called Nl. Then the model searches its memory
backwards from the last trial included in Nl to
find a trial for which the explanation-hypothesis
was the same as the explanation-hypothesis for
trial t. Then the model determines the number of
contiguous events including, preceding, and following this prior occurrence of the explanationhypothesis of trial t that can be predicted by
this hypothesis. This number will be called N2.

If N2=Nl, the model decides that the explanationhypothesis for trial t will not be the predictionhypothesis for trial t+l. The prediction-hypothesis for trial t+l becomes guess-opposite-theexplanation-hypothesis for trial t. If N2::> Nl,
the model decides that the explanation-hypothesis
for trial t will be the prediction-hypothesis for
trial t+l. If NI> N2, the model decides that this
prior occurrence of the explanation-hypothesis for
trial t is really not pertinent and continues to
search its memory for an occurrence of the explanation-hypothesis where N2~Nl. If no such
occurrence can be found, the prediction-hypothesis
for trial t+l is the explanation-hypothesis for •
trial t.
Predicting with the Models
Models of individual behavior like the one
described for DH can be used to predict the same
series of binary events that the subject was
asked to predict. The predictions and hypotheses
of the model--the model's protocol--can then be
compared to the subject's protocol. The model
does not speak idiomatic English, and so the comparison is made between the machine's protocol
and a suitably coded version of the subject's
protocol.
The model's protocol can be generated by
presenting the model with the events in the same
way the subject was presented with the events in
the binary choice experiment; or the computer can
take the experimenter's role, too, if suitable
precautions are taken to prevent the model from
peeking. However, this straightforward method of
simulating the subject's behavior raises difficulties. These difficulties are identical to those
of getting a chess or checker program to play a
book game. 6 ,8 Because the decision of the chess
or checker program at move m depends on its decisions at the preceding moves, m-l, m-2, ••• , such
a program, when it is playing a book game, must be
"set back on the track" if its move deviates from
the book move. Th~ program and the book must have
the same history if the program is to have a fair
chance to make the same decision as that made in
the book game. This ttsetting-back-on-the-track"
may involve resetting a large number of parameters
as well as changing the move itself. Elsewhere, I
have called this "setting-back-on-the-track" technique conditional prediction. 4 The prediction of
the model is conditional on the preceding decisions of the model being the same as those of the
subject it is trying to predict.
The application of the conditional prediction
technique to binary choice models such as the one
described above for subject DH involves (1) COmparing the program's behavior and the subject's
behavior at every possible point, (2) recording
the differences between the behaviors, and (3)
imposing the subject's decision on the model
where necessary. A type of monitor system is
imposed on the program to perform these functions.
The model for DH with the conditional prediction
system controls is represented in Figs. 3 and 4.
An example will help clarify these figures. In

136
3.3
Fig. 3, after each decision by the model to keep
the pattern of the prediction-hypothesis for
trial t for the explanation-hypothesis for trial
t (8), this decision is compared to the subject's
decision (1). If the model's decision was different from that of the subject, control is transferred to the pattern-change mechanism (3 trials).
If the model's decision was the same as that of
the subject, control is transferred to another
part of the monitor (117 trials). Figs. 3 and 4
only contain the results for 195 trials because
the model began at trial 6.

Some evidence also exists that when suitably
motivated by money, Some subjects in a binary
choice experiment will predict the most frequent
event on each trial. Models for these subjects
require statements of the conditions under which
subjects abandon testing other hypotheses or at
least abandon testing hypotheses by using them to
predict events. Hypotheses could still be considered and tested without using them to predict
events.

Conclusions

The consequences of computer simulation for
the study of hUman behavior have been discussed at
some length in several places, and I have made a
limited statement of my views on this matter in
~nother place. 4 It will suffice then to discuss
some of the implications of the work reported here
for our understanding of behavior. The computer
models of binary choice behavior are relatively
simple computer programs; however, they are relatively complex psychological models. A widely
accepted view of binary ch~ice behavior has been
the idea of verbal conditioning embodied in the
stochastic learning model. In its simplest form,
this model says the subject's probability of
predicting El or E2 in the binary choice experiment is an exponentially-weighted moving average
over preceding events. The verbal conditioning
model is hardly consistent with the hypothesis~
testing behavior exhibited by DH and a dozen other
subjects for whom I have protocols. Protocols of
group behavior in the binary choice experiment
made available to me by David G. Hays are also
consistent with the general idea of hypothesistesting. Other inadequacies of the verbal conditioning model and evidence for hypothesis-testing
models have been discussed elsewhere. 3

Deficiencies of the Models
The model for DH and the similar models that
have been constructed to simulate the behavior of
two other subjects in the binary choice experiment 3 are deficient in several respects. First of
all, the comparison of the behavior of the model
to that behavior of the subject from which the
model was developed is, of course, not a very good
test of the model. This type of comparison only
yields some indication of the adequacy of the
model and its components. Comparison of the behavior of the model to sequences of behavior of the
subject not used in constructing the model awaits
correction of some of the deficiencies mentioned
below.
The segment of the model which has the
highest number of errors relative to the number of
times it is used is the guess-opposite segment
(see Fig. 4). The subject certainly exhibits this
type of behavior, but the model does not very
often predict "guess opposite" when the subject
does.

Contributions of the Models

The pattern-change segment has a better error
record, but it raises another issue. This segment
is actually a selection device. A pattern is
selected from the list of patterns that the subject uses. A more elegant pattern-change meChanism would generate a pattern out of the preceding
sequence of events and some basic concepts. One
of these concepts might be that patterns with
equal numbers of pes and C's are preferred to
alternation patterns with unequal numbers of
pIS and C's, all other things being equal.

The computer has provided the exponents of
hypothesis-testing models of behavior with the
means for studying and testing these complex
models. Oversimplified explanations of human
behavior can no longer be justified on the grounds
that the means for studying complex models do not
exist. Hopefully, the use of computers to simulate human behavior can extend man's intellect by
helping him study his own behavior.

The models have no mechanisms for making
perceptual errors--"seeing" one symbol when
another has occurred. Examination of the protocol
of DH (Appendix) indicates that he does sometimes
think that a C is a P (e.g., trial 196).

The author is indebted to Allen Newell for
advice and suggestions made during the course of
the research summarized in this paper.

The models do not have a sufficiently rich
repertoire of hypotheses. Subjects entertain
more types of hypotheses about the event series
than the two types, pattern and anti-pattern used
in the model for DH. Some subjects entertain more
sophisticated hypotheses. For example, one subject was able to detect the fact that a series of
events was randomized in blocks of ten trials,
i.e., the series had 7 P's and 3 C's in each block
of ten trials.

1.

Clarkson, G.P.E., and Simon, H.A. Simulation
of individual and group behavior. American
Economic Review, 1960, 50, 920-932.--------

2.

Duncker, K. On problem solving. Psychological Monograph!_ 1945, 58, No. 270.

3.

Feldman, J. An analysis of predictive beh!vior in a two-choice situation. Unpublished
doctoral dissertation, Carnegie Institute of

Acknowledgment

References

~

137
3.3
Technology, 1959.
4.

Computer simulation of cognitive
processes. in H. Borko (ed.), Computer
applications in the behavioral sciences,
Englewood Cliffs, Prentice-Hall, forthcoming.

(What do you say for the 4th symbol?) I'll say C
again. (Why?) This time I feel it'll be a C.
(The 4th symbol is a C. When you give your
answer, if you say, "1 think the 5th one will be
something," it'll be easier to check the tape
against the answer sheet.)

5.

Heidbreder, E. An experimental study of
thinking. Archives of Psychology, 1924, 11,
No. 73.

(What do you think the 5th one will be?) The 5th
one will be a P. (Why is that?) I feel it'll be
a P, that's all. (The 5th one is a C.)

6.

Newell, A., Shaw, J.C., and Simon, H.A.
Report on the play of Chess Player 1-5 of a
book game of Morphy vs. Duke Karl of
Brunswick and Count Isouard. CIP Working
Paper No. 21, Graduate School of Industrial
Administration, Carnegie Institute of Technology, 1959.

(What do you think the 6th one will be?) The 6th
one will be a C because you've been giving me C's
all along, and I don't think this progression will
end. (The 6th one is a C.)
,

Newell, A., and Simon, H.A. The simulation of
hUman thought. RAND Corporation Paper
P-1734, 1959.

C.)

Samuel, A.L. Some studies in machine learning, using the game of checkers. IBM
Journal of Research and Developmen~1959,
l, 210-230.

a P.)

7.

8.

Appendix:

Protocol of Subject DH*

(All right, now I'll read the instructions to you.
I'm going to show you a series of symbols. They
will either be a P symbol or a C symbol. Before
each word I'll give the signal N~~. When you hear
the signal NOW, tell me what symbol you expect
will occur on the next trial and why you selected
that symbol. That's the purpose of the tape
recorder. Take your time. After you have given
me your guess, I will show you the correct symbol.
Your goal is to anticipate each word as accurately
as you can. Please ••• Well, do you have any
questions?) Primarily, I just guess whether it'll
be a P or a C. (That's it.) But this explaining
why I think so. It can be little more than--I
think it'll be this, I guess, I have a feeling.
How more involved can it be than that? (Well,
whatever reasons you have. If those are the only
reasons that occur to you as you go thru this
those will be the only reasons. Maybe they won't.
OK, we'll try a few and then if you have any
questions ••• )

(What do you think the 7th one will be?) The 7th
one will be a C because I don't think the progression will be broken. (OK, the 7th one was a
The 8th one will be a C for the same reason. You
won't break the progression. (OK, the 8th one is
(What do you think the 9th one will be?) The 9th
one will be a C. (Why is that?) I think that you
just gave me the P to throw me off and you'll continue the progression. (The 9th one is a C. Oh,
one thing, can you see these cards?) Yes. (Can
you see me writing?) No, I can't. (OK.) I'm not
looking. (Well, you can look at these cards. I
want you to see I'1Ti not picking these out of my
head. This set has been predetermined.)
All right. This one will be a P. The 10th one
will be a P. (Why is that?) I feel that the progression will start to mix up now. (The 10th one
is a C.)
(What do you think the 11th one will be?) The
11th one will be a C. You're continuing the progression. (The 11th one is a C.)
(What do you think the 12th one will be?) The
12th one will be a C because you're continuing the
progression. (The 12th one is a P.)
The 13th one will be a C. The 12th one was a P.
You were trying to throw me off. The progression
will continue. (The 13th one is a P.)

(Now what do you expect the first symbol will be?)
P. (OK, the 1st symbol is a C.)

The 14th one will be a P. You're beginning a new
progression ~ith P' s. (The 14th one is a P.)

(OK, now what do you expect the 2d symbol will
be?) It'll be a P. (Why?) It's pictured in my
mind. (OK, the 2d symbol is a C.)

The 15th one will be a P. You're still continuing the progression. (The 15th one is a P.)

I'll say a C. (Why?) Primarily this time because I'm trying to outguess you. (OK, the 3d
symbol is a C.)

(What about the 16th one?) The 16th one will be
a C. • •• to throw me off now. (The 16th one is a
C.)

The 17th one will be a C. You're going to see if
Itll revert to the progression of P's. (The 17th
one is a C.)
*The statements in parentheses are those of the
experimenter.

The 18th one will be a P. You're going to break
this progression of C's. (The 18th one is a C.)

138
3.3
The 19th one will be a P. You're going to get off
this progression of C's. (The 19th one is a P.)

The 39th is a C. You'll continue the progression.
(The 39th is a C.)

The 20th one will be a P. You're going to try to
throw me off trying to make me think that all-think you're going back to the other progression
which I'm confused about now. I don't remember
wbat the last one was--C, I believe. (The 20th
one is a P.)

The 40th is a C. You'll continue the progression.
(The 40th is a C.)

The 21st one will be a C. You won't continue with
the progression of P's. (The 21st one is a P.)
lbe 22d one is a C. You're doing this so that I
might think the P progressi.on will continue. (The
22d one is a C.)
The 23d one will be a C. You're trying to make me
think that the next one will be a P--going back to
the old progression. (The 23d one is a C.)
The 24th one will be a C. You're going to continue the progr~ssion of C's. (The 24th one was a

The 41st is a C. You'll continue the progression.
(The 41st is a C.)
The 42d is a C. You'll continue the progression.
(The 42d is a C.)
The 43d is a C. You'll still continue the progression. (The 43d is a C.)
The 44th is a C. You'll still continue the progression. (The 44th is a C.)
The 45th is a C. You'll still continue the progression. (The 45th is a C.)
The 46th is a C.

greSSion.

You'll still continue the pro(The 46th is a C.)

C.)

The 25th one is a C. You're still going to continue the progression of C's. (The 25th one is

The 48th will be a C. You'll go back to the old
progression. (The 48th is a C.)

a C.)

The 26th one is still a C. You'll continue the
progression. (The 26th one is a C.)
The 27th one is a P. You'll break the progression
now. (The 27th one is a C.)
The 28th one will be a P. You're going to break
the progression now. (The 28th one is a C.)
ThE! 29th one is a C. You're ~ontinuing the progression. (The 29th one is a C.)
The 30th is a C. You'll still continue the progression. (Tht! 30th is a C.)
The 31st is a C. You'll continue the progression.
(The 31st is a C.)
The 32d is a C. You'll still continue the progression. (The 32d is a P.)
The 33d is a C. Yoti gave me a P last time to
thro~ me off.
(The 33d is a C.)
The 34th is a C. You'll continue the progression.
(The 34th is a C.)
The 35th is a P. You're going to throw me off the
progr~ssion.
(The 35th is a C.)
The 36th is a C. You'll
(The 36th is a C.)

The 47th will be a P. You'll now break the progression. (The 47th is a C.)

con~inue

the progression.

The 49th is a C. You'll continue the progression.
(The 49th is a C.)
The 50th is a C. You'll continue the progression.
(The 50th is a P.)
The 51st will be a C. You gave me the P to throw
me off. (The 51st is a P.)
The 52d is a p. You've begun a progression of
P's. (The 52d is a C.)
The 53d is a P. You gave me a C to throw me off.
(The 53d is a C.)
The 54th is a C. You'll continue the progression
of C's. (The 54th is a C.)
The 55th is a C. You'll still continue the progression. (The 55th is a C.)
The 56th is a C. You'll continue the p"rogression.
(The 56th is a P.)
57 is a P. The P will throw me off the progression thinking you had tried to throw me off the C
progression with your last P. (57 you said was a
P?)
P. (57 was a C.)
58 is a C.

You began a progression of C's.

(58

is a p.)

The 37th is a C. You'll continue the progression.
(The 37th is a C.)

59 is'a C. You're still trying to throw me off
with the C's. (59 is a P.)

The 38th is a C. You'll continue the progression.
(The 38th is a C.)

60 will be a P. You're beginning a progression
of P's. (60 is a C.)

139
3.3
61 is a P. You're zigzagging between pt s and
(61 is a P.)

C~s.

84 will be a C. The P's were given to throw me
off. (84 is a P.)

62 is a C.
is a C.)

(62

85 will be a P. You've begun a new alternating
sequence. (85 is a P.)

You'll continue the oscillation.

63 is a C--rather 63 is a P because of the oscillation pattern. (63 is a P.)

86 will .be a C. You're following with a C and 2
P's. Another C will come. (86 is a C.)

64 is a C becuase of the oscillation pattern.
is a C.)

(64

87 will be a P.
(87 is a C.)

65 is a P because of the oscillation pattern.
is a C.)

(65

88 will be a P. You've begun a sequence of 2 Cts
and a P. (88 is a C.)

66 is a C¥ You've begun a progression of C's.
(66 is a P.)
67 will be a C.

(67 is

89 is a C. You've begun a new progression of C's.
(89 is a C.)
90 is a C.
is a C.)

You'll continue the progression.

a C.)

68 is a C. You're having a different type of oscil1ation--2 C's between a P. (68 is a P.)

91 is a C.

The progression continues.

(91 is a

C.)

69 is a C. You're oscillating with C's and P's.
(69 is a C.)

The progression continues.

(92 is a

P .)

70 will be a P.
is a P.)

You're oscillating again.

You'll fOllow the same sequence.

It's the alternate symbol.

(70

71 will be a C because of the oscillation
sequence. (71 is a C.)

92 is a C.

(90

93 is a P. The P's given to me previously to make
me think that the progression was being broken and
that you would revert to it after the P. The next
one will be a P. (93 is a C.)

72 will be a P because of the oscillation
sequence. (72 is a C.)

94 will be a C. You've gone back to the C progression. (94 you say now is a C.) 94 is a C.
(OK, 94 is a C.)

73 will be a C. You've begun a new progression of
C's. (73 is a C.)

95 is a C. You've begun a progression of C's.
(95 is a P.}

74 is a C. You're continuing the progression.
(74 is a C.)

96 will be a C. You're alternating now with C's
and P' s • (96 is a P.)

75 is a C. You're still continuing with the progression. (75 is a C.)

97 is a C. You've begun a progression of a C and
2 P' s • (97 is a P.)

76 is still a C. You're continuing with the progression. (76 is a C.)

9b is a P. You've begun a progression of P's.
(98 is a C.)

77 is a C. You're still continuing with the progression. (77 is a C.)

99 is
and 3
(sic)
a C.

78 is a C.
is a P.)

The progression is continuing.

79 is a C. The P is to throw me off.
gression continues. (79 is a C.)

(78

The pro-

80 is a C.
is a C.)

The progression will continue.

81 is a C.

The progression continues.

(80

(81 is a

a C. You've begun a progression of 3 P's
C's. You've already had the 3 pts. 98
will be a C. (That was ••• 99 is going to be
You said. 99 is a C.)

(What's 1001) 100 will be a C.
progression. (100 is a C.)

It follows the

101 will still be a C. Continue the progression
of 3 P's and 3 C's. (101 is a C.)
102 will be a C. You've begun a progression of
C's. (102 is a C.)

P .)

82 will be a C. You're alternating now with C's
and P's. (82 is a P.)
83 is a P. You'Ve begun a progression of P's.
(83 is a C.)

103 is a C. You'll continue the progressi.on of
C's. (103 is a C.)
104 is a C. You'll continue with the progression. (104 is a C.)

140

3.3
105 will be a C. You'll continue the progression. (lOS is a C.)

127 will be a C. You gave me the P to throw me
off. (127 is a P.)

106 will be a P. You'll break the progression
now. (106 was a C.)

128 will be a P. You've begun a progression of
P's. (128 is a C.)

107 will be a C. You'll continue the progression. (107 was a P.)

129 will be a C. You've begun a progression of 2
P's and 2 CiS. (129 was a C.)

108 will be a C.

130 will be a C.

off.

C' s.

You gave me the P to throw me
The progression will continue. (108 is a

You've begun a progression of
(130 is a P.)

C.)

109 will be a C.

sion.

You'll continue the progres(109 was a P.)

110 will be a C. You"re alternating with C's
and P's. (110 is a C.)

131 will be a P. You're continuing the progression of 2 P's and 2 C's. (131 is a C.)
132 will be a P. You're alternating the signs
now. (132 is a C.)

You'll continue the alternation.

133 will be a C.
(133 is a C.)

You've begun a sequence of C's.

111 will be a P.
(111 was a P.)

You'll continue the alternation.

134 will be a C.
(134 is a C.)

You're continuing the sequence.

112 will be a C.
(112 was a P.)

113 will be a C. You've begun a progression of a
C and 2 p, s • (113 is a P.)

135 is a C. You're continuing with the progression. (135 is a P.)

114 will be a P. You've begun a progression of
p, s • (114 is a P.)

136 will be a P.

You've begun ••• you're trying to
throw me off now with a 2d P. Think there would
be only one P. (136 is a C.)

115 will be a P.
(115 is a C.)

You'll continue the progression.

137 is a C. You're going to continue with the
progression of C's. (137 is a C.)

116 will be a p.
(116 is a C.)

The C was given to throw me off.

138 is a C. You'll continue the progression.
(138 is a C.)

117 is a C.

and

4

C's.

You've begun

a

progression of 4 P's

(117 is a P.)

118 will be a P. The progression has changed
from 4 P's and 4 C's to 4 pes and "3 C's. (118 is

139 is a C. You'll continue the progression.
(139 is a P.)
140 is a C. The P was given to throw me off.
(140 is a P.)

a C.)

119 will be a P. You're alternating with C's and
P' s • (119 is a C.)

141 is a C. You gave me the 2 C's (sic) for the
same reason as the previous time you "had given me
the 2 C's 'er 2 P's... (141 is a C.)

120 will be a C. You're continuing the progression. (120 is a P.)

142 is a C. You'll continue with the progression.
(142 is a C.)

121 will be a P. You have a progression'of 2 C's
and 2 P's. (121 is a P.)

143 is a C. You'll continue with the progression.
(143 is a C.)

122 will be a C. You'll continue this progression
of 2 and 2. (122 is a C.)

144 is a C. You'll continue with the progression.
(144 is a C.)

123 will be a C. You're continuing the progression. (Of what?) Of 2 C's and 2 P's. (123 is a

145 is a P.
is a C.)

You'll break the progression.

(145

C.)
124 will be a C.

C's.

You've begun a progression of
(124 is a C.)

124 (sic) will be a C. You're continuing the progression. (125 is a C.)
126 will be a C. You're continuing the progression. (126 is a P.)

146 is a C. You'll continue the progression.
(146 is a C.)

147 is a C. You'll continue the progression.
(147 is a C.)
148 is a C. You'll continue the progression.
(148 is a C.)

141

3.3
149 is a C. You'll continue the progression.
(149 is a C.)

171 will be a P.
(171 is a P.)

You'll begin a sequence of P's.

150 is a C. You'll still continue the progression. (lSO is a C.)

172 will be a C.
is a C.)

You'll revert to the C's.

151 is a C. You'll still continue the progression. (lSI is a C.)

173 will be a C. You're alternating with 2 P's
and 2 C's. (173 is a P.)

152 will be a P.
(l52 is a C.)

174 will be a C.
(174 is a C.)

You'll break the progression.

(172

The alternation is a C and a P.

153 is a C. You'll continue the progression.
(153 is a P.)

175 will be a P. You'll continue this alternation. (175 is a C.)

154 is a C. You've broken the progr~ssion and
you'll revert to it now. (154 is a C.)

176 will be a C.
(176 is a P.)

You've begun a sequence of C's.

155 is a C. You'll continue the progression.
(155 is a P.)

177 will be a C.
gression of C's.

You'll continue with the pro(177 is a P.)

156 is a C. You're alternating with P's and C's.
(156 is a C.)

178 will be a C. You've begun a progression of 2
P's and 2 C's. (What did you say 178 was?) A C.
(178 is a C.)

157 is a C.

The alternation of P's and C's was to
throw me off the progression of C's. The C progression will continue. (157 is a P.)
158 is a C. You're still going back to C sequence. (158 is a C.)

159 is a C. You're still going to continue this
sequence. (159 is a P.)
160 is a C. You have an alternating sequence of
P's and C's. (160 is a C.)

161 will be a P.
(161 is a P.)

You'll continue with another C
to complete the sequence of 2 P's and 2 C's.
(179 is a C.)

180 will be a P.
(180 is a C.)

You'll continue this sequence.

181 is a C. You've begun a sequence of C's.
(181 is a C.)
182 is a C.
is a C.)

You'll continue the sequence.

(182

183 is a C.
is a P.)

You'll continue the sequence.

(183

You'll continue to alternate.

162 will be a C. You'll continue this oscillation. (162 is a P.)

163 will be a C.
(163 is a C.)

You'll continue the alternation.

164 will be a P.
(164 is a C.)

You'll continue the alternation.

184 will be a C.
(184 is a C.)

The P was given to throw me off.

185 is a C. You'll continue the sequence of C's.(185 is a C.)

165 will be a P. You'll go back to the alternation. (165 is a C.)
166 will be a C.
(166 is a C.)

You've begun a sequence of C's.

167 will be a C.
(l67 is a P.)

You've begun a sequence of C's.

168 will be a P. You've begun a sequence of 2
C's and 2 P's. (168 is a C.)
169 is a C. The previous P's were given to throw
me off. You'll continue the sequence of C·s.
(169 is a C.)
170 will be a C.
(170 is a P.)

179 will be a C.

You'll continue the sequence.

186 will be a C.
(186 is a c.)

You'll continue the sequence.

187 will be a C.
(187 is a C.)

You'll continue the sequence.

188 is a C.
is a C.)

You'll continue the sequence.

(188

189 is a C.
is a P.)

You'll continue the sequence.

(189

190 will be a C.
(190 is a C.)

The P was given to throw me off.

191 will be a C.

The double P (sic) was given to
throw me off a little more. (lql is a f"' . )

You've ••• been giving me a sequence
of 2 pes and 2 (;'s. (192 is a C.)

192 is a C.

142

3.3
192 (sic) is a P. You're continuing the sequence
of 2 pes and 2 C's. (193 is a C.)
194 is a C. You've begun a sequence of C's.
(194 is a C.)
195 is a C.
is a P.)

You'll continue the sequence.

(195

196 will be a P. You have a sequence here of inserting 2 pes. (196 is a C.)
197 is a C. The P was given to throw me off.
(197 is a C.)

19.8 will be a C.
(198 is a C.)
199 is a C.

You'll continue the sequence.

You'll continue the sequence.

(199

is a C.)
200 will be a C.
(200 is a C.)

You'll continue the sequence.

A. COULD THE PATT~~N CUMPUN~NT UF THE ~~tOlCTIUN-HYPOTHESIS
FOR TRIAL T HAV~ PRtuICTtu Tht EVtNT OF TRIAL T CO~RECTLY.
b. YES-EXPLANATIU~-HY~0THESIS FU~ TKIAL T IS THE PATTE~N
COMPON~iH OF THl:: Pl-ID The.. P .... t.l)l<..TI().~

hY~UTrH:.5I:.S

FOt<

TkIALS T-1 AND

T-2

WEkE THE
CORRECT.
1... Yt.S-PKt.ulCTIvi~-r1Yr-uTdtSlS l-vl< TKIAL 1+1 IS GUESSUl-itJOSITc THe. c.X~LANATIO~-HYPUTHe.~I& fO~ TRIAL T.
M. NU--PKEulCT10i\-HYPUTH~Sl~ l-OU TRIAL T+1 IS THE
EXPLANATION-hYPOTHESIS l-OR T~IAL T.
~i I LL THe. f.:JCtJLANA T I Oi-i-Hypr) THES I S FO~ TR 1 AL. 1 CONT I NUE..
(SlE TEXT FOR AN EX~LANATIOh UF THIS TEST)
O. YES-PREDI~TION-HYPOThl~l& FOR TRIAL T+1 IS THE
CO~TAIN GUES~-0PPUSITE COMPO~£NTS A~D
j-IRtDltTIUN~ Uf Th~ /.:;.Vc..NTS Uf THt:SE T~IALS

~h

f:.XPLANATION-HYPOTI1t..SIS FOR TRIAL T.
p. NO--PK(DICTION-HYP0THESrS FOR TRIAL T+l IS GUESSvPPOSI Tf TH~ i;:XPLAf"AT I(.Ii,-hYPUTHESIS FOK Ho/'ho attempted to solve a "concept learning"
problem which had three logically correct
answers; a disjunction, a conjunction, and
a relation. (This problem has been described previously9 and same data on its
difficulty ..las available.) All three conditions of presentation 'Ivere given to each
sUbject. The model 'to/'e have just presented
gave the best overall "postdiction" of
responses of any model 'YTe could devise.
In fitting it 'Vle altered the order and
identity of symbols on reference lists, but
othervlise kept the model constant. Since
each subject solved three problems, we were
able to make some tests of our transfer procedures and thus do not rely too heavily upon
pre-specified orders. The results of our
match were generally encouraging. However,
they cannot be taken as validating evidence
since the protocols were used to develop
the program.
Some more encouraging evidence came
'.·,hen the artificial subject attempted a
series of problems used by Shepard, Hovla.~d
and Jenkins21. This 'Ylas a completely
separate study. Human subjects 'Ylere asked
to find categorizing rules for each of the
six possible types of splits of eight instances, each describable by one of two
values on tr.ree dimensions, into t'l1O sets
of four each. Huoan subjects could solve,

A somewhat similar, unpublished,
experiment was performed by Hunt and H. H.
vlells. Here the five commonly used logical
connective betvleen two elements provided
the anS~·ler. A 11 truth table" was constructed
and presented to subjects in geometriC form.
For example, the connective II p and q" might
be represented by ured and starT The five
problems ..lere presented in five orders,
each subject solving all five problems in
one of the orders. Simulation and analysis
of this experiment has not been completed
at the time of this 'Yr.citing, however, vie have
same preliminary results. There is good
general agreement bet'lo/'een our simulation
routines and some protocols. Both the
computer model and the subjects are sensitive
to the order in 'I'7hich problems are presented,
but their reactions are not as similar as
'-Ie vlould like. A neyl transfer procedure is
needed. In an experiment which is not directly
related to simulation, ~'lells is studying the
manner in vlhich human subjects learn methods
of solution for disjunctive problems. We
hope that his experiments .vill provide some
clues about the nature of the transfer
procedures Y/e should include in our model.
;"je do not claim to have presented a
complete explanation of qoncept learning!
Certainly others Ifill agree 'Ylith us. In
programming the model ife made many decisions
ivi th little theoretical or empirical justification. Some of these are certain to be
wrong. But ~'lhich ones'?

He sha.ll probably ha.ve to change our

151

3.4
routines for memory and recognition. Same
of the known phenomenon of memory cannot
be reproduced by a simple occupancy model.
For instance, the effect of stimulus similarity upon memory cannot be represented. Our
model bas an all or none aspect in its
interference features. An intervening
instance either completely eliminates the
record of a previous instance or does not
affect it at all. This does not seem to b~
the final answer to the problem of memory in
concept learning.
Two alternative memory systems have been
considered. One system retains and extends
the limited occupancy model. Instead of
storing one IIcodeword" (actually, a list
structure), representing all knovTn information about an instance, on a single occupancy
list, several code vlords would be stored in
several occupancy lists. Each of these code
words would represent a particular type of
information about some part of the instance
in question. Storage of each code\vord would
be independent on each occupancy list. Code..vords referring to the same instance 1-lould
reference each other's locations. rlhen information fram memory '\-Tas required a picture
of each instance would be reconstructed from
the cross referencing system. HO\'Tever, since
intervening instances ''i'ould be storing codewords independently on each occupancy list,
some of the codevTords might be replaced.
The extent of this replacement would depend
upon the similari ty bet~'i'een the instance to
be recalled and the stimuli which follovled
its presentation. This system would be
sensitive to stimulus similarity effects.
Alternately, He could use an associationist memory system. Instead of trying to
remember units of information directly we
would build II associations II bet"i.Teen names and
sti.rnulus feature:::;. This is the logic of the
technique used by many learning theorists
in psychology. Machinery to realize such
a memory has be enS extensively investigated
by Rosenblatt17,l. There is also some
similari ty bet"leen this approach and the
classification mechanisms based upon Selfridge~
"Pandemonium" scheme19 • To adopt such a
memory system vTould require changing the
entire logic of our model. Association
schemes generally contain, in themselves,
a mechanism for concept learning. It also
seems that they require some sort of gradient
of generalization. Recent experiments 20 ,21
indicate that, in concept learning, the
tendency to code stimuli symbolically plays
a greater role than generalization based
upon stimulus Similarity. For these reasons

we ~ve, tentatively, rejected an associationist memory mechanism.
In the present model '-Ie subject the formal
description of an instance to t'·l0 transformations. \wen an instance is presented the
dimensions of the formal description are
sampled to determine what information is to
be placed in memory. At some later time,
that part of the formal description which is
in memory is re-transformed to provide a·
''i'orlting description. The two procedures
could be combined if the description routine
currently at the head of the description
routine reference list viere to be applied
directly to an instance before it entered
memory.
Such a procedure would have advantages
in saving storage space. Instead of having
to have t-';-lQ separate locations, for working
and permanent description, in the internal
memory, o~ one description need be stored.
But . .·re pay for saving this space by losing
information. By definition, any working
description can be derived from the formal
description. All vlOrking descriptions cannot be derived from each other. For instance,
if vIe know that an instance contained t,'i'O
figures of the same color, ·we do not know
what that color is. As a result, our artificial subject's ability to utilize a
particular description routine at time t
would depend very much upon the description
routines used previously.
The role of IIset ll at time of presentation
as a determinant of later memory characteristics needs more extensive investigation. Same
experiments12 ,13 suggest that 11 set" is a
function of how memory is searched rather
than hOl-l i terns enter into memory. Also,
there exists a rather contradictory literature
on lllatent learning", a term used to describe
experiments in which animals, trained to
respond to cue A in the presence of cue B,
which is irrelevant to the animal's current
need, learn more rapidly a later response
to cue B. From present experimental results
it is not obvious hOI'; stimulus recognition
and ans'l-ler development procedures should be
connected in a concept learning simulation.
Procedures for representing transfer
may not be represented adequately in the
present model. Transfer is defined as the
effect of previous problem solving experience
upon solution of the problem with which the
subject is faced at the time of the test.
We decided to work first with a simple
method of representing transfer, in which

152
3.4

the subject tries \.,hatever vlorked last time.
A principal result of the simulation of the
Hunt and Hells work on logical connectives
has been a demonstration that a ne\{ transfer
procedure is needed.
In the tradition of classical learning
theory, we could attach a modifiable numerical
index to each routine on a reference list.
This index could be used to determine the
probability that a routine Hould be selected.
This method of representing learning is
probably the most connnon. The prinCipal
objection to it is that it implies the
existence of II counters in the head" and,
essentially, makes our program a digital
simulation of an analog system.
The alternative to association indices
is a new method of ordinal rearrangement
of routines on a reference list. The problem
with ordinal re-ar~angements is that they
do not permit us to specify a variable distance bet"leen routines on a list. Suppose
vle consider each concept learning problem
as a contest betw'een routines on the same
reference list. The one that finds a place
on a successful execution list is victorious.
How many times must the routine in position
£ "ivin" before it gains the next hiGhe st
position? Should it jump pOSitions? As
vTe have indicated, some research relevant
to this topic is being conducted.
Conceivably, Ive may have to change our
entire method of transfer. At present our
model records ansvlers, 1..;1th associated information about useful routines. T,;e could
attach to routines information about problems
on ,\-Thich they had been useful. 'de Ilould then
have to develop some i'lay for the artificial
subject to extract, rapidly, key features
of a problem "i-lhile the anmrer is being
developed~ Houtines vlOuld be examined to
see "That, in the light of past experience,
'\-las their probable usefulness on this sort
of problem.
Closely related to the problem of transfer
is the problem of response selection d~~ir~
learnill3. Our present model rearranges its
order of response selection after a problem
is solved. During a problem, response selection
is controlleli by time parameters which are independent of prog:'"'am contl'"'ol. No use is made
of intermediate computations in selecting
the next item to be placed on an execution
list. In an alternate model this miGht be the
controlling factor. The means-end analysis
of the Loeic Theorist15 uses intermediate

calculations heavily. Amarell has proposed
a computer model for an area very similar
to ours in ivhich intermediate computations
exert control on ans'\'ler development.
Our simulation '\'lork, and analysis of
experimental data, has convinced us that
some method of making the choice of one
item on an execution list dependent upon
the product of execution of previously
selected routines is desirable. ~~t is
not clear is the optimum amount of dependency.
Bartlett 2 has presented an analogue, in an
entirely different context, vlhich may clarif'y
the problem. He compared problem solving
and thinl~ing to motor skills responses, such
as serving in tennis. There are certain
pOints at vlhich a chain of responses can
be altered, in betT,Teen these points a series
of acts vTill be executed uithout interruption.
Our problem, experimentally, is to identify
the responses and choice pOints.
He feel that the principal use of our
model, so far, has not been in the generating
of an eA~lanation of concept learning so much
as it has been in indicating the type of new
experimental data needed. J,:e have had to be
very specific in our thoughts as we programmed
this model. Ac a result, we have reached
some conclusions about the kind of experiments
that need to be done. It may well be that
the typical concept learning experiment
confuses three processes; memory, recognition,
and symbolic problem solving. It is not
clear whether or not these should be treated
as part of' a unitary "concept lea.rningll act.
They can be programmed separately. In addition,
He have become concerned vlith questions of
transfer, the effect of the subject's current
hypothesis upon his later retention of information, and the effect of time pressure
upon information processing. A real ailareness of these problems has been a major outcome of programming a concept learning mouel.
Comparisons with Related Work
VieT,-led formally, our problem is closely
related to models of pattern recognition.
ProgramminG either a pattern recognizer or
a concept learner involves the development
of a mechanism which operates on a specified
stimulus universe to map stimuli from predetermined subsets into particular responses.
Because of this mathematical identity, at
least one critic lO has suggested that problems
of this sort should be treated together,
without "psychologizing" or lIneurologizing."
\ihile this may be useful in developing

153
3.4

theorems about a canonical form of categorization, it may not be an appropriate strategy
for simulation studies. In particular, our
approach is quite different from that of the
pattern recognition studies i-lith "lhich we are
familiar.
The most striking difference is in the
manner in \'lhich '.le pre-code the stimuli.
Pattern recognizers usually accept stimuli
coded into projections on a grid. The result
is a string of bits, each bit representing
the presence or absence of illumination of'
some pa..l't of the grid. The Game representation could be used for a temporal pattern.
:2ach bit T,'lOuld stand for the presence or
absence of some stL~ulus feature.
He presuppose the existence of a dimension
and value coding 6 and deal i1ith perceptual
as:gects '.Thicn are readily verbalizable. A
pattern reco:,;nizer develops its mTn code.
.\ny coding scheme developed by a pattern recoe;nizer '.7ill be speci.?ic to the stimuli used
(visual VB. audi to:ry, etc.). Since ,Te are
interested in the manipulation of coded elements
"7e avoid this problem by fiat in our nro!."ramming and by explicit instructions to ~uro
subjects in our experimental 'tjork.
Our model is also different from most
pattern recognizers in the processes it uses.
Pattern recognizers, at least as developed
by self:ri~~ aga. his co--workers 19, and by
Rosehblatt 7,1 , are basically parallel
processing devices ,;hich utilize a large number
of redunclant, e:..~ror prone tests. Our proGram
is a serial processor i{hich tries to develop
a single, perhaps complex, error free
classification test. 'He do not see any incompatibility in the t',lQ approaches. Pattern
recognizers are inspired by the capability
of biological systems to amplify u:90n their
sensory inputs. Our program deals ''.'1ith
the simulation of a symbolic process. That
the tvo problems are formally similar does
not mean that they are realized in the same
way by problem solvers.
In principle, there ,vould be no objection
to utilizing a pattern recognizer to provide
the input to the concept learner. The combined system could develop its own dimensions
and values and then operate upon them. In
practice, such a scheme is undoubtedly premature. But it is a long ran3e goal.
The concept learning problem has been
attacked directly in two previously mentioned
studies by Kochenll and A.ma.rell • Kochen
restricted his program to solution of

concepts II based upon a conjunctive rule
involving stimuli specified by strings of
bits. His program consisted of executing
algorithms upon the information about the
universe of objects which was available
~ ~ ~~, in memory.
The program
also contained features for making random
guesses about the correct concept. These
guesses could be weighed for II cOnf'idencE;" ,
using an index ,vhich satisfied Polya' s16
postulates for plausible reasoning. One
of Kochen's findings, based on Monte Carlo
runs of his system, was that changes in
the value of the confidence index could be
used to estimate the probability that an
anS'lver "las correct before a proof of the
anSi-leI' 'fas available.
It

Amarell proposed a machine that could
generate routines to map arguments to values
in symbolic logic. The key feature of his
proposal, one \ole might '\-lell adopt, is his
use of intermediate results to IImonitorll
future anS\ver development.
Neither Kochen nor Amarel "lere directly
concerned "lith simulation of human performance. This difference in goals, and
features of~ programming, are the major differences bet'-leen our work and theirs.
Superficially, our program is similar
to the list processing programs \~itten by
the Carnegie Institute of TechnologyRM1) Corporation group headed lly Ne'-lell,
Shavl, and Simon, and McCarthy! and his
associates at M.I.W. In particular, the
,.;ark of Feigenbaum at Carnegie, is related to ours. He developed a program to
simulate paired-associateG learning. As
part of his program he included a routine
for selective recognition of stimulus
features. As more experience idth the
stimulus universe '(las provided, more
features '\-lere read into the system to
enable it to make finer discriminations.
The logic of Feigenbau~fs recognizing
systen, and in particular its capability
for dropping stimulus features 1lhich are
not useful in discrimination, could be incorporated into our program.

I,

Our present program, although running
nOvl, is in no sense complete. Almost every
ne\ol simulation has indicated ways in which
it could be improved. ~le intend to continue
to investigate concept learning by
use of an information processing model.
But we do wish to add a word of caution.

154

3.4
Neither our model, nor any other, has generated
a large number of new experiments. This is
a traditional test of the utility of a scientific model, and it is going to have to be met
by us and by others interested in this field.
We do not feel that the utility of computer
programming models in psychology has been
proven or disproven. The jury is still out.
We, of course, hope that a favorable verdict
will be returned.
References
1. Amarel, S. An approach to automatic
theory formation. Paper presented at
the Illinois Symposium on the Principles
of Self Organization, 1960.
2.
3.

Bartlett, F. C. Thinking.
Basic Books, 1958.

Church, A. Introduction to mathematical
logic, vol. I. Princeton, N.J.:
Princeton U. Press, 1956.
An information processing
theory of verbal learning. RAND Corp.
publication, p. 1817, 1959.

6.

7.

14. McCarthy, J. Recursive functions of
symbolic expressions and their computation by machine. Communications
2! ~ Association for Computing
Machinery, April, 19bo.
15. Newell, A. and Shaii, J. C. Programming
the logic theory machine. RAND Corp.
publication, p. 95 4, 19)7.
16. Polya, G. Mathematics and plausible
reaponing. Princeton: Princeton U.
Press, 1954.

New York:

4. Feigenbaum, E.

5.

13. Lawrence, D. H. and LaBerge, D. L. The
relationship betvleen recognition
accuracy and order of reporting
stimulus dimensions. ~.~. Psychol.,
1956, 51, 12-18.

17. Rosenblatt, F. The perceptron, a
probabilistic model for information
organization and storage in the
brain. Psychol. ~., 1958, 65,
368-408.
18. Rosenblatt, F. Perceptual generalization
over transformation groups. In Self
organizing systems, London, PergaiiiOii
Press, 1959.

Green, B. F. IPL-V, the Newell-Shaw-Simon
programming language. Behavioral Science,
1960, 5, #1.

19.

Selfridge, O. and Neisser, U. Pattern
recognition. Scientific American,
1960, 203, 60-79.

Hovland, C. I. A "communication analysis"
of concept learning. Psychol. Rev., 1952,
59, 461-472.
-

20.

Shepard, R. N. and Chang, J. J. Stimulus
generalization in the learning of
classifications. Bell Telephone Lab.
mimeographed report, 1961.

21.

Shepard, R. N., Hovland, C. I. and
Jenkins, H. 1,1. Learning and memorization of classifications. Psychol.
Monogr., 1961, in press.

Hovland, C. I. and Hunt, E. B. The computer
simulation of concept attainment.
Behavioral Science, 1960, 5, 265-267.

8.

Hunt, E. B. An experimental analysis and
computer simulation of the role of
memory in concept learning. Unpubl.
Ph.D. dissertation, Yale U., 1960.

9.

Hunt, E. B. and Hovland, C. I. Order of
consideration of different types of
concepts. ~.~. Psychol., 1960, 59,
220-225.

Acknowledgment

10. Keller, H. Finite automata, pattern recognition, and perceptrons. AEC Computing
Center and Applied Mathematics Center,
Report NYO-2884, 1960.
li.

Kochen, M. Experimental study of hypothesis
formulation by computer. IBM Research
report, RC-294. IB1 Research Center,
Yorktown Heights, New York, 1960.

12.

Lavrrence, D. H. and Coles, G. R. Accuracy
of recognition with alternatives before
and after the stimulus. ~.~. Psychol.,
1954, 47, 208-214.

The work reported in this paper was
supported by a grant for the study of concept
learning, from the Ford Foundation to Carl
I. Hovland. The computational work involved
was supported, for the most part, by the
Computation Center, l~ssachusetts Institute
of Technology. Dr. Bert F. Green, Jr., and
Alice Holf, of Lincoln Laboratory, M.I.T.
made available the 709 version of the IPL-V
interpreter and instructed the first author
in its use. The aid received is greatfulJ~
acknowledged.

155
3.4

Subject

Figure 1.

Program Control Chart.

Problem
Characteristics

Time

Checks

Internal
Memory

Figure 2.

Answer Developing Procedure.

157
4.1
PARALLELISM IN COMPUTER ORGANIZATION
RANDOM NUMBER GEnERATION
IN THE FIXED-PLUS-VARIABLE COMPUTER SYSTEM
M. Aoki*, G. Estrin*, and T. Tang**
Department of Engineering
University of California*
National Cash Register Co.**
Los Angeles, California
Summary
TV e Fixed-PIus-Variable Structure Computer
System utilizes an inventory of modules which can
be interconnected as special purpose configuratwns
operating simultaneously with other parts of the
system. Since the structure considered makes no
permanent committment of hardware to relatively
rarely used operations it permits reconsideration
of designs previously discarded as uneconomic.
Problem formulations utilizing random numbers generally require large nuobers of trials to achieve
high confidence results. Despite the fact that in
most problems random number generation does not r~
quire a large percentage of the total computing
time, it may be desirable to eliminate even that
time by use of special purpQse random number generators.
Special purpose circuits can be designed to
generate pseudo-random numbers in parallel with
other activities such that these numbers are available on demand in the same sense as any other operand stored for use in the computation.
This paper discusses a number of different
methods of generating pseudo·random numbers, the
time required in existing programs, the hardware
implications of different parallel and serial designs. The criteria for choice of one method over
another in the context of particular problems and
the Fixed-PIus-Variable Computer System are evaluated.
1.

Introduction

T~e Fixed-PIus-Variable Structure Computer
System is organized such that there is associated
with a general purpose computer an inventory of
modules which may be reorganized as needed to reduce the overall computation time required for solution of particular problems. The speci81 purpose
configurations established in the variable structure
part of the system ~~y carryon their operations
simultaneo~sly with those of the fixed part of the
system anQ may have their configurations altered
during the course of a given problem.

In general, of course, any of the operations
!he preparation of this paper was sponsored, in
pqrt, by the Office of Naval Research and the
Atomic Energy Commission. Reproduction in whole or
part is permitted for any purpose of the United
States Government.

which may be executed by the special purpose configuration may either be programmed in the fixed
structure general purpose part of the system or
may be added to the list of permanently available
commands. The latter course will not in general
be taken for relatively rarely used operations and
both methods imply sequential operation on all
commercially available systems. Since in basie
concept the (F + V) system makes no permanent
committment of its variable structure inventory,it
is reasonable to consider its application to many
problems It,here special purpose techniques would
previously have been discarded as uneconomic.
This paper considers the class of problems
which makes use of long sequences of random numbers for simulation ~!4complex processes using
Monte Carlo methods.
In most such problems the
operations which must be done on the random number
are more time consuming than the random generations
themselves and deserve most of the attention of speCial
purpose equipment. However i t will be found that the
large number of operations generally will make it
worth while to utilize some fraction of the variable structure inventory to essentially remove the
time for random number generation from the overall
computation time.
2.

Methods for Generating ana Testing
Random Numbers

2.1 Introduction
The successful ~~e of the t10nte Carlo method
in digital computers
depends on*a good supply of
long sequences of random numbers.
There are,
broadly speaking, four ways of supplying such
sequences.
* During the course of the calculations associated
wi th any Monte Carlo problem, it is also necessary to
produce random variables accordingtoa variety of
different probability distributions. A co~~onmethod
of doing this is to incorporate into the computing
machine a sequence of uniformly distributed random
numbers. The desired random variables ~,e then obtained by transfor~ations based on them' •
There are three methods which can be adapted to
serve this purpose. These are based on direct
functional transformations, the principle of compound probabilities, and the procedure of rejecting
part of the sampled ~a$ues according to an appropriate test or rule.' The first is called the
"direct method", the second the "composi tion method"
and -the third the "rejection method".

158

4.1
1. Tables of random numbers may be recorded
on paper tape, punched cards or magnetic tape. The
tapes or cards are fed into the comnuter at suitable stages of the calculation. This method has
found little favor in those problems which require
very long sequences of random numbers since the
time taken to read in the tables may soon become
excessive. For example the IBM 711 card reader
operates at the rate of 250 cards per minute or
7
6,000 words per minute.
If a problem requires one
million random numbers, it will take the card
reader 166.66 minutes to transfer the data into the
computer. If magnetic tape is used, over 100 seconds
would be required on the fastest available magnetic
tape sys tem.
2. Random numbers may be generated by physical processes such as radioactivity or discharges
in gases. The chief objection is a rather paradoxical one; the number sequences cannot be repeated
and so it is very difficult to check the calculation because it is not always possible to distinguish between variations in re~ults due to random
fluctuations and those due to changes in the program or even to the faulty running of the computer.
3. Pseudo-random numbers may be generated by
arithmetical processes on general purpose high
speed digital computers. The most common methods
are the mid-square process ana 9dditive and multiplicative conqruence methods. '
Whenever the time required for generation
of pseudo-random numbers in a problem is significant, a special purpo~e computer or wired-in instruction may be considered utilizing any of the
methods in (3). Such a special purpose configuration would essentially deliver a pseudo-random
number on demand and might operate in parallel with
any other processes going on.
There are three criteria which should be satisfied by any method of generation of pseudo-random
numbers.
1. The numbers generated should satisfy the
tests of randomness prescribed by the user.
2. The rate of generation of pseudo-random
numbers should compare favorably with the rate at
which they may be used in typical computations.
3. The recursion relation should produce a
sufficiently long sequence of pseudo-random numbers.
All sequences generated by arithmetic process are
either cyclic or enter a cycle if prolonged far
enough. A cyclic sequence will not be a satisfactory source of pseudo-random numbers unless the
period is so ereat that it is of no consequence in
practical computations.
2.2 The Testing of Sequences for Randomness
10

The tests of Kendall and Smith
have been
used frequently on sequences of decimal digits.
They may be adopted in various ways for testing
sequences of binary digits generated by pseudorandom number generators. These tests are the frequency test, the serial test, the poker tp.st and
runs test. In all these tests the deviations of the
observed counts from those expected from a perfectly random sequence are studied. Chi-squared

tests are usually used to give a measure1~f the
permissible deviations from expectations.
In practical applications of various tests of
Significance, the 5%, 1% and 0.1% levels of significance are often used. Some empirical results related to the fo~r tests below gan be found in
papers by Green and Rotenberg •
1. The frequency test. The primary requirement for randomness is that the numbers, considered
as fixed point, positive fractions, be uniformly
distributed over their range (0 ~ X ~ 1)
To test this, the binary numbers can be
separated into octal digits, and a tally of the
frequency of occurrence of each octal numeral can
be made and tested since a random process should
produce uniformity in every octal digit position.
2. Serial correlations. Another vital requirement is that the successive generated numbers
be independent of others.
The correlation coefficient ~ of two onedimensional random variables X and Y is defined by

f
'Vf20 }A02
where

j)v . .
1J

= E[(X-E (X)yi

If the two random variable are independent
we have '11=0 and thus f=o. Thus two independent
randQm varlables are always uncoffelated, although
the converse is not always true.
One can com~ute from the generated sequence
of pseudo-random numbers the f between the successive terms and between two numbers k positions
apart, k>1, by defining X and Yappropriately.
If the computed ~ is far from zero one rejects the hypothesis that the sequence of pseudorandom numbers are independent, although one has no
guarantee of their independence even if the computed Y is close to zero.
3. The poker test. The mutual independence
of the various digit positions can also be checked
by the poker test. For each generated number, a
designated five octal digits are treatedas a poker
hand, without regard to suit, and the hand is
tallied as five of a kind, four of a kind, full
house, three of a ki~d, two pairs, one pAir, or bust.
Table 1 shows the expected frequency of poker hands.
Table 1 12
Class

Symbol

Busts

abcde

EXEected

freguenc~

302.4

Pairs

aabcd

504.0

Two Pairs

aabbc

108.0

Threes

aaabc

72.0

Full house

aaabb

9.0

Fours

aaaab

4.5

aaaaa

0.1

Fives

1,000.0

159
4.1
Good pseudorandom numbers should not dethese expected frequencies.
4. Runs test.
Another aspect of serial dependency can be checked by finding the distribution
of runs above and below some constant and runs up
and down as described below. If the observations
are randomly drawn from the same population we do
not expect very long runs of either type, and the
occurrence of such runs will, therefore, usually be
taken as indicating non-randomness.
13 14
a. Runs above and below the median. '
Consider a random arrangement of n elements consisting of n a's and n b's, n = n + n , such as
the followink arrangeme~t of 11 a's1and214 blS:
viate

Table 2

significantlY1~rom

babaaaababbbaababbbbbabba
An uninterrupted sequence of elements
of the same kind is called a run, and the length of
a run is given by the number of elements defining
the run. The sequence above begins with a run of
one b, then follows a run of one a and so on.
Assuming that all possible arrangements occur with the same procability we may find
the distribution function of the number of runs of
a's of length i by means of combinatorial theory.
For small values of n the distribution function of the mean number of runs of a's of
length i, R ., and R , similarly defined for runs
2i
1
ofb's,
ma~ be taoul.ated by writing dOvln all
possib~e arrange~ents of a's and bls, as shown in
Table 2 for n=6 and n = n = 3. The numcer ofdif2
1
ferent arrangements is (6) = 20. The expected number of runs of both kind~ of ele~ents of length k
or more is approximately given by
(2.2)

from which we get the expected total number of runs,
R, as
_ n+2
R - 2
The theory above may be applied to
samples from a population with a continuous distribution function by cl~ssifying sample values larger
than the sample median as a's and values smaller
than the median as b's.
In some cases the population median
is known or a hypothetical value is tested. It
will be noted that in this case the number of a's
will be a random variahle, having a binomial distribution with n1/ = 0.5.
n
14
b. Runs up and down.
Consider a sequence of n different observations, X , X ••• ~ X
2
and the sequence of signs (+ or -) of 1 the (n-1; n
differences Xi 1- X.• A sequence of successive +
signs is called a ran up and a sequence of successive - signs is called a run down. The length ofa
run is given by the number of equal signs defining
the run. The total number of runs is denoted by ~
the number of runs of length i by r and the numi
ber of runs of length k or more by Rk , Rk = Z r i .
i

.lk.

No.
1
2

3
4
5
6
7
8

9
10
11

12
13
14
15
16
17
18
19
20

Arrangement
aaabbb
aabbba
abbbaa
aababb
aabbab
abaabb
abl)aab
ababba
abbaba
ababab
bababa
baabab
babaab
baabba
babbaa
bbaaba
bbabaa
baaabb
bbaaab
bbbaaa

R

R11

2

4
4
4
4

1
2
2
2
2
2
2

5
5

3
3

6
6
5

3
3

Total

80

40

Mean

4

2

3
3

1
1

1
0

o
o

1

1
1
1
2

o
o
o
o

102
102
102
002
002
003

o
1

o
o
o
o

003

5

2
2

103
103

4
4
4
4

2
2
2
2

1

3

1

3

1

0

1
1
1

2

16

4

0.8 0.2

0

0
0
0

0

o
o
o

2

102
102

o

1
1
1

o
o

2
2
2

o

40
2

16

4

0.8 0.2

For example, the sequence of 15
elements: 39,42,38,53,51,30,40,28,43,46,52,55,29,
2~34, leads to the following sequence of signs of
differences between successive elements:
+ - + -- + - + + + + - - +

The sequence is thus characterized by
4 runs up of length 1, one run u~ of length 4, 2
runs down of length 1, and 2 runs down of length 2,
giving a total of 9 runs.
Assuming that all n! possible arrangements of the n numbers occur with the same probability, we have the following mean number of runs.
-

2

r

ri=Ti+3YLn

_ i ...
i ~ n-2 (2.4)

and

Table 3 shows the expected number of runs up and
down of length k or more in rando~ arrangements of
n different numbers. The values RK computed from
generated pseudo-random numbers should not again
differ too much from the theoretical values.
5. Discussion. In a particular application
a block of digits of a definite length may be required to be random when consid~Oed in isolation.
The remark of Kendall ann Smith
is relevant here,
namely "if a series S is locally random in a Domain,
it does not follow that any part of S is locally
random in th~t Domain.f! They conclude that a set
of random numbers which is adeQuate for all requirements is impossible, and the only solution
is to carry out tests on blocks of numbers and
give the results of these tests, so that the prospective user can choose from the tables these

160
4.1
Table 3

venient for binary machines is
r

~

k

(1/3) (2n-1)
2

(1/12) (3n-5)

3

(1/60) (4n-11 )

4

(1/360) (5n-19)

5

(1/2520) (6n-29)

6

(1/20160) (7n-41)

7

(1/181440) (8n-55)

X. = (X. 1 + X. ) Mod 2
J
JJ-n
r
where 0 <. X. < 2 and r is the maximum number of bits
used to endode each fixed pOint number.
The reason that equation (2.7) is very
con~e~ienf8for binary machin~~s that t~e modulus
addltlons
are performed by slmply addlng and disregarding overflow.
To illustrate the behavior of this type
of sequence, consider the following simple example:
if

n

=2

X = 0,

X = 2477, the middle four digit~ of X;, as the
1
pseudo-random number.
2

Next X = 06135529 and X = 1355 the second pseudo2
random1 number. Similarly X3 = 8360, X = 8896, etc.
4
.

For the results of s9~e tests on numbers
generate~7this way see Mauchly
and Votam and
Rafferty • Unsatisfactory results have been observed if the number has less than eight digits and
the sequence may develop unsatisfactory properties
if extended beyond 700 or so eight-digit numbers.
2. The Congruential methodse
a. The additive process. Starting with
an initial set of n random numbers, X ' X ••• ,X in
2
n
1
the range
0(X<1
the procedure generates additional numbers, X.,
successively according to the following rule: J
X.

J

= (X.J-1

+ X.

J-n

) Mod 1 j>n
/

An equivalent statement of the process that is con-

1,

r

=3

Equation (2.7) becomes

blocks which are suitable for his problem.
2.3 Methods of Generation of Pseudo-Random Sequences
Methods of generation of pseudo-random
bers by high speed computers are usually divided
into two classes.
a. Those for which there is, or appears to
be, no way of determining the cycle structure other
than the brute force procedure of investigating the
sequence by actual computation. The best known
method of this type is the middle-of-square method.
b. Those for which mathematical analysis can
determine the cycle structure, and even suggest
suitable parameters in the recursion relation to
give the longest period and ~ost satisfactory output such as the congruential methods, additive and
multiplicative.
15
1. The Middle of Square Method • Von
Neumann and Metropolis first suggested the middle
of squares method. This process canbe exe~plified
in a special case as follows. Take a 4 digit number X , e.g., Xo = 2061. Square it to obtain
04247921. Define

X
2

1

Xj

(X. 1 - X.

X3

(X

J-

J-n

)

Mod 2 3

(2.8)

Then

X

4

3
2 - X1 ) Mod 2
(X - X ) Mod 23
2
3

2

By the same procedure, the following sequence is
obtained
0, 1,1,2, 3, 5, 0, 5, 5, 2, 7, 1

Then the sequence will repeat itself.
Length of Period. As just mentioned,
the additive process is neriodic; it will eventually repeat itself by generating the original n
numbers. The period depends m~inly on nand r. In
Green, Smith and Klem's paper, they state that the
period T

(2.9)
where K is a constant depending on n and is given
in TablR 4
Table 4
n

K
n

n

2

3

7

127

3

7

8

63

4

15

9

73

14

11811

5

21

10

889

15

32767

6

63

11

2047

16

255

For n

= 15

K

n

n

12

K

n

3255

2905

and r = 35 the period is
T = 32767 X 2 34

which is approximately equal to 5 X 0014.
Four statistical tests 1 were made by
Green, Smith and Klem of the apparent randomness
of numbers generated by the additive process of
equation (2.7) with various nand r.

161
4.1

The additive process passes the frequency, poker, and serial correlation tests, but it
fails the runs test for n less than 16. It passes
the runs test for n = 16, and presu~bly for n
greater than 16. If alternate numbers are discarded, then the additive process passes the runs
test for n = 6, and presumably for all larger n.
One wishing to use the additive process to generate random numbers has three alternatives. He may decide that he needs only apuroximate randomness, as in setting up ex~erimental designs, and may therefore ignore the runs difficulty.
He will then choose n according to his convenience.
The runs test is very sensitive and failure to pass
this test does not mean that the numbers are badly
awry.
However, if one is working on a Monte
Carlo problem, or some similar problem with stringent randomness recuire~ents, then he must either
choose an n of at least 16, or must plan to discard alternate number.
b. The multiplicative method. Although
the additive process for generating random numbers
has been found very convenient for use in digital
computers, there exists an even simf6er process
which was f~ost suggested b~ Lehmer ~ and later by
Greenberger and Rotenberg~.
Xi +
1

=

(2

a

+

1) Xi + C, Mod 2P

(2.10)

advantages of this process over
the additive ~rocess are as follows:
(a) The process does not require an
initial set of randoo numbers. This means that no
memory storage is needed.
(b) Multiplication by a power of the
base can be accomplished by shifting, which is comparable in speed to addition.
(c) This process requires essentially
three additions and it can be done in one logical
step by a special ?urrose digital computer.
(d) This process also permits cons truction of a special serial computer with very few
components.
Several empirical tests were made by
Rothenberg of the apparent randomness of the numbers generated by the process of Equation (2.10)
with a = 7, C = 1 and p = 35. These tests are:
the frequency test, the runs up and down test and
the runs above and below the mean test. The results show that the numbers are uniformly distributed and that there is no serial correlation in the
sequence.
The serial correlation coefficient between two consec~tive numbers of this sequence is
shown by Coveyou ,
~he

C
1-6 -C( 1 --)

P

P

(2.11 )
where P is the modulus of Xi' i.e. P
for a
7 1 C
_1_ _

J=

27 + 1-

0.008

2P

1

By taking a = 9, this correlatiJn coefficient can be recuced to ?~~roxi~tely 0.002.
It can be show~- tnat the secuence of
Equation (2.10) can generate the full period of 2D
numbers if

a

~

2 and C is OnD.

See appendix I for the proof.
3.

Designs of a Special Purpose
Random Number Genera tor

As a result of the increased interest in the
use of Monte Carlo methods of co~putation in high
speed digital com~uters a number of subroutines
have been written.
For exa~ple, an IEM 709 program for the additive process
x. = ( x. 1 + x.
J

J-

J-n

) ,Mod 2 ~"
-' ~

requires 11 instructions and twenty-two 709 m9.chine
cycles, i.e.,264 microseconds to generate one number. See appendix II for code.
An IBM 709 program for the multiplicative
process
a
x. = (2 + 1) x. 1 + C, Mod 235 (;.2)
J

J-

requires 7 instructions and fourteen 709 machine
cycles, i.e.,168 microseconds to generate one
pseu~o-random number.
The actual code is listed
in appendix II.
A Monte Carlo problem which demands one million random nUill~ers is not considered to be impracticable. Notice that this does not mean that
the problem has undergone one million comDutational
trials. There are usually many random variables involved in one trial run. To generate one million
random numbers by the multiplicative method requires 168 seconds on the 709.
The random number generation time can be reduced, if a fast ~~chine is used. The 7090 would
take 33.6 microseconds to generate one number by
using the multiplic~tive process, which is five
times faster than the 709. However, the percentage of the generation time consumed in solving the
problem is not going to be changed. For example,
in the matrix inversion problem it is 17~ of the
total problem running time no matter what computer
is being used. And in some other problems, it
could be even more Significant than this 1~~.
In the paper "Organization of Computer Systen:
The Fired Plus VaFiable Structure Co~puterlf by
Estrin, it is pointed out that If • • • The fastest
single compo!;gnt switching speeds being discussed
are about 10 second. For any significant p:1rallel
information transfers ~ost r searchers observ~ a
3
loss of a factor of 10 or 10 bE~n~ing ~7siC parallel computer operations to 10
or 10 second •• "
One cannot expect to do very much better with the
populations of switching elements without ~ change
in the method of organizing them ••• The pri-

162
4.1
mary goal of the Fixed Plus Variable Structure Computer is:
"1. To permit computations which are beyond 111e
capabilities of present systems by providing an inventory of high speed substructures ana rules for
interconnecting them such that the entire system
may be temporarily distorted into a problemorjented special purpose computer •••••• "
Special purpose circuits can be designed to
generate random numbers in parallel with other
'l~tivities in the computer such that these numbers
are available on demand in the same sense as any
other operand stored for use in the computation.
The special purpose generator can be some fraction of the variable structure inventory in the(F
+ V} system to essentially remove the time for random number generation from the overall co~putntion
time.
In the following sections, the design of a
serial and a parallel random number generator will
be discussed. The principal criterion for the serial generator is the use of a small number of components while maintaining a reasonable speed; for
the parallel generator, the achievement of high
speed with a reasonable number of components.
The serial generator would increase the performance in solving many Monte Carlo problems. It
is possible to generate one random number every
eight microseconds continuously.
The pArallel generator may achieve an access
time of less than 0.14 microseconds. Although the
par~llel generator does not appear to have immediate usefulness it may be compatible with future
super high speed computers and particular problems
formulated for s01ution by Monte Carlo techniques.
4.

Parallel and Serial Designs of a Special
Purpose Random Number Generator Utilizing A Multiplicative Congruence Method

4.1 FunctionAl Description
This pseudo-random number generator is essentially a special purpose co':router, which will produce long sequences of unifornly distributed random numbers. The algorithm used is based on the
Multiplicative Congruenti~l method,
x.

J

=

(2

a

+ 1) x. 1 + C, Mod 2P
J-

As indicated in section 3 ,the choice of the
parameters a, c, and p is governed by the period
desired ann the minimum acceptable value of the
serial correlaticn coefficient.
In this example the constants are chosen in
such for'71 a.s tv illustrate the computer design
anci may be modified accordn~ to the above cri teria •
The cons"'"nts of the above equation are chosen

as follows:*
a

11

C

20

p

20
The length of the period is equal to 2 which
is approximately one million. For a = 11, the serial ~~,relation coefficient is apnroximatelyequal
to 2
•
The generator can be set to any initial starting number. Since the process does not require
storage of the previous number, the generator will
go on to generate the next random number as soon as
the pr0viouR one has been transferred.
1. The Serial Gener~tor. The Functional
Block Diaeram is shown in Figure 1. The complete
functional flow includes two cycles:
11
Cycle 1: (2
+ 1) x. is formed
1

Cycle 2:

X. 1 =(2
J+

11

+ 1)

X.

1

+ 1 is cO Tl1rleted.

The R register is a 20-bit register in which
Rj initially contGtir:s the least Significant bit and
which s torE's the ranc am numller, X.. In every logical sequential step, the 20-bit fegiRter is shifted right one place. The output of the adder is
shifted into the most Significant bi t of the R register, R , and the least Significant bit of the R
20
register IS shifted into the I register.
The T register c~ntrols the cycles during
the flow. T = 0 corresponds to Cycle 1. T = 1
corresponds to Cycle 2.
The N register counts from 0 to 19 and provides thereauired sequence of 20 control steps.
The S register controls the start signal.
It also provides appropriate control signals to all
other registers in the generf'to.r. The S rep;ister
is set by an initial "Demand" signal from a supervisory control observing the needs of the rest of
the (F + V) system and is reset at the end of Cycle
2.
a. Input eo.us.tions. The input eouations
may be written by collecting together and combining the timing and truth tnble logic as follows:
S

(Deoand)S'· + (N1 + N2 + N3 + N4 + NS) S + T

N1

S [N1 t

:t-.2

S [N1N2' + N1'N2]

N3

S [N1N2 N3'N5' +

N4

S [N1N2N3N4'· + (1\1' + N2'· + N3') 1\4]

N5

S [N1N2N3N4N5' + (N1' + N2') NS]

T

J
(N1' + N2')

N3

J

T'N5N2N1 + T (N5' + N2' + N1')

* If C ~ 1, then C must be added into the proper
position of a parallel adder or at an appropriate
time in a serial adder rather than the Simple
forced carry into the least significant position.

163
4.1
R
n
J

n

R
(n= 2,3" ••••••• , 19)
n+1
11 12'C _ ' + 11' 12 C _ ' + 11' 12'C n _
1
n 1
n 1

+ 11 12 C _

n 1

where I

n

is the adder output
I1

= R1

I2 + R10 TI(N5 + N4 N3 + N4 N2N1)
C

n

I1 12 + C _ (11 + 12) + Tt N5 N2N1
n 1

b. Estimation of running time for the
full cycles. If five megacycle flip-flops are used,
the delay ~hrough each flip-flop is 200 millimicroseconds. It takes forty-clock times to complete a
cycle. The total delay, therefore, is eight microsecond..,.
2. The Parallel Generator. The functional
block diagram is shown in Figure 2.
The R register stores the random numberx.
which is 20 ~tts long. The two inputs to the addef
are x. and 2 x'i1 The constant C = 1, is added to
the slim x. and 2 x. by setting the initial carry.
As soon a~ the Start signal is received, the adder
outputs will transfer
1 into R register to reJt1
place x .• Since x., 2 i. and C are all available
at the ~ame time, the pro~ess cl1.n be accocnplished
in one lo,o;ical step. For cp..ses where C 1= 1 more
co~plete stages of the adder may be required to
permit injection of C. The access time of this
special computer will mainl;" depend upon the speed
of the adder.
A parallel adder2~u2~ be able to generate
parallel carry functions.-'
Since carry C is
k
an explicit function of C
,the parallel carry
functions can only be obt~i~ed by a method of substitution. In applying this method, one will soon
find out that the carry fl.;nctions contain a great
number of terms. This may make it electronically
impossible to mechanize and in any event the response of the carry functions becomes so slow that
it actually loses the effectiveness of a parallel
adder.
If an adder2~f the type proposed by
Weinberger and Smith
is used, the logical configuration of Figure 3 is obta~ned.
By studying the adder equ"l.tions, one finds
that the maximum fan-in is four. The maximum fanout is five. This results in a maximum of six inverter delay times and one flip-flop delay time.
If one uses as a basic building block, a current
mode, diode gate unsaturated inverter with a gainbandwidth product of approximately 400 megacycles,
then one may use inverters hElving a response time
of less than 15 millimicroseconds per stage. The
total delay through the adder is less than 90m~s.
If 20 megacycle flip-flops are used, the delay
through each fJ ip-flop is 50 mrs., Uncler sucr. conditions it is implied that the random number generator could generate a random number every 140 m~s.

4.2

solved, it is always possible to design a special
purnose computer which can perform the operation
faster and more efficiently than a general purpose
computer can. It is clear, however, that the
speed and simplicity of the special purpose computer has been achieved at a complete sacrifice of
flexibility. The lack of flexibility, in general,
means that the control unit for a special purpose
computer can be simplified since the sequence of
operations performed is fixed. The data to be
operated on can be arranged so that it becomes
available as required without the necessity for
addresses or even a memory cycle.

A general purpose computer is usually more
expensive than anyone special purpose computer.
However, aside from the fields of automatic control
and business data processing, one will hardly find
the extended use of a special purpose computer. A
general purpose cooputer is mostly used as anarithmetic tool, which can be programmed to solve thousands of functions separately and continuously. It
is p~ssible but more likely impractical to have a
great number of fixed special purpose computers
working Simultaneously together. Either it is a
tremendous waste in the dollar point of view or the
overall perfornance will probably be disappointing
due to the difficulty of data synchroni~ation among
machines.
In order to take advantage of the better technology of both types of computers; that is, the
speed and siTple organization of the special purpose computer and the flexibility of the general
purpose computer and still be within the realism of
electrcnic circuitry knowhow, the proposed "Fixed
Plus Variable Structure Comnuter" is advantageous.
The variable structure inventory in the (F +"V)
system \.:ould be expanda'hle ana could grow wi th expandable a.nd could p:rO'lt1 with experiences and needs.
The procedures which lead to the design of special
purpose modules from the vari8ble structure inventory can be set up in standard forms such'as the
SHARE program for a general purpose com~uter system.
The special purpose random number generator
will be considered as one of the specia.l modules
in the (F + V) system. It should be noticed that
the particular serial random number generator discussed in the previous section consists of only
five conventional switching elements. They are:
1•

A shift register

2.

A three input serial full adder

3.

A counter

4.

Three flip-flops

5.

Three "and" gates and three "or" gates

The pRrallel generator consists of the following
two parts:

(F + V) Structure

1.

A register

Given any sets of functions which are to be

2.

A high speed parallel adder

164
4.1
It is also suggested in the (F + V) system
that if the random number generator is used frequently enough, it should become part of the instructionset of a future general purpose computer or
even possibly become one of the mOdules in the V
inventory. ~ adding this special purpose random
number generator to any presently exisitng general
purpose computer, for solving Monte Carlo problems,
it becomes a minimum (F + V) system at the expense
of a fraction of the original cost but achieves a
significant gain in speed.
5.
5.1

Examples of Applications

Introduction

In this section, three problems using Monte
Carlo technique, i.e., gamma ray diffusion problem,
matrix inversion problem and design of electronic
curcuits, are considered. The role which is played
by sequences of pseudo-random numbers will be emphasized as well as the possibility of simultaneous
generations of other elementary functions such as
, logarithmic and exponential functions in the (F+V)
structure computer.
5.2

. 24 25
Gamma ray diffuslon '

In the Monte Carlo approach to the Gamma ray
diffusion problem, a beam of monoenergetic Gamma
rays is incident at a given angle on a plane parallel barrier of finite thickness in one dimension
and of infinite length in the other two dimensions.
The paths of the Gamma rays are simulated by
appropriate random walks. A set of three random
numbers is generated per random walk between successive events such as collision or absorption in
order to obtain energy and angular distribution of
transmitted and reflected Gamma rays.
In this problem it turns out that simUltaneous
computations of elementary functions In x and eX
can reduce the total computation time by 40~. The
generation of pseudo-random ~~m~5r36r3~u~8e397.4%
of the total computing time. ' , , , ,
5.3

An important consideration in the design of a
transistor resistor logic system is the delay in
propagating signals through various levels of these
circuits. The propagation delay is an implicit
non-linear function of many variables. In investigation of its statistical properties, two immediate difficulties are encountered. First, the distributions of the variables are not completely
known. While it is possible to make reasonable assumptions regarding resistor values, such as a uniform denSity function, the transistor parameters
present a different picture. Their denSity functions are not generally known in a form describable by any simple function such as the normal distribution. One approach to this problem is to derive approximations to these functions from empirical measurement. Even with the distribution of
all the variables known, the distribution of the
resultant delay cannot be readily determined and
the techniques of the Monte Carlo method become
very powerful aids in the solution of such complex
problems. This is particularly true since the development of models for the transient and steady
state analysis of transistor SW~9ching circuits
such as ~ose of Ebers and Moll
and Sparks and
l3eaufoy.
The flow chart below shows a typical computational flow for a conventional sequential machine
in applying the Monte Carlo technique to evaluate
the propagation delay.

Random Selection of
Power Supply voltages,
Transistor Parameters
andValues of Impedance
Steady State
Circuit Current
Computation

Matrix Inversion

Matrix inversion by the Monte Carlo method was
first suggested by Von Neumann an~6Ulam and later
described by Forsythe and Liebler • The method
provides a simple computational approach to the
statistical estimation of the elements of the inverse of a given matrix. Here, random walks are
again introduced in generating the matrix inverse
and the random walks are terminated when the statistical variation of the result is less than a
given tolerance. The percentage of time spent in
generating random numbe 2g ~?r2Shis type of program
is approximately 17.3~. ' ,
5.4 Electronic Circuit Design31 ,33,34,35
The Monte Carlo technique is finding wider
application in designing electronic circuits and is
illustrated by the problem of estimating propagation
delay time of transistor-resistor logic circuits. 31

repeat

The method of approach is: establish a mathematical model of the circuit; generate a set of
numerical values for all these variables according
to their respective distributions;* evaluate the
function using this set of values; and repeat the
above operations using another independent set of
data.

* One can obtain these values by actually measuring the parameters of a pa.rticula.r transistor randomly selected. Resistor values may be assumed to
have a uniform distribution.

165
4.1
Calculation of Rise Time, To
The time required for the collector current to
reach 90% of its final value is
1b1
To =

La

0.91 c

I b1--_

Pn

where four random numbers are generated to select
't
a

minority carrier lifetime in the active
region

1b1

step turn on current

I

collector current
normal common emitter current gain

R

the effective impedance looking out from the
base terminal
input capacitance
forward

bias voltage on base

reverse bias voltage on base
In estimating R, it may be more desirable to randomly select individual resistor values in the
circuit and then compute R, rather than picking R
randomly from some estimated distribution for R.
Turn on

c

(3n

Calculation of Storage Time -T
1
The time required to switch the transistor
from the saturation region to the edge of the active region is

+

In expression (5.2), T is the response of the transistor to a step gf base current.
The second term compensates the delay due to the
input capacitance.
Storage Time

where two new random numbers are needed to obtain
minority carrier lifetime in the saturation
region
1b2

step turn off current

Calculation of Decay Time T2
The time required for the collector current to
decay to zero 1s

Calculation of the Input Delay time TOD
The time required to charge or discharge the
input capacitance is

e

L
Re.

l.n -1

L
RC

in

where at least four additional random numbers are
used to select

T = T +L ln
s
s
1

e

T
on
1:'s
T
on
1:' s

-1

(5.6)

In expression (5.6) T is the time required
to remove excess carriers from the base with a
step of base current applied. The second term in
expression (5.6) is due to the fact that the base
of the transistor is discharged by a signal with a
finite rise time, Ton.

A simulation progr~r has been prepared by Y.
C. Ho and W. J. Dunnett. The entire program has
approximately 4,000 instructions and takes an average of three seconds per run on the IBM 709
Computer. In order to verify and establish the
accuracy of the mathematical model a statistical
experiment was also performed. In this experiment
measured data was compared with predicted statistical data provided by the computer program and in
each case, the predicted mean was quite close to
the measured mean. Moreover, the computed deviation in every case was more conservative than its
measured value.
One notices that in computing the propagation
delay, logarithmic anrl exponential functions are
used four times and three times respectively, in
addition to the required generation of a set of ten
random numbers per transistor per run.
The special purpose random number generator
does away with most of the time required to generate random numbers which become available essen-

166

4.1
tially in one access time to the high speed memory.
If in the same sense as the random number generation, a special purpose configuration isxestablished in V for computation of ln x or e concurrently with the program being executed in F, then
one may expect to achieve significant reduction in
total computation time.

"Empirical Tests of an Additive Random Number
Generator", Journal of the Assoc. Compo
Machinery, pp. 527-537, September, 1959.
9.

Rotenberg~

Consideration of the set of equations 5.1-5.6
shows that the seven natural logrithm and exponential functions must be evaluated in sequence.
Thus while F is calculating the operands to be
used in these functions, V might be generating the
random numbers and functions required in the following steps.

10.

Kendall, M. G. and Smith, B. B., "Randomness
and Random Sampling Numbers", Journal of the
Royal Statistical Society, Vol. 101, pp. 147166, 1938.

11.

Cramer, H., .Mathematical Methods of Statistics,
Princeton University Press, 1946.

A block diagram of the (F + V) structure computer illustrating a form for achieving the above
is shown in Figure 4.

12.

Rand Corp., A Million Random Digits With
100.000 Normal DeViates., Free Press, 1955.

13.

Dixon, W. J. and Massey, F. J., Introduction
to Statistical Analysis, McGraw-Hill, 1951.

14.

Hald, A., Statistical Theory with Engineering
Applications, Wiley, New York, 1952.

15.

Taussky, O. and Todd, J., "Generation of
Pseudo Random Numbers", Symposium on Monte
Carlo Methods, Wiley, 1956.

16.

Mauchly, J. W., Pseudo-Random Numbers, Presented to American Statistical Assoc.,
December 29, 1949.

17.

Votan, D. F. and Rafferty, J. A., "High Speed
Sampling", !!!!2. 5 (1951) pp. 1-8.

18.

Garner, Harvey L., "The Residue Number System",
Proceedings of Electronic Computers 8 June,
1959.

19.

Lehmer, D. H., "Mathematical Methods in Large
Scale Computing Units," Proc. of a 2nd Symp.
on Large-Scale Digital Calculating Machinery,
pp. 141-146, 1949.
Greenberger, M., Decision Unit Models and
Simulation of the United States Econo
Chapter III, Preliminary Draft 1958 •

6.

Conclusion

In all three examples considered in Section
5, serial generation of random numbers appeared
quite satisfactory, since, using inexpensive
transistors it is possible to generate one random
number in about 8 }Ns. No case has been found in
which it is necessary to employ the parallel method
of random number generation.
If it becomes necessary to generate random
numbers more rapidly than 8fs per random number,
one can either decide to employ more expensive,
faster components in a serial random number generator or utilize the fully parallel configuration.
References
1.

2.

3.

4.

5.

6.

7.

Estrin, G, I1Organization of Computer Systems The Fixed Plus Variable Structure Computer,"
Proceedings of Western Joint Computer Confer~, May, 1960.
Donsker, M. D. and !{ac, M., I1The Monte Carlo
Method and Its Applications", Proceedings of
Computation Seminar, IBM Corp., December, 1949.

20.

McCracken, D. D., "The Monte Carlo Method",
Scientific American, Vol. 192, Page 90, May,
1955.

21.

Brown, G. W. "Monte Carlo Methods", Modern
Mathematics for the Engineer, Edited by E.
Beckenbach, McGraw-Hill Co., New York, 1956.

Coveyou, R. R., "Serial Correlation in the
Generation of Pseudo-Random Numbers," J. Assoc.
Comp., pp. 72-74, January, 1960.

22.

Weinberger, A. and Smith, J., "A One-Microsecond Adder Using One Microsecond Circuitry,"
Proceedings of Electronic Computers, IRE
EC-5, June, 1956.

23.

Richards, R. K., Arithmatic Operations in
Digital Computers, D. Van Nostrand, 1955.

24.

Berger, M. J., "An Application of the Monte
Carlo Method to a Problem in Gamma Ray
Diffusion," Symposium on Monte Carlo Methods,
Wiley, 1956.

25.

Beach, L. A. and Theus, R. B., "Stochastic

Lytle, E. J., itA Description of the Generation
and Testing of a Set of Random Normal Deviates",
Symposium on Monte Carlo Methods, Wiley, 1956.
Butler, J. W., "Machine Sampling from Given
Probability Distributions", Symposium on Monte
Carlo Methods, Wiley, 1956.
IBM 709 Data Processing System Reference
1958.

~,

8.

A., "A New Pseudo-Random Number
Generator , Journal of the Assoc. for Compo
Machinery, pp. 75-77,January, 1960.

Green, B. F., Smith J. E. K., and Klem, L.,

167
4.1
Appendix I

Calculations of Gamma Ray Diffusion,
Symposium on Monte Carlo Methods, Wi1ey,1956.
26.

27.

28.

29.

30.
31.

32.

33.

34.

35.

Forsythe, G. E. and Leib1er, R. A., "Matrix
Inversion by a Monte Carlo Method," MTAC IV
31: pp. 127-129, 1950.
-Tang, T. "A Special Purpose Random Number
Generator" Master Thesis, U.C.L.A., January,
1960.

Period of Multiplicative Congruence Method
From the equation
Xi +1 = (2

Beaufoy, R. and Sparks, J. J. "The Junction
Transistor as a Charge Controlled Device,"
A.T.E.Journa1, October, 1957.
Ho, Y. C. and W. J. Dunnett, "Monte Carlo
Analysis of Transistor Resistor Logic Circuit, It
IRE Convention Record, March, 1960.
Edmonds, A. R., "The Generation of PseudoRandom Numbers On Electronic Digital
Computers," The Computer Journal, pp. 181-185,
January, 1960.
Hellerman, L. and Racite, M. P" "Reliability
Techniques for Electronic Circuit Design,"
Transaction on Reliability and Quality Control,
IRE September, 1958.
Parker, J. B., "The Accumulation of Chance
Effects and the Gausslal) freq.uency Distribution,"
Phil Mag., 38:681-682 (1947)
Silberstein, L. "The Accumulation of Chance
Effects and the Gaussian Frequency Distribution," Phil Mag. , 35: 395-404 (1944).

36.

IBM Share Program F114 Distribution No. 027,
Western Data Processing Center, U.C.L.A.

37.

IBM Share Program LAQ1 Distribution No. 525,
Western Data Processing Center, U.C.L.A.

38.

IBM Share Program AS09 Distribution No. 224,
Western Data Processing Center, U.C.L.A.

39.

IBM Share Program AS03 Distribution No. 224,
Western Data Processing Center, U.C.L.A.

P

+ 1) Xi + C, Mod 2

and
a

Ralston, A. and Wi1f, H. S. Mathematical
Method for Digital Computers, John Wiley and
Sons, Inc., New York, 1960.
Ebers, J. J. and Moll, J. L. "Large-Signal
Behavior of Junction Transistors," Proceedings
of IRE, December, 1954.

a

(2 + 1)

x.1+1

a
(2 + 1)

L (2a +1)

+ C
Xi +

c] + C

a
a
(2 + 1)2 Xi + (2 + 1) C + C
and
a
a
Xi+n = (2 + 1)n Xi -+ [(2 + 1 )n-1+

(2a + 1) n-2 + •

] C

(2a+1)n X. + C (2 a + 1)n _ 1
1

The sequence repeats itself when

Xi +n

= Xi

From (2) and (3) we have
a

(2 +1 )n-1

a2

[

a
]
2 Xi + C = 0

Mod 2P

If C is odd and a> 2 the sum of the terms in
the bracket is odd. Then the first term must be
divisible by 2P in order to satisfy (3). Thus it
is required to find the smallest n satisfying the
congruence
(2 a + 1)n_1

a
2

In Rotenberg's paper 9 , he points out that
from the Number Theory, ~ (2P ) is a solution of
(4) and the smallest n must be of the form 2r.

]
]
If a = 2, all terms in the bracket after the
first "l tt will be even and thus the Whole bracket
is odd. Then to satisfy (4) the factor 2r must
be divisible by 2P • Thus the minimum value for r
1s P, or n = 2P •

168

4.1
Appendix II
Examples of Computer Routines to Generate
Random Numbers
The IBM 709 MIdi tive Congruence Method 14
Location

Operation

Address Tag

00

SXD

Rand 1, 1

01

LXD

Rand 2, 1

02

CLA

Rand 3, 1

04

TIX

05, 1, 0

05

ADD

Rand 3,

Comments
The contents
of index reg.
1 is stored
in Rand 1
No. of initial
numbers is
loaded into
index reg. 1
x. 1 is added
ta-acc.
Transfer on
index (10)
x.
is added
t

06

STO

Rand 3,

07

SXD

Rand 2,

08

LXD

09

TOV

10

TRA

a-nx j _ 1

x. is stored
J

The content of
index reg. 1
is stored in
Rand 2
Rand 2, 1
Index reg. 1
is restored
10
overflow condition reset
Normal Return

Rand 1 Temporary Storage
Rand 2 No. of initial numbers
Rand 3 Address of the last initial random numbEr
(Random numbers are stored in consecutive
memory locations)
The two SXD and LXD (00, 01, 07, 08) instructions could be omitted if the index register 1 were
available for use by the subroutine, thus saving
four in~tructions per generated number.
The IBM 709 Multiplicative Congruence Method
Location

Address

Comments

CLA

Rand

x. 1 is added to
aac.
C is added to x _
j 1
x. 1+ C is stored

Operation

0

ADD

Rand 2

2

STO

Rand

3

ALS

Rand 3

4

ADD

Rand

2 x _ is. formed
j 1
Xj is formed

5

STO

Rand

x. is stored

6

TOV

Normal Return

Rand

Random number x. 1

Rand 2 C
Rand 3 a

J-

~-

J

CARRY

s

T

~~
~
N'19

I

N

ADD COMPLETE

(T=O)(N=19)

ADDER

REG~

0-19

s =I

Figure 1

-ADO COMPLETE

FUNCTIONAL BIDCK DIAGRAM FOR A SERIAL PSEUOO-RANroM NUMBER GENERA.'roR

~I-'

•

Ol

I-' (0

170
4.1

COMPLETION
TO S.C.
START
FROM S.C.

R REGISTER

I--~

RANDOM NUMBER TO
DATA REGISTER

PARALLEL ADDER

Figure 2

FUNCTIONAL BlOCK DIAGRAM FOR A PARALLEL PSElJ.OO-RANJX)M NUMBER GENERA.IDR

171
4.1

X7

I------------------~

Xa ~----------------~

Xg

~------------------~------~

X,o

~----------------__1

TIME

FiGClre:3

SYNEOLIC

I

Fum

TIME 2

TIME 3

TIME 4

CR4..."tT FOR PARALLEL ADDER INDICATING TINllm DURING ADDITION

t/:o.~

RANDOM NUMBER
GENERATOR

••

s.c.

"DEMANDII FROM

s.c.

..-. COMPLETION TO

"TRANSFERII FROM

JL

a:
w

s. C.

---.. RESULT TO F

l-

(/)

(!)

w
a:

~ 1-+ ARGUMENT
o

In x

OR

eX

GENERATOR

...

DEMAND FROM

~

COMPLETION

s.c.

TO

Figure 4 (F + v) STRUCTURE CONFIGURATION

s.C.

FROM F

•

....;j

~

to.;)

173
4.2
THE CELLS CAN

SYSTEM T.M.

A LEUCOCYTE PATTERN ANALYZER
K. Preston, Jr.
The Perkin-Elmer Corporation
Norwalk, Connecticut
Summary
Medical workers suspect that the incidence
of the binucleate lymphocyte in the peripheral
blood stream is an index of radiation damage. l
The fact that the incidence of this type of white
cell is of the order of one per each million
blood cells makes practical use of this index by
the human observer essentially impossible. The
Atomic Energy Commission has therefore arranged
for the construction of a blood cell scanning
system with which to determine the feasibility of
the semi-aatematic identification of the binucleate lymphocyte in glass-mounted smears of human
blood.
The CELLSCAN system consists of a closed circuit television microscope coupled to a special
purpose digital computer. The system produces a
quantized image of the leucocyte whose pattern is
to be analyzed. The computer program causes the
quantized image to be operated upon serially in
such a way that groupings of contiguous binary
"l's" are reduced to single l's. Sampling the
number of isolated l's at various instants during
the reduction process produces a histogram of the
leucocyte's various constituent parts. The
CELLS CAN system is intended to determine whether
the-resultant histogram of the binucleate lymphocyte is unique in the histogram population of
all leucocytes.
Introduction
The use of radioactive materials or of
radiation producing equipment continues to increase throughout the world. Users are confronted with the problem of measuring the effect of
radiation on workers in radiation environments.
To date no good index has been found which
measures the physiological changes produced in
humans by low level radiation. One hypothesis
that has been advanced is that the appearance of
a rare type of white blood cell in the blood
stream may provide such an-index. The rate of
occurrence of this white cell is extremely low.
I~ order to obtain sufficient statistical data to
prove the above hypotheses, it is essential that
automatic equipment be developed which is capable
of scanning and anaiyzing blood samples.
It is the purpose of this paper to describe
an experimental blood cell scanning system which
has been constructed for the Atomic Energy Commission. The equipment which has been assembled
is designed to scan individual white cells rather
than blood samples containing a multitude of
cells. Experimental results will be used to es-

tablish the feasibility of using electro-optical
scanning with data processing pattern recognizing techniques to identify certain types of
human blood cells.
Hematological Background
Figure 1 is a table showing some of the
classes of cells which are found in the human
blood stream, i.e. the peripheral blood. Blood
cells are divided into two major classes: The
red cells (erythrocytes) and the white cells
(leucocytes). It is the white cell population
with which we are concerned here. The two major
classes of white cells are lymphocytes and
granulocytes. There are about two granulocytes
for each lymphocyte. The sub-class of lymphocytes whose incidence may be an index of low
level radiation damage consists of lymphocytes
having two nuclei. These cells are called
binucleate lymphocytes. For brevity we shall
refer to them as "bilobes".
Some quantitative information on the incidence of bilobes has been obtained in experiments with animals. Figure 2 shows a graph prepared by Dr. Marylou Ingram at the University of
Rochester. 1 It shows the number of bilobes per
thousand lymphocytes in the peripheral blood of
dogs who were exposed to low level radiation
from the University of Rochester cyclotron. As
can be seen the normal incidence of bilobes in
the dogs tested is about one per 30,000 lymphocytes. There is an immediate increase in the
number of bilobes upon exposure by about an order
of magnitude. This increased incidence continues
in the peripheral blood for approximately one
month during which time its amplitude gradually
diminishes.
The normal incidence of bilo~es in man is
somewhat greater than that given in Figure 2 for
dogs. It is of the order of one per ten-thousand
white cells which is equivalent to about one for
each one million total blood cells. In order to
measure the index of bilobes for a particular
human being, the technician employed to process
a blood sample must count and catalogue cells
for several hours. This makes routine measurements of this index both practically and economically unattractive as long as human technicians
are required to perform the measurements. These
measurements are also subject to error to
technician fatigue. For this reason a study program has been instituted in order to determine
whether it is within the present state-of-the-art

174

4.2
to evolve a blood cell scanning and data processing system which will identify bilobes in typical
samples of peripheral human blood with reasonable
error rates. This system is called the CELLS CAN
system.
Figure 3 shows a portion of a typical blood
sample. The sample is prepared by squeezing a
drop of human blood between two glass microscope
covers lips. When the slips are separated the
surface of each is coated with a monocellular
layer of blood. The layer of blood is then
treated with certain biological dyes which selectively stain the different cells and cell constituents. The chemical reaction between the dye
and the cells imparts characteristic colors to
the cells which assist the hematologist in cell
identification. Figure 3, however, is purposely
black and white as the CELLSCAN system uses a
monochromatic input.
Different types of white cells appear in
Figure 3 interspersed with red cells. The red
cells are the regular disc-shaped objects. The
white cells are the irregularly shaped darker
objects. Each white cell consists of nuclei within the cell body or "cytoplasm". A bilobe is
shown as well as two granulocytes. Granulocytes
are characterized by the multitude of dark grains
or "granules" within their cytoplasm. It should
be noted that one granulocyte has two nuclei.
Therefore, in order to identify the bilobe, the
CELLSCAN system must differentiate between
granulocytes and lymphocytes as well as between
binucleate lymphocytes and lymphocytes with other
than two nuclei.
The technique used by the CELLSCAN system to
differentiate between classes of white cells is
to count and size the constituents of a given
cell. By "constituents" we mean the nuclei and
the granules. The output of the system is thus
a histogram from which cell type may be determined.
A digital approach to this problem of particle
counting and sizing has been chosen as the best
method for dealing with irregularly shaped
particles.
The "Shrink" Algorithm
The equipment which has been designed to
identify bilobes employs a television microscanner to deliver a binary video image to a
special purpose computer. The scanner is so designed that, whenever white cell nuclei or granules appear in the field of view, the digital
output is "1". All other areas of the field of
view are "0". A hypothetical binary image is
shown in Figure 4a. Since the computer is
digital, this image is quantized into a Cartesian
array of elements or "bits". Dark elements in
Figure 4 correspond to binary "l's".
The function performed by the special purpose computer is to count and size the image
areas which are comprised of contiguous "l'su.
The output of the computer is an image histogram

for the particular cell which is viewed by the
scanner. In order to produce this histogram the
computer first stores the entire binary image.
It then operates sequentially on each image bit
causing binary "l's" on the periphery of any
contiguous group of "l's" to be changed to "O's".
This operation has been called the "shrink" operation and was originally suggested by M. Golay.2
In general, the computer is required to make
several sequential examinations of the image before the image histogram is completed. Each
examination will be called a "pass". Figure 4b
indicates how the shrink operation modifies the
original image shown in Figure 4a after two
complete passes. Figure 4c indicates the re-·
sidual image after sufficient passes have been
made to reduce the original image to a single
isolated binary "I". The number of passes required to reduce the original image to an isolated
"1" is proportional to the maximum chord of the
original image. Thus the image histogram is computed by counting the number of isolated ones
present in memory after each pass. This information is then converted into a plot of the
total number of groups of "l's" in the original
image which fall into each of several maximum
chord ranges.
To program the computer to perform the shrink
operation an algorithm has been devised by L.
Scott and R. M. Landsman. 2 The algebraic expressions which define the shrink algorithm are
shown in Figure 5. The image bit operated upon
is designated as X and its neighbors as A, B, C,
••• ,H. Since the computer sequences from left
to right and top to bottom, bits A, B, C, and H
have previously been processed. Their processed values are designated Ap ' Bp ' Cp ' and Hp.
Three functions are derived from the values
of these 8 neighbors. The function f(ISO) indicates whether the bit under examination is an
isolated one while the function f(TAZ) indicates
whether the neighbors contain three adjacent
"O's". A further function is required whose
value indicates whether the X bit is a link between subgroups of a given grouping of contiguous "l's". This indication is provided by the
value of f(TUP) which is "1" when there are three
or more unlike neighbor pairs. This is necessary
so as to prevent the computer program from producing two isolated "l's" when operating upon
a dumbbell shaped original image.
The complete shrink algorithm is f(X ) and
is given in the last line of Figure 5. ~en this
algorithm is applied to a stored image, results
as previously described in Figure 4 are obtained.
It causes isolated "l's" to be retained rather
than being converted to "0". This permits periodic sampling of the number of isolated "l's" in
the computer memory. This periodic sampling takes
place at intervals determined by the scale required for the image histogram.

175
4.2
Technical Details
The first model of the CELLS CAN System has
been built in order to evaluate the feasibility
of recognizing bi10bes using the data processing
technique described above. A block diagram is
shown in Figure 6. The first model is not intended to be a machine capable of scanning complete blood samples. The micro-scanner is manually centered on the blood cell to be analyzed.
For reasons of economy the scanning and data processing rates are slow. About 10 minutes are
required to process one image. The periodic computer sampling of isolated "l's" is converted to
an image histogram by manual calculation.
The Scanner
The scanner standard employs a Leitz Ortholux microscope coupled to a Dage Data-Vision
scanner which has been modified so as to enhance
its signal to noise ratio. The output of the
video amplifier is digitized by means of a video
quantizing circuit.
The Data-Vision equipment scans at a rate of
60 horizontal scans per second which is about 300
times slower than standard closed circuit television systems. The period of the vertical scan
is 5 seconds. This produces a field of 300 horizontal scans for each image. This slow scanning
rate was chosen so that the output data rate of
the scanner could be matched to the data processing rate of the computer. It permits image
data recording directly upon magnetic tape at
2000 bits per second. Thus a relatively inexpensive audio tape machine can be employed.
White cells are typically 10 to 20 microns
in diameter. In order to restrict the field of
view to a single cell an area of blood sample 30
microns square is imaged by the micro-scanner.
Thus, with 300 scanning lines, the elementary
sample area or scanner "resolution" is one-tenth
of a micron square. This resolution is required
since the separation between nuclei in a bi10be
may be of this order. It implies, however, that
computer storage of almost 100,000 bits per field
of view is required. In order to reduce this
memory requirement a processing technique developed by W. K. Taylor is used in the scanner. 3
Circuitry is used which stretches white-polarity
video signals and equally shrinks dark-polarity
video signals. This causes the separation between bi10be nuclei to appear greater. However,
it simultaneously makes granules appear smaller.
In the CELLSCAN system a five fold stretching is
permissible. Although granules having maximum
horizontal chords less than one-half micron disappear from the image, sufficient granules remain
to make cell identification possible. In this
fashion the horizontal resolution is decreased
to one-half a micron and the computer memory capacity requirement becomes about 20,000 bits.
The Computer
A block diagram of the CELLS CAN system com-

puter is shown in Figure 7. The computer memory
is a continuously moving loop of magnetic tape
which is capable of storing 19,200 bits corresponding to a 64 x 300 field of view. Non-returnto-zero recording is used with automatic erase
after reading. One track is used for storing
"O's"; another, for "l's". Internal computer
timing signals are obtained from the data stored
on the tape. At the beginning of the data processing run the computer memory is erased. Then
under control of synchronizing signals from the
scanner the memory is filled with one field of
view. The image stored on the magnetic tape is
serially transferred to memory registers and
operated upon using the shrink algorithm under
the command of controls on the computer console.
The computer applies the shrink operation to the
image for a pre-set number of passes variable
geometrically from 4 to 128. After this number
of passes has occurred, the computer automatically reverts to a mode of operation wherein the
tape continues to be read and the image is rewritten unchanged. At this time the computer may
be ordered to count the number of isolated "l's"
in the residual image. By alternating between the
command to shrink and the command to count isolated "1' s", the Ote rator is able to complete the
image histogram for the field of view which has
been stored.
Another requirement of the computer is to
complement the image stored. This routine is
required in order to eliminate noise in the
original image. Consider for example, the problem of image inclusions as in applying the shrink
operation to the letter "0". Due to f(TUP) which
indicates an X bit which is a link between two
neighbors, this configuration of binary "l's"
cannot be reduced to a single binary "1" by the
shrink operation. This indicates that any inclusions which occur in a group of contiguous
binary "l's" will prevent this group from being
reduced to an isolated" 1". In or der to eliminate inclusions in the image received from the
scanner the CELLSCAN computer first complements
this image. In the complemented image inclusions
appear as isolated "l's" or small groups of contiguous of "l's". These are eliminated by applying a modified shrink operation wherein f(ISO)
= 0 so that isolated "l's" are not retained. Now
the image is again complemented in order to recr~ate the original image in an inclusion free
form.
Part of the computer is a test pattern generator which may take the place of the scanner as
a data source. It is used during periods allocated to cOmputer maintenance and debugging. The
action ot the computer is monitored by a video
monitor which displays the original or residual
image contained in computer memory. Furthermore,
the action of the scanner is observed by a second
monitor which is capable of displaying either the
analogue video signal "or the quantized signal.

176

4.2
Results
An example of an analogue signal from the
scanner is shown in Figure 8, which shows a
typical bilobe.
Figure 8 shows the analogue
signal before pulse stretching. It was obtained
using a glass mounted blood sample prepared by
the University of Rochester using peroxidase stain
counterstained with Wright stain. An oil immersed
90 power apochromatic objective lens having a numerical aperture of 1.32 was used with a 1.40
numerical aperture oil immersed condenser. The
virtual image formed by the objective was imaged
on the photoconductive surface of a General
Electrodynamic Corp. 7325 vidicon tube by means
of a Leitz widefield periplant eyepiece. The
light source was a tungsten spiral filament filtered by a 5300 Angstrom narrow band filter.
The analogue signal was quantized with the
result shown in Figure 9. Inclusions in the
nuclei due to non-uniformities in transmissivity
can be seen. After elimination of inclusions by
complementing the image and processing in the
modified shrink mode this image would shrink,
when recomplemented, to two isolated ones after
about 50 to 100 passes.
Other cell types have been imaged and quantized using the CELLSCAN system. At this writing
statistical information is being gathered on cells
from many blood samples in order to ascertain
what error rates can be expected in scanning
bilobes in the presence of all other cell types.
General Considerations
In the above discussion of automatic blood
cell scanning, we have described a machine which
instruments a particular data processing routine.
This special purpose machine is being used to
ascertain whether the particular routine suggested is sufficiently general to identify binucleate lymphocytes. As in all discussions of
special purpose pattern identifiers, the question
arises as to how one is to determine the optimum
approach to the problem at hand. Few treatments
of the general theory of pattern recognition
exist. Some that are known to the author are
listed in the bibliography.4,5,6 One of the more
general treatises in this field is the one written by A. Gill. 6 He treats the general problem
of pattern recognition by assuming a pattern set
S, consisting of M subsets or individual patterns.
He assumes that each of the M sub-sets contains N
features. If we confine ourselves to a binary
system, then the minimum value of N is (10g2M),
where the parentheses indicate the largest integral value of log2M. Furthermore, the maximum
value of N is equal to (M-l). In the latter case
the M x N matrix characterizing the S is the
unitary matrix, i.e., a matrix whose diagonal
values are all binary "l'sll and whose other elements are "0".
The more efficient the method of defining
each Mi, the lower will be the value of N. Gill

defines the efficiency of a noiseless pattern
scanning system as equal to the information content of S divided by the value of N. Defining
"e" as efficiency we thus have:

~ Pi 10g2 Pi

<

i

e

S.

M-l

~

Pi 10g2 Pi
(log2M)

where Pi is probability of finding the Mi in S.
Let us consider S as the population of white
blood cells in peripheral human blood. The probability of occurrence of mononucleate lymphocytes,
binucleate lymphocytes, and granulocytes respectively, are shown below:

PML

3 x 10- 1

PBL

3 x 10- 5

PG

7 x 10- 1

From the above figures, we can compute the
information content of the source as equal to
0.70. Since we are dealing here with three patterns, Gill's approach would indicate both a maximum efficiency equal to 0.35. However, in the
special purpose system which has been designed
it should be noted that a value of N equal to
19,200 has been used. This indicates a system
efficiency of 3.7 x 10- 5 • This quantity is completely outside the range of efficiencies delineated by Gill. This would seem to indicate
that a highly inefficient approach has been
adopted in the present pattern analyzing system.
Contemplating this problem further, we should
again note that, using the Gill approach, we
should be able to obtain a value of N ,equal to 2.
This implies that the existence or presence of
only two features need be recognized by the
scanner. For example, these two features could
be:
A

The existence of two nuclei.

B

The existence of many granules.

The ideal scanner would recognize the bilobed
lymphocyte as being characterized by AB. All
other lymphocytes would be characterized by AB.
Granulocytes would be characterized by (AB + AB)
or, merely, B. The problem now is how to design
a scanner whose output can be directly translated
into the existence or non-existence of the two
features mentioned above. Gill's theory that such
a scanner should exist does not give the designer
a clue as to how such a scanner can be constructed.
It appears clear to the author that existing contributions to the theory of pattern recognition .
require further extension in order to define
methods whereby the salient features of a multiplicity of patterns may be determined. An alternative is to term the special purpose computer

177
4.2
which accompanies the scanner as part of the scanner, rather than part of the recognition logic.
This semantic manipulation, however, is of little
value in solving the practical engineering problem.
Future Development
Some of the practical problems which must be
solved in building a practical blood cell pattern
analyzer may now be listed. As has been mentioned,
the present system is a research tool for use in
proving the value of certain data gathering and
processing concepts. It operates at low speed
and would, in fact, require over a month of continuous operation to process a single blood
sample. A useful system is one which can process
a blood sample (containing approximately 10,000
white cells) in about 15 minutes time. This rate
of data processing allows about 100 milliseconds
to process each cell. Taking into account present limitations of video scanning systems, about
30% of this time should be allocated to scanning
and storing the cell pattern. This leaves approximately 70 milli-seconds during which to perform
the shrink operation on the pattern stored for a
sufficient number of passes to prepare an image
histogram.
Let us assume that we extend the present
serial mode machine directly and further assume
that about 100 passes would be required for each
cell identification. Noting that about 20,000
bits must be stored in each image, this implies
that 2,000,000 bit operations must be performed
in 70 milli-seconds, i.e., an allowance of 35
nano seconds per bit. In order to do logical
operations at this rate we must work at the frontier of the state-of-the-art, using, let us say,
a microwave memory as a data storage medium and
high speed logic circuitry to process the data.
Such a technique would require that the machine
retrieve one bit of data from the delay line
store, cause the operation of a logical sequence
incorporating about 10 propagation times, and
read out the results of this operation all in a
35 nano second period. Such a feat may well be
beyond present day capabilities. 7
Another approach would be to substitute a
combined parallel-serial mode machine for the
serial mode machine described above. For example, one might contemplate storing the image
in a 300 word sequential access memory, having
64 bits per word. Parallel logic would be provided which would examine 64 bits of the image
simultaneously. This would require a 64 fold increase in the number of logical gates which would
be used to instrument the shrink logic. Some reduction in this figure is possible by cross connecting the f(TUP) logics. The computer would
operate by storing three 64 bit words in its
memory register and simultaneously storing 64
previously processed bits in an auxiliary register.
A further 64 bit register would be required to
store the output of the 64 shrink logic circuits.
Because of the characteristics of the shrink

algorithm, which at the moment is implemented by
about 50 logic gates, it can be shown that a 4 to
5 fold increase in total machine complexity is
indicated. The memory would perform a read-write
cycle for every 64 bits of image processed. Again
assuming 100 passes per image, this would imply
30,000 read-write cycles in 70 milli-seconds which
is about 2 micro-seconds per read-write cycle.
These specifications for a parallel-serial mode
machine seem more reasonable than those corresponding to the serial mode machine. They do,
however, imply a large increase in components
required due to the adoption of parallel logic
techniques.
Conclusion
In automatizing the analysis of blood cell
patterns the cQmputer engineer must devise
machine techniques which can reproduce the human
visual recognition process. It appears characteristic of such machines that vast amounts of
input data must be operated upon in order to
deliver a fairly elementary output. This is in
contrast to present scientific and business computers where input and output data rates are commensurate. For example, in the present CELLSCAN
system an input of approximately one quarter of
a billion bits would be gathered in scanning a
blood sample. This input is reduced to the
quantity of bilobes in the sample which can be
represented by a five-bit word. Even if the present system efficiency is improved to the maximum
predicted by Gill, the ratio of input to output
data ra es would be of the order of ten-thousand
to one.
Therefore, it is found that today's
applications of data processing technology in the
field of pattern recognition require us to take
full advantage of present circuit art. It is
here that use can be made of microwave logic
techniques. Furthermore, a better theoretical
grasp of pattern recognition problems is required
so as to guide the engineer towards the most efficient use of available logic circuitry and data
stores.

6

Acknowledgements
The author would like to acknowledge the
support provided the CELLSCAN project by Dr.
Marylou Ingram, of the University of Rochester,
who has been the prime mover behind the medical
investigation of the incidence and significance
of binucleate lymphocytes. The project itself
has been undertaken on a U. S. Atomic Energy
Commission contract.
References
1.

Ingram, Dr. Marylou, "The Occurrence and
Significance of Binucleate Lymphocytes in
Peripheral Blood After Small Radiation Exposures", Internationa1.Journa1 of Radiation
Biology, Special Supplement, Immediate and
Low Level Effects of Ionizing Radiations
(1959).

178

4.2
2.

United States patent application number
854254, filed Oct. 8, 1959 by the PerkinElmer Corp.

3.

Taylor, W. K., "An Automatic System for Obtaining Particle Size Distributions with the
Aid of the Flying Spot Microscope", Brit.
J. Appl. Phys., SUppa No.3, 173 (1954).

4.

Kirsch, R. A., et al, "Experiments in Processing Pictorial Information with a Digital
Computer", Proc.EJCC, 221 (1957).

5.

Unger, S. H., "A Computer Oriented Toward
Spacial Problems", Proc. IRE 46, 1744 (1958).

6.

Gill, A., "Theoretical Aspects of Minimal
Scan Pattern Recognition", Electronics
Research Lab., University of Calif., Series
60, Issue 233, March 23, 1959.

7.

Turnbull, J. R., "lOO-Mc Nonsynchronous
Computer Circuitry, Technical Digest, 1961
Internat'l Solid State Circuits Conf.,
Feb. 1961.

179

4.2

PERIPHERAL BLOOD

I
WHITE CELLS
(LEUCOCYTES)

RED CELLS
(ERYTHROCYTES)

I
LYMPHOCYTES

Figure 1.

GRANULOCYTES

COMPOSITION OF PERIPHERAL BLOOD

(/)

l.LJ

EXPOSURE

~

t;

0.8

~ 0.6

~::e

o

...J

~

EXPOSURE

~

o

CJ)

EXPOSURE

I

0.4

-0
en 0 0.2

o
; 0.0

~

, I'

24

0

TIME IN WEEKS

Figure 2.

INCIDENCE OF BILOBES IN DOGS

'I

28

,Ir--,--,

33

180
4.2

Figure

3.

PERIPHERAL BLOOD SMEAR

II

a. ORIGINAL

FIELD

b.

AFTER 2ND PASS

Figure

4.

THE "SHRINK" PROCESS

c.

AFTER 10 TH PASS

181

4.2

A B C

H X D
G F E
.f(ISO)

= ApBpCpDEFGHp.

~+(TAZ):: ABC+BCD+CDE+DEF+EFG+FGH+GHA+HAB.

of (TUP)

::

(AB + AB){(BC + Be) [(ei5+cD) + ... + (GH+GH)]}
+ .. " + ( E F+ EF) ( FG + FG) (G H+ GH).

FINALLY

~

f(X p ) =

x [.f(ISO)+.f(TUP)+-F(TAZ)]

Figure 5.

THE "SHRINK" ALGORITHM

MICROSCOPE

VIDICON
CAMERA

SCANNER
CONTROL

.-

_

SCANNER-COMPUTER
LINK

~
-

COMPUTER

~
VIDEO
MONITOR

Figure 6.

DATA DISPLAY
MONITOR

THE CELLSCAN SYSTEM

~

......
(X)

~

MAGNETIC TAPE STORE
( 19,200 BITS)

~
QUANTITIZED
VIDEO INPUT

--

EXTERNAL
SYNCHRONIZING
SIGNALS

TEST PATTERN
GENERATOR

-

CONTROL
~

--

--

,

DELAY REGISTERS
( 194 BITS)

SHRINK LOGIC

+
ISOLATED ONES
COUNTER

Figure 7.

..

COMPUTER BLOCK DIAGRAM

OUTPUTS TO
MON rTOR

~

183
4.2

Figure 8.

Figure

9.

ANALOG VIDEO IMAGE OF BILOBE

QUANTIZED IMAGE OF BILOBE

185
4.3
APPLICATION OF COMPUTERS TO CIRCUIT DESIGN FOR UNIVAC LARC
Gilbert Kaskey
Associate Division Director. Systems Design and Application
Remington Rand Univac
Philadelphia. Pennsylvania
Noah S. Prywes
Consultant to Remington Rand Univac
Assistant Professor. Moore School. University of Pennsylvania
Philadelphia. Pennsylvania
Herman Lukoff
Chief Engineer. Remington Rand Univac
Philadelphia. Pennsylvania
The design of circuits for computers has become in recent years a complex undertaking. The
problem is two-fold. On one hand optimization of
cost and speed is the prime objective. On the
other hand. complexity is increased through factors such as component charact~ristics and life
expectancy. manufacturing techniques. and the
suppression of noise in very large systems. The
complexity makes the use of computers as an aid to
design almost imperative; this was the case in the
desi.gn of circuits for Univac Larc.
Several applications of computers in circuit
design are reported here and demonstrated by case
histories for Univac Larc. The paper consists of
two parts. In Part I. a general description of
the problems and solutions is given. References
to available reports or publications are given
where a more detailed description can be found.
The problem areas can be divided into three categories: evaluation of components and life test;
design of circuits; protection against noise.
Part II consists of a detailed description of the
statistical techniques used in the circuit design.
Part I
A.

Evaluation of Components and Life Tests

Evaluation of Components. The components
evaluated included transistors. diodes. ferractors. ferrite cores. resistors. capacitors. etc.
As an example. the evaluation of Philco surface
barrier transistors will be reported here. This
transistor was selected during the first half of
1956 after a study of many other candidates in
respect to rise time. storage time. gain. current
level at optimum performance and cost. The evaluation program required the testing of a large
number of transistors to determine such parameters
as beta. rise time. storage time. and breakdown
voltage. These values were measured initially and
at various times throughout life test.
In the attempt to mechanize the test data
analysis. a complete library of Univac statistical
routines has been developed. These statistical
routines analyzed and evaluated the available
data. The statistical analysis of empirical data
is greatly simplified if the variate under analy-

sis is normally distributed. Since this is rarely the case in practice. the distributions were
transformed to ones that have Gaussian properties.
Close cooperation with manufacturers was
maintained to insure that the transistors received were the best that could be produced in
the manufacturing process used. Experimental
designs were made to aid in the determination of
the effect of various changes in production techniques. e.g •• resistivity and etching time. on
the several transistor parameters. A UNIVAC@
routine was then used to perform the analysis of
variance necessary for the identification of the
statistically significant variables. The results.
a joint effort between the manufacturer and user.
helped establish production control procedures
which virtually insured that component lots would
5
meet the required specification.
Evaluation of Life Test. Because of the
long life expectancy of transistors it was difficult to ascertain failure characteristics by life
test in a reasonable time period. In other words.
it was not possible to detect significant degradation of transistor parameters over thou~ands of
hours of life test. Since it was nevertheless
extremely important to be able to make a prediction as to the life expectancy of the transistor.
an attempt was made to run accelerated life tests
at elevated temperatures. under severe humidity
conditions and under vibration. with the purpose
of producing gradual deterioration in a reasonable
time. It was hoped that the correlation of such
results with deterioration under normal usage conditions would result in a reasonably accurate prediction of life expectancy. The tests under elevated temperature conditions were the only ones
that proved useful in this respect.
0

0

Transistors were placed on test at 55 c. 65 c.
85°c. and lOOoc. The transistors involved
were first tested for homogeneity by a study of
the distribution of breakdown vOltage and ~.
These parameters also appeared to be the major
cause of transistor failure in the circuits and
therefore are the subject of the investigation.
By studying the behavior of these homogeneous sets
over time, we hoped to obtain as estimate of transistor behavior at 25 0 c.
75 0 c,

186
4.3

To illustrate the statistical methods used,
the determination of transistor life using degradation of breakdown voltage as a criterion will be
discussed. The circuit design indicated that the
breakdown voltage degradation to 3 volts implied a
transistor failure. Theoretical studies had suggested the dependence of the breakdown voltage on
temperature and time as follows:
= A - Be -a/2T v't
(1)

vv:p

where Vp is the breakdown voltage, T is the absolute temperature and t is the age of the transistor in hours.
A least-squares fit was then made to the
data, using fixed values of T, to determine the
"best" values for A, B, and a. The results of
this analysis are given in Table I.
Table 1. Punch-Through Voltage
Accelerated Life Test
Group
No.

Temperature
(oc)

10

55
65
75
85

11

12
13

Least-Squares Equation
(t in hours)
10.6
9.57
11.9
10.9

-

0.0002t
0.0004t
0.0028t
0.0058t

(2)

where m is a slope of the linear least-square
equation and T is the corresponding absolute temperature as shown in figure 1-1.
This equation was used to estimate the slope
at 25 0 c. and the value was found to be approxi-5
.
mately 2 x 10 volts per ho~r. The results of a
fit by eye made prior to the regression analysis
made on the Univac System indicated an average
life of 211,000 hours as indicated in figure 1-2.
In addition to the prediction of accelerated
life tests, life tests under conditions similar to
operatinG conditions were conducted. Improved
predictions were only possible as more data became
available.
B.

1)

More efficient circuit optimization in terms
of the predetermined functionals. that is,
cost. by utilizing the speed of computers.

2) The generation of component specifications
which computer programs correlate with the
capabilities of the designed circuit.
Circuit design and optimization start with
given circuit schematic configurations and a
performance requirement. In the case of computer
circuits, the latter can be stated. for initance.
in terms of fan in (number of logical inputs),
logical operation, fan out (number of circuits to
which the output is connected), and the delay per
circuit. The objectives of the design are to
single out one of several suggested circuit configurations and determine component parameters so
that cost is minimized.
The process can be roughly divided into
three steps:

Because of the assumed exponential relationship, the least-squares straight line was obtained
for the logarithm of the slope as a function of
reciprocal temperature. The equation thus obtained
was
m = -1.3 x 1015e-(14278/T)

LARC attempting to mechanize all steps previously
performed manually. Our objective has been to
completely automate the design steps required in
going from proposed circuit schematic configurations to the development of an optimized circuit.
The process will consist primarily of computer
programs using a detailed mathematical model.
There are two advantages to such a process:

Circuit Design

Mechanization of the various steps involved
in circuit design for Univac Larc has served as
the basis for research whose objective is the
complete mechanization of design and fault diagnosis of transistor circuits. Research has been
in progress since the initial design phase of

1) Generation of d-c circuit equations.
2) D-c circuit design.
3) Optimization of the Circuit for Reduced

Dela~

Generation of d-c circuit equations. A code
was devised for transferring the circuit information in a schematic diagram into the computer.
This code is completely reversible; that is, the
original circuit diagram can be derived uniquely
from the computer code.
The nodes of the circuit are assigned a number mlm2. Each branch is uniquely determined by
its endpoints (that is, the branch with nodes mlm2
and nln 2 as endpoints is called branch
mlm2nln2PlP2). An additional set of numbers PlP2
is necessary to differentiate two or more branches
which have common endpoint notes. After listing a
number which identifies a branch, the components
of the particular branch are listed. When this
has been done for all branches. the circuit has
been completely described. Each component is
associated with a letter of the alphabet (for
example, resister R. emitter E, battery B, and
diode D). In the format used each letter is
followed by a tWO-digit number to differentiate
components which are of the same type.
The component closest to node mlm2 is listed
immediately after the description of the branch
mlm2nln2PIP2' followed in order by the remaining

187
4.3

components in the branch; thus, the component next
to node n n is the last in the series. As an example, th~ 10ll0Wing is the format for the circuit
shown in figure 1-3.
00
00
00

01
01
02
02
03
03

00

ROI

000

DOO
ROO

GOO
BOO

500
R02

04

00
01
00
00
00
00

04
04

00
00

R03
GOI

000
BOI
000
000
B02
000

01
04
04
02
04
03

QOO
EOO

This method of recording the information for
given circuit configuration gives a unique representation so that each component and its location is specifically described.

&

The two basic methods available for the generation of the circuit equations (that is, the
loop and the nodal-branch techniques) have been
considered. The loop method has the advantage of
yielding fewer equations, since not all of the
possible variables are included. If the,additional variables are eliminated from the nodal-branch
equations, the reduced set is identical with the
set derived by using the loop method.
The nOdal-branch derivation has a single advantage which is extremely important from the
standpoint of a computer solution: the equations
are derived very systematically. Thus the process
of mechanization, which will result in a set of
redundant equations, is easily implemented. On
the other hand, when using the loop equation
method, there are frequently too many loops to
allow extracting those which make up the system of
redundant equations.
It was decided, therefore, to concentrate on
the more systematic nodal-branch method, and then
eliminate any irrelevant variables.
If there are n nodes in the circuit, n-l independent nodal equations can be generated. (It
can be shown that if the n-th equation is generated, the result can be derived from the other n-l
equations.) Referring to the circuit in figure
1-3, the n-l equations generated are:

V - VI
4

=

+ V
V
BOI
R02

V3 - V2

VQo

V4 - '2

VE

V4 - V3

VR03 + VB02

These four nodal and nine branch equations.
then, represent a complete set of redundant
equations which fully describe the system.
The current-voltage relation of diodes and
transistors is nonlinear. In order to simplify
computation, these nonlinear curves have been
approximated and replaced by linear segments in
the regions of operation which are of interest.
Thus. whenever the voltage-current relationship
of a diode is considered. the following condition
is employed:

where VD and ID are the voltages and current respectively through the diode. (See figure 1-4.)
The constants DO and RD are unknown quantities to
be determined in the calculations. In effect. a
variable (Vo). which changes with input conditions. has been replaced by two quantities (DO.RD)
which remain constant through varied input
conditions.
In a similar manner the following substitutions can be made for the transistor currents:

1010 + 1040 .. 1041 • 0
-1 010 ' + 1120 .. 1140

=0

-1120 + 1230 + 1240

=0

-1 230 + 1340 + 1341

=0

The branch equations represent the total
voltage drop across each branch. For the sample
circuit in figure 1-3 the equations are as
follows:

where SO' RS' Kl and K2 are constants determining
the two straight line approximations; IS and 10
are the currents through 5 (base) and Q (collector)
respectively; and Vs and VQ are the voltage drops
across the base and collector. (See figure 1-4.)
As in the case of the diode. unknown quantities (VS' VQ)' which change with input conditions.
are replaced by quantities (K l • K2• SO, RS) which
remain constant through varied input conditions.

188

4.3
D-c circuit design. In the case of UNIVAC
LARC the optimization of cost consisted mainly of
reducing the required Beta of transistors used.
There are two criteria for calculating the required Beta: Worst case design, statistical
design.
In the case of worst case design, the parameters (such as resistances, supply voltages, etc.)
are multiplied by a factor which represents the
maximum tolerances allowed so that the Beta of the
transistor involved becomes minimum.
In statistical circuit design, Monte Carlo or
analytical 6 ,7, methods are used to obtain a distribution of the required Beta of the transistors
as a function of the distributions of the circuit
parameters (resistances. supply voltages, etc.)
Optimization of the Circuit for Reduced Delay.
Examination of the above circuit equations shows
that the number of unknown variables exceed the
number of equations. Therefore there is no unique
solution. Generally delay decreases with increase
in Beta, although the delay would depend on many
other parameters as well. The purpose of the optimization is then to determine a unique circuit
having the lowest Beta requirement such that the
maximum delay allowed in the circuit specifications is not exceeded.
The transient behavior of the circuit can be
determined experimentally, analytically, or through
statistical studies.
In the experimental transient study the unknowns in the circuit equations are divided into
so-called dependent variables and independent
variables. The number of the dependent variables
is equal to the number of equations. The determination of optimum values for the independent
variables inplies unique solution of the circuit
equation which represents the optimized circuit.
The problem then is to vary the independent variables and determine experimentally the values
corresponding to minimum delay. This can be an
iterative process where one of the independent
variables is varied while the others are kept constant. 8 Figure 1-5 illustrates such an experiment
where R3 is varied for a transistor Beta of 9 and
the on-base current equals 1.15 DB. The 'optimum
value of R3 is found to be approxiDBtely 750 ohms.
A large number of circuits have to be computed in the process of optimization using circuit
equations modified for worst case design. These
circuit equations are found to be nonlinear.
Solution of the system of equations by computer is
of significant advantage over DBnual computation,
especially when the system of equations is nonlinear.
The theoretical relationship between circuit
delay and the several circuit parameters has been
found to be extremely unreliable for prediction
purposes and therefore abandoned.

The third approach involves the statistical
determination of the relationship between the circuit parameters and the circuit delay. Specifically, the determination of the regression of circuit delay on the transistor parameters has been
e~tablished.
This method is the key to the circuit design procedure used and therefore it is
described in considerable detail in Part II of
this paper.
A functional relationship, developed statistically, serves two closely related purposes. It
is not only necessary for any work in statistical
circuit design but offers the following advantages:
1) Changes in production control, which, experience indicates, occur frequently, may improve
the parameters of the selected transistors in
some respects and degrade them in others.
Also, with the rapid developments in transistor production, newer, better and less expensive transistors become available. A correlation between transistor parameters and
circuit performances allows the transistor
manufacturer the freedom of changing production controls to improve one parameter at the
expense of others so that transistors improve
in yield and cost. Also a freedom is maintained to purchase transistors from many
sources.
2) The circuit that has been designed for a
particular specification may be useful in
other applications in which a less expensive
transistor would satisfy a functional specification calling, for example, for slower
speed or less gain. The regression of circuu
performance on the several component parameters allows such a change to be made without additional design or experimental check.
C.

Reduction of Noise and Delay in Backboard
Wiring

The transmission delay increases with the
lengths of the wire and distributed capacitances
representing connectors, wires, etc. The noise
pick up (that is, voltages and currents induced
in a wire by pulses in other wires in its proximity) increases with the lengths of the wires but
decreases with the total distributed capacitance
on the wire. The assignment of elements on the
backboard is made to reduce delay and noise pick
up. In Univac Larc, the logical designer decided
where groups of circuit elements were to be placed
on the backboard, based on his familiarity with
general information flow path among organs of the
computer. The circuit elements were then assigned
t? ~pecific printed circuit packages. Preliminary
wIrIng procedures were then run on Univac I to
determine whether bad cases, representing wires
exceeding length or capacitance, existed. This
was done on the basis of wire length calculation
between terminals, and calculation of total capacitance represented by connectors and wire lengths.
When bad cases existed, the logical elements were
moved (by decision of the logical designer) in an
attempt to reduce and/or eliminate the bad case

189
4.3
conditions. 9 lterations of this procedure,
partly manual and partly automatic, are continued
until wire lengths and capacitances are reduced to
a tolerable level.

A.

Research on computer placement of circuit
elements on the backboard has continued after completion of the layout of Larc-Univac. A suitable
algorithm has since been developed for performance
lO
of this task.
The algorithm is capable of minimizing the longest wires on the backboard or the
total length of all wires combined.

Driving circuit: Flip-flop whose output voltage
has exponential rise time to 70% in 40 ~s.

Part II
The engineer who has as his assignment the
design of a transistor circuit to perform according to a predetermined functional specification
has a choice of two courses in designing the circuit and specifying the transistor.
One approach, in general use, is to determine
by measurements the worst parameters of a selected
type of transistor, to employ these parameters as
the limiting criteria in a worst-case design, and
to ch~ose the other components in the circuit to
optimize speed or gain, for example, in a nonrigorous way.
In the second approach, discussed in this paper, the engineer designs a circuit for a typical
transistor which performs to the given functional
specification. The other circuit components are
selected to optimize the operation with this transistor. The dependence of the functional operation of the circuit (for instance, its gain or delay) upon the parameters of the transistor is determined over a wide range of variations of these
parameters through statistical studies. Transistor parameters are then determined for each range
corresponding to the functional specifications.
Information on the dependence or correlation
of parameters is valuable to both the transistor
manufacturer and the circuit designer. One transistor parameter can be improved at the expense of
another so that the transistor improves in production yield, cost, ete., without harmful effect on
the operation of the circuit. A change in production control, rather than being harmful, can be
helpful. Various types of transistors are candidates for use in the circuit without additional
deSign or experimental work, and the same circuit
can be used to satisfy a number of specifications,
changing only the type of transistor. The circuit
deSigner can use the same information for statis4
tical circuit desig~ as opposed to the worst case
deSign, thus effecting additional savings.
The subject approach will be illustrated by
a case history of a circuit design. To relate a
given circuit specification to the transistor
parameters involved, a considerable amount of computation is necessary, which has been carried out
on a UNIVAC I data-processing system.

Description of the Circuit

The functional specifications of the circuit
to be deSigned were as follows:

Input voltage:

Pulse from -2.9v to -o.3v

Input current:

Pulse from 0 ma to 4.5 rna.

Output voltage:

Pulse from -2.9 to Ov.

Output current:

Pulse from 0 ma to 52.0 mao

Maximum output capacitance:
Maximum load:

1000

~~f.

32 standard circuits.

Delay:
High-speed range
Medium-speed range
Low-s peed ra nge

Minimum
22 IDJ.I.s
33 Il\J.s
44 ~s

The circuit configuration
figure 2-1. Delay is measured
of the clock pulse driving the
beginning of the output of the
The delay-measuring circuit is

Maximum
165 m~
205m~
245 m~

chosen is shown in
from the beginning
flip-flop to the
leading circuits.
shown in figure 2-2.

Since the minimum delay is not critical in
this configuration, design effort continued with
the input and loading for maximum dela~as shown
in figure 2-3. Measurements were made when the
transistor was turning off, since maximum delay
occurs at that time. A typical transistor was
selected to give a delay in the medium range. The
values of the other components in the circuit that
would minimize delay were then determined experimentally. The Surface Barrier Transistor (SBT)
in the circuit (figure 2-1) has a relatively small
effect on the delay of the entire circuit; therefore, the determination of worst parameters for
the SBT was feasible. This paper will deal with
the regression of the parameter of the second
transistor and on the performance of the circuit
as a whole.
Like the choice of the circuit configuration
(figure 2-1), final determination of the values of
components other than transistors was based upon
other experimental work not relevant to the work
discussed here.
B.

Parameters of the Transistors

Four parameters and circuit delays were measured for each of 360 transistors. The six parameters normally specified by the manufacturer are
Breakdown voltage, Leakage current, Current gain
(~), Rise time (T), Storage time (S), and Peak
base-to-emitter voltage (V). The first two, which
affect mainly the dc operation of the circuit,
have no significant effect on circuit delay. The
remaining four parameters, assumed a priori to affect circuit delay significantly, were measured in
each transistor. Current gain (~) was measured at

190
4.3

a constant collector current of approximately 100
milliamperes and a collector voltage of -0.6. Because of dc considerations, gain must exceed 30
under these conditions. The circuits shown in
figures 2-4, 2-5, and 2-6 were used to measure T,
V, and S, respectively. Total circuit delay (6)
was assumed to be a function of V, ~, T, and S.
In addition, 6 was measured for each transistor,
with output loading which corresponds to maximum
delay (figure 2-3).
The transistors tested were in three groups:
236 transistors of type GT762 (taken from two production runs), 99 transistors of type CK762, and
25 transistors of type TA1830.
No theoretical relationship among the measured parameters was assumed. Each measurement
was performed twice. Transistors whose values did
not check within the accuracy limits of the measuring device, were eliminated from further consideration but the number of these was negligible.
C.

Statistical Studies

Interdependence studies between the parameters and delay values were undertaken first.
The tools of regression'analysis l were used to
ascertain whether, in general, circuit delay can
be predicted from known parameters of a transistor.
The second step was the establishing of a functional relationship between circuit delay and the
several known parameters. This function formed
the basis for the successful determination of
transistor parameters for the delay ranges.
Studies in Regression Analysis. To investigate whether there is any direct relationship between circuit delay (6) and any of the four transistor parameters listed, the measured value of
circuit delay for 236 transistors, assuming these
to be a representative sample of the population of
all GT762 transistors, was plotted against each of
the parameters. It was assumed that the regression of 6 on each of the parameters V, ~, T, and S
is linear. The scatter diagrams of 6 versus each
of the transistor parameters and fitting of linear
regression equations are given in figures 2-7 thru
2-10. The mean 8 values are connected in a line
of best fit, shown as a light line; the line of
regression, shown as a heavy line, is defined as
the linear function of the form y = mx + b, which
fits the means of arrays best, in the least
squares sense. The fitted linear regression equations are given below:

ships are obtained by using the last three of the
above equations. Though the estimated coefficient
of regression between 6 and V is greater than the
corresponding coefficient between 6 and any other
parameter, it is not statistically significant
since the estimated variance of this coefficient
is very high. Further investigation made to determine whether any direct relationship between
the parameters V, ~, T and S existed, indicated a
strong direct relationship between both V and T
and also between T and ~.
Delay as a Function of Sixteen Expressions.
A UNIVAC program was used to find the linear fit
and regression coefficients be.tween 6 and the following 16 parameter expressions:
V,

~,

2
T, S, 1I ~, TI ~,T 21~,T/~,

v/~,

VT, TS, S/~, T2, 1/~2, lis, VS
It was assumed that the coefficients of regression
between 6 and other parameter expressions were not
significant. The program revealed highest positive regression between 6 and terms T, T2, T2/~,
VT. S/~, and TS. These six parameter combinations
were chosen for a function with linear constants
as follows:
6· KIT + K2T2 + K3VT + K4T2/~
(1)

+ KsS/~ + K6TS + K7
A second program was subsequently written to
apply the least-squares fit criterion to the 234
sets of transistor data for the given equation.
The normalized equations (seven equations, seven
unknowns) of the fit were solved by the Crout
Method 3 • A third program was developed to test
the curve fit; that is, to compare the calculated
6 with the observed and to determine the individual term contributions. These programs revealed
that the KsS/~and ~ TS terms contributed little
to the value and could be dropped, thus simplifying the function to the following:
6

=

KIT + K2T2 + K3VT + K4T2/~ + K7'

where Kl - 1.666, K2 = 0.001, K3
K4

= -0.175,

and K7

(2)

= -2.717,

= 129.135

+ 197
+ 146.5
+ 171

A relatively simple evaluation of the normality of the distribution of errors based on this
regression equation is indicated in figure 2-11.
The cumulative dis,tribution of errors would appear
perfectly linear in the representation of a normally distributed population.

By using t tests 2, it was found that the regression between 8 and V is not statistically
significant but the regressions of 6 on the other
parameters are highly significant, i.~., at the 1%
level. There is a definite indirect relationship,
then, among ~, T, Bnd S. The indirect relation-

Discussion of Accuracy of Prediction Using
the Function. A lot of 99 \ype GT762 transistors
from a later shipment was measured to determine the
applicability of the derived functions. equations
0) and (2). The distribution of errors between
the equation prediction (2) and the observed

8

=

6
6 =
6 =

-33.4V + 194
-0.08l2~

0.577T
12.45

191
4.3
values of delay appeared normal, with a mean of
8%. A change in the constant term or inclusion of
dependence on S would correct the function as applied to this particular group and shift the mean
to zero.
D. Range Deter.iaation
Since the circuit under design specifies use
in one of three delay ranges, rather than a specific delay, a method for classifying transistors
into the three ranges according to known parameters would serve the purpose. The method capitalizes on the relationship established in the
search for a predictive function.
The 236 units first investigated were plotted
on a T-ordinate, ~-abscissa graph, and labeled
with their observed a values. Arbitrary 6 ranges
were found to separate themselves fairly well into
various regions of the plot; rough borders were
sketched between regions following the best range
separations. These T (~) curves descended exponentially at low ~ values, and leveled off horizontally as ~ increased, suggesting the functional relationship:
T

= Kl + K2e-K~

(4)

The a-labeled points were separated into the
three designated groups: 0-155 ~s, 156-195 m~s,
and 196-234 m~s; and the two borders were added.
A program was devised to fit the border ~, T data
to the suggested functional expression, yielding
the constants Kl , K , K3 for each curve. The
2
smooth exponential decay curves were drawn in to
separate the data. The results, for the first lot
of 234 type GT762 transistors, are described as
follows:
1)

2)

In the high range (196 < 5 < 235), eight
units out of 52 occurred which did not belong. Their values were 180, 180, 186, 192,
192, 192, 192, 194 m~. Thus there were only
two outside of the tolerance criterion,
± 10 m~. This tolerance was selected arbitrarily by adding the measurement tolerances
of T and 5, each ± 5 m~s.
In the medium range (156 S a S 195) nine
units out of 179 occurred which did not belong. One unit was below (152), and eight
units were above (196, 196, 196, 198, 198,
198, 200, 200). None of these was outside
the ~ 10 m~ tolerance region.

The results suggest a very accurate separation.
The lowest region, where there was insufficient
data available, was checked on another set of
transistors. The results are discussed below.
Figure 2-12 is a graph that can be used to
sort transistors by ~ and T measurements. Once
the measurements for each transistor are made, the
~-T point on the graph establishes the delay range
of the unit.
Discussion of Accuracy of Prediction Using
Ranges Determined. With the method just indicated,
using the transistor delay-range chart with the
originally derived borders (figure 2-12), the new
lot of 99 type GT762 transistors was plotted.
There were 28 transistors in the high range
(196 ~ 0 ~ 235).
In the medium range (156 ~ 0 ~ 195), 69 units
occurred, of which 11 did not belong. Nevertheless, of these 11, ten units were acceptable under
the tolerance limits, indicating only one misplaced.
In the low delay range (6 ~ 155), only two
transistors occurred, both of which were correctly
placed.
A linear shift in the borders of the a ranges
wobld take care of the errors of misplacement.
These results are strongly indicative that new
lots of the transistor have some property changes
that can affect our application, unless additional
parameters such as storage time (S) are considered.
An excellent prediction for the TA1830 data
was achieved by the transistor delay-range chart
(figure 2-12). Of the 25 units tested, 22 fell
within the predicted range and 3 were borderline.
The borderline cases were so close that, within
tolerance limits, they could be placed in the
correct categories.
In contrast to the broad range of delay
values in the original 236 type GT762 transistors,
these RCA TAI830 units were mostly confined to the
lowest delay range.
Acknowledgements
The authors gratefully acknowledge the
assistance of the many groups and departments of
Remington Rand Univac who contributed to the
success of this paper. Special acknowledgement
is given to P. Krishnaiah and P. Steinberg.

s

References

4) The two border equations are as follows:

lEzekiel, Mordecai. Methods of Correlation
Analysis. 2nd ed. New York: Wiley, 1941.

3) The low range (a 155) contained only two
units, both of which were correctly placed.

= 30.97 +
T = 62.18 +

T

60.58e-o·0157P at 5

= 155

112.41e-o,OI66~ at 5

= 195

(5)
~)

2Johnson, P. o. Statistical Methods in
Research Chap. V. Prentice Hall, Inc., 1944.
3nildebrand, F. B. Introduction to Numerical
Analysis. p. 429. McGraw Hill, 1956.

192
4.3

4Gray, Harry J., Jr. An Application of
Piecewise Approximations to Reliability and
Statistical Design and Proceedings of IRE. July,
1959.

~emington Rand Univac, Division of Sperry
Rand Corporation. Statistical Techniques in
Transistor Evaluation Final Report. Dept. of
the Navy, Bureau of Ships NObs 72382. Applied
Math Department: Remington Rand Univac, Philadelphia, Pa. April, 1959.
6Senner, A. H., and Meredith, B. Designing
Reliability into Electronic Circuits. Proc. Nat'l
Electronics Conf. Vol. 10, pp 137-145. Oct. 1954.
7Gray, H. J., Jr. An Application of Piecewise Approximations to Reliability and Statistical
Design. Proc. of the IRE Vol. 47, No.7, pp.
1226-1231. July, 1957.
8Remington Rand Univac, Division of Sperry
Rand Corporation. Univac Larc Highspeed Circuitry
Case History in Circuit Optimization. Prywes,
N. S., Lukoff, N., and Schwartz, J. Remington
Rand Univac, Philadelphia, Pa.
9Remington Rand Univac, Division of Sperry
Rand Corporation. The Univac Prepared Engineering
Document Program. Williams, T. Remington Rand
Univac, Philadelphia, Pa.

l~emington Rand Univac, Division of Sperry
Rand Corporation. The Backboard Wiring Problem:
A Placement Algorithm. Steinberg, L. Remington
Rand Univac, Philadelphia, Pa.

193

4.3

64 ~--------------------------------------------------------~

56

,,,
,,,
,,
,,,

--~--ACTUAL

THEORETICAL

48

I
I

40

w
:::to«

I

(!)

IJJ

z

32

v
~

P

)(

IJJ

0-

J

0

...J

(f)

24

15

25

35

45

55

65

TEMPERATURE (OC)

Figure 1-1.

PUNCH-THROUGH VOLTAGE SLOPE PREDICTION

75

85
701

,j:::..

I-'
(0

W,j:::..

12

II
4

25°C Vp =1O.6-0.36xl0- t

---

10

- - - ------.'-1.TLT2~',';~~ ~~~~~E
"

9

..........

"-

II)

>u 8

m

u>
~

7

0

>

~
x

6
65°C V p =10.6-4xl0- 4 t


0

0::

5

X
lX
0

4

z

""

75°C V p =10.6-28xl0- 4 t

85°C Vp =10.6-58xl0- 4 t

::>

Q.

3

2

o

~'------------------~

100

__________ ______

400

~

1000

~~

______

2000

-L~

__________

4000

~

________-L__

10,000

TIME (IN HOURS)

Figure 1-2.

PUNCH-THROUGH VOLTAGE ACCELERATED LIFE TEST

20,000

~

____

~

__

40,000

~

______

~

100,000
700-Rl

195

4.3

""....-----...--....,
,/

/
INPUT
TEST
POINT

"

BASE
CONNECTIONS

\

D

I

/

./

TRANSISTOR

/

......

OUTPUT
TEST

...... "

COLLECTOR
CONNECTION

"

\

POINT

NODE3~

/

\

\
\

I

\
I
I

,
I

R/

\
\

NODE 1

I

/

\
\

EMITTER
CONNECTION

\."

'R2

..............

...........

/

I

_--

/
/
./

G

+

_./

TP

+
"-NODE 4
4402

Figure 1-3.

SAMPLE CIRCUIT FOR FAULT DIAGNOSIS

-Q
RS -ON

-'--~------------~IS

a. Diode

b. Transistor Base

c. Transistor Collector
4103

Figure

1-u.

VOLTAGE-CURRENT CHARACTZRISTICS OF A DIODE)
A1~ BASE AND COLLECTOR OF A TRANSISTOR

196
4.3

80~---------------------------------------------,

75 I-

0

TRANS I STOR /3 =9
BASE CURRENT 18 =1.15mo

3:'U

....

CI)
en

cn::t.
cnE
0D:cn
u .... 70


I-

~u
....J~

lLIU

0

65 IOPTIMUM

~~

0
0

I

I

0.4

0.8

I

1.2

1.6

2.0

-L
Xl03
R3

Figure 1-5.

DELAY AS A FUNCTION OF THE 1/R3

2.4
5634-Rl

197
4.3

+t2

+0.75
617

Figure 2-1.

SCHEMATIC DIAGRAM OF CIRCUIT

LOADING
CIRCUIT

FF
+
CLOCK
PULSE

MEASURED
DELAY
618

Figure 2 -2.

DELAY MEASUREMENT

198

4.3

r--i\----I
rV-I I
I

I

I

31 LOADING
CIRCUITS

FASTL-J'
________________~_+~
~--~ FF

~L~PC~~~~~;_ - - -

I UN DER TEST
I FF

--\J

I
I

I

I
I

CLOCK
PULSE

I

L ______

-~

~----------8mox.----------~
619

Figure 2-3.

INPUT AND LOADING FOR MAXIMUM DELAY

-4.1

MERCURY - RELAY
PULSE
~--~~~-+~
GENERATOR

a. Circuit

I-

5fLsec

-I

lL...--1t_ _ _----I1
b. Input Voltage

O-----~========~~~~~~~-

_4.1---..:::31;.;.;~9::.;'lf~4~-------------------~
~r:~

c. Output Waveform
620

Figure 2 -4.

MEASUREMENT OF RISE TIME

-4.1

Vc
~-8V

50mJLsec TC

r----.,
MERCURY-RELAY~
PULSE

: 100Q

Wv

I

GENERATOR

I

I

I

"NY ._ .. -

I

Q

\.

I 50JLJLf -r- I
I
-L.. I
L ___~...J
a. Circuit

I'
•
I..

5~..c
b. Input Voltage

\

-v
_I

I
T

.,

a. Circuit

Vc

~Ts=E
b. Output Voltage

+3V----~--~

L

o

c. Output of RC Circuit

-12V

__- -__-

-....,""'--+----___
C. Input Voltage at A

622

d. Bose -to- Emitter Waveform
621

Figure 2-5.

MEASUREMENT OF PEAK BASE VOLTAGE

Figure 2 -6.

MEASUREMENT OF STORAGE TIME

~~
•
(0
(.to) (0

~

N

•we
e

240

230 l -

ONE POINT
TWO POINTS
• THREE OR MORE POINTS
e·

o

220

•

210

u
Q)

In

::l
E

ro
><{
...I

w

•

•

•

200

'90~

8 =- 33.4
r

V

+ '94

••

/

·•

=-0.0495

•

•

•

0

I-

~
0:

180

U
170

160

I
I

•

150

140
0.20

0.21

0.22

0.23

0.24

0.25

0.26

0.27

0.28

0.29

0.30

0.31

0.32

0.33

0.34

BASE-TO-EMITTE;R VOLTAGE: V(VOLTS)

Figure 2-7.

SCATTER DIAGRAM AND ROORESSION LINE FOR CIRCUIT DELAY AS A FUNCTION OF BASE-TO-EMITTER VOLTAGE

0.35
607

240

230

• ONE POINT
o TWO POINTS

220

210

..

'0
Ql

~
E 200

8 = -0.0812 f3 + 197

(.0

r = -0.393

;>.:

«
....J
lLJ

0

190

l-

S

(,)

a::

(,)

180

170

160

150

140 I
50

!

70

90

110

130

150

170

190

210

230

250

270

290

310

BETA:f3

Figure 2 -8.

330

350
608

SCATTER DIAGRAM AND REGRESSION LINE FOR CIRCUIT DELAY AS A FUNCTION OF BETA

~N

•w· ......
0

.l::>.~

eN

240

230

220~

• ONE POINT
o TWO POINTS

• THREE OR MORE POINTS

8 = 0.577

T

+ 146.5

r = 0.7895

210

'0
Q)

i

200

5

c:o
>-

«

...J

190

w

0

I-

:;

u

0:

180

U
170

160

..

150

140~1

o

______L -_ _ _ _-L______L -_ _ _ _-L______L -_ _ _ _-L______L -_ _ _ _-L______L -_ _ _ _-L______L -_ _ _ _- L______L -_ _ _ _- L_ _ _ _
10
20
40
30
50
60
70
80
90
100
110
120
130
140
150
~

TRANSISTOR DELAY: T (mJLsec)

Figure 2-9.

SCATTER DIAGRAM AND REGRESSION LINE FOR CIRCUIT DELAY AS A FUNCTION OF TRANSISTOR DELAY

609

o

~

240

230

220

• ONE POINT
o TWO POINTS
• THREE OR MORE POINTS

8=12.45+171

210

UQ)

1

200

.5

ro
><{

....J

190

lLJ

0

!::
::J
u
0::

180

u
170

160

150

140
0.50

0.90

1.00

1.10

1.20

1.30

1.40

1.50

1.60

1.70

1.80

1.90

STORAGE FACTOR: S (j-Lsec)

Figure 2-10.

2.00
606

SCATTER DIAGRAM AND REGRESSION LINE FOR CIRCUIT DELAY AS A FUNCTION OF STORAGE FACTOR

~N

•

0

ww

tIl-N
•
0
Wtll-

9

8

I--

7 l-

z
52

I-

6

I--

5

I--

4

I--

3

I-

2

'-

(,)

0

lIJ
0::

-

Cl.

~
0::

0

0::
0::
lIJ

I-

z

lIJ
(,)

0::
lIJ

Cl.

./

0
-t

--

-4 -5 -2
-3

-6

-

-7

l-

-8

I-

-9
O.Ot 0.050.t

v

v/

V

V

V

/

I

(

2

to

20

30 40 50 60 70 80

90

PROBABILITY OF OBTAINING GIVEN ER,ROR OR LESS

Figure 2-11.

I

//

/1)

-

l)

98 ' 99

99.9

99.99
610

CIRCUIT DELAY PREDICTION, 234GT 762 TRANSISTORS TEST FOR NORMALITY OF ERROR DISTRIBUTION

120

flO

100

\

90
UQ)

"l.

~

"'\.

'"E:t..

j::'

80

CD

~

w

::E

~

w
en
ii:

--

CIRCU IT DELAY RANGE:

196-235mJLsec

~

r-,...

70

t::::L

a::

0

--r...- r---a...--

I-

en

en
z

«
a::

60

I-

50

~h

40

~

--

®

CIRCUIT DELAY RANGE:

~h

156-195mJLsec
"-

~

1"1-I L - - -,
--

0

30

CIRCUIT DELAY RANGE:

0-155mJLsec

o
50

70

90

110

130

150

170

f3

Figure 2-]2.

I

I

190

210

230

(CURRENT GAIN)

250

270

290

310

330

350
605-R'

CIRCUIT DELAY RANGES

~N

•

w

0
C11

207
4.4

WIDE TEMPERATURE RANGE COINCIDENT CURRENT CORE MEMORIES
R. S. Weisz and M. Rosenberg
Ampex Computer Products Company
Culver City, California

A desired extension of man's intellect is to
extremes of temperature where the human brain
does not function effectively. Both military
and indus~rial applications call for digital
computers which can operate from polar cold to
equatorial heat. In the space age, these limits
must be broadened still further. Unfortunately,
core memories are generally somewhat less tolerant of temperature changes than are human beings.
Chiefly at fault is the ferrite core itself,
because of its high temperature coefficient of
coercivity, 0.5 to 0.7% per degree Centigrade.
Nevertheless, since ferrite cores have proven
to be highly reliable, reasonably fast, light in
weight, and modest in power requirements, some
means of overcoming the high temperature coefficient is needed to adva~ce the art of computing.
A number of such schemes have been proposed.
They include: (1) Compensating the drive circuits for the decrease in required current with
rising temperature; (2) putting the memory in a
thermostatically controlled chamber above the
highest ambient anticipated; (3) using multiapertured ferrite cores to provide a wide margin
of current tolerance; (4) using a "word select"
or linear select mode of address; (5) using
metal alloys which have inherently lower temperature coefficients. Each of these methods may
have some merit; certainly all have serious
limitations.
Clearly, what one would like is a ferrite
core having temperature independent properties.
We have recently developed such a core. Aside
from a low temperature coefficient. of coercivity,
the new core has a better disturb ratio than
that of standard ferrite memory cores over the
range -55° to +lOO°C. Valuable for other purposes as well, a high disturb ratio also aids
in decreasing temperature sensitivity. An array
of 1092 bits has been built from these cores
using a simple toroidal geometry. It has been
successfully operated over the range -55° to
+125°C in a coincident current mode without
temperature or current compensation.
In the remainder of this paper, we shall discuss in further detail the problem of wide
temperature memory operation, the prior art on
the subject, the new ferrite's characteristics,
and the new memory's operation.

Problem of Wide Temperature Operation
To match commonly called for military specifications, we shall discuss memory operation over
the range -55° to +125°C. It is important to
note that there already exist components other
than memory cores which can be operated over
this same range. For example, transistors,
diodes, resistors and capacitors, all necessary
in memory circuits, are available.
A fundamental problem involved in operating
a memory core over a wide temperature range will
be discussed in reference to Figure 1, which is
a superposition of hysteresis loops taken at
-55°, +25°, and +lOOoC for a conventional memory core. It is readily seen that a current
which is sufficient to switch the core at -55°
is more than twice the required current at 25°C.
Therefore, with the usual coincident current
scheme, if half such a current is applied at
room temperature, it will completely switch
what are intended to be half-selected (unswitched)
cores. Conversely, at low temperatures of
operation, it is impossible to switch a core
using the smaller current required at higher
temperatures.
In any usable memory core, the ratio of the
threshold to the full drive (or disturb ratio,
Rd, as it is commonly called) is actually'
greater than 0.5. The resulting current tolerance can be used to provide a margin against
drift in the drive circuits or changes in temperature. Conventional type memory cores are
known with Rd as high as 0.67. If no current
drift need be provided for, such a core can be
operated over a range of approximately 50°C.
A more realistic view is to allow for a drift
of ±10% in drive current leaving an operating
range of only 20°C.
One of the first methods suggested for extending the temperature range of operation to
the desired 180°C was to provide compensation
in the drive circuitry to decrease the current
with increasing temperature so as to match the
decreasing coercive force. Over a large temperature range a close match is difficult, but
perhaps possible. A more serious problem,
however, is also present. Those cores which
are being switched during a temperature change
will obviously settle on the appropriate new
hysteresis loop as shown in Figure 1. But what
happens to a non-selected core? In the absence

208
4.4

of an applied field, it is believed that it
remains at or close to its original remanent
point. Thus, at the end of the temperature
change there will be cores on two different
hysteresis loops. It follows that spurious
signals would then be obtained upon interrogating the memory. Despite this limitation,
current compensation is successfully used over
a narrower temperature range providing operation
from 0° to 60°C.

presence of moisture. None of the thin metal
film devices has yet found wide acceptance.
It is believed that uniformity of individual
storage elements has been a more serious problem here than with discrete ferrite toroids.
Furthermore, these elements have to date be~n
operated only in word select type memories.
This results once again in the use of a greater
number of semi-conductors than would be used in
an equivalent size coincident current memory.

Another approach that avoids the above problems completely, is to place the memory ia a
container which is kept at a controlled C' nperature above the highest anticipated. l
Although this scheme has the merit of simplicity,
i-t sacrifices some reliability due to the possibility of failure in the heater circuits.
There is a further limitation inherent in the
power requirement of the heaters. For example,
to heat a memory of 3200 bits from -55° to
+125°C, 10 watts are required. l A power level
of this magnitude is far beyond the supply
available in satellites now and in the predictable future; it is also an order of magnitude
above the power requirements of the memol.'~·
itself.

New Ferrite Cores

Still another partially successful approach
is to use multi-apertured cores or
"transfluxors. ,,2 Here one makes use of thr
geometry to provide a wider current tolerance
than is possible with simple cores. Memories
of this type have been operated over the range
-20° to +lOO°C with non-destructive readout.
The principal disadvantages of transfluxor
memories are: (1) Their complicated wiring,
and (2) increased use of semi-conductors. The
cores are also much larger than simple toroids.
Another approach has been to use ferrite
cores in a word select mode of operation. 3
This has the disadvantage of using a greater
number of semi-conductors, resulting in decreased reliability and increased costs.
Finally, it has been proposed to abandon the
square loop ferrites entirely and substitut~
metals or alloys in various configurations. 4 ,5
Permalloy and other nickel-iron alloys have very
low temperature coefficients of coercivity.
The principal disadvantage is their high electrical conductivity. One is forced to use very
thin film to avoid eddy-current losses and as a
result, two new problems arise. First, thin
films are not well suited to wide temperature
cycling because a difference in coefficient of
expansion between the film and substrate causes
cumulative stresses to be set up. Second,
metalic films are prone to destructive oxidation
at elevated temperatures, especially in the

Since none of the previously proposed methods
of wide temperature memory operation seemed
to be free of serious limitations, we embarked
on a program to develop a square loop ferrite
with a low temperature coefficient of coercivity.
A family of such materials has now been found.
Fortuitously, some of these also have a disturb
ratio higher than conventional cores over the
range -55 0 to +lOO°C. Details of the chemistry
and physics of the new core are outside the
scope of this article, but will be given in
another publication. 6 A particular core was
chosen for the memory to be described on the
basis of optimum squareness, temperature coefficient, and speed. The dimensions of the toroid
are standard: 50 mils O.D. x 30 mils I.D. x
15 mils thick.
Important characteristics of the cores are
given in Figures 2 through 4. Figure 2 shows
a superposition of 71 kc hysteresis loops of
the wide temperature range core taken at -55°,
+25°, and +lOO°C. For comparison purposes,
Figure 1 shows the corresponding loops of a
conventional magnesium-manganese ferrite core
with the lowest temperature coefficient we have
found among the presently used materials. One
will observe that the new core has a much lower
temperature coefficient of coercive force
(approximately 0.13% per degree Centigrade as
compared with 0.5% per degree Centigrade for
the magnesium-manganese ferrite). The usual
decrease in saturation flux density with increasing temperature is also much lower for
the new core. Undoubtedly, this is connected
with a high Curie temperature (greater than
500°C vs. 300°C or less for the magnesiummanganese ferrites).
More striking, and more to the point, are
the pulse characteristics of the new core.
Figure 3 epitomizes this data, giving the
following characteristics as a function of
temperature for a constant drive of 1.0 ampere
turn: Switching time, t s , and peaking time, t p '
in microseconds; undisturbed output uVl, and
noise, dV z , in millivolts; disturb ratio, Rd.
This data was obtained with a pulse rise time,

209

4.4

t r , of 0.2 microseconds, a pulse width, td, of
10 microseconds, and 20 repeats of the disturbing current, whose magnitude was 0.5 ampere
turn. The most striking feature is simply that
a core can be operated over the range -55° to
+lOO°C with a constant current. As stated pre-viously, conventional cores at a constant drive
cannot be operated over more than a 50°C spread.
Also important is the disturb ratio which is
greater than 0.69 up to 100°C. Conventional
cores have Ru up to 0.67. With Rd equal to 0.76
at room temperature, it is possible to operate
the new core easily on a triple coincidence
scheme. Although this possibility cannot be
exploited simultaneously with the wide temperature operation, it is a promising lead for future
work.
Figure 4 gives more detailed information on
the core characteristics as a function of
variable drive current. Other parameters of
the drive pulses were made the same as for
Figure 3. It should be stated at this time
that these figures show tentative values for
early cores. Recent improvements have lowered
the noise voltage from that shown with no
deleterious effects on other parameters.
Signal-to-noise ratios of 10:1 without time
strobing are now consistently obtained at room
temperature under normal drive conditions.
Since high reliability is one of the chief
attributes of ferrite memory cores, we felt
that a life test of the new type at an elevated
temperature was essential. In one such test,
cores have been aged in air at 100°C for over
eight months. Periodic test has shown no significant changes in pulse properties. In
another test, cores were aged in a vacuum at
10- 4 millimeters of Hg at 100°C for three months.
At the end of this period, the samples were
returned to room temperature conditions and retested. Again, no significant change was
observed.
Besides the 50-mil diameter toroids, sample
quantities of a 30-mil toroid, an 80-mil toroid,
and a multi-apertured structure have been made
of the new material. All have shown similar
temperature characteristics.
Memory Characteristics
Having established the unusual temperature
characteristics of the new core, system evaluation was carried out with the construction and
test of a model memory array. It was felt that
this-was an essential part of the evaluation
since, at times, components which have appeared
to be usable in individual tests have failed
when operated in systems. Memory elements in
particular are likely to show interactions,
delta noise problems, etc.

A standard type of coincident curre~t array
with 40 x 30 cores was constructed for the test.
With a constant, uncompensated drive, the array
was successfully operated in a temperature test
chamber from -55° to +125°C. Figure 5 shows
the direct output from the sense winding under
worst pattern conditions at +lOOoC; Figure 6
shows the direct output from the sense windLng
at -55°C. The pulse conditions in both cases
are as follows:
Pulse width = 3
Pulse rise time
Repetition rate

~s
0.75~.

= 50

kc

The signal-to-noise ratio at peaking time
of the "one" signal was greater than 50:1 in
both cases. The variation in sense amplitude
output over the temperature range-is less than
2:1 (90 millivolts to 60 millivolts). This
change is well within the dynamic range of a
properly designed sense amplifier, particularly
in view of the excellent signa1-to-noise ratio.
Since the cores nominally switch in a microsecond, they can be used in a 5-6. microsecond
memory system.
During test, the array was operated in a
3:2 selection mode by using 0.667 NI in the X
lines and 0.333 NI in the Y lines. At 25°C
and worst pattern conditions, a signal-to-noise
ratio of better than 10:1 at peaking time was
obtained. This simple experiment indicates
that it is possible to operate in a true threedimensional selection system. Further work is
planned along these lines.
Conclusions
It is believed that the described array is
a prototype for the first true wide temperature
range memory. The well-known resistance of
ferrites to oxidation, corrosion, and radiation
damage can also be used to advantage in adverse
environments. At the present time, a memory
for a satellite is being built with the new
type core. This memory has approximately one
thousand bits operating in a coincident current
mode over a temperature range of -55° to +lOOoc
without compensation of current or temperature.
Other interesting properties and applications
of the new memory core will be discussed in
later papers.

210
4.4

References

Captions

1. Miniature Memory Plane for Extreme Environmental Conditions, R. Straley, A. Heuer,
B. Kane, and G. Tkach. Journal of Applied
Physics, vol. 31, April 1960, pp. l26s l28s.

Figure 1 - Hysteresis Loops of Standard One
Microsecond Memory Core at -55°,
+25°, and +lOO°C. Vertical Scale
680 gauss/dive Horizontal Scale =
0.780e/div.

2. Temperature Characteristics of the Transfluxor, H. W. Abbott and J. J. Suran. IRE
Transactions on Electron Devices, vol. ED-4,
April 1957, pp. 113-119.

Figure 2 - Hysteresis Loops of Wide Temperature
Range Core at -55°C, +25°C, and
+lOO°C. Vertical Scale = 940
gauss/dive Horizontal Scale =
1.4 oe/div.

3. Design of a Reliable High Speed Militarized
Core Memory, M. Stern and H. Ullman.
Program of the Winter Conference on Military
Electronics (IRE), Los Angeles, Feb. 1, 1961.
(in abstract form)

Figure 3 - Pulse Characteristics of Wide
Temperature Range Core as a Function
of Temperature at a Constant Drive
of 1.0 Ampere Turn.

4. Recent Advances in Magnetic Devices for
Computers, D. H. Looney. Journal of Applied
Physics, vol. 30, April 1959, pp. 38s-42s.

Figure 4 - Pulse Characteristics of Wide
Temperature Range Core as a Function
of Drive.

5. Millimicrosecond Magnetic Switching and
Storage Element, D. A. Meier, Journal of
Applied Physics, vol. 30, April 1959,
pp. 45s-46s.

Figure 5 - Array Output at 100°C. Vertical
Scale
50 mv/div. Horizontal
Scale = 0.5 ~s/div.

6. Square Loop Ferrites With Temperature Independent Properties and Improved Disturb Ratio,
R. S. Weisz. Accepted for publication in
Journal of Applied Physics.

Figure 6 - Array Output at -55°C. Vertical
Scale
50 ~v/div. Horizontal
Scale = 0.5 ~s/div.

211
4.4

Figure 1.

HYSTERESIS LOOPS OF STANDARD O:NE NICROSECOND MEMORY CORE
AT -55°, f250J AND flOOD C.
Vertical Scale = 680 gauss/dive
Horizontal Scale = 0.78 oe/dive

Figure 2.

HYSTERESIS LOOPS OF WIDE TEMPERATURE RANGE CORE
AT -55°C J f250C J JLWD flOOoC.
Vertical Scale = 940 gauss/dive
Horizontal Scale = 1.4 oe/div.

.

.t:.N

......

.t:.N

~

~

~

~

~

I

0
~
(\J

0
~

-:

Q

0

0

Q

~

~

0

~

~

0

"":

f'.:

0

~

~

0

~

~
~

,-:.

en

uV,

00

I"-

-RO

W

0

10

-tp

0

"'"

~

0

~

0

-;

Q

0

0

t()

(\J

L-----

-dVz

1

-50

-30

-10

+10

+30

+50

+70

+90

+110

TEMP. (OC)
Figure

:3.

PULSE CHARACTERISTICS OF WIDE TEl:4PERATURE RANGE CORE AS A FUNCTION OF TEMPERATURE
Constant Drive of 1.0 Ampere Turn

fd

0

N

01=>0

W

~

214
4.4

Figure

5. ARRAY OUTPUT AT 100°C.
Vertical Scale = 50 mY/dive
Horizontal Scale = 0.5 j..t s/div.

Figure

6. ARRAY OUTPUT AT -55°c.
Vertical Scale
Horizontal Scale

= 50

mY/dive

= 0.5 fJ. s/ di v •

215
5.1

DESCRIPTIVE LANGUAGES AND PROBLEM SOLVING
Marvin Minsky
Dept. of Mathematics and Computation Center
Massachusetts Institute of Technology

Advances in machine problem solving
may depend on use of internal languages
for description and abstraction of the
outcomes of experiments. As more complex
problems are attempted there will have
to be less trial and error and more .
systematic analysis of the results of
each trial. Learning on the basis of
experience will require a phase of
refinement in which the machine will
attempt, by analysis and inductive inference, to get as much as possible from
each experiment.
Introduction
Work on artificial intelligence is
proceeding at a slow, apparently steady,
rate. The complexity of problems being
attacked is growing slowly, as is the
complexity of the successful programs
themselves. In the past it seems to
have taken two or three years for each
significant advance and one may ask why
progress is so slow. Much of this time
has been spent on the development of
programming languages and systems
suitable for the symbol manipulation
processes involved. But much of the
difficulty has been conceptual also.
The methods which worked quite well on
easy problems did ,not extend smoothly
to the difficult ones. Continued progress
will require implementation of new ideas,
for there are some very tough problems
in our immediate path. It seems to us
that solution of these problems will
require the use of non-trivial formal
and descriptive language systems. These
are only beginning to appear as a working
part of the problem solVing machinery
and it will take much ingenuity to
bring current notions into usable i"Ol'm.
The two papers of this session
represent important, and very different,
phases in the development of machineusable language systems. In one case
we have a system which can process, to
find the meaning with respect to a small
universe, expressions in a very lifelike fragment of ordinary language.
In the second paper we find an
ambitious attempt at the beginnings of a
Theory of Computation, based in part on
the use of a symbol manipulation language
suited at once for both theoretical
analysis and for practical programming use.

Our purpose here is to indicate a
few of the considerations that seem to
point toward the incorporation of
complex linguistic processes into the
next generation of heuristic programs.
Some of these difficulties have arisen
in the author's work, jOintly with 1
McCarthy, on the Advice-Taker system
In a recent paper 2 the author discussed
the principles and mechanisms of a
variety of problem solVing systems, but
did not dwell on the question of
extending these to really complex problems.
We assume the terminology of that paper.
When one attempts to apply the
teChniques aescribed there one discovers
that
1. The search problems become very
serious. One is faced not only with
greatly enlarged problem trees but also
with a greater variety of plausible
methods.
2. The problem of learning from
experience becomes qualitatively more
difficult. To learn the lesson of a
complex experience requires shrewd,
deliberate, analysis that ca~not be
approximated by any of the simple
learning models based on averaging or on
correlation.

3. The classification and pattern
recognition methods must be on a descriptlve level. Again, correlation or
matching methods must be replaced by
more sophisticated symbol-manipulation
processes.
4. Planning methods, Character
and Difference algebras, etc., threaten
to collapse when the fixed sets of categories adequate for simple problems have
to be replaced by the expressions of
a descriptive language. The use of lookup tables for choosing methods 't'lill have
to be supplemented by something more
like reasoning.
When we call for the use of
"reasoning we intend no suggestion of
giving up the game by invoking an
intelligent subroutine. The program
that administers the search will be just
another heuristic program. Almost
certainly it will be composed largely of
the same sorts of objects and processes
that will comprise the subject-domain
ll

216
5.1

programs. Almost certainly it will be
recursively applied to itself so that
the system can be finite. But it does
seem clear that the basic (non-recursive)
part of the structure will have to be
more complex than is any current system.
The Need for Analysis
The simplest problems, e.g., playing
tic-tac-toe or proving the very simplest
theorems of logic, can be solved by
simple recursive application of all the
available transformations to all the
situations that occur, dealing with subproblems in the order of their generation. This becomes impractical in more
complex problems as the search space
grows larger and each trial becomes more
expensive in time and effort. One can
no longer afford a policy of simply
leaving one unsuccessful attempt to go
on to another. For each attempt on a
difficult problem will involve so much
effort that one must be quite sure that,
whatever the outcome, the effort will
not be wasted entirely. One must become selective to the point that no trial
is made without a compelling reason;
Just as in any research, expensive
experiments must be carefully designed.
One must do a good deal of criticism and
analysis between experiments so that
each will be a critical test of a
significant portion of the search space.
The ability to solve a difficult
problem hinges on the ability to split
or transform it into problems of a lower
order of difficulty. To do this, without total reliance on luck, requires some
understanding of the situation. One must
be able to deduce, or guess, enough of
the consequences of the problem statement
to be able to set up simpler models of
the problem situation. The models must
have enough structure to make it likely
that there will be a v'lay to extend their
solutions to the original problem.
The construction of less difficult
subproblems will be useful, by definition,
only if one has already a very good chance
of solving them efficiently. Otherwise
the search tree will grow beyond bounds.
This means we must have already built
up adequate solution methods for the
lower order problems, e.g., as a set of
more or less packaged subroutines. This
entails soine formidable requirements:
Training Sequences
The machine is presumed to have
acquired its good subroutines through
earlier solution of less complex
problems. (We are not interested here

in the case in which these methods are
provided at the start.) Thus the machine
must have been exposed to a graded sequence
of problems. To be sure, given timelimits, a machine will select a graded
subsequence from an unorganized variety
of problems. But a careful arrangement
will be necessary to insure that methods
learned in the problems that the machine
does manage to solve will be useful on
more difficult problems met later. In
any case one cannot rely on making large
Jumps, either in machines or in humans.
Refinement Phase
Solving simpler problems is not
enough. To make progress one needs also
to package" the successful method for
effective later use. We are not interested in the trivial case of recognizing
a problem once before solved, though this
can be difficult enough when there is
some disguise. The success must be
generalized to cover a substantial
variety of situations. To do this it
would seem that there should be a phase
of exploration and consolidation in
which the successful method is refined-its central innovation (if any) isolated
and packaged, in terms as general as
possible., One must explore its range of
application and construct an expression
describing this range. This may involve
inventing similar problems on which the
method, or close variant, works; then
constructing a plausible generalization.
Il

Certainly people must go through
such phases. One cannot usually solve
hard problems with once-used but still
unfamiliar methods. One must first
"understand" the methods quite well; this
means becoming able to recognize situations in which they are applicable. It
is probably misleading to think of this
as 'Ipractice ll --acquisition of facility
through repetition. Exercise in, e.g.,
mathematical technique is probably very
different from exercise in weight-lifting.
Its effect is not so much in reinforcing
methods, or paths already weakly laid
down, but is rather to provide the necessary data for some Inductive Inference
technique. The latter will replace the
special method by one of somewhat greater
generality.
Failure of the refinement phase to
yield a precise, abstractly stated conclusion can be concealed to a pOint.
One often encounters mathematical
situations in which one can answer
particular questions quickly, yet is
unable to state a satisfactory formal
generalization. This can happen through
the assembly of a set of different models

217
5.1

or examples which, as a group, show most
or all of the features of the unformulated
general theorem. One can answer some
question in the negative, by finding
inconsistency with an example. Consistency with all leads one to the affirmative. Often the examples themselves
are not formulated clearly, or completely
consciously. In such cases one will
find some statements seem "obvious" yet
(because of the incomplete understanding
which precludes giving any precise
explanation) are also felt to be
"intuitive." An incomplete formalization
or conceptualization, e.g., such a set
of examples, can be very powerful when
used at or near the top level. But if
not understood or packaged" it could
become a serious nuisance later when,
because of its informality, it cannot
be used in deduction or in the construction of further abstractions.
l1

Coding and Retrieval Problems
The compact representation of
results of previous experience requires
an adequate descriptive language. This
language must permit general statements
about both problem-domain matters and
about the problem-solving methods. It
must permit logical deductions to be
made. This raises several problems.
One problem that has been a great
nuisance to us arises in connection with
non-mathematical problems in which actions
affect the state of some subject domain.
Thus a move affects the positions
of pieces in a board game. When this
happens, some statements formerly deduced
about the situation cease to be true.
(In a mathematical domain a theorem, once
proved, remains true when one proves
other theoremsl) One must then deduce
all the consequences of an action in so
far as it affects propositions that one
is planning to use. This might be done
through some heuristic technique which
can assess relevancy, or it could be done
through a logic which takes sU,ch consequences into account. The trouble
with the latter is that the antecedents
of all the propositions must contain a
condition about the state of the system,
and for complex systems this becomes
overwhelmingly cumbersome. Other
systematic solutions to the problem seem
about equally repellent. It is a problem
that seems urgently to require a heuristic
solution.
Our present proposal on this matter
is to make the system plan ahead. Whenever an important deduction is made, the
system is to try to discover which kinds
of actions could affect its validity.

Independent monitors are then set up to
detect when such actions are proposed.
The normal problem solving exploration
process proceeds independently of these
monitors, and is interrupted when one
of them detects a threat to the proposition it is defending. This model has
a certain introspectively attractive
character; it suggests a free conscious
exploration with more or less subconscious
trouble-detectors. Unfortunately, its
essentially parallel nature threatens
to make its use in serial computer programming rather expensive. We hope
someone will come up with a better idea.
In any case, the retrieval problem
has to be faced. The problem of making
useful deductions from a large body of
statements (e.g., about the relevance
of different methods to different kinds
of problems) raises a new search problem.
One must restrict the logical exploration
to data likely to be relevant to the
current problem. This selection function
could hardly be completely built-in at
the start. It must develop along with
other data accumulated by experience.
Another rather serious problem
centers around the problem of abbreviations,
or proper names. The language must be
used together with an abbreviative
technique so that the most useful notions
can be deSignated by reasonably convenient
(short) representations. This is not
only a matter of convenience and compactness; it is a more or less inescapable
requirement of known inductive inference
techniques and thus requisite for formation of hypotheses or generalizations.
Unfortunately an abbreviation cannot
show all of the structure of the longer
expression it designates. This seriously
limits the possibilities of making formal
logical deductions. Ultimately the
machines will have to use mnemonic codings
in their internal languages, just as we
need to do this when we use their external
languages.
The systematic solution to the
abbreviation problem is, again, to revise
the whole body of propositions in current
use, in so far as they are going to be
used in the same deductive operations.
All the alternatives to this that we
can envision are of somewhat stopgap
nature. We content ourselves with the
observation that it is equally a major
problem for humans to make substantial
changes in basic abstractions, ways of
classifying or perceiVing, and the like.
Once one has built up a structure depending
on a certain conceptual commitment, he
v'lill stave off a revision of its foundation as though the cost of changing

218
5.1

it were high. Otherwise, perhaps, people
would not argue so much. One may view
that phenomenon, if one likes, as a matter
of ego involvement. But it would be well
to remember that being wrong (and having
to change) has a real intellectual cost,
and not merely a social cost.
In any case, our ideas on this
subject are not yet in presentable
condition.
Conclusion
The need to be able to make abstractions in symbolic language is
already urgent in current attempts to
make machines prove theorems, 'play games,
etc. There are some very difficult
problems to be faced in this area. We
still maintain, with Mccarthy, that !lin
order for a program to be capable of --learning something1it must first be capable
of being told it."
Results on "selforganizing systems " without explicit
provision for such abilities show very
little promise to date, and systematic
attempts in the direction of internal
language processing should be promoted.
References
1.

J. McCarthy, Programs with Common
Sense," r,1echanization of Thought
Processes, H.M.S.O., London, 1959,
pp. 75-84 (Vol. I).

2.

M. L. Ivlinsky, 11 Steps TO\''lard
Artificial Intelligence, ?roc.
IRE, Vol. 49, no. 1, Jan. 1961,
pp. 8-30.

If

11

219
5.2

BASEBALL:

AN

AUTO~fATIC

QUESTION-ANSWERER

Bert F. Green, Jr., Alice K. Holf, Carol Chomsky, and Kenneth Laughery
Lincoln Laboratory*, Massachusetts Institute of Technology
Lexington 73, l~ssachusetts
Surmnary
Baseball is a computer program that answers
questions phrased in ordinary English about stored
data. The program reads the question from punched
cards. After the words and idioms are looked up
in a dictionary, the phrase structure and other
syntactic facts are determined for a content
analysis, which lists attribute-value pairs
specifying the information given and the information requested. The requested information is
then extracted from the data matching the specifications, and a~v necessary processing is done.
Finally, the answer is printed. The program's
present context is baseball games ; it ans\"ers
such questions as "Hhere did each team play on
July 7?"
Introduction
Men typically communicate vli th computers in
a variety of artificial, stylized, unambiguous
languages that are better adapted to the machine
than to the man. For convenience and speed,
many future computer-centered systems vTill
reqUire men to communicate with computers in
natural language. The business executive, the
military commander, and the scientist need to
ask ~uestions of the computer in ordinary English,
and to have the computer answer questions
directly. Baseball is a first step toward this
goal.
Baseball is a computer program that answers
questions posed in ordinary English about data
in its store. The program consists of two parts.
The linguistic part reads the question from a
punched card, analyzes it syntactically, and
determines what information is given about the
data being requested. The processor searches
through the data for the appropriate information,
processes the results of the search, and prints
the answer.
The program is written in IPL-Vl , an information processing language that uses lists, and
hierarchies of lists, called list structures, to
represent information. Both the data and the
dictionary are list structures, in which items
of information are expressed as attribute-value
pairs, e.g., Team = Red Sox.
*Operated with support from the U.S. Army, Navy,
and Air Force.

The program operates in the context of
baseball data. At present, the data are the month,
day, place, teams and scores for each game in the
American League for one year. In this limited
context, a small vocabulary is sufficient, the
data are simple, and the SUbject-matter is
familiar.
Some temporary restrictions ,,,ere placed on
the input questions so that the initial program
could be relatively straightforward. QQestions
are lliaited to a single clause; by prohibiting
structures with dependent clauses the syntactic
analysis is considerably simplified. Logical
connectives, such as and, or, and not, are prohibited, as are constructions implying relations
like most and highest. Finally, questions
involving sequential facts, such as "Did the
Red Sox ever win six games in a row?" are prohibited. These restrictions are temporary
expedients that will be removed in later
versions of the program. ~~reover, they do not
seriously reduce the number of questions that
the program is capable of ansvlering. From simple
questions such as "Who did the Red Sox lose to
on July 5? 11 to complex q,uestions such as "Did
every team play at least once in each park in
each month?" lies a vast number of answerable
questions.
Specification List
Fundamental to the operation of the baseball
program is the concept of the specification list,
or spec list. Tnis list can be viewed as a
canonical expression for the meaning of the
question; it represents the information contained
in the question in the form of attribute-value
pairs, e.g., Team = Red Sox. The spec list is
generated from the question by the linguistic
part of the program, and it governs the operation
of the processor. For example, the question
-"'{.here did the Red Sox play on July 7? 11 has the
spec list:
Place
Team
Month
Day

=?
= Red Sox

= July

=7

Some questions cannot be expressed solely
in terms of the main attributes (Month, Day, Place,
Team, Score and Game Serial Nillmber), but require
some modificatjon of these attributes. For
example, on the spec list of ";{.hat teams vTon 10

220
5.2

games in July?", the attribute Team is modified by
and Game is modified by Number of, yielding

~{inning,

Month

Month = July
Place = Boston
=7
Game Serial No. = 96
(Team = Red Sox, Score
(Team = Yankees, Score

Day

Team(winning) = ?
Game (number of)

values for each of six attributes, e.g.:

10

= July

5)
3)

Dictionary

The parentheses indicate that each Team must be
associated "lith its own score, "ifhich is done by
placing the~ together on a sublist.

The dictionary definitions, which are
expressed as attribute-value pairs, are used by
the linguistic part of the program in generating
the spec list. A complete definition for a ifOrd
or idiom includes a part of speech, for use in
determining phrase structure; a meaning, for use
in analyzing content; an indication of whether
the entry is a question-word, e. g., 'ifho or how
manyj and an indication of whether a-wDrd oCCUrs
as part of any stored idiom. Separate dictionaries are kept for words and idioms, an idiom
being any contiguous set of words that functions
as a unit, having a unique definition.

The processing routines are written to
accept any organization of the data. In fact,
they will accept a non-parallel organization in
which, for example, the data might be as above
for all games through July 31, and then organized
by place, ,nth month under place, for the rest
of the season. The processing routines will also
accept a one-level structure in which each game
is a list of all attribute-value pairs for that
game. The possibility of hierarchical organization
was included for generality and potential
efficiency.

The meaning of a word can take one of
several forms. It may be a main or derived
attribute with an associated value. For example,
the meaning of the word Team is Team = (blank),
the meaning of Red Sox is Team = Red Sox, and
the meaning of who is Team = ? The meaning may
designate a subroutine, together with a particular
value, as in the case of modifiers such as
'ifinning, ~, ~ or how many. For example,
winning has the meaning Subroutine Al = ITinning.
The subroutine, which is executed by the content
analysis, attaches the modifier 'dinning to the
attribute of the appropriate noun. Some 'ifOrds
have more than one meaning; the v,ord Boston
may mean either Place = Boston or Team = Red Sox.
The dictionary entry for such words contains, in
addition to each meaning, the designation of a
subroutine that selects the appropriate meaning
according to the context in which the word is
encounted. Finally, some ifOrds such as the, did,
play, etc., have no meaning.
-- -Data

The data are organized in a hierarchical
structure, like an outline, with each level
containing one or more items of information.
Relationships among items are expressed by their
occurrence on the same list, or on associated
lists. The main heading, or highest level of the
structure, is the attribute ~~nth. For each ~onth,
the data are further subdivided by place. Below
each place under each month is a list of all
games played at that place during that month.
The complete set of items for one game is found
by traCing one path through the hierarchy, i.e.
one list at each level. Each path contains

Details of the Program
The program is organized into several
successive, essentially independent routines,
each operating on the output of its predecessor
and producing an input for the routine that
follows. The linguistic routines include
question read-in, dictionary look-up, syntactic
analysis, and content analysis. The processing
routines include the processor and the responder.
Linguistic Routines
Question Read-in. A question for the program
is read into the computer from punched cards.
The question is formed into a sequential list of
words.
Dictionary Look-up. Each "ord on the
question list is looked up in the word dictionary
and its definition copied. Any undefined words
are printed out. (In the future, 'ifi th a directentry keyboard, the computer can ask the questioner to define the unknovrn words in terms of
,fOrds that it knows, and so augment its vocabulary.) The list is scanned for possible idioms;
any contiguous words that form an idiom are replaced by a single entry on the question list,
and an associated definition from the idiom
dictionary. At this pOint, each entry on the list
has associated with it a definition, including a
part of speech, a meaning, and perhaps other
indicators.
Syntax. The syntactiC analysis is based on
the parts of speech, which are syntactic categories assigned to words for use by the syntax

22l

5.2

routine. There are 14 parts of speech and
several ambiguity markers.
First, the question is scanned for ambiguities in part of speech, which are resolved in
some cases by looking at the adjoining vTords, and
in other cases by inspecting the entire question.
For example, the w·ord ~ may be either a noun
or a verb; our rule is that, if there is no other
main verb in the question, then score is a verb,
otherwise it is a noun.
Next, the syntactic routine locates and
brackets the noun phrases, [1 , and the preposi tional and adverbial phrases, ( ). The verb is
left unbracketed. This routine is patterned
after the work of Harris and his associates at
the University of Pennsylv~nia.2 Bracketing
proceeds from the end of the question to the
beginning. Noun phrases, for example, are
bracketed in the following manner: certain parts
of speech indicate the end of a noun phrase;
v1ithin a noun phrase, a part of speech ma;>7 indicate that the '\{ord is wi thin the phrase, or that
the word starts the phrase, or that the word is
not in the phrase, which means that the previous
word started the phrase. Prepositional phrases
consist of a preposition immediately preceding a
noun phrase. The entire sequence, preposition
and noun phrase, is enclosed in prepositional
brackets. p~ example of a bracketed question is
shown be1m{:
[ROvl many games] did

~he Yankees) play (in rJu1y1)?
limen the question has been bracketed, any unbracketed preposition is attached to the first
noun phrase in the sentence, and prepositional
brackets added. For example, 1I1ilho did the Red
Sox lose to on July 5?" becomes "(To [who] ) did
{ the Red Sox] lose (on [July 5 j )?"
Follov1ing the phrase analysis, the syntax
routine determines whether the verb is active or
passive and locates its subject and object.
Specifically, the verb is· passive if and only if
the last verb element in the question is a main
verb and the preceding verb element is some form
of the verb to be. For questions ,nth active
verbS, if a free noun phrase (one not enclosed in
prepositional brackets) is found between two verb
elements, it is marked Subject, and the first free
noun phrase in the question is marked Object.
otherwise the first free noun phrase i~
subject, the next, if any, is the object. For
passive verbs, the first free noun phrase is
marked Object (since it is the object in the
active form of the question) and all prepositional
phrases with the preposition by have the noun
phrase "inthin them marked Subject. If there is
more than one, the content analysis later chooses

among them on the basis of meaning.
Finally, the syntactic analysis checks to
see if any of the words is marked as a question
word. If not, a signal is set to indicate that
the question requires a yes/no answer.
Content Analysis. The content analysis uses
the dictionary meanings and the results of the
s;>~tactic analysis to set up a specification list
for the processing program. First any subroutine
found in the meaning of any word or idiom in the
~uestion is executed.
The subroutines are of two
basic types; those that deal with the meaning of
the '\{ord itself and those that in some way change
the meaning of another word. The first chooses
the appropriate meaning for a '\-Tord with multiple
meanings, as, for example, the subroutine mentioned above that deCides, for names of cities,
"ifhether the meaning is Team = At or Place = Ap.
The second type alters or modifies the attribute
or value of an appropriate syntactically related
'\-Tord. For example, one such subroutine puts its
value in place of the value of the main no~n in
its phrase. Thus Team = (blank) in the phrase
each team becomes Team = each; in the phrase what
team, it becomes Team =? Another subroutin-e--modifies the attribute of a main noun. Thus
Team = (blank) in the phrase innning team becomes
Team ( ,nnning) = (blank). In the question "1';Jho
beat the YanKees on July 4?11, this subroutine,
found in the meaning of beat, modifies the
attribute of the subject and ~bject, so that
Team = ? and Team = Yankees are rendered
Team(winning) = ? and Team(losing) = Yankees.
.Another subroutine combines these tvo operations:
it both modifies the attribute and changes the
value of the main noun. Thus, Game = (blank) in
the phrase six games becomes Game(number ~f) = 6,
and in the phrase hov many games becomes
Game(number of) = ?
After the subroutines have been executed,
the question is scanned to consolidate those
attribute-value pairs that must be represented on
the specification list as a single entry. For
example, in lI'Vlho was the winning team ••• Team = ?
and Team(winningJ = (blank) must be collapsed into
Team(winning) =.. Next, successive scans will
create any sub1ists implied by the syntactic
structure of the question. Finally, the composite
information for each phrase is entered onto the
spec list. Depending on its complexity, each
phrase furnishes one or more entries for the list.
The resulting spec list is printed in outline
form, to provide the questioner with some intermediate feedback.
11

Processing Routine
Processor. The specification list indicates
to the processor what part of the stored data is
relevant for answering the input question. The

222
5.2

processor extracts the matching information from
the data and produces, for the responder, the
answ'er to the question in the form of a list
structure.
The core of the processor is a search routine
that attempts to find a match, on each path of a
given data structure, for all the attribute-value
pairs on the spec list; when a match for the whole
spec list is found on a given path, these pairs
relevant to the spec list are entered on a found
list. A particular spec list pair is considered
matched when its attribute has oeen found on a
data path and, either the data value is the same
as the spec value, or the spec value is ? or each,
in which case any value of the particular attrroute
is a match. !',1ltching is not ablays straightfor,yard. Derived attributes and some modified
attributes are functions of a nuraber of attributes
on a -~th and must be computed before the values
can be matched. For example, if the spec entry
is Home Terun = Red Sox, the actual home team for
a particular path must be computed from the
place and teruns on that path before the spec
value Red Sox can be matched "Ti th the computed
data value. Sublists also require special
handling because the entries on the sublist must
sometimes be considered separately and sometimes
as a unit in various permutations.
The found list produced by the search routine
is a hierarchical list structure containing one
main or derived attribute on each level of each
path. Each path on the found list represents the
information extracted from one or more paths of
the data. For example, for the question "lihere
did each team play in July?", a single path
eXists, on the found list, for each team which
played in July. On the level below each team,
all places in ,{hich that team played in July
occur on a list that is the value of the attribute
Place. Each path on the found list may thus
represent a condensation of the information
existing on many paths of the search data.
Many input questions contain only one query,
as in the question above, i.e., Place =?
These
questions are ansvTered, with no further processing,
by the found list produced by one execution of
the search routine. Others require simple processing on all occurrences of the queried attribute on the generated found list. The question
"In how many places did each team play in July?"
requires a count of the places for each team,
after the search routine has generated the list
of places for each team.
Other questions imply more than one search
as well as additional processing. For a spec
attribute with the value every, a comparison with
a list of all possible values for that attribute
must be made after the search routine has
generated lists of found values for that attribute.

Then, since only those found list paths for which
all possible values of the attribute exist should
remain on the found list as the anS"Ter to the
question, the search routine, operating on this
found list as the data, is again executed. It
now generates a new found list containing all the
data paths for which all possible values of the
attribute vTere found. Likewise, questions
involving a specified number, such as 4 teams,
imply a search for which teruns, a count of the
teams found on each path, and a search of the
found list for paths containing 4 teams.
In general, a question may contain implicit
or explicit tlueries. Since these queries must
be ansvTered one at a time, several searches,
ivith intermediate processing, are required. The
first search operates on the stored data while
successive searches operate on the found list
generated by the preceding search operation.
As an example, consider the question liOn how
many days in July did eight teams play?" The
spec list is
Day (number of) =?;
MOnth
= July;
Team (number of) = 8 .
On the first pass, the implicit question which
teams is answered. The spec list for the-rirSt
search is

Day
= Each;
Month = July;
Team =? •
The found data is a list of days in July j for
each day there is a list of teruns that played on
that date. Follovdng this search, the processor
counts the teams for each day and associates the
count '\vith the attribute Team. On the second
search, the spec list is
Day
=?
Month
= July;
Team (number of) = 8 .
The found data is a list of days in July on which
eight teruns played. After this pass, the processor counts the days, addsthe count to the
found list and is finished.

Responder. No attempt has yet been made to
respond in grammatical English sentences. Instead,
the final found list is pri~t~d, in outline form.
For questions requiring a yes/no answer, YES is
printed along with the found list. If the search
routine found no matching data, NO is printed for
yes/no questions, and NO DATA for all other cases.

223
5.2

Discussion
The differences bet'YTeen Baseball and both
automatic language translation and information
retrieval should nO"l be evident. The linguistic
part of the baseball program has as its main goal
the understanding of the meaning of the g,uestion
as embodied in the canonical specification list.
Syntax must be considered and ambiguities resolved
in order to represent the meaning adequately.
Translation programs have a different goal: transforming the input passage from one natural language
to another. Meanings must be considered and
a.t"Ubiguities resolved to the e::tent that they
effect the correctness of the final translation.
In general, translation prograras are concerned
more ,Yith syntax and less "lith meaning than the
Baseball program.
Baseball differs from most retrieval systems
in the nature of its data. Generally the retrieval problem is to locate relevant doc~~ents.
Each docurnent has an associated set of index
nwnbers describing its content. The retrieval
system must find the appropriate index numbers
for each input request and then search for all
documents bearing those index numbers. The basic
problem in such systems is the assignment of index·
categories. In Baseball, on the other hand, the
attributes of the data are very 'tTell specified.
There is no confusion about them. HOHever,
Baseball's derived attributes and modifiers imply
a great deal more data processing than most
document retrieval prog~~. (Baseball does bear
a close relation 'Ylith the ACSI-r.IATIC system
discussed by Miller et al at the 1960 i·lestern
Joint Computer Conference.3)
The concept of the spec list can be used to
define the class of questions that the baseball
program can answer. It can answer all questions
whose spec list consists of attribute-value pairs
that the program recognizes. The attributes may
be modified or derived, and the values may be
definite or queries. Any combination of attributevalue pairs constitutes a specification list.
~i3.ny will be nonsense, but all can be ans'YTered.
The nmnber of questions in the class is, of
course, infinite, because of the numerical values.
But even if all numbers are restricted to two
digits, the program can answer millions of meaningful ~uestions.
The present program, despite its restrictions,
is a very useful communication device. Any
complex question that does not meet the restrictions can al'vays be broken up into several simpler
questions. The program usually rejects questions
it cannot handle, in which case the questioner
may rephrase his question. He can also check
the printed spec list to see if the computer is
on the right track, in case the linguistic program
has erred and failed to detect its O,in error.

Finally, he can often judge whether the answer
is reasonable.
Next Steps
No important difficulty is expected in
augnenting the prograrn to include logical
connectives, negatives, and relation words. The
inclusion of multiple-clause questions also seems
fairly straightfo~~rd, if the questioner will
mark off for the comuuter the boundaries of his
clauses: The progra,;.1 can then deal "Ti th the
subordinate clauses one at a time before it deals
\-lith the main clause, using existing routines.
On the other hand, if the syntax analysis is
req.uired to determine the clause boundaries as
well as the phrase structure, a much more
sophisticated program ,fould be required.
The problem of recognizing and resolving
semantic arabiguities remains largely unsolved.
Determining what is meant by the question "Did
the Red Sox 'rin most of their ga.t'1les in July?"
depends on a much larger context than the
immediate question. The computer might answer
all meaningful versions of the g,uestion (vTe know
of five), or might ask the questioner which
meaning he intended. In general, the facility
for the computer to query the questioner is
likely to be the most pmferful improvement.
This would allo'., the computer to increase its
vocabulary, to resolve ambiguities, and perhaps
even to train the questioner in the use of the
program.
Considerable pains ,{ere taken to keep the
program general. Host of the program 'YTill rema.in
unchanged and intact in a ne,{ context, such as
voting records. The processing program ,{ill
handle data in an;~r sort of hierarchical form, and
is indifferent to the attributes used. The syntax
program is based entirely on parts of speech,
which can easily be assigned to a new set of ,fords
for a new context. On the other hand, some of the
subroutines contained in the dictionary meanings
are certainly specific to baseball; probably each
new context would require certain subroutines
specific to it. Also, each context might introduce a number of modifiers and derived attributes
that would have to be defined in terms of special
subroutines for the processor. Hopefully, all
such occasions for change have been isolated in a.
small area of special subroutines, so that the
main routines can be unaltered. However, until
l..,e have actually s,. . i tched contexts, we cannot say
definitively that 'Yle have been successful in
producing a general question-answering program.
Acknowledgment
The Baseball program was conceived by
Fredrick C. Frick, Oliver G. Selfridge, and
Gerald P. Dineen, whose continued guidance is

224

5.2

gratefully

acknowledged~

References
1.

A. New"ell and F. Tonge, HAn introduction to
lni'ormation Processing Language yll, Commun.
Assoc. for Computing Mach., Vol. 3, pp. 205211; April, 1960.

2.

See the project summary, by Z. S. Harris, in
Current Research and Development in Scientific
Documentation No.6, pp. 52-53, Nat'l
Sciences Foundation, May, 1960.

3.

L. r.fi.ller, J. Minker, ~i. G. Reed, and 1,1. E.
Shindle, "A multi-level file structure for
ini'ormation processing", Proceedings \-lestern
Joint Computer Conference, Yolo 17, pp. 53-

59,

M3.Y,

1960.

225
5.3
A BASIS FOR A MATHEMATICAL THEORY OF COMPUTATION, PRELIMINARY REPORT
John McCarthy
M.I.T. Computation Center
Cambridge, Massachusetts
Abstract: Programs that learn to modify their
cwn behaviors require a way of representing
algorithms so that interesting properties and interesting transformations of algorithms are simply
represented. Theories of computability have been
based on Turing machines, recursive functions of
integers and COHlputer proGrams. Each of these has
artificialities which make it difficult to manipulate algorithms or to prove things about them.
The present paper presents a formalism based on
conditional forms and recursive functions whereby
the functions compEtable in terlLS of certain base
functions can be simply expressed. We also describe some of the formal properties of conditional
101 ms, and a method called recursion induction for
proving facts about algorithms.
1

Computation is sure to become one of the
most important of the sciences. This is because
it is the science of how machines can be made to
carry out intellectual processes. We know that
any intellectual process that can be carried out
mechanically can be performed by a general purpose digital computer. Moreover, the limitations
on what we have been able to make computers do so
far seem to come far more from our weakness as
programmers than from the intrinsic limitations
of the machines. We hope that these limitations
can be greatly reduced by developing a mathematical science of computation.
There are three established directions of
mathematical research relevant to a science of
computation. The first and oldest of these is
numerical analysis. Unfortunately, its subject
matter is too narrow to be of much help in
forming a general theory, and it has only recently begun to be affected by the existence of
automatic computation.
The second relevant direction of research
is the theory of computability as a branch of
recursive function theory. The results of the
basic work in this theory including the existence
of universal machines and the existence of unsolvable problems have established a framework
in which any theory of co~putation must fit.
Unfortunately, the general trend of research in
this field has been to establish more and better
unsolvability theorems, and there has been very
little attention paid to positive results and
none to establishing the properties of the kinds
of algorithms that are actually used. Perhaps
for this reason the formalisms for describing
algorithms are too cumbersome to be used to describe actual algorithms.
The third direction of mathematical research
is the theory of finite automata. Results which
use the finiteness of the number of states tend
not to be very useful in dealing with present
computers which have so many states that it is
impossible for them to go through a substantial
fraction of them in a reasonable time.

The present paper is an attempt to create a
basis for a mathematical theory of computation.
Before mentioning what is in the paper, we shall
discuss briefly what practical results can be
hoped for from a suitable mathematical theory.
This paper contains direct contributions towards
only a few of the goals to be mentioned, but we
list additional goals in order to encourage a
gold rush.
1. To develop a universal programming
language. We believe that this goal has been
written off prematurely by a number of people.
Our opinion of the present situation is that
ALGOL is on the right track but mainly lacks the
ability to describe different kinds of data,
that COBOL is a step up a blind alley on account
of its orientation towards English which is not
well suited to the formal description of procedures, and that UNCOL is an exercise in group
wishful thinking. The formalism for describing
computations in this paper is not presented as
a candidate for a universal programming language
because it lacks a number of features, mainly
syntactic, which are necessary for convenient
use.
2. To define a theory of the equivalence
of computation processes. With such a theory
we can define equivalence preserving transformations. Such transformations can be used to
take an algorithm from a form in which it is
easily seen to give the right answers to an
equivalent form guaranteed to-give the same answers but which has other advantages such as
speed, economy of storage, or the incorporation
of auxiliary processes.
4. To represent algorithms by symbolic expressions in such a way that significant ,changes
in the behavior represented by the algorithms
are represented by simple changes in the symbolic
expressions. Programs that are supposed to learn
from experience change their behavior by changing the contents of the registers that represent
the modifiable aspects of their behavior. From
a certain point of view having a conv~nient
representation of one's behavior available for
modification is what is meant by consciousness.
5. To represent computers as well as
computations in a formalism that permits a
treatment of the relation between a computation
and the computer that carries out the
computation.
6. To give a quantitative theory of computation. There ought to be a quantitative
measure of the size of a computation analogous
to Shannon's measure of information. The
present paper contains no information about this.
The present paper is divided into two
sections. The first contains several descriptive formalisms with a few examples of their
use, and the second contains what little theory
we have that enables us to prove the equivalence

226

5.3
of computations expressed in these formalisms.
The formalisms treated are the following:
1. A way of describing the functions that
are computable in terms of given base functions
using conditional expressions and recursive function definitions. This formalism differs from
those of recursive function theory in that it is
not based on the integers or any other fixed
domain.
2. Computable functionals, i.e. functions
with functions as arguments.
3. Non-computable functions. By adjoining
quantifiers to the computable function formalism,
we obtain a wider class of functions which are
not ~ priori computable. However, such functions
can often be shown to be equivalent to computable
functions. In fact, the mathematics of computation may have as one of its major aspects rules
which permit us to transform functions from a
non-computable form into a computable form.
4. Ambiguous functions. Functions whose
values are incompletely specified may be useful
in proving facts about functions where certain
details are irrelevant to the statement being
proved.
5. A way of defining new data spaces in
terms of given base spaces and of defining functions on the new spaces in terms of functions on
the base spaces. Lack of such a formalism is one
of the main weaknesses of ALGOL, but the business
data processing languages such as FLOWMATIC and
COBOL have made a start in this direction, even
though this start is hampered by concessions to
the presumed prejudices of business men.
The second part of the paper contains a few
mathematical results about the properties of the
formalisms introduced in the first part. Specifically, we describe the following:
1. The formal properties of conditional
expressions.
2. A method called recursion induction for
proving the equivalence of recursively defined
functions.
3. Some relations between the formalisms
introduced in this paper and other formalisms
current in recursive function theory and in
programming.
We hope that the reader will not be angry
about the contrast between the great expectations
of a mathematical theory of computation and the
meager results presented in this paper.
FORMALISMS FOR DESCRIBING COMPUTABLE FUNCTIONS
AND RELATED ENTITIES

1.

Functions Computable in Terms of Given Base
Functions

Suppose we are given a base collection ~
of functions having certain domains and ranges.
In the case of the non-negative integers, we may
have the successor function and the predicate of
equality, and in the case of the S-expressions
discussed in (7.), we have the five basic operations. ~r object is to define a class of functions C ~~ which we shall call the class of
functions computable in terms of ~ - . - - Before developing C~ formally, we wish
to gIve an example, and fn: 6rder to give the
example, we first need the concept of conditional
expression. In our notation a conditional expreSSion has the form
(Pl- e l ,P2- e 2 ,·· .Pn -en)
which corresponds to the ALGOL 60 reference
language (5) expression
if PI then e

l

else if P2 then e 2 ••• else if
Pn then en

Here Pl, ••• ,Pn are propositional expressions
takIng the values T of F standing for truth and
falsity respectively.
The value of (Pl- el'P2 - e 2 ,·· .,Pn -en)
is the value of the e corresponding to the first
p that has value T. Thus
(4< 3 -7,2> 3 -8,2< 3 -9,4< 5 -7) = 9
Some examples of the conditional expressions
for well known functions are
Ixl = (x< 0 _-x,x 1--0 _x)
oij = (i=j -l,i"tj - 0 )
and the triangular function whose graph is given
in figure 1 is represented by the conditional
expression
tri(x)

= (x-t'-l

-O,x$O

-x+l,x~ 1

y

-I-x,
x> 1 _0)

--------~~~----~----------~~-----x
Figure 1

In this part we describe a number of new
formalisms for expressing computable functions
and related entities. The most important section
is l~ the subject matter of which is fairly well
understood. Trre other sections give formalisms
which we hope will be useful in constructing computable functions and In proving theorems about
them.

Now we are ready to use conditional expreSSions to define functions recursively. For
example, we have
n = (n-O

-l,n~ -+n.(n-l)~)

227

5.3
Let us evaluate 2! according to this definition.
We have
2~

(2=0 ....... 1,2,,10 ....... 2·(2-1)~)
2.1
2(1=0 ....... 1,1'0 ....... l.(l-l)~)
2.1.0!
2.1.(0=0 ....... 1,0'0 ....... O·(O-l)!)
2.1.1
2

The reader who has followed these simple
examples is ready for the construction of C (~
which is a straightforward generalization of the
above together with a tying up of a few loose
ends.
Some Notation. Let ~be a collection
(finite in the examples we shall give) of functions whose domains and ranges are certain sets.
C ~~ will be a class of functions involving the
same sets which we shall call computable in terms
of rq::-'.
Suppose f is a function of n variables and
suppose that if we write y=f(xl, ••• ,x ), each Xi
n
takes values in the set U and y takes its value
i
in the set V. It is customary to describe this
situation by writing
f :U xU x ••• xU ....... V
l 2
n
The set Ulx ••• xU of n-tuplets (xl' ••• ,xn ) is
n
called the domain of f, and the set V is called
the range o~
Forms and Functions. In order to make
properly the-definitions that follow, we will
distinguish between functions and expressions
involving variables. Following Churchl the
latter are called forms. Single letters such as
!,~,~, etc. or sequences of letters such as sin
are used to denote funcgions. Expressions such
as f(x,~), f(g(x),X), x +y are called forms. In
particular we may re~er to the function f
defined by f(x,y)=x +y. Our definitions will be
written as though all forms involving functions
were written f(, ••• ,) although we will use
expressions like x+y with infixes like ± in
examples.
Composition. Now we shall describe the ways
in which new functions are defined from old.
The first way may be called (generalized) composition and involves the use of forms. We-8hall
use the letters x,y, ••• '(sometimes with subscripts) for variables and will suppose that
there is a notation for constants that does not
make expressions ambiguous. (Thus, the decimal
notation is allowed for constants when we are
dealing with integers.)
Tile class of forms is defined recursively
as follows:
i) A variable x with an associated space U
is a form, and with this form we also associate U.
A constant in a space U is a form and we also
associate U with this form.
ii) If e , ... ,e are fo:ms associated with
l
n
the spaces U ' •.• ,U respect1vely, then
l
feel' ••• ,e ) is a fgrm associated with the space
V. In thi~ way the form (f(g(x,y),x) is

built from the forms g(x,y) and x and the function f.
If all the variables occurring in a form
e are among Xl' ••• x
n we can define a function
h(xl, ••• ,x ) = e. We shall
n
assume that the reader knows how to compute the
values of a function defined in this way. If
fl, ••• ,fm are all the functions occurring in
h

by writing

e we shall say that the function h is defined
by composition from fl, ••• ,f • The class of
m
functions definable from given functions by composition only is narrower than the class of
functions computable in terms of the given
functJ.ons.
Partial Functions. In the theory of computation it is necessary to deal with partial
functions which are not defined for all ntuplets in their domains. Thus we have the
partial function minus, defined by
minus(x,y)=x-y, which is defined on those pairs
(x,y) of positive integers for which
x is
greater than y. A function which is defined
for all n-tuplets in its domain is called a
total function. We admit the limiting case of a
partial function which is not defined for any
n-tuplets.
The n-tuplets for which a function described by composition is defined is determined
tn an obvious way from the sets of n-tuplets for
which the functions,entering the composition
are defined. If all the functions occurring in
a composition are total functions, the new function is also a total function, but the other
processes for defining functions are not so kind
to totality. When the work "function" is used
from here on, we shall mean partial function.
Having to introduce partial functions is a
nuisance, but an unavoidable one. The rules for
defining computable functions sometimes give
computation processes that never terminate, and
when the computation process fails to terminate,
the result is undefined. It has been shown that
there is no effective general way of deciding
whether a process will terminate.
Predicates and Propositional Forms
The space ~ of truth values whose only
elements are T (for truth) and F (for falsity)
has a special role in our theory. A function
whose range is ,r,r-is called a predicate. Examples of predicates on the integers are prime
defined by
T i f x is prime
prime(x)
F otherwise
and less defined by
T if x< y
less(x,y) =
F otherwise
We shall, of course, write x 1-+3) = 3
(1< 2 -+4,1< 2 -+3) = 4
(2) 1 -+1,3> 1 -+3) is undefined
(0/(,< 1 -+1,1< 2 ~3) is undefined
(l < 2 -+0/0,1< 2 ~l) is undefined
(1< 2 -+2,1< 3 -+0/0) = 2
The truth value T can be used to simplify
certain conditional forms. Thus, instead of
= (x< 0 -+ -x, x ~ 0 -+ x)
we shall write
Ix) = (x< 0 -+ -x, T~ x)
The propositional connectives can be defined
by conditional forms as follows.
p /\ q
(p -+ q, T -+ F)
p v' q
(p -+ T, T -+ q)
1\.1 P
(p ~ F, T -+ T)
pC q
(p ~ q, T -+ T)
Conside~ations
of truth tables shows that these
formulas give the same results as the usual definitions. However, in order tQ treat partial
functions we must consider the possibility that
p or q may be undefined.
Suppose that p is false and q is undefined; then according to the conditional form
defini tion p/\ g is false and ql\ p is undefined. This unsymmetry in the propositional
connectives turns out to be appropriate in the
theory of computation since if a calculation of
p gives F as a result q need not be computed
to evaluate p 11 q, but if the calculation of p
does not terminate, we never get around to computing P.
-)0

pc'

It is natural to ask if a function cond

n

of

2n

variables can be defined so that
condn(Pl,···,Pn'

e l ,·· .,en )
This is not possible unless we extend our notion
of function because normally one requires all
the arguments of a function to be given before
the function is computed. However, as we shall
shortly see, it is important that a conditional
form be considered defined when, for example,
PI is true and e
is defined and all the
l
other p's and e's are undefined. The required extension of the concept of function would
have the property that functions of several
variables could no longer be identIfied with
one-variable functions defined on product spaces.
We shall not pursue this possibility further
here.
We now want to extend our notion of forms
to include conditional forms. Suppose
Pl, ••• ,Pn are forms associated with the space
of truth values and

el, ••• e
are forms
n
associated with the same space V. Suppose
further that each variable xi occurring in
PI' ••• Pn

and

el, ••• e

with the space U.

n
Then

is associated with the
(PI -+e , ••• ,Pn -+e )
l
n

is a form associated with V.
We believe that conditional forms will
eventually come to be generally used in mathematics whenever functions are defined by considering cases. Their introduction is the same
kind of innovation as vector notation. Nothing
can be proved with them that could not also be
proved wIthout them.
However, their formal
properties, which will be discussed later, will
reduce many case-analysis verbal arguments to
calculation.
Definition of Functions by Recursion. The
definitio-n----- n~ = (n=O -+l,T -+n·(n-l)~)
is an example of definition by recursion. Consider the computation of O~
0: = (0=0 ~1,T -+O'(O-l)~)
We now see that it is important to provide that
the conditional form is defined even if a term
beyond the one that gives the value is undefined.
In this case (O-l)~ is undefined.
Note also that if we consider a wider
domain than the non-negative integers, n~ as
defined above becomes a partial function, since
unless n is a non-negative integer, the
recursion process does not terminate.
In general, we can either define single
functions by recursion or define several functions together by simultaneous recursion, the
former being a particular case of the latter.
To define simultaneously functions
fl, ••• f k , we write equations

229

5.3
f

Let I be the set of non-negative integers
[0,1,2, .• !? and denote the successor of an
integer r( by n' and denote the equality of
integers n
and n
by n =n • If we define
l
2
l 2
functions succ and ~ by

(x , ••• , x

lIn
e

k

The expressions cl, ••• ,e

k

must contain only

known functions and the functions f , ••• ,f •
k
l
Suppose that the ranges of the functions are to be
VI' ""V respectively; then we further require
k

that the expressions e , ••• e be associated with
k
l
these spaces respectively, given that within
e , ••• e the f's are taken as having the V's as
l
k
ranges. This is a consistency condition.
fi (xi'" .x
is to be eval uated for given
k
values of the x's as follows.
1. If e is a conditional form then the p's
i
are to be evaluated in the prescribed order
stopping when a true p and the corresponding e
have been evaluated.
2. If e has the form g(e*, ••• , e*) then
i
1
m
ei, ••• ,e~ are to be evaluated and then the funcg

applied.
3. If any expression f. (e*, ••• e*) occurs it
~

1

nj

is to be evaluated from the defining equation.
4. Any subexpressions of e that have to be
i
evaluated are evaluated according to the same
rules.
5. Variables occurring as subexpressions are
evaluated by giving them the assigned values.
There is no guarantee that the evaluation
process will terminate in any given case. If for
particular arguments the process does not terminate, then the function is undefined for these
arguments. The possibility of termination
depends on the presence of conditional expressions in the e. 's.

~unctions

i~f

The class of
C
computable in
terms of the given base functions ~ is defined
to consist of the functions which can be defined
by repeated applications of the above recursive
definition process.
2.

R0cursive Functions of the Integers
In Reference 7 we deVelop the recursive functions of a class of symbolic expressions in terms
of the conditional expression and recursive function formalism.
As an example of the use of recursive function definitions, we shall give recursive definitions of a number of functions over the integers.
We do this for three reasons: to help the reader
familiarize himself with recursive definition, to
show how much simpler in practice our methods of
recursive definition are than either Turing
machines or Kleene's formalism, and to prove that
any partial recursive function (Kleene) on the
non-negative integers is in C (T) where e::p:- contains only the successor function and the
predicate equality.

succ(n) = nt
eq (n ,n )
T if n = n
l 2
l
2
F if n I- n
2
l
then we write ~ =fsucc,e~. We are interested
in C [~~. Clear11 all functions in CE~;1
will h~ve'either integers or truth values as
values.
First we define the predecessor function
pred (not defined for n=O) by
pred(n) = pred2{n,0)
pred2(n,m) = (m'=n - m. T - pred2(n, m'».
W3 shall denote pred(n) by n-,
Now we defi ne the sum
m+n = (n=O ~,T - m' +n-),
th~ product
mxn = (n=O -,~ 0, T -+ m+mxn-)
the difference
m-n = (n=O.....>· m, T -+ m--n-)
which is defined only for m n, the inequality
m =n = (m=O) " (.-1/(n=O)/\ (m-!: n-)
the strict inequality
m < n = (m:::..n) 1 ~ (m=n)
the inte~er valued quotient
min = (m < n -->- 0, T ->, «m-n)/n) '),
the remainder
rem(m/n) = (m < n -+- m, T -+ rem(m-n/n»
and the divisibility of a number n by a number
m
min = (n=O) "'«n~m)A (ml(n-m»).
The primeness of a number is defined by
prime(n) = (n~),If (nl-l) 1\ prime 2(n,2)
where
prime2(m,n) = (m=n) V «mtn)1l prime2(n,m'»
The Euclidean algorithm defines the
greatest common divisor if we write
gcd(m,n) = (m > n - gcd(n,m) ,rem(n/m) =
-r m,T gcd(rem(m/m) ,m»
and we can define Euler's ~ -function by
fen) =~2(n,n)
where
«2(n,m) = (m=l- 1, gcd(n,m) = 1-+~2(n,m-) I ,
T -> ~2(n,m-»

°

The above shows that our form of recursion
is a convenient way of defining arithmetical
functions. \'Ie shall see how the properties of
the arithmetical functions can conveniently be
derived in this formalism in a later section.
Computable Functionals
The formalism previously described enables
us to define functions that have functions as
arguments. For example,
3,

n

.'2- a i
l.=m
can be regarded as a function of the numbers m
and ~ and the sequence
If we regard the

fai1'

sequence as a function f
recursive definition

we can write the

230
5.3
sum(m,n,f) = (m > n -- O,T

~

f(m) + sum
(m+l,n,f»
or in terms of the conventional notation

+.l:.

~ f(i) = (m > n ->- 0, T~' f(m)
If(i))
l.=m
l.=m+
Functions with functions as arguments are called
functionals.
Another example is the functional least(p)
which gives the least integer n such that pen) for
a predicate p. We have
least(p) = least2(p,0)
where
least2(p,n) = (p(n) - l ' n,T~ least2(p,n+l»
In order to use functionals it is necessary
to have a notation for naming functions. We use
Church'sl lambda notation. Suppose we have a
function f defined by an equation f(x , ••• ,x )=e
..
1
n
where ~ i s some expressl.on 1.n xl, ••• ,X •
n
The name of this function is /}«x , ••• x ),e). For
1

n

example, the name of the function f defined by
f(x,y) = x 2 +y is ~ «x,y) ,x2 +y). We have
~«xly),x2+y)(3,4) = 13 but
~«y,x),x2+y)(3,4) = 19.
The variables occurring in a rI defini tion are
dummy or bound variables and can be replaced by
others without changing the function provided
the replacement is done consistently. For example, the expressions ~«x,y),x2+y) and
.::7«u,v),u 2 +v) and ri«y,x),y2+x ) all represent
the same function.
n. 2
In the notation ~l 1. is represented by
sum(l,n ~«i),i2» and the least integer n for
which n 2 > 50 is reBresented by
least( A «n) ,n > 50)
When the functions with which we are dealing
are defined recursively, a difficulty arises.
For example, consider factorial defined by
factorial(l1) = (n=O ~ 1,T -+ n'factorial(n-l»
The expression
;J.( (n), (n=O -l- I, T -." n·factorial(n-l»)
cannot serve as a name for this function because
it is not clear that the occurrence of "factorial"
in the expression refers to the function defined
by the expression as a whole. Therefore, for
recursive functions we adopt an additional convention. Namely,
label(f, ~ «xl' ••• xn ) , e»
stands for the function
equation
f (xl" •• xn) = e

f

defined by the

where any occurrences of the function letter f
within e stand for the function being defined.
The letter f also serves as a dummy variable.
The factorial function then has the name
label(factorial, ~«n), (n=O ~ I,T ~ n'
factorial(n-l»»
and since factorial and ~ are dwmny variables
the expression
la be 1 (g, ~ « r) , (r=O -+ 1, T - r. g (r-l) ) ) )
represents the same function.
If we start with a base domain for our
variables, it is possible to consider a hierarchy
of functionals. At level I we have functions
whose arguments are in the base domain. At

level 2 we have functionals taking functions of
level 1 as arguments. At level 3 are functionals
taking functionals of level 2 as arguments etc.
Actually functionals of several variables can be
of mixed type.
However, this hierarchy does not exhaust
the possibilities, and if we allow functions
which can take themselves as arguments we can
eliminate the use of label in naming recursive
functions. Suppose that we have a function f
defined by
f(x) = t(x,f)
where~(x,f) is some expression in
x and the
function variable f. This function can be
named
label(f, -=l «x) ,e(x,f»)
However, suppose we define a function
g by
g(x,~) =
(x, ;j «x), y,(x,f'?»)
or
g = ;) «x, ce) '€'( x,~( (x) ,If'(x,h)))
We then have
f(x) = g(x,g)
since g(x,g) satisfies the equation
g(x,g) = E:-(x, ;::t«x),g(x,g»)
Now we can.write f as
f = -1 «x), ~«y, 1:"),~(y, A«U), ~(u,t""»»
(x, rl «y, ~)
(y, A «u),

e

e

~u, ~»»

This elinunates label at what seems to be an
excessive cost. ~ly, the expression gets
quite complicated and we must admit functionals
capable of taking themselves as arguments thus
escaping our orderly hierarchy of functionals.
4.

FW1ctions and Functionals
It nught be supposed that in a mathematical
theory of computation one need only consider
computable functions. However, mathematical
physics is carried out in terms of real valued
functions which are not computable but only
approximable by computable functions.
'Ve shall consider several successive extensions of the class C (~~ • First we adjoin
the universal quantifiefV to the operations used
to define new functions. Suppose e is a form
in a variable x and other variables associated
with the space ~ of truth values. Then
~-ComputabHl

"( (x) ,e)

is a new form in the remaining variables also
associated with ~. ~«x),e) has the value T
for given values of the remaining variables if
for all values of x, e has the value T.
V«x),e) has the value F if for at least one
value of x, e has the value F. In the
remaining-case, i.e. for some values of x e has
the value T and for all others e is undefined,
Y «x) ,e) is undefined.
If we allow the use of the universal
quantifier to form new propositional forms for
use in conditional forms, we get. class of functions Ha f9j which may well be called the class
of functions hyper-arithmetic over ~ since in
the case where ~= ~successor, equalit
on the
integers, Hatry consists of Kleene's hyperarithmetic functions.
Our next step is to allow the description
operator ~ • L«x),~(x» stands for the unique

S)

231

5.3
x such that ~(x) is true. Unless there is such
an x and it isunique (: «x) ,~(x» is undefined.
In the case of the integers««x),~(x» can be
defined in terms of the universal quantifier using
conditional expressions, but this does not seem to
be the case in domains which are not effectively
enumerable, and one may not wish to do so in
domains where enumeration is unnatural.
The next step is to allow quantification over
functions. This gets us to Kleene ' s 5 analytic
hierarchy and presumably allows the functions used
in analysis. Two facts are worth noting. First
~«f), ~f»
refers to all functions on the
domain and not just the computable ones. If we
restrict quantification to computable functions,
we get different results. Secondly, if we allow
functions which can take themselves as arguments,
it is difficult to assign a meaning to the
quantification. In fact, we are apparently confronted with the paradoxes of naive set theory.
Ambiguous Functions
Ambiguous functions are not really functions.
For each prescription of values to the arguments
the ambiguous function has a collection of possible
values. An example of an ambiguous function is
lessen) defined for all positive integer values
of n. Every non-negative integer less than n is
a possible value of lessen). If we define a-basic
ambiguity operator amb(x,y) whose possible values
are x and I when both are defined otherwise whichever-is defined, we can define lessen) by
lessen) = amb(n-l,less(n-l».
lessen) has the property that if we define
ult (n) = (n=O --.. 0, T --.. ul t (less (n» )
then
'V«n),ult(n)=O) = T.
There are a nUlaber of important kinds of
mathematical arguments whose convenient formalization may involve aml?iguous functions. In order to
give an example, we need two definitions. If f
and g are two ambiguous functions, we shall say
that-f is a descendant of g if for each x every
possible value of f(x) is also a possible value of
g(x). Secondly, we-shall say that a property of
ambiguous functions is hereditary if whenever it
is possessed by a function g it is also possessed
0y all descendants of g.
The property that iteration of an integer valued flllction eventually
gives 0 is hereditary and the function less has
this property. So, therefore, do all i~
descendants. Thus any function, however, complicated which always reduces a number w ill if
iterated sufficiently always give O.
This example is one of our reasons for hoping
that ambiguous functions will turn out to be
useful.
With just the operation amb defined above
adjoined to those used to generate C ~~~ , we
can extend
to the class C* [~ wb.ic~ may be
called the computably ambiguous functions. A
wider class of ambiguous functions if formed using
the operator Am(x,~(x» whose values are all
x's satisfying ~(x).
5.

r

6.

Recursive Definitions of Sets
In the previous secti~s~rccursive definition of functions the domains and ranges of the
basic functions were prescribed and the defined
functions had the same domains and ranges.
In this section we shall consider the definition of new sets and the basic functions on
them. First we shall consider some operations
whereby new sets can be defined.
I. The Cal'tes~an product AxB of two sots A
and B is the set of all ordered pairs (a.b) with
a(A and beB. If A and B are finite sets and
n(A) and nCB) denote the numbers of membors of
A and R respectively then l1(AxB=n(A)·n(B).
Associated \uth the pair of sets (A,B) are
two canonical mappings
~
:AxB ->- A defined by IT
{(a.b» = a
(A,B:AxB -+ B defined by ~A,B{(a.b» = b
,A,B
A,D
The word "canonical" refers to the fact that
~
and
are defined by the sets A and B
A,B
A,B
and do not depend on knowing anything about the
members of A and B.
The next canonical function riS a function
of two variables ~:A B~ AxB defined by

e

~

'A B

I

(a,b) = (a'b)
A,B
For some purposes functions of two variables x
from A and y from B can be identified with functions of one variable defined on AxB.
2. The direct union AGB of the sets A and B
is the union of two ,non-intersecting subsets one
of which is in 1-1 correspondence with A and the
other with B. If A and B are finite, then
n(AGB) = n(A)+n(B) even if A and B intersect.
The elements of A@B may be written as elements of
A or B subscripted with the set from which they
come, i.e. a or b •
A
B
The canonical mappinGs associated with the
direct union AGB are
i
: A -)- AG8 defined by iA (a)
A,B
,B
j
: B .,.-)- A@B defined by jA B(b)
b'
,
A,B
B

p
: AGB -> ~ defined by PA,B(x)
T if and
A,B
only if x comes from A.
q
: AGB -'r ~ defined by q
(x) = T if and
A,B
A,B
only if x comes from B
There are also two canonical partial functions r
: AGB --Jo- A which is defined only for
elementsAc~ming from A and satisfies
r
(
(a»=a. Similarly SA A@B-l-B satisfies
A,B A,B
,B
sA,B(jA,B(b»=b.
3. The power set ,/' is the set of all
mappings f:B ->- A. The canonical mapping
./-,
:ABXS-)-A is defined byqt..
(f,b) = f(b)
A,B
A,B

Canonical Mappings
We will not regard the sets Ax(BxC) and
(AxB)xC as the same, but there is a canonical 1-1
mapping between them
r:.:
: (AxB) xC -+ Ax(exc)
~A,B,C

232
5.3
defined by

~A,B,C(U) = dA,BXC(rrA,B(7TAxn;C~»' ~,C
~(A,B(rrAXB,C(U»,

AxB,C(u»).

We shall wri te
(AXB)xc~Ax(BXC)

to express the fact that these sets are
canonically isomorphic.
Other canonical isomorphisms are
1. t
: AxB -~. BxA defined by
A,B
t(u) = y. (e
(u),rr
(u»
B,A A,B
A,B
2. d : Ax(nOC) ._)- AxB@AXC
l
;j.
a: (AGl3)OC -> A0(BOC)
2

d: p~ xrF ._). (AxBP
2
d : ABxl~ -> lJGC

4.

5

•
6.

3

s: (AB'P
1

_+./'xC

We shall denote the null set (containing no
by 0 and the set consisting of the
integers from 1 to n by n. We have
A@ oj!::A
element~

A9 O~
Axl~A

Ax2~~@A

AO~

(n terms,associate to left by
convention)
(by convention)

Al,-vA
A ~AX ••• xA (n terms, associate to left by
convention)
Suppose we write the recursive equation
S =(.A} GAxS
we can interpret this as defining the set of
sequences of elements of A as follows:
1. In terpret j\.. as denoting the null sequence,
Then the null sequence (strictly an image of it)
is an element of S.
2. Since a pair consisting of an clement of
A and an element of S is an element of S,a pair
(a,A) is an element of S. So,then,are
a l • (a '11» and a • (a • (a ,,») etc.
Z
l
2
3
Thus S consists of all sequences of elements of A
including the null sequence.
Suppose we substi tute
~
@AxS
for S in the right side of S =.
GAxS. We get
S = tA1GAX( (AJ@AXS)
If we aga1n ~bstitute for S.and expand bv the
distributive law we ~et
S =
GAx U)fAxAX [-:Ii) + ••••
which if we denotft6.e set~"!J becomes
S ~ IGA9A 20A3G •••

f.("1

[:!J

which is another way of wri tillg the set of
sequences. We shall denote the set of sequences
of elements of A by seq(A).
Another useful ~ursiye construction is
S = AGSxS
Its elements have the forms a or (aI' a ) or
2
«a 'a )'a ) or (a '(a 'a » etc. Thus we have
2 3
I 2
3
I
the set of S-expressions on the alphabet A which
we may denote by sexp(A). This set is the subject
matter of Reference 7 and the following paragraph
refers to this paper.

When sets are formed by this kind of
recursive definition, the canonical mappings
associated with the direct sum and Cartesian
product operations have significance. Consider,
for example, sexp(A).
We can define the basic operations of LISP,
i.e. atom, eq, car, cdr and cons by the equations
atom(x) = p
(x)
A,SxS.
eq(x,y) = (~,SXS(x) = ~A,SXS(Y»
assuming that equality is defined on the space A
car(x) = rrS, S CAA, SxS (x»
cdr(x) = es, s (~
-A,SX Sex»~
cons(x,y) = jA,SXS(~,s(x,y»
Definition of the set of integers.
Let-'\,. denote the nullset and 1/1) be the
set whose sole element is the null set. We can
define the set of integers I by
I = f,4 G 11 xl
its elements are then
/J, (A, 4), (A, (I(,A», (A, (A, (/) ,A») etc.
which we shall denote by 0,1,2,3 etc.
The successor and predecessor functions are then
definable in terms of the canonical operations
of the defining equation. We have
succ(n) = ~J,IVf,n)

J f J

pred(n)

= e}

fA

(,a....-).~ (n»

,I

'f,1y'r I
I

PROPERTIES OF CO~WUTABLE FUNCTIONS
The first part of this paper was solely
concerned with presenting descriptive formalisms,
In this part we shall establish a few of the
properties of the entities we previously introduced. The most important section is the
second dealing with recursion induction.
7.

Formal Properties of Conditional Forms

~heory of conditional expressions

corresponds·to analysis by cases in mathematics
and is only a mild generalization of propositional
calculus.
We start by considering expressions called
generalized Boolean forms (gbf) formed as follows:
1. Variables are divided into propositional
variables p,q,r, etc. and general variables
x,y,z, etc.
2. '/ie shall write (p -i- x, y) for
(p -+ x, T -> y) • (p...,. x, y) is called an elementary
conditional form (ecf) of which p,x, and yare
called the premise, conc~usion and the
alternative, respectively.
3. A variable is a gbf and if it is a
propositional variable it is called a pf
(propositional form).
4. If rr is a pf and .,.Land fJ are gbfs, then
('lr~,~) is a ~bf~ If, in addition,A..and 13
are pfs so is (rr -;- pI-,JJ) •
The value of a gbf ~for given values
(T,F or undefined) o~he propositional variables
will be T or F in case ~ is a pf or a general
variable otherwise. This value-rs determined for
a gbf (7T -r.,L,.. ,3) according to the table

233
5.3
value

value «7T -+""- , JJ »

('IT)

T

value (0')

F

value (1)

~.0'0
cd~

~

undefined

undefined

~
~

~

t

-;:;
T

pqr
TTT
TTF
TTu

.e
T
T

T

-;
H

(I$~

~

~

(I$~

(I$~

-r

l'

t

0'

l

.e
'-'

~

'-'

a
a
a

a
a
a

a
b
u

b
b
b

a
b
u

b
b

u

a
b

u
u
u

H

'0

(I$~

We shall say that two gbfs are stron:,:;ly
equivalent if they have the same value for all
values of the propositional variables in them including the case of undefined propositional
variables. They arc weakly equivalent if they
have the same values for all values of the
propositional variables when these are restricted
to T and F.
The equivalence of gbfs can be tested by a
method of truth tables identical to that of
propositional calculus. The table for
«p -)- q, r) - l - a, b) and (p -~ (q -) a, b) (r ->- a, b» is

(Ij~

-- I' - - t
--t t t
- - --

0.

a
a
a

C)

'0

(Ij~

.o~

0.

0.

0.

0'

a
a
c
c

b
b
d

a
b
c

d

d

i'

~

~

'-'

T
T
F
T

a
b
a
b

c
d
c
d

a
b
c
d

'0

.0
;

T

0'0"

P q
T

(I$~

C)~

C)

t

F
T
T

~

C)

0.

i

which proves that (p -+ (q -} a, b), (q -+ c,d» and
(q ...... (p -+ a,e), (p -, b,d» are weakly equivalent.
They are also strongly equivalent. '.'/e shall
wri te
and -=for the relations of
-s
w
strong and weak equivalence.
There are two rules whereby an equivalence
can be used to generate other equivalences.
1. If ~
13 and 0'-1
.JJ 1 is the
result of substituting any gbf for any variable
• This is called
in ~ =...B
• then...< 1 El

=-

==

T

the rule of substitution.
2. If"",
1) and.?- is sub-expressioh of
"""and & is the result of replacing an occurrence
of"+'" in y
by an occurrence of.J3 , thenr-::=' 6
This is called the rule of replacement.
These rules are applicable to either strong
or weak equi valence and in fact to .l1uch more
general situations.
Weal< equivalence corresponds more closely
to equivalence of truth functions in propositional
calculus than does strong equivalence.
Consider the equations
1)
(p --> a,a)
wa

==-

TFT
TFF
TFu

F
F
F

b
b
b

TuT
TuF
Tuu

u
u
u

u
u
u

u
u

u

FTT
FTF
FTu

T
F
u

a

a
a
a

a
b
u

a

2)

(T- a,b) =-sa

b
u

3)

(F-l-a,b):;:"sb

4)'

(p

FFT
FFF
FFu

T
F
u

a
b
u

b

a

b
b

5)

(p.-~-

(p->-a,b),c) ==-s(p->a,c)

u

a
b
u

6)

(p

a, (p

FuT
FuF
Fuu

T
F
u

a

a
b
u

a
b
u

7)

u

u
u
u

«p->q,r) -) a,b);::.s(P- (q-)"a,b),
(r -)- a, b»
( (p -~:- - (q --. a, b) , (q -" c, d» ===- s

uTT
uTF
uTu

u
u
u

u
u
u

a
a
a

a
b
u

u
u
u

uFT
uFF
uFu

u
u
u

u
u
u

b
b
b

a
a
u

u
u
u

uuT
uuF
uuu

u
u
u

u
u
u

u
u
u

a
b
u

u
u
u

b

u

b

b

b

According to the table «p -+ q, r) - ? a, b)
and (p -- (q
a, b), (r -+ a, b» are strongly
equivalent.
For weak equivalence the u case can be
left out of the table. Consider the table
-)0

=

8)

-l

-r

T,F)

== sP
->

b,c» :::'s (p -;. a,c)

(q -;.. (p

-3'

a,c), (p -;. b,d»

All are strong equivalence except the first and
can be proved by truth tables.
These eight equations can be used as axioms
to transform an ::;bf into any weakly equivalent
one using substitution and replacement. In fact,
they can be used to transform any gbf into a
canonical form. This canonical form-is the
following. Let PI' ""Pn be the variables of the
gbf a taken in an arbitrary order.
be transformed into the form
(PI -> a ' a )
O l
where each a
a l =(P2

l

has the form
-l

aiO,a il )

Then

a can

234

5.3
and in general for each k = l,n-l
a. , •••. = (P. i<>a. , ••• ,. ,O,a. ···.1)
11
1k
1+
11
1k
11
1k
and each a , ••• ,
is a truth value or a general
il
in
variable.
For example, the canonical form of
q,r) -> a, b)
with the variables taken in the order r,q,p is
(r -- (q -r (p -). a,a), (p~. b,a», (q ~ (p -~ a, b),

«p -

n

(p~·b,b»)

In this canonical form, the 2 cases of the truth
or falsity of Pl, ••• ,Pn are explicitly exhibited.
An expression may be transformed into
canonical form as follows:
1) Axiom 7 is used repeatedly until in
every sub-expression the 7f in (IT -- t:A ~ ) consists

replacement on formulas we need the additional
axioms
(9) (p - (q - a, b) ,c) ~ (p - (q -~. (p - . . a,a),
~s
(p - b, b» ,c)
and
(10) (p-,.a,(q-l b,c»~s(p-a.(q-.;. (p-, b,b),
(p ->- c, c»)
Suppose there is an occurrence of PI in the
conclusion; we want to replace it by T. To do
this, we use axioms 9 and 10 to move in a PI

until the objectionable PI occurs as the inner
PI of one of the forms
(PI-> (Pl-l"o a,b),c)
or
(PI -',. a, (PI

->

b,c».

In either case, the objectionable PI can be

of a single propositional variable.
2) The variable PI is moved to the front by

removed by axiom 5 or 6 and the PI'S that were

repeated application of axiom 8. There are three
cases: (q-> (Pl-:·a,b),(Pl-c,d» to which axiom

moved in can be moved out again.
Thus we have (PI -+ e:A--,fi) with PI missing

8 is directly applicable.
(q -+ a, (PI -+ c,d» where axiom 8 becomes applicable

from ~ and jJ' •
4)
Inevitable variables are then brought to
the front of t?'-and.J? u!ld so forth.
Two gbfs are equivalent (weakly or strongly)
if and only if they have the same(weak or strong)
canonical :form. One way this is easy to provej
if two gbfs have the same canonical form they
can be transformed into each other via the
canonical form. Suppose two gbfs have different
weak canonical forms when the variables are taken
in the same order. Then values can be chosen
for the p's giving different values for the form
proving non-equivalence. In the strong case,
suppose that two gbfs do not have the same inevitable propositional variables. Let p be inevitable in a but not in b. Then if the other
variables are assigned suitable values b will
defined with p undefined. However, a wIll be
undefined since p is inevitable in a-which proves
non-equivalence. Therefore, strongly equivalent
gbfs have the same inevitable variables, so let
one of them be put in front of both gbfs. The
process is then repeated in the conclusion and
alternative etc.
The general kind of conditional expression
(P -+ e , ••• ,Pn - en)
l
1

after a~dom 1 is used to make it
(q --> (PI -,. a,a), (PI -) c,d» and then axiom 8 is
applied, the case (q

->-

(PI - a, b) I c) is handled

similarly.
Once the main expression has the form
(p - d-. ,ft) we move any PI's which occur in
, 1

c;A....

and~ to
5 and 6.

the front and eliminate them using axioms
We then bring P to the front of c:I- andJ!
using axiom I if neces~ary to guarantee at
least one occurrence of P2 in each of ~ and~ •

The process is continued until the canonical form
is achieved.
There is also a canonical form for strong
equivalence. Any gbf a is strongly equivalent
to one of the form(Pl--> ~, .Jl), where ~nd ..ff
do not contain PI and are themselves in canonical
form.

However, the variable PI may not be chosen

arbitrarily but must be an inevitable propositional
variable of the original gbf and can be chosen to
be any inevitable variable. An inevitable
variable of a gbf (IT -If?'-Jl) is defined to be
either the first propositional variable or else
an inevitable variable of bot~ and~ •
A gbf a may be put in strong canonical form
as follows:1) Use axiom 7 to get all premises as
propositional variables.
2) Choose any inevitable variable, say PI'
and put a in the form (PI -0'--,3) by using
axiom 8.
3) The next step is to eliminate occurrences
of PI in A and!3. Thi s can be done by the
general rule that in any ecf occurrences of the
premise in the conclusion can be replaced by T
and occurrences in the alternative by F.
However, if we wish to use substitution and

can be regarded as having the form
(PI -> e l , (p 2 - e 2 ,·· .(Pn -)0 en,u) ••• »
where u is a special undefined variable and
their properties can be derived from those of
ece's.
The relation of functions to conditional
expressions is given by the distributive law
f(xl, .. ·,xi_l,(Pl-el, .. ·,Pn ~·en)'
x

, ••• x )

=

i+l
k
(p ->f(x , ••. x.
,e ,x.
, ••• ,x ), ••• ,p 1
1
1-1 1 1+1
n
n
f(x , ••• ,x
,e ,x
, ••• ,x »
1
i+l n i+l
n

235
5.3
The rule of replacement can be extended in
the case of conditional expressions. Suppose ~
is an occurrence of a sub-expression of an expression
We define a certain propositional
expression 7f called the premise of ~in ]3 as
follows:
I) The premi se Of.-- in
i sT.
2)" The premise of pin f(x ,···, n'·· .x )
l
n
where j3 is part of~ is the premi se of t7'-- in J?
3) The premise of
in
(p -;.. e , ••• p -r e , ••• p -;.. e )
I
Iii
n
n
where [3occurs in e , and the premise of~ in e

rr-

¥

i

is

7f,

is "1..Jp ;\. ••• II £...-)P.

where

~-

I

4)
(p

f

I

I"

i

p. A
~

The premise of p in
- e , ••• p. -)- e. , ••• p

-)0

e )

n
occurs in Pi' and the premi se of O"-"in Pi
~

I

~

n

7f.

, is /!Vp I A ••• Il t'VP.~-l/\ 7f.
The extension of the rule of replacement is
tha t an occurrence o~ inl1 may be replaced by 0('
if (7f -;.pZ.)-== (7f ->P< ') wliere 7f is the premise of
-s
oL-in F. Thus in a subcase one need only prove
equivalence under the premise of the subcase.

is

m and n and hence is defined by th~ equation.
(m+n) I
(n=O -;.. m,T -;.. ml +n-) ,
== (n=O -)- mI ,T - l (m I +n-) I )
m+n
= (n=O -l- m' ,T -;.. (m') +n-)
It is easily seon that the functions g and h
defined by the equations g(m,n) = (m+n) , and
h(m,n) = m'+n both satisfy the equation f. For
example, it is clear that ~(m' ,n-) = (ml+n-), and
hem' ,n-) == (m')I+n-. Therefore, by the principle
of recursion induction hand g are equivalent
fUnctions on the domain of where f is defined,
bu~ this is the set of all pairs of integers.
The fact that f(m,n) converges for all m and
n is a case of the fact that all functions defined
by equations of the form
Hn,..x_ .... x) = (n=O-.. g(xl ••• Xn),T--h(f(n-,ri
(x 1 ' .•• xn) , •.. l' l~ (Xl ••• xn) ) , ••• ,
f (n - ,1'1k (xl' •.• , xn ) , ••• , rnk

7f

S.

Recursion Induction
Suppose a function

by
1)

f

is defined recursively

f (x , •• ., x ) = e ~. xl'.'. x , f)
1

C

n

{

n

where£" is an expression that in general contains
f. Suppose that C{ is the set of n-tuplets
(x , ••• , x ) fOl' which f is defined. Now let g and
I

n

-

h be two other functions with the same domain as f
and which are defined for all n-tuplets in GL •
Suppose further that g ant: h satisfy the equation
which defined f. We assert-that
g(x , ••• ,X ) ,== h(x , ••• x )
n
n
l
l
for all (x., ••• ,x ) in C{. This is so simply
~

n

because equation 1) uniquely determines the value
that any function satisfying it has for arguments
in 4. which in turn follows from the fact that
1) can be used to compute f(x , ••• ,x ) for
1

n

(x , ••• x ) in C{
I
n

We shall call this method of proving two
functions equivalent by the name of recursion
induction.
We shall develop some of the properties of
the elementary functions of integers in order to
illustrate proof by recursion induction. We recall
the definitions
m-tn = (n=O -> m, l' -;.. mI +n-)
ron == (n=O -+ 0, T --,' n1+nlll -)
Th 1. m+O
m
Proof m+O = (0=0 -> f.1, T - m1+0-)

conver!~8.

(xl" .xn ) ,n, xl" .x »
n
We shall postpond discussinG formal

proofs of conver:~8nC2.
In prcs~~tinG further proofs we shall be
more terso.
Th 3. (1I1+~)+P = (m-tp)+n
Proof. Ld't f(m,n,p) = (p=O~-m+n,T--f(m',n,p-»
AGain f conver~es for all m,n,p. We have
(m+n) +p
(p=O...., nun, T - r (m+n) '+p-)
(p=O->-m-l-n,T->- (m'-tn)+p-) using Th 2.
(m+p) -:-n
(p=O - r 111, T -- m' +p-) +n
(p=O-+m+n,T-- (m'+p-)+n)
Each of these forms satisfies the equation for
f(l11,n,p).
Settin~ m=O in Th 3 gives
(O+n)+p = (O+p)+n
so that if we had O+m = m we would have
commutativity of addition.
In fact, we cannot prove O+m = m wi thout
makin~ some assumptions that take into account
that we are dealing with the integers. For
suppose our space consisted of the vertices of
tho binary tr~~ where ml is the vertex just above
and to th~ left and m- is the vertex just below
and 0 is the bottom of the
tre'c. m+n can b0 defined as
above and of course satisfies
Theorems 1, 2 and 3 but does
not satisfy O+m == nl.
We shall make the following assumptions:
1. ml p 0
2.

(m') -

=m

3. (m#O);:>«m-)1 -= m)
which embody all of Peano's axioms except the
induction axiom.

Only the definition of addition and the properties
of conditional expressions were used in this proof

Th 4. O+n = n
Proof. Let fen) = (n=O - r 0, T -;.. f(n-) I)
O+n = (n=O -- 0, T - r 0 1+n-)
= (n=O -r O,T -r (O+n-)')
n = (n=O -r n,T - r n)
(n=O -+ 0, T -1- (n-) I)
axiom 3

Th 2 (m+n) I = ml+n
Proof Define f(m,n) == (n=O -;.. m',T

Th 5. m+n
n+m
Proof. By 3 and 4 as remarked above

=m

-~

f(m,n-»

It is easily seen that f(m,n) converges for all

236

5.3

Now
Th (). (m+n) +p = ul+(n+p)
(m+p) +n
Proof. (m+n) +p
(p+m) +n
(p+n) +m
m+(n+p)

[X*y] *z
Th
Th
Th
Th

3
5

3
5 twice

Th 7. m.O = 0
(0=0 -+ O,T -+ m+n·O-)
Proof. m.O

=0
=0

Th 8. O·n
Proof. Let f(n)

O·n
o
TIl 9.

Proof.

mn'
mn'

(n=O -+ O,T -+ f(n-»
(n=O -> 0, T -)- 0+0 ·n-)
(n=O -+ O,T -+ 0)

m+Jnn
(n'= 0 -+ O,T-+ m+m·(n')-)
nl+lnn
axioms 1 and 2

Now let
f( XjYj z]

=£ null[ x*y ] -+

Zj T -+ cons[ cal[ x*y];
cdr [x*y]z
=[null[ x] -;. [null{ y] --+ z; T -+ cons
[ca:r{ x*y]; cdr [x*y]*z ]] T-+ cons
[car[ x*y ]j cdr [x*y] *z] ]
=fnull[ x] -+ [null( y] -+ z; T -+ cons [car
y]; cdr [y J"z T -+ cons [car [x ]j[ cdr
[x J*y] *z]]]
=[null[x] -+ y*zjT -+ cons[ car [x] [cdr
[x] *y]*z ]]

=[

null [x ]-+ y*zjT

[x]

-+

cons [car[ X]j f( cdr

jYi Z ] ] ]

From the above steps we see that [x*y)*z satisfies
the equation for f. On the other hand
x*[ y*z ]:;::: [nuIl[x] -+ y*Zj T -+ cons[ car ~]; [cdr
[x }[ y*z ] ]]
satisfies the equation directly

Th 10. m(n+p):::: mn+mp
Proof. Let f(m, n,p) = (p=O -+ nm, T -l> f(ln, n' , p-»
m(n+p)
m(p=O -)- n, T -+ n' +p-»
:::: (p::::O -) nm,T -l> m(n' +p-»
mn t-mp = mn+(p=O -+ 0, T ~. m+mp-)
(p::::O -+ mn+O, T -> nm+(m+mp-»
(p=O -> mn,T -+ (mn+m)+mp-)
(p=O -I- nm, T -) mn' +mp-)

NIL*x = x
x*NIL = x
Proof. NIL*x = [null fiIL]~' Xj T -;. cons par [NIL] j
cdr [NIL]*x ] ]
=x
x*NIL = [null ~ }-~ NILj T -> cons ~ar [x 1
cdr [x] *NIL ] ]
Let f(x) = [null[ x] -+ NILjT -+ con~aar [x] jf[cdr

NoW we shall give some examples of the
application of recursion induction to proving
theorems about functions of symbolic expressions.
The rest of these proofs depend on an acquaintance
with the LISP formalism
We sta~t with the basic identities.
car [cons[ Xj ~] :;: : x
cdr[ cons[ Xj ~] = y
a tom [x] [ cons[ car [xj : ccl r [ x]
x
a tome cons[ Xj y ] ]
nul~ x] :;::: eq[XjNIL]

x*NIL satisfies this equation. We can also write
for an~ list x
x = lnull[ x] -,. Xj T -+ x]
= [null[ x] ._.- NILj T -> cons[ car [x] j cdr[ x] ] ]
which also satisfies the equation.
Next we consider the function reverse x
defined by
reverse W
/pull [,a -> NIL; T -+ reverse[ cdr [x] ]
*cons [car[ x]; NIL] ]
It is not difficult to prove by recursion
induction that
reverse[x*~ = reverse &]* reverse~]
and
reverser reverse [,q] = x.

Let us define the concatenation x*y of two
lists x and y by the formula
x*y :;::: [null[ x}--l- YjT -- cons [car[x]jcdr[x }l y,T-)l
n
2
n
l
h (y' ,xl' ••• ,xn »
2
The converse statement that all functions in
C
suce, eq "') are partial recursive is presumably
also true but not quite so easy to prove.
It is our opinion that the recursive function
formalism based on conditional expressions
presented in this paper is better than the formalisms which have heretofore been used in
recursive function theory both for practical and
theoretical purposes. First of all, particular
functions in which one may be interested are more
eas~ly written down and the resulting expressions
a~e briefer and more understandable.
This has
been observed in the cases we have looked at and
there seems to be a fundamental reason why this is
so. This is that both the original Church-Kleene
formalism and the formalism using the minimalization operation use inte~er calculations to control
the flow of the calculations. That this can be
done is noteworthy, but controlling the flow in
this way is less natural than using conditional
expressions which control the flow directly.
A similar objection applies to basing the
theory of computation on Turing machines. Turing
machines are not conceptually different from the
automatic computers in general use, but they are
very poor in their control structure. Any programmer who has also had to write down Turing
machines to compute functions will observe that
one has to invent a few artifices and that constructing Turing machines is like programming. Of
course, most of the theory of computability deals
with questions which are not concerned with the

'l

l

.3

particular ways computations are represented. It
is sufficient that computable functions be
represented somehow by symbolic expressions, e.g.
numbers, and that functions computable in terms
of given functions be somehow represented by expressions computable in terms of the expressions
representing the original functions. However, a
practical theory of computation must be applicable to particular algorithms. The same objection applies to basing a theory of computation on
Markov's 9 normal algorithms as applies to basing
it on properties of the integers; namely flow of
control is described awkwardly.
The first attempt to give a formalism for
describing computations that allows computations
with entities from arbitrary spaces was made by
A. P. Ershov 4 • However, computations with the
symbolic expressions representing program steps
are also necessarily involved.
We now discuss the relation between our formalism and computer programming languages. The
formalism has been used as the basis for the LISP
programming system for computing with symbolic
expressions and has turned out to be quite practical for this kind of calculation. A particular
advantage has been that it is easy to write
recursive functions that transform program which
maltes genera tors easy to wri te •
The relation between recursive functions and
the description of flow control by flow charts is
described in Reference 7. An ALGOL program can
be described by a recursive function provided we
lump all the variables into a single state vector
having all the variables as components. If the
number of components is large and most of the
operations performed involve only a few of them,
it is necessary to have separate names for the
components. This means that a programming
language should include both recursive function
definitions and ALGOL-like statements. However,
a theory of computation certainly must have
techniques for proving algorithms equivalent and
so far it has seemed easier to develop proof
techniques like recursion induction for recursive
functions than for ALGOL-like programs.
References
1.

2.
3.
4.
5.
6.

Church, A., The Calculi of Lambda-Conversion,
Annals of Mathematics Studies:-no. G,
Princeton, 1941, Princeton University Press.
Church, A., Introduction to Mathematical Logic,
Princeton, 1952, Princeton-University preSS:-Davis, lvI., Computability and Unsolvability,
New York, 1958, McGraw-Hill.
Ershov, A.P., On Opera tor Algori thms (Russian),
Doklady Akademii Nauk, vol. 122, no. 6,
pp. 967-970.
Kleene, S.C., Recursive Predicates and Quantifiers, Transactions of the American-Mathematical Society, vol. 53, 1953, p. 41
McCarthy, J., letter to the editor,
Communications of the Association for Computing Machinery, vol. 2, August, 1959, p. 2.

238

5.3
McCarthy, J., Recursive Functions of Symbolic
Expressions and Their Computation by Machine,
Part I, Communications of the ACM, vol. 3,
April, 1960, pp. 184-195.
8. McCarthy, J., The LISP Programmer's Manual,
M.I.T. Computation Center, 1960.
.
9. Markov, A.A., Theory of Algorithms (Russian),
Moscow, 1954, USSR:Academy of Sciences,
Steklov Mathematical Institute.
10. Naur, P., et al., Report on the Algorithmic
Language ALGOL 60,Communications of the ACM,
vol. 3, May 1960.
11. Turing, A.M., On Computable Numbers ~ ~
Application to the Entscheidungs Problem,
Proceedings of the London Mathematical Society,
sere 2, vol. 43, 1937, p. 230; correction,
ibid, vol. 43, 1937, p.544.

7.

239
6.1

:INFORMATION RETRIEVAL; STATE OF TIlE .ART

Don R. Swanson
SUMMARY

Certain aspects of science communication are
of especial importance to information retrieval.
The exponential growth of science raises many
questions related to increasing specialization.
"Quality identification" is suggested as a critical issue. All approaches to information retrieval share a common set of basic problem areas
and solutions. Semantics and redundancy are key
conceptual issues and give rise to difficulties
more likely to be overcome by meticulous thesaurus
compilation than by any sudden insight or "breakthrough. " The effectiveness of present retrieval
capabilities is largely unknown, though certain
recent studies are illuminating. Presentation
and display to the user are suggested as important
approaches to problems of information digestibility.

******

Information retrieval embraces only one
aspect of a broad class of science communication
problems. A proper perspective for assessing the
state of the information retrieval art can best
be achieved through considering first the broader
problem context. Certain elements of that context will be discussed before the subject of
information retrieval itself.
Important Aspects of Science Communication
Who Pushed the Panic Button?
The obvious fact that the world's store of
scientific information is increasing at an exponential rate has apparently enjoyed independent
discovery by scores of science information writers
during the past few years. On a suitably scaled
graph, any rising exponential curve tends to produce an hypnotic effect of impending crisis. Now
an atmosphere of alarm, since it is conducive to
action, is no doubt beneficial, but only provided
such action is properly directed. Increasing
volume of information is not in itself the real
problem. It can be inferred in fact that in the
context of increasing world scientific population,
the information increase rate is not at all out
of balance. The number of published papers, the
number of journals, the number of scientists, and
expenditures for research, all have been increasing atlan annual rate of 5-7% for the past 250
years.
Since the earth's population increases
at only 1.6% annually, indiscriminate extrapolation of the exponential growth of science
clearly is absurd. ll Limiting factors will of
course set in. Prof. Bar-Hille12 has challenged
the view that anything at all is worsening (so far
as basic long term trends are concerned), and
rightly insists that the prophets of calamity must
produce more convincing evidence than that which
has yet come to light. Since on a scientist per

capita basis the quantity of scientific information remains about constant, the case for pushing a panic button is at best obscure. The real
issue must be identified as a changing partition
of an increasing supply of human knowledge among
the increaSing number of scientists who generate
and assimilate that knowledge. More simply, a
scientist cannot now realistically expect to keep
on top of as large a portion of any particular
field or discipline (such as "physics" or "logic")
as he could some years ago. The problem must be
considered in terms of "backlog" as well as
"current production."
Many profound questions, presently unanswerable, should be diligently explored, not in an
atmosphere of crisis but in recognition of their
long term fundamental importance. Should scientific journals be organized to meet increasingly
more specialized needs? Does the manner of
journal organization itself influence the course
pf scientific research? Can excessive specialization inhibit scientific creativity? Should
steps be taken to "purge" the world's scientific
literature, or at least to separate the important
from the unimportant? Are more surveys, review
papers, and monographs needed? How should education systems and information retrieval systems be
kept flexibly responsive to the increasing segmentation of scientific knowledge?
These and other issues must be identified so
that guidelines to future research may be
developed but further pursuit of the matter is
inappropriate here. So far as information retrieval itself is concerned, it will be assumed
from this point on that things are bad enough
and ought to be improved, regardless of the rate
at which they mayor may not be getting worse.
The conceptual, rather than equipment, aspects
of the subject will be emphasized.
Mooers' Law
Mooers 3 has postulated that "an information
system will tend. not to be used whenever it is
more painful and troublesome for a customer to
have information than for him not to have it."
By way of explanation, he observes that the
penalties for not being diligent in library
research are minor if they exist at all. Clearly
a greater amount of recognizable and rewarding
(even though duplicative) work can be accomplished if time is not spent in the library.
Effective and efficient information retrieval
systems have been installed and then removed for
lack of use; the customer's ability to retrieve
information outstripped his motivation. It is
suggested in effect that the diligent finding and

240

6.1
use of information should be rewarded and failure
to do so punished; in these circumstances we can
expect better retrieval systems.
Though Mooers may have overstated the case,
his observations are provocative. If this state
of affairs indeed exists, and certainly it must
to some extent, then the question of remedial
action is difficult and important. In addition
to the concepts of IIrewardll and "punishment",
two additional directions for improving "motivation" seem to be promising.
First, more attention ought to be paid to
active dissemination of scientific information
rather than passive interment in libraries wherein
the resurrection of information requires initiative and ingenuity on the part of the customer.
Perhaps the scientific community is in need of
something akin to IIscreening panels" whose job
is to direct newly acquired information to the
appropriate potentially interested users.
Secondly it is suggested that beyond, after,
and independently of "information retrieval"
per se, the proper presentation and display of
information to the customer may play an important
role in the palatability and digestibility of the
retrieved information. The extension of present
information retrieval boundaries in this direction
will be discussed later in the framework of research trends and goals.
Birth Control of Technical Reports
Scientists are painfully aware that much, if
not most, scientific literature is either so
repetitive,verbose, or of such low quality, that
it ought not to have been written in the first
place. In the opening address at the 1958
International Conference on Scientific Information,
Sir Lindor Brown made a sparkling appeal for the
exercise of res~raint on the part of both authors
and publishers.
In professional journals, quality of information is at least partly controlled
through editorial sanction, but the now comparable
volume of unpublished technical reports are totally immune to such controlling influence. Dwight
Gray5 pinpoints the lack of bibliographic control
as an especially unfortunate aspect of technical
reports.
It is doubtful that any system of "informat:iDn
birth control" can be made practical, particularly
one that appeals to voluntary self restraint; involuntary control however may be . a cure worse than
the disease. "Quality Control" applied more'
liberally may be a part of the answer, but any
connotation of "control" raises the question of
information suppression without competent and
diligent exercise of the judgmental function.
"Quali ty Identification 11 is suggested here as a
more appealing approach. Clearly in a situation
in which the volume of information exceeds the
individual scientist's digestive capacity, some
method of assisting him to discriminate between
the important and the unimportant literature can
be counted among the most crucial of requirements.

The referee system for professional journals
imposes a modest degree of quality control on
scientific information. It is at least thinkable
that recognized leaders in the various technical
fields could be imposed upon to implement a
"quality identification" system on a sizable
scale. However, a more practical technique
immediately at hand may lie in the idea of
"citation indexing." It is not unreasonable to
measure the importance of a published article or
technical report by the frequency of subsequent
citations to that article or report (possibly
taking into account who did the citing and why.)
This suggestion was ably described by garfield
in an article published six years ago.
Experimental studies of citation patterns in physics
literature has been carried out by Kessler. 7
The idea of automatic abstracting has enjoyed considerable publicity and popularity during
recent years as an approach to information compression. Techniques for so dOing have somehow
managed to come into being without the benefit of
any adequate description of requirements. The
situation can be rather easily understood by
first considering one plausible elementary rule
for "automatic abstracting" (or rather "automatic extracting"): "Select title and first
sentence of each paragraph." This rule is
obviously mechanizable. Furthermore, at the time
of this writing, no other rules have been demonstrated on the basis of which a convincingly
better abstract can be produced. The problem lies
in the fact that adequate measures for the
"quali ty" of an abstract (human or machine) have
never been developed. Ideally, of course, one
would like to cut the length of an article drastically without appreciable loss of information,
but unfortunately it cannot yet be demonstrated
that the percentage of information loss is
Significantly smaller than the percentage of text
reduction. The question of a need for text
reduction even at the expense of information loss
is nonetheless open.
An interesting and constructive approach to
"volume reductions is contained in a recent Case
Insti tute report, a part of which concerns the
effect of condensation on reader comprehension of
journal articles. If indeed one can somehow
single out "measurable" factors in the area of
reader comprehension, then certainly experimental
observations of this kind are crucial to questions
of redundancy elimination and abstracting.

"Communication Habits of Scientists"
The scientific community itself has been the
object of attention of a number of "opinion
surveys" and scientific communication llbehavior
studies." These studies represent a commendable
attempt to describe characteristics and attributes
of our present system of scientific communication.
A comprehensive review of several dozen such
studies has been reported by Menzel. 9 The stated
purpose of this review is to display the variety
of research that has been done on the flow of
informationamongscientists; it does not attempt,

I

241
6.1

however, to evaluate or recapitulate that which
is worthwhile and deserving of future emphasis.
The total amount of information of this kind
presently available tends to cause mild indigestion when one attempts to assess the relevance of
the whole matter to the implementation of practical improvements in scientific communication. In
their present state, however, these "behavior"
studies are intended only to be descriptive rather
than diagnostic. It is premature to attempt to
translate a description of present behavior into
future requirements; to discern any relationship
between the two requires considerable further
study.
certain fragmentary results of a rather
general nature are worth noting, however, since
they have an interesting bearing upon some of
the questions raised here. Let us consider a
few excerpts from the Menzel review.
"The amount of time chemists devote to
reading on the job was found·directly related to
the ease of their access to scientific literature.
Reading time was directly related to the availability of journals at chemists' desks, to the
location of library facilities in the chemists'
bUilding, and to the existence of company library
facilities."

of current literature for news rather than
reference reflects a requirement or whether it
simply reflects the fact that current literature
information systems are inadequate for reference
purposes, and hence can be used only for news.
This identification of problem areas at
least suggests action in two directions. First
since the stimulant effect of scientific publication is known to be important, steps to enhance
such stimulant capability can be taken. If this
idea were coupled with "quality identification"
we might reasonably equate "quality" with
"stimulation ability." Once stimulating material
has been identified, provision could then be
made for wider circulation within the scientific
community. The conclusion that current literature
is not really used very much for reference, if
correct, is dramatic in its implication of a
requirement for a profound look at and perhaps
overhaul of our literature reference systems.
The spectrum of behavior studies reviewed
by Menzel does not apparently include any reports of active controlled experiments which
exert a perturbing influence on the scientific
communi ty • Experiments with a fully controlled
model system have been suggested by Kessler,7,lO
and seem worthy of serious consideration.
The Conceptual Nature of Information Retrieval

"'one of the greatest stimulants to the use
of information is familiarity with its source.'"
"Sometimes pieces of work which have been
ignored by the scientific community prove to be
highly significant when someone finally stumbles
upon them in the back volumes ••••• It is suggested
tentatively that it is often necessary to publicize information repeatedly, lest it fail to enter
the stream of communications which will lead to
its ultimate user. From the point of view of the
consumer of the information, it seems sometimes
necessary to be exposed to the information
repeatedly before it will make an impact."
"On the basis of the inforr.lation on chemists'
reading time and on the number of articles abstracted in Chemical Abstracts in a given year,
it was concluded that only one half of one percent of the articles published in chemistry are
read by anyone chemist."
"a large portion of articles read and considered useful have been met with by chance;"
" 'there is thus a good deal of circumstantial evidence for the hypothesis that the
literature is used very much more for news than
for reference. ' "
The foregoing conclusions are provocative
and clearly have some bearing on the issue of
"literature search" versus "specific-information
retrieval," as well as on the matter of Mooers'
Law. It should be realized that the results as
they stand shed no light on cause-effect
relationships. It is not clear whether the use

All approaches to information retrieval are
reducible to a common set of elements, both from
the point of view of problems and solutions.
The objective of any information retrieval system
is to permit the originator of information to
communicate with an unknown user at some unknown
future time. We may presume that the user or
customer is faced with a large volume of information, i.e., a library which by many orders of
magnitude is too large to permit an exhaustiye
direct examination search to meet whatever
requirements for information he might have. Thus
communication from originator to "user" must take
place· via some abbreviated "representation" of the
contents of the library. This representation may
take many forms, for example: classification
systems, alphabetic subject indexes, coordinate
systems, facet classification systems, full text,
and others yet to be invented. They all incorporate certain common problems, and these problems have a common origin.
When any document is indexed, cataloged,
classified, or otherwise tagged, this process
constitutes an attempt to represent "information
content" of the article or document in a hignly
abbreviated forin. This "representation" is
intended to serve the purpose of information
retrieval but contains only a minute fraction of
the information contained in the document. Now
it is not usually supposed that the "representation" should reflect the total "information
content", yet at the same time there is no solid
theoretical reason that the purpose of information retrieval can be fully served in any other
way. Fortunately, the whole matter need not rest

242

6.1
on theoretical proof since the objective at hand
is of a purely practical nature.
As a practical matter, the encoding of
"information content" or IImeaning" of a document
in any highly abbreviated representation is an
intuitive process not amenable to formal description. Experiments have indicated that the
reproducibility of the intuitive judgment of
indexers or catalogers is relatively low. A
subject heading, index term, or descriptor, has
a IImeaning" which is a function of the vie'WpOint
of the observer or equivalently a function of the
context in which that heading, term, or descriptor is utilized.
The effectiveness of an information retrieval
system depends on the success with which the indexer and user can pursue the following strategy.
In creating an lIabbreviated representation II of an
article being indexed, an attempt is made by the
indexer to foresee essentially all ways in which
some unknown user might ~sh to recover the
information. A rather high degree of redundancy
is a result of a deliberate attempt on the part
of the indexer to predict the customer's viewpOint. Similarly the search is conducted on a
redundant basis since in effect the user hunts
in all likely (and in some not-so-likely) places.
Both indexer and user must therefore play a
"guessing game ll to ascertain the vie'WpOint of
the other. This game results in redundancy, but
not so much so that large quantitites of retrieved irrelevant information thwart the purpose
of the search. The foregoing description applies
to a successful information retrieval system.
The fact is, it is possible to find many systems
which seem to have no elements of such guessing
or redundancy and which, at the same time, are
not very successful.
Consider for the moment a retrieval system
in which the library IIrepresentationll is based
solely on key words or descriptors. For this
Situation, the llguessing game", played by both
the indexer and searcher, can be aided by means
of a thesaurus. For our purpose here, we define
a thesaurus to be a collection of groups of words
wherein the various members of a group tend to
mean about the same thing for purposes of information retrieval. These word groups thus serve
as a reminder list to assist the indexer and
searcher in conjecturing how the same idea might
be expressed in many ways. (Experimental use of
a thesaurus is described for full text search in
references 11, 12.) In one form or another,
something akin to a thesaurus must be implicitly
present in any good retrieval system. The IIsee
also" portion of an alphabetic subject index or
of a classification system can be looked upon
as serving the purpose of a thesaurus.
The total amount of desirable redundancy in
any retrieval system, and the detailed manner in
which redundancy should be distributed between
indexing and searching, depends upon economics
and on the question of irrelevant retrieval.
For a system in which the encoding process is hi~

ly efficient, redundancy may be cheaper to come
by as a part of the input rather than as part of
the output. In other systems, however, such as
those in which the search workload is light, it
may be that redundancy can be purchased more
cheaply in the searching process.

An important problem area, more or less
distinct from questions of meaning, has not been
covered in the foregoing discussion. This area
concerns syntactic relationships among the various
index terms that might be assigned to a given
document. The effect of failure to utilize such
syntactic relationships is to increase the retrieval of irrelevant material but with no loss
of that which is relevant. To illustrate, if the
phrase "the effect of radiation on mutation" is
an appropriate description of the content of an
article, then the independent aSSignment of
descriptors corresponding to "radiation" and
I1mutation" would not by itself preserve the
syntactic relationships between these two terms
implied by the original phrase. Irrelevant
retrieval of documents in which the descriptors
"mutation" and '!radiation" were simply included
among a number of other descriptors in such a
way that those two were not related to one
another, could clearly result. The degree to
which this whole question is of importance in
terms of the amount of irrelevant retrieval that
might be caused by ignoring syntax has not been
experimentally established. In principle though
the problem is of considerable importance and a
number of approaches to its solution should be
mentioned.
First of all, a considerable amount of work
has been in progress for some years at the 13
University of Pennsylvania under Z. Harris.
The objectives of this effort are broader and
more fundamental than the question of preserving
syntactic relationships among index terms, but
the subject is nonetheless of considerable relevance. Eugene Wall14 ,15 has presented a lucid
categorization of information retrieval problems
in terms of vie'WpOint, generiCS, semantiCS, and
syntax. He describes the use of "role indicators'~
as a solution to problems of syntax, and claims
that effective and successful technical information systems which employ only twelve such
indicators have been in operation.
So far as full text searching is concerned,
an especially simple substitute for syntax,
namely, the "proximi t~" of terms has been suggested by the author l • This approach depends
on the fact that two terms which appear in the
text of an article in reasonable proximity to
one another (i.e., within a few sentences) have
a high probability of being syntactically
related.
"Effectiveness ll and "economy II can be identified as the two fundamental objectives of information retrieval research. "Effectiveness II has
to do with how well a system works, in terms of
both the percentage of relevant material retrieved and the accompanying amount of irrelevant

243

6.1
material. "Economy" of course refers to the cost
of operating the system, including indexing,
storage, file maintenance, and searching. In
particular, the "cost" to the user of reading
irrelevant material must be included. (Otherwise
the distinction is subject to total confusion by
the existence of a hypothetical library system of
100% effectiveness and essentially zero cost.
This library is an unorganized, unindexed, unmanned warehouse which the customer reads through
from beginning to end for each information request.)
Any research project on information retrieval
should be assessed in the light of its objectives,
and the division of objectives into the two categories of effectiveness and economy usually provides a perspective not otherwise apparent.

for its final justification on overall economy of
the resulting system. Up to the present time
large scale systems based on the merger philosophy have not been conspicuously noted for
economy and simplicity (to risk a serious understatement). (That fact was of course known from
the deSign stage on--what's worse though, at
least some of these systems after installation
have been victims of a rather severe onset of
Mooers' Law). The question of mechanization in
libraries is full of economic pitfalls, but not
to the extent of preventing soundly designed
systems to be implemented with present hardware.
Since this paper is intended to focus on conceptual problems, the matter of eqUipment will
not be further pursued.

The issue of mechanization is, of course,
tied to economy. It is obvious, though sometimes
overlooked, that computing machines cannot in
principle improve the "effectiveness" of anything
at allover what can be done with people, unless
response times or environmental factors are of
importance. The basic question is one of economy
alone. Speed itself may be of some relevance but,
in principle at least, one can duplicate the speed
of any automatic process with enough people working in parallel. Admittedly this argument is
oversimplified (though the remark on response
times and environment is a strong hedge), but
justified as a counteraction to the unfortunately
prevalent belief that mechanization of a process
that doesn't work to begin with will improve
matters.

A considerable amount of reported work in
the information retrieval field is indirectly
addressed to the question of economy rather than
effectiveness though not in an obvious sense and
not necessarily in connection with mechanization.
A number of ingenious logical-mathematical models
of information retrieval systems have been devised.
Many of these papers have interesting implications
on questions of file org~ization and search
strategies, (e.g. Estrinl~, Moore 19 ) and possibly
on the design of future machines; their promise
for leaoing to new insights with respect to the
more basic semantic problems that lie at the core
of "retrieval effectiveness" is less clear.

Apart from its intimate relationship to
economy, one other especially important factor
in mechanized information retrieval should be
recognized. The machine handling of the physical
contents of a library involves problems totally
different from the machine handling of a representation which permits that library to be
searched. Present general purpose computing
eqUipment, provided it is used with some ingenuity, is not badly suited to machine search of
representations, i.e., index and catalog type
information. So far as handling the total information content of a library is concerned, the
required storage jumps several orders of magnitude, and existing computers are largely inappropriate. The output of computers which handle and
search only an index-representation is necessarily
just a bibliographic listing of responsive documents (accompanied possibly by titles or brief
abstracts). Retrieval of the document itself
(from shelves, filing cabinets, or microfilm
storage) must then follow as a second stage process. Special purpose retrieval machinery can
be relatively efficient for this second stage,
though generally speaking well designed manual
filing systems offer serious economic competition.
Some equipment designs have been based on a
merged record (usually recorded on i'i1m) of the
machine-readable coded representation of a document with a non-machine-readable full text document image. This merger imposes awkward machin~
design problems from the beginning and must depend

It is at least plausible, however, that some
of the mathematical models might eventually lead
to fruitful results in a more fundamental sense.
Mooers 20 has developed a model in which certain
mathematical properties of different types of
retrieval systems can be compared. It would seem
useful to extend a model of this kind to include
formal representation of redundancy, and then to
investigate the relationship between retrieval
effectiveness and redundancy. In the terminology
of Mooers' model, the transformation relationships between the space of all retrieval prescriptions and the space of all document subsets
should be formulatable in such a way that the
effect of redundancy is clearly brought out. It
would be overly bold to predict at this point
whether such an approach would yield any new
fundamental inSights, better file organization
and storage strategies, both, or neither.
A key element in understanding the "nature
of information retrieval" must certainly be the
types of questions posed by users of information
systems. It is in fact far more reasonable to
design library representations on the basis of
the way in which the users tend to organize the
subject matter rather than the way in which
indexers imagine that it ought to be organized,
yet it seems that this procedure is seldom
fOllowed. This approach is embodied in a study
carried out by Herner2l on the information system
of the U.S. Atomic Energy Commission. (Several
interesting results with immediate practical
implications, so far as machine design is concerned, were obtained. Ninety percent of the
questions involved three or fewer distinct

244

6.1
concepts and of all of the multiple concept
questions, ninety-eight percent involved logical
products rather than logical sums or differences.)
In connection with the Herner study, it would be
interesting to know the extent to which the user's
questions were predicated on some prior notion of
how the AEC library was organized; that is to say,
did their questions really reflect what they
wanted or were they conditioned by what they
thought the library could provide?
How Effective are Present Information Retrieval
Systems?
It is a curious fact that the above question
is essentially unanswerable in terms of any
objective "measurables." For any given information
retrieval system, those concerned with it are
generally not at a loss for an opinion as to how
well it works, but such opinions are seldom
accompanied by evidence. Some recent experimental research has been seriously addressed to
this point, and insight is being acquired. Large
scale experiments which attempt to measure retrieval effectiveness and at the same time compare
various factors crucial to such effectiveness, are
presently under way (under the direction of
Cleverdon) at the College of Aeronautics, Cranfield, En~~and. Preliminary results have been
reported.
Within the framework of a small
scale experimental system, considerable attention
is given to the question of defining measures for
retrieval effectiveness in the author's investigation of text searching. ll The results of several
retrieval methods are compared with Itdirect examination" of the whole document collection.
The emerging results reported in these two
recent studies seem to be taking on a rather
interesting pattern. The ASLIB Cranfield project
had for its objective the measurement of three
variables: the indexing system (four specific
systems were compared), the time spent indexing,
and the experience or qualifications of the indexer. Preliminary results presented by Cleverdon
in a recent seminar1 7 indicate that it doesn't
much matter which indexing system is used, how
much time is taken in the indexing process, or
whether the indexer did or didn't have a lot of
experience. Nothing seemed to depend critically
on the searcher either. Similar "invariances"
were encountered in the reported text searching
experiments. ll
On the whole retrieval effectiveness in these
experimental systems was relatively low (Cleverdon
reports recoveries in the neighborhood of eighty
percent but it should be noted that these figures
refer to It source " documents--Le., those which
directly inspire the retrieval question. Retrieval
percentages for non-source documents have not yet
been reported for that project. The same distinction between source and non-source documents
was used by the author, 11 and it was found that
recovery of source documents was about twice as
effective as that for non-source documents.) In
general these experiments seem to indicate
mediocrity of retrieval effectiveness and

insensitivity to parameters that might reasonably
be supposed important. Though further analysis
is necessary before firm conclusions can be drawn,
certain hypotheses to explain the observed mediocrity and insensitivity can be suggested. If we
suppose that the thesaurus used by the author and
a "see also" structure of the indexing systems
tested by Cleverdon were about equally primitive
(insofar as covering any substantial range of
language redundancy is concerned) then one would
have expected the results to come out about as
they did. It has already been found that a substantial portion of the text searching ineffectiveness could reaso~~bly be,attributed to a
deficient thesaurus.
An interesting sidelight brought out by
Cleverdon in the NSF Seminar is that there presently exists no book or well organized doctrine
on how to compile subject heading lists nor does
there exist a systematic body of opinion on the
application of Itsee also references. Cleverdon
observes that the latter particularly tend to
be applied in a haphazard manner. With inadequate
thesaurus groups, cross references, and haphazard
Itsee alsol! application, it should come as no surprise that the indexer and user find difficnlty
in communicating via an information retrieval
system.
lt

A few remarks can be made on the state of
automatic indexing. It has been pOinted out that
the language redundancy problems associated with
retrieval effectiveness must be approached through
thesaurus-like compilation techniques. On the
basis of all present evidence, there is no
particular reason to believe that the process
is any more difi'icult i"or automatic indexing than
it is for human indexing. The more subtle aspects
of semantics and redundancy in a system based on
automatic indexing must in that case be thrown
into the search process, but with no obvious disadvantages. The ~eal question is not so much
whether automatic indexing can be made to work
(though to be sure the matter must still be left
open) but whether it can be made economical.
Generally speaking, with present computing equipment it cannot, except for certain special
applications where input costs can be amortized
in other ways. Engineering achievements in the
area of direct printed page input and higher
speed memory readout, may hold the final answer
to this question.
Trends and Goals
The nature of the basic problems of information retrieval is such that no sudden conceptual
breakthrough seems likely. Current inventories of
mathematical theories and techniques are applicable
to information systems only to a limited extent.
The tasks that clearly lie ahead must include
large scale language stUdies and laborious experimental investigations. The past several years
have seen increasing awareness of the significance
of some kind of thesaurus approach to problems of
multiple meaning, viewpoint, generics, and redundancy. This approach is not limited to natural

245

6.1
language search techniques or to key-word descriptors, but finds its counterpart in the "see
also ll cross references of all types of subject
headings and classification systems. These compilation efforts which in effect attempt to build
"structures of relatedness II in indexing and retrieval systems should be closely tied to studies
of the types and forms of questions which users
ask of the system. Studies of users' questions
should be patterned not only after the kind that
are presently asked (e.g., Herner 21 ) but should
be addressed also to the kinds of questions Which
users ought to ask if they had different and
better information retrieval systems.
Retrieval techniques work best when they
deal with a limited subject area and are tailored
to the requirements of a limited group of users.
Working systems of this kind should form the
nucleus for experimental investigations so that
deeper inSights can be obtained into the question
of how and why they behave as they do.
Syntactic studies should continue though it
may be anticipated that their practical relevance
to problems of information retrieval may not
materialize to the degree that many workers presently hope. It seems to the author that problems
of semantics tend to dominate practical requirements. It is a more or less obvious phenomenon
of language that essentially the same concept
can be expressed in many dozens of ways which
are not syntactic transformations of one another
and this fact alone suggests that syntacticians
may be eventually disappointed in the extent to
which their work finds practical application.
Storage and retrieval of condensed or IIkernelized ll
sentences 13 suffers from the same illness as do
other methods of automatic abstracting; at present
there are no acceptable measures for the amount of
information loss in the kernelization process.
This entire area may well emerge as being of
considerable future importance however if for no
other reason than the keen interest of a considerable number of competent workers; it holds much
potential for taking off in new directions.
The presently primitive state of the automatic indexing art was mentioned briefly in the
last section, and it is. clear that studies of
this kind shall and should continue. It is in
this area that advances in ~quipment capabilities
are urgently needed. A portion of these studies
should be addressed to the interplay between
natural language queries &ld mechanized information retrieval systems. The formulation of a
request in natural language for computer proceSSing is not subject to the current limitations
of high volume storage that natural language text
searching involves.
The extent to which one should be sanguine
about continued studies of the information
gathering habits of scientists, as though they
were a colony of bees or ants, is not really clear.
Certainly a rash of these studies has broken out
during the past four years or so and many of the
results have been interesting. It j.s possible

that these have largely run their course and that
now emphasis ought to shift to experimental studies
that are subject to greater control. Certainly
more investigation should be made of the obviously
observable parameters within the science communication sphere such as those provided by the technique of citation indexing. Other techniques
exist for "hooking" documents together by purely
pragmatiC and intuitive estimates of relevance
within the framework of some particular purpose.
A set of citations as useful "relevance hooks"
would be based on an assumption that co-citation
implies similarity of meaning from the point of
view of the author dOing the citing. In an
analagous sense, Fan022 suggests that the recogni tion of similarity of two documents in the
process of request formulation (i.e., the requestor asks for a document "like" one he already
has) provides relevance hooks similar to those
provided by co-citation. Networks of relevance
chains once assigned are susceptible to highly
efficient machine manipulation and have the added
virtue that the profound questions of semantics,
viewpoint, ambiguity, generics, and syntax, are
involved only to the extent of directly observable
external effects resulting from a large collection
of instances in which professional judgment is
applied to the question of "relatedness."
It was mentioned earlier in connection with
Law that extending the boundaries of
information retrieval into the area of presentation and display may provide an approach to
solVing problems of information indigestion on the
part of the user. Strictly speaking, an information retrieval system has served its purpose
once responsive documents have been delivered to
the customer, but if indeed the customer's attitude
is characterized by indifference, to th~ extent
that the retrieval system is not effectively used,
then we might surmise that a highly important and
critical problem area has been ignored. This area
is one which begins where information retrieval
ends. Let us imagine that a stack of several
dozen documents of several thousand words each has
been retrieved. Now the customer's original
motive in re~uesting those documents must be examined. If he seeks specific pieces of related
information, a considerable effort on his part may
be required to extract what he wants. If that is
indeed the case then further machine proceSSing of
such retrieved data may serve a useful purpose.
It is of particular significance to note that
computer handling of the total retrieved text (in
contrast to the total text of the library) is
reasonable in terms of storage requirements. His
original requirement may of course have been of a
different kind, such as acquiring general familiarity with a subject area, but we shall consider
further only that situation which calls for specific small fragments of information to be brought
together and "correlated" in some abstract intellectual way. This type of requirement typically
occurs in a business or military intelligence
application. The question now is whether or not
the proper presentation, rearrangement, and display of numerous fragments of retrieved data may
play a significant role in stimulating the user to
Moo~rs'

246
6.1
perceive relationships that are not otherwise
apparent in a more laborious process of directly
examining the total contents of the retrieved
material. Experiments carried out by the author
and co-workers at Ramo-Wooldridge during the last
year or so have demonstrated that such stimulation
is indeed possible. (A report on this work is in
preparation.) When combined with one 0f the conclusions quoted in the Menzel review, 9 namely
that the periodical literature tends to be used
more for "idea stimulation If than for reference,
these concepts of presentation and display take
on added significance. It is suggested that
future information retrieval work may exhibit
an interesting trend in the direction of autmatic user stimulation.
The articles cited in the follOwing list of
references themselves contain references to 160
other articles (probably some are duplicative).
If succeeding generations of citations are
counted, one wonders whether a closed system will
be encountered (it seems likely) and whether
subsets of those cited many times are in some
way more "central" or "important" to information
retrieval.
REFERENCES
1. Scientific and Technical Information as One
of the Problems of Cybernetics by G.E.Vleduts,
V.V.Nalimov, N.I.Styazhkinj SOVIET PHYSICS
USPEKHI, Vol. 2 No.5, p 637, Sept.-oct. 1959
2. The Real Problems Behind Information Retrieval
and Mechanical Translation, Y• Bar-Hillel , Seminar
Feb. 13, 1961, Sponsored by the Office of Naval
Research
3. Information Retrieval Selection Study, Part II:
Seven Retrieval System Models, by Calvin N. Mooers,
Zator Company, Report No. RAOC-TR-59-173 Contract
AF30(602)-1900
4. Opening Address, Sir Lindor Brown, ICSI* p.3
5. Technical Reports I Have Known, and Probably
Written, by Dwight E. Gray, PhYSics Today, V.13
No. 11, p.24 Nov.1960
6. Citation Indexes for Science by Eugene
Garfield, SCience, V.122 No. 3159 p.108 July15,'55
7. Technical Information Flow Patterns,
M. M. Kessler, WJCC May 1961
8. An Operations Research Study of the Dissemination and Use of Recorded Information by
Operations Research Group, Case Institute of
Technology, December 1960 (Sponsored by National
Science Foundation)
9. Review of Studies in the Flow of Information
Among Scientists, Bureau of Applied Social
Research, Columbia University, January 1960
(Sponsored by National Science Foundation)

10. Background to Scientific Communication by
M. M. Kessler, IRE Translations, Vol. EWS-3 No. 1
April 1960
11. Searching Natural Language Text by
Computer, by Don R. Swanson, SCience, Vol. 132
No. 3434 p.1099 October 21, 1960 (Sponsored by
Council on Library Resources)
12. Research Procedures for Automatic Indexing
by Don R. Swanson, presented at American University Third Institute on Information Storage and
Retrieval, Feb. 16, 1961, (Sponsored by Council
on Library Resources)
13. Linguistic Transformations for Information
Retrieval by Z. S. Harris, ICSI* p. 937
14. Documentation,Indexing,and Retrieval of
Scientific Information, prepared by Committee
on Government Operations, U. S. Senate, I:ec. No .113
(report by Eugene Wall p. 175-202)
15. Summary of Area 4 Discussion, ICSI*
p. 804 805
16. ASLIB CRANFIELD RESEARCH PROJECT, Report
on the First Stage of an Investigation Into the
Comparative Efficiency of Indexing Systems by
Cyril W. Cleverdon, Sept. 1960 (Sponsored by
the National Science Fbundation)
17. Seminar February 17, 1961, by Cyril W.
Cleverdon, held at National Science Foundation,
Washington, D.C.
18. Maze Structure and Information Retrieval,
Gerald Estrin, ICSI* p. 1383
19. A Screening Method of Large Information for
Retrieval Systems, Robert T. Moore, WJCC May 1961
20. A Mathematical Theory of Language Symbols
in Retrieval, by Calvin N. Mooers, ICSI* p.1327
21. Determining Requirements for Atomic Energy
Information from Reference Questions, by Saul
Herner and Mary Herner, ICSI* p.181
22.

Summary of Area 6 Discussion, ICSI* p.1407

*ICSI : Proceedings of the International Conference on Scientific Information National Academy
of SCience, National Research CounCil, Washington,
D.-C. 1959

247

6.2

TECHNICAL INFORMATION FLOW PATTERNS
M. M. Kessler

Lincoln Laboratory,

* Massachusetts

Institute of Technology

Summary
A study of the bibliographies of a large
number of articles in physics and electrical
engineering indicates that definite patterns exist
for the flow of technical information. Quantitative data are pre sented on the flow of information
between countries, between cultural and functional groups, and between past and present. An
analysis of the numerical data indicates that
these flow patterns are deeply rooted in the dynamics and evolution of scientific thought and
engineering development. The analysis also
discloses that extreme asymmetry exists between journals in their capacity as carriers of
scientific information.

This study is concerned with the literature of physics and closely related fields of application. Data are presented on the flow of information across political boundaries, from a
basic science to an applied technology, from
past to present, and concludes with some remarks on the efficiency of various journals as
carriers of information. The results are then
discussed in terms of their application to the
design of a system for scientific retrieval and
communication.
Num.e rical Data
Communication across Political Boundaries

Introduction
Every project in the field of information
retrieval must face up to the following question:
"Will it promote the flow of meaningful information from originator to consumer? II No idea,
scheme or component, no matter how intellectually clever or technically elegant, has any worth
except insofar as it contributes to an information
flow system. This flow system has its carriers,
channels, sources, and sinks. It defines the
coupling between individual scientists across
field boundaries, political groupings, time, and
habits of tradition. It is clear that any new
scheme or component of information retrieval,
in order to be effective, must mesh into the flow
pattern and be properly matched to it. And yet
we know remarkably little about the information
circuits as they now exist. 1 This paper attempts
to map the flow of information by analyzing the
statistics of reference s that authors include in
their published papers. It is assumed that the
published journal article is the message unit and
that its citation by an author is recorded evidence
that the message has found a meaningful target.
The strength of this method is that within its
area of limitations it is quantitative and unambiguous. It doe s not depend on subjective opinions
and questioners, and very significantly, a wealth
of recorded data exists in journals of all countries, all sciences, and as far into the past as
we care to go. The weakness of the method is
that it certainly does not measure all scientific
communication. Other modes and circuits exist
that are not reflected in bibliographic citations
and conversely some citations may be irrelevant
to the information transfer process. The method
is nevertheless particularly applicable to the
problems of retrieval systems because the latter
is largely concerned with communication through
written papers.

A number of journals were analyzed for
purpose of obtaining a rough measure of the flow
of scientific information across national boundaries. The bibliographies in the indicated journa
were sorted on the basis of country of origin.
Table I shows the results for the January 1957
issue of the Physical Review.
Table I
Geographic Distribution of References
in the Physical Review (January 1957)
Reference to
Physical Review
Other American
British
European
Russian
All others

No. of
References
994
558
198
258
28
33

% of
Total
48.0
27.0
9.5
12.4
1.4
1.6

We see that roughly half of the references
in the Physical Review are to papers published in
the same journal. Three quarters of all the references are to American journais. The remaining 250/0 of the references are distributed among
a variety of European journals, mainly British.
Note in particular the vanishing call on Russian
reference material (1.40/0).

*

Operated with support from the U. S. Army,
Navy and Air Force

248

6.2
We now consider the same process as it
operates on Russian physicists. Table II gives
the results of a count on the June and October
1957 issues of the Journal of Theoretical and
Experimental Physics.
Table II

JETP
Other Russian
Physical Review
Other American
British
European
All Others

Geographic Distribution of References
in the Physica (1957)
Reference to

Geographic Distribution of Refere~ces
in the Russian Journal of Theoretical and
Experimental Physics (June and October 1957)
Reference to

Table IV

No. of
References

% of
Total

102
201
148
57
63
66
22

15.4
30.5
22.4
8.7
9.5
10.1
3.3

Note that on its home grounds the Russian
JETP actually runs second to the Physical Review. The Rus sian physicists depend on British
and other European literature roughly to the
same extent as do the Americans, but they do
not have the strong partiality to their own chief
journal nor to any combination of journals in
their political group.

Reference to

No. of
References

% of
Total

Nuovo Cimento
Othe r Italian
Russian
Physical Review
Other American
British
European
Others

344
38
84
771
331
223
245
135

15.7
1.7
3.9
35.6
15.4
10.5
11. 2
6.2

225
71
30
249
85
159
121
108

21.6
5.9
3.0
23.9
8.2
15.2
11. 6
10.4

In view of the political and cultural polarization of modern society into East and West
groupings, it is interesting to present the data
of Tables I to IV in such a way as to illustrate
the flow of physics from East to West and vice
versa. Table V groups all references to American and European journals and compares them
with those to Russian and other "iron curtain"
journals.
Table V
The Flow of Information along the
East- West Political Axis
Phys.
Rev.
Western Jnls. 96.9
Eastern Jnls.
1. 4
All Others
1. 6

Geographic Distribution of References
in the Italian Nuovo Cimento (Jan. - June 1958)

'fo of
Total

Physica
Other Netherlands
Russian
Physical Review
Other American
British
European
Others

Similar counts were made on Nuovo
Cimento and Physica, Italian and Dutch journals
of physics. The results are shown in Tables
III and IV.
Table III

No. of
References

Nuovo
Cimento

Physica JETP
86.4
3.0
10.4

90. 1
3.9
6.2

50.7
45.9
3.3

To extend this picture somewhat beyond
pure physics, a count was made on the Journal
of Applied Physics (JAP) and the Proceedings
of the Institute of Radio Engineers (IRE). In
the latter case, we picked January, June, and
September 1957 as representative issues. A
special issue of the IRE devoted to a symposium
on transistor technology was treated separately.
The results are shown in Table VI. The Phys.
Rev. data are reproduced for comparison.
Table VI
Geographic Distribution of References
in Phys. Rev., JAP, and IRE
Proc. IRE
Phys. In. App. Proc. Transistor
Rev.
Phys.
IRE
Issue
ences to
U.S.A.
British
European
Russian
Others

75.0
9.5
12.4
1.4
1.6

70.5
12.0
11.7
2.9
2.9

77.0
11. 6
8.7
1.2
1.4

78.0
5.3
8.6
2.4
1.9

249
6.2
The data suggest the following conclu-

Table VII

sions.
1.
The Physical Review is truly a definitive journal for physicists. It commands overwhelming dominance over all other journals as a
carrier of information between physicists of all
lands.
2. American physics is the chief source
of information, not only for other Americans but
for the international community of physicists.
3. American workers in fields of applied
physics (as typified by authors in JAP and IRE)
find their literature needs overwhelmingly satisfied by American journals.
4.
European physicists draw heavily on
American literature. Their coupling to the Russian literature is not significantly greater than
that of American physicists.
5.
The Western world is virtually selisufficient with regard to physics. The Russian
cultural sphere on the other hand draws heavily
on the We st for its information.
A close comparison of the various tables
suggests that these conclusions are valid even if
we take into account the language barrier between
East and West.

Journal Distribution of References
in the Transistor Issue of IRE
Reference to

The June 1958 is sue of the Proceedings of
the IRE was chosen for study. This issue is a
symposium of 22 papers (353 pages) devoted to
transistor technology. Of the fifty authors,
forty-one indicated affiliation with industry,
eight with universities, and one with the government. Transistor technology is of great interest
to industrial and defense workers and yet it is
new enough to have rather simple and short routes
to the underlying sciences. For these reasons it
was thought that a detailed analysis of the transistor issue would be instructive. Table VII
shows the distribution of references among
journals.

0/0 of
Total

230
129
69
52
48
61
36
22
94

31.0
17 .4
9.3
7.0
6.5
8.2
4.9
3.0
12.6

Physical Review
Proc. IRE
J. Applied Physics
Bell System Tech. Jnl.
British
German
Other Foreign
Rus sian, etc.
Miscel. Jnls. (American)

Table s VI and VII sugge st that the transistor issue of the IRE shows the typical American pattern of bibliographic distribution and doe s
not differ much from a random is sue of the IRE.
This invariance applies only if we consider the
geographic distribution of references. Both
cases follow the American pattern; over 750/0 of
the references are to 'American and over 900/0 to
Western journals. But when we analyze in detail the American journals for the two cases an
entirely different picture emerges. Table VIII
is a break-down of the references to American
journals in the two samples of the IRE.
Table VIII

Flow of Information to an Applied Field
A significant special case of information
flow and retrieval concerns communication across
disciplinary boundaries. We suspect that a retrieval or communications scheme designed to
process physics literature for the physicist is not
the same as what is needed to proce s s physic s
literature for the chemist, the engineer, or the
biologist. A related problem, particularly important in the dynamics of applied research and
development, is the flow of information from
basic research scientists to production engineers.
Ii the coupling is tight and information flows
freely, one may expect a low lag time between
basic discovery and application. A study of certain reference statistics indicates that definite
patterns exist in this information circuit.

No. of
References

Detailed Analysis of IRE References
to American Journals
IRE
Physical Review
JAP
Proc. IRE
Other American

7.3
2.7
23.0
43.0

IRE
Transistor
Issue
25.2
7.5
14. 1
26.3

Whereas the average issue of IRE refers
the ~hysical Review 7.30/0, the special translStor issue has 25.20/0 of its references to the
Physical Review. Thus we see that the bibliographic count is sensitive enough to measure the
degree of coupling between science and technology.
t~

The authors who contribute generally to
the ~RE have a lesser coupling to the Physical
ReVIew than those who are preoccupied with the
new and rapidly developing field of transistors.
T~is analysis of information coupling between
SCIence and technology can be continued another
step. We see from Table VIII that 250/0 of the
references in the transistor issue of the IRE are
to authors of papers in the Physical Review. We
now ask who are these authors? Do they differ
as a. class from the usual authors in the Physical
RevIew? In other words, do the contributors to

250

6.2
the IRE transistor issue make contact with a.
representative group of Physical Review authors,
or is there some transitional group of physicists
who serve as a bridge between the general population of physicists and the industrial group?
Table IX presents the institutional origins
of authors who publish in the Physical Review,
Journal Applied Physics, and IRE. In all cases
column A refers to all authors in a random issue
of the journal. Column B refers to those authors
in the journal who were cited as references in the
transistor is sue of IRE.
Table IX
Institutional Distribution of Authors
in Phys. Rev., JAP, and IRE

acquaintance with the literature of science would
extend the curve farther into the past. Both
hypotheses may be true, namely, workers in
fast-growing, competitive fields may have no
time to search the literature and thus confine
their sources to current material. It is at any
rate clear that this phenomenon has to be understood and absorbed into a serious retrieval
scheme.
The Detailed Bibliographic Structure of a Single
Journal
As a final example of bibliographic statistics, we mention a very detailed study that we
made of the references in the Physical Review.
The study was made for other purposes and involved a complete recording on IBM cards of all
the references in 26 volumes of the Physical
Review.
Some of the results are mentioned
here because they relate to the subject of this
paper. Excluding the unpublished and nonperiodic literature, the authors in the volumes
made 74,599 references to journal articles. Of
this number 45,592 or 60% were to articles in
the Physical Review. The next five most frequently used journals contributed another 13.8%
of the references and the next twelve in order of
frequency contributed 12.2%. Thus 18 journals
accounted for 86% of all the references. The remaining 14% of the references were distributed
among some 650 journals of which 240 were mentioned not more than once in all the 26 volumes
and 420 were mentioned four times or less.

*

JAP

Phx:s. Rev.
Affiliation
University
Industry
Govern:ment
Foreign
Others

A

66
9
13
9
3

B

16.4
74.0
6.3
3.4

A

B

40.3 31.0
31.4 61.0
5.4
17.5
2.7
9.0
1.8

IRE
A

B

25.8 14. 1
44.0 76.8
10.0 0.6
20.5 8.5

The data show that ordinarily 660/0 of
Physical Review authors have university connections and only 9% are from industry. The
sub-group referred to by IRE authors more than
reverses this picture; only 16.4% are university
affiliated and 74% are from industry. Similar
trends are apparent in the other columns of
Table IX. On the basis of the se limited data,
it seems reasonable to assume that the coupling
between basic science and its applied technology
is tighter for the newer technologies and that the
coupling is made through an intermediate group
of scientists who form an intellectual bridge between the university and industrial community.
Flow of Information from the Past
At any given time scientists draw heavily
on the accumulated experience of the past. Indeed, one of the greatest as sets of the journalarticle mode of communication is that it conserves
the record orthe past in an orderly and chronological manner. This coupling to the literature
of the past was studied by plotting the number of
references as a function of .time into the past.
Thus a distribution curve of references was obtained with age as the independent variable.
Graphs 1, 2, 3, and 4 show such distributions
for four journals. The integrated curves are
shown in Fig~re 5. Although the curve s are
orderly, their statistical base is rather limited
and one should be careful with conclusions. One
may speculate that vigorous and fast-growing
fields will show a sharp early rise and level off
rather soon. Another hypothesis, however, may
suggest that a sharp early rise indicates a superficial scholastic approach and that a more basic

Another by-product of this study that is
relevant to our discussion is the following. In
spite of the strong definitive position of the
Physical Review, it is nevertheless true that
any given paper in the Physical Review has a very
low probability of ever being used as a reference.
Indeed, the largest single class of papers never
appears in the reference literature at all. It is
hard to assume that this large group of papers
are never cited because they are worthless.
Other reasons must be sought.
Discussion
The over-all impression left by the data
may be summarized as follows:
a) If we consider the scientific paper as
a message unit and the journal the mes sage carrier, if we accept that inclusion of a paper in a
published list of reference s indicate s that the
message found a relevant receiver, and if we
regard the population of Physical Review author s
as a representative group of physicists, then
there is a massive asymmetry and an overwhelming inhomogeneity in the capacity of the many

*

This work was done in collaboration with Mr.
Frank Heart of the Lincoln Laboratory and will
be published elsewhere.

251
6.2
hundreds of journals to serve as carriers of the
scientific message. The imbalance may be due to
language barriers, cultural and political isolation,
reputation of the journal we have examined and
its availability or to a combination of the se. The
inhomogeneity exists. whatever the reason, and
in its most extreme form gives rise to the definitive journal, such as the Physical Review. The
data raise important questions about the design
philosophy of retrieval systems. In view of this
inhomogeneity, should the retrieval process and
the flow channels be the same for all carriers
or should they take account of the carrier's
capacity? If account is to be taken of the carrier's capacity, should the system be designed
to further reinforce the strong and efficient carriers at the expense of the less efficient, thus
reducing the noise. or should we on the contrary
take the attit1,lde that the efficient carriers need
Ie s s attention than the weak, anQ. therefore concentrate on the latter and raise their efficiency?
Is the definitive journal a desirable phenomenon
or does it in the long run inhibit the communication process?
How should we approach the complex problem of injecting the results of Russian research
into the main stream of American physic s. Such
an injection is certainly desirable, but it is not
clear that a mas sive translation program and
wide distribution of the translated material is
the best way of doing it. Another method might
be to encourage a small number of practicing
American physicists to learn the Russian language and depend on them as the instrument of
injection. Considering that the mOI).ey and effort
available for this purpose are limited, one should
not blandly accept either method.
A retrieval and communication system.
unless its contribution is trivial, will have sufficient feedback to strengthen or weaken the
various elements of the communication process
that now exists. It is therefore important to
understand these elements and to design our
system with them in mind.
b) The previous section concerned communication within the relatively homogeneou&
group of physicists who are in the habit of publishing in the Physical .Review. The problem of
communication across field boundaries or between scientists and engineers is a somewhat different matter. The limited amount of data that
we have collected must be considered as a
sample that only indicates the complexity of the
problem. It would seem that in this case there
is no definitive journal. Furthermore. meaningful communication seems to involve a chain of
intermediaries that form a bridge between pure
science and applied technology. Should a system
be designed to encourage traffic along this chain
of bridges, or should it attempt to short circuit
the chain? That this is not a purely academic
problem is clear from the experience of the
Chemical Abstracts, a major communicative
link between physic s and chemistry. The

Chemical Abstracts attempts to short circuit any
bridging mechanism and bring physics directly to
each chemist by abstracting practically the entire physics literature. This, together with
other examples of short circuitry, has so overloaded its own channel that Chemical Abstracts
is becoming increasingly awkward and bulky.
They could have chosen not to include the physics
literature in its abstracts and to depend on the
various hyphenated journals, such as the In. of
Physical-Chemistry, Chemical-Physics, Colloid
Chemistry, etc. etc., to act as injectors of
physics into the main stream of the chemical
literature. This would appear to loosen the
coupling between physics and chemistry and increase the time necessary for information transfer. But the decrease in volume of traffic could
well compensate for the looser coupling and produce a more efficient system. The phenomenon
of coupling between fields is at any rate important enough to be considered in the design of a
retrieval and communication system.
c) The flow of scientific information in
time. past to pre sent, or the useful half life of
a scientific message unit is another important
element of our problem. Our numbers indicate
a useful half life of some five to ten years. A
rigorous definition and measure of this phenomenon is not easy to come by. But it is obvious
that no system of retrieval and communication
can long survive without some method of purging
its message population from time to time. This
is not just a matter of eliminating poor material
in favor of newer and better results. It is a
curious fact that even the masterpieces of scientific literature will in time become worth1e s s
except for historical reasons. This is a basic
difference between the scientific and belletristic
literature. It is inconceivable for a serious student of English literature, for example, not to
have read Shakespeare, Milton and Scott. A
serious student of physics, on the other hand,
can safely ignore the original writings of Newton,
Faraday and Maxwell. The removal of a scientific paper from the retrieval system should not
depend on a value judgment. The correct criterion should be based on the degree to which the
paper's information has been metabolized into the
flow stream of science. We could perhaps decide that once a paper has appeared in the citation literature a given number of times, it need
no longer be carried as an independent message
unit. This too needs more study.
At the other extreme we have the phenomenon of a large group of papers, perhaps the largest single group, that is never quoted in other
peop1e's bibliographies. If these papers do not
enter the bibliographic literature within, say,
five years of publication, they may become effectively lost to the literature. In view of the
careful editing process, it is hard to believe
that this large group of papers is useless or
redundant. If they are useless or redundant, the
publishers and editors could well afford to review
their processing criteria. On the other hand, a

252
6.2
continuing review of the citation literature over a
travelipg five-year period may reveal a group of
papers that warrant more detailed channeling.
Concluding Remarks
Science and technology are approaching a
in communication. It would be a mistake
to define this crisis entirely in terms of retrieval
problems. Indeed, the break-down in communication between scientists must itself be evaluated
in terms of the maturing sociology of science.
Communication, after all, is not the only aspect
of science that is in crisis. There are problems
of technical manpower shortages, the increasing
cost of scientific research, the growing imbalance between basic and applied re search, and
many others that relate to the emergence of science as a major instrument of national favor,
stature and propaganda. But even if we confine
our attention to the, communication problems only,
we must still remember that no single channel or
mode is likely to solve all our needs. The components must be evaluated in terms of their contribution to the over-all system performance.
Many systems problems must be studied if a working solution is ever to be achieved.
CrlS1S

a) No system can be designed in an
economic and social vacuum. Even the vital
functions of national defense are subject to restraints. We must estimate the social and economic limits that will govern our system and
optimize its function within the se limits. How can
limited resources best be apportioned between
the various segments of the scientific community?
What part of the re source s shall be as signed to
retrieval and other question-answer functions as
opposed to directing the flow of information on
the basis of generalized need-to-know criteria?
What are the information needs within welldefined fields like physics as opposed to the flow
of information across field boundaries?
b) Ii there is a critical failure of communication now, how will we know when improvement has taken place? What test procedures and
criteria of performance can we use to evaluate
the system? Unless a figure of merit can be assigned to the system as a whole, the contribution
of any given component is always in doubt. One
cannot test the performance of a system by
measuring each component separately in terms of
its own parameters. Experience with large multicomponent systems indicates that one cannot
generally arrive at a figure of merit analytically.
It is usually necessary to build a model and evaluate new components in terms of their effect on
the model.
c) To what extent should the system
operate only when interrogated and to what extent should it operate on the initiation of its
built-in logic?

d) Should the system be local, regional,
or national? To what extent can communication
media other than the journal article be exploited
(radio, television, newspapers, remote printers,
etc. )?
e) What is the probable cost of various
systems, both initial capital investment and
operating cost? Shall it be financially selisupporting or shall it be subsidized? Stability of
financing is of particular significance because
the system must have long-range continuity in
order to be at all effective.
f) Finally, we wish to stress that the
problems of science communication are not primarily equipment and hardware problems. Nor
are they primarily problems in indexing, abstracting, or retrieval. The significant problems are
in the area of systems design and systems logic.
At this time there is no single organization whose
professional competence and involvement embraces
the entire spectrum of problems. Such an organization is needed if a reasonably successful solution is expected.

Reference
1.

See for example the series of papers in
"Area I - The Collected Papers of the
International Conference on Scientific
Information," Washington, D. C. (1958)

\3-40-4658\
0.20

en

w
u

Z 0.16
w

a::
w

LL

w
a:: 0.12
--I
--I

«

~ 0.08
I-

Z

W

u 0.04

a::
w

0..

o

4

12

8

16

20

24

28

32

36

40

AGE OF REFERENCES (years)

Figure 1

Distribution of References by Age
Physical Review 1957
en N
(J1

N

eN

O"lN

en

Not>.

\3-40-4659]
0.20

en
lJJ

U

~ 0.16

a:

lJJ

1.1..
lJJ

a: 0.12
..J
..J

«.
~ 0.08
~

z

lJJ

u 0.04

a:
a..

lJJ

o

4

8

12

16

20

24

28

AGE OF REFERENCES (years)
Figure 2

Distribution of Reference s by Age
Russian JETP, February, March, April 1957

32

36

40

13-40-4660j
0.20
CJ)

w

u

w 0.16
a::
w

Z

IJ..
W

0:::

0.12

...J
...J

«

~0.08
IZ
UJ

u 0.04
0:::

w

0...

o

4

8

12

24
28
AGE OF REFERENCES (years)

Figure 3

16

20

32

36

40

Distribution of References by Age
Physica 1957
en
tv

N
(J1
(J1

O')N

•

U1

NO')

13-40-46611
0.20
en

w
u

~ 0.16

0:::

w
LL

W
0:::

0.1 2

...J
...J

«

~ 0.08

.....
z
w

U

0:::
W

a.

o

4

12

24
28
AGE OF REFERENCES (years)

8

Figure 4

16

20

Distribution of References by Age
J. Acoust. Soc. Atn., Jan., Feb., March 1957

32

36

40

13 - 40 - 46621
100

~

w

(.!)

i:!Zwen

~

1

~-----~

I

80

wU

uZ

o::w
wO::
a.. w
w~

PHYSICAL REVIEW

60

JETP

>0::

~-'

-' -' 40

::::>«

~lL..

:EO
::::>

u

20

o

4

8

12

16

20

24

28

AGE OF REFERENCES (years)

Figure 5

Cumulative Distribution of References by Age

0')

N

N
CJ1
"'-l

259
6.3

A SCREENING METHOD FOR
LARGE INFORMATION RETRIEVAL SYSTEMS
Robert T. Moore
Data Processing Systems Division
National Bureau of Standards
Washington, D.C.
Summary
This paper is addressed primarily to describing a method for reducing the excessive processing
times sometimes encountered in large information
retrieval systems. Two tools are suggested:
(1) a screening system to allm., multi-level
processing of material and (2) pre-processing
of the files to allow block rejection of documents
not answering a retrieval re~uest.

encountered among "descriptions" of different
documents in the file, which may be applied to
compreSSing and structuring a large file of
retrieval information. Each of these ideas is
abstracted and discussed in a more general
context before being applied to the specific
system under consideration here.
Screening and the Use of Descriptors

A screen using descriptors to constitute the
first level of screening is described in detail,
and a pre-processing techni~ue, based upon
clustering of descriptors, is developed. Two
methods of implementation on high-speed data
processors are described based on serial access
type storage and random access "dynamic storage,"
respectively.
This screening techni~ue is compared for
efficiency against two other familiar methods of
employing descriptors: the ordinary serial
descriptor list method and the inverted file
techni~ue.

Introduction
The primary purpose of this paper is to
describe a method of constructing an initial
screening stage for a multiple-component information retrieval system. The screening system (or
sub-system) utilizes document descriptions
comprised of independent single-word index terms,
herea.fter referred to as "descriptors ll which are
thus to be interpreted in their most restricted
connotation, and applies a method of file
organization and compression which has several
special advantages. Two different methods of
implementation will be discussed, the choice of
method being dependent upon the type of data
proceSSing e~uipment to be used.
The deSigner of a large retrieval system is
faced with two separate but related problems. He
must minimize the expensive machine time re~uired
to process a request for information. It is also
desirable to minimize the total storage, both
internal and peripheral, required for the index
file. The system described here represents an
attack primarily upon the processing time but
some of the ideas introduced show promise of
reducing the file size as well.
In the process of discussing this specific
screening system, several ideas of greater
generality are utilized, among them (1) the
"screening approach 11 , as applied to storage and
retrieval systems, and (2) the redundancy

Advantages of the Screening Approach
It is useful to distinguish between two
alternative approaches to the problems of
information storage and retrieval. The importance of this distinction, it is hoped, will be
adequately illustrated in the discussion to
follow.
One approach is to look upon an information
retrieval system as a IImechanized librarian".
The user explains his problem or question to the
librarian, and is given suggestions as to where
in the library he may find the answer. If his
~uestion turns out to have elicited no helpful
suggestions, he rephrases it and tries again.
The function of the librarian (mechanical or
human) is to compare the re~uest with "stored"
information·about the library and, after
considering various "aspects" of the question,
to make determinations of "relevance", thus
directing the user to the right documents. This
might be described as the searching approach,.
since it involves the librarian's actively
seeking the relevant material.
Alternatively, we may deSign a retrieval
system which behaves in a manner that would be
considered obtuse, at best, in a human librarian.
It can single out all those references which
could not possibly be of any use and remove
these from consideration. It serves as an
information garbage disposal unit, which discards
the definitely useless and presents the
remainder to the user. Since this renresents a
screening out of the undesirable doc~ents (to
change metaphors), it may be called a screening
approach.
Of course, for perfect systems this
distinction is uninteresting. Any document that
"passes through a perfect screen 11 will be "found
by a perfect search" and vice versa. What is of
interest is the way in which an imperfect
searching system fails, as contrasted with the
comparable problems in an imperfect screening

260
6.3
system. An adequate searching system will only
rarely "find" documents which are irrelevant, but
it may very well fail to locate documents which
a human reader of all the documents would classify
as pertinent to a request. An imperfect screen,
on the other hand, will not block good documents
nearly as often as it will allow documents to pass
which a human reader would know were irrelevant.
(These may be considered as the defining properties of the two types of systems. However, they
also reflect two different philosophies of system
design. )
In all of the discussion to follow, we
presume that the indexing is such that the systems
~ discussed are imperfect, since no one seems to
have produced a system which cannot be confounded
by the vagaries of "semantics". However, the
problem of human or mechanical error is not to be
considered.

In the National Bureau of Standards-Patent
Office HAYSTAQ project1 , we are finding it
necessary to adopt the screening approach as a
supplement to the searching approach in order to
solve the particular set of problems with which
we are faced.
In the HAYSTAQ program, our more
elaborate and sophisticated programs (topological
compari~on of chemical structure diagrams,
comparison of logical expreSSions, etc.) are too
detailed and time consuming to be run against
every document in files of the size we will
eventually have to handle. Thus the needs of the
HAYSTAQ system point directly to the desirability
of a simple economical screen to reduce the
volume of input to the more elaborate routines.
Such a screen would also be a logical starting
pOint for the construction of new systems. The
main subject of discussion in the remainder of
this paper is a system of that type.

because of the "peripheral" vagueness in the
meaning of such termso An index using these
terms will also tend to become obsolete as
science and technology progress. Ideas and
techniques developed in one field often find
application in seemingly unrelated fields.
One can easily get a list of candidates for
the vocabulary by listing all objects, processes,
and relations that occur in a reasonable-sized
random sample of the documents in the library to
be indexed. The problem is then to choose that
small set of terms which has the greatest "power
of discrimination" among documents in the library.
In calculating power of discrimination, one must
also consider how often terms will be used in
framing requests. (See discussion of measure of
merit in the Appendix for an example.) It is not
our purpose to discuss solutions to this difficult
problem. Suffice it to say that it is closely
related to the problem of character recognition:
What is the cheapest way to distinguish between
centered binary images of the letters of the
alphabet? What bits in the images carry the most
information?
A descriptor set for a document will then be
a simple list of the descriptors applying to the
document (objects, processes and relations
discussed in the document) chosen from the
selected vocabulary. Then when a given request
is being processed, a document will be
screened out, if and only if there is one or more
descriptors in the request that is not in the list
applying to that document. If the screen is to
function in a fail-safe manner, the user must
keep this fact in mind when he sets up the
descriptor list of each request.
File Preparations

Descriptors for Screening

General Method

Although mutually independent descriptors
or index terms have been widely used since the
early years of information retrieval research,
more recently many workers in the field have
begun to question their value for projects
requiring indexing in depth. 2 They argue that
scientific discourse is too complex to allow
adequate indexing by means of unconnected oneword comments on a document. 3 While it is
demonstrable that Simple descriptor methods are
inadequate as a single solution to problems of
deep and exhaustive indexing~ this by no means
implies that they should be rejected entirely.
In fact, their very simplicity adapts them very
well for use in initial screening systemso

Rather than use a file which is encoded in
the language used by the human document analysts
and arranged in the order in which the documents
are prepared for machine storage, it is
altogether reasonable to transform it into a "prescreened" or "pre-searched" file. Such a file
would be organized so that documents could be
processed partially in parallel against a request.
Furthermore, many of the manipulations and
housekeeping operations which are independent of
the specific content of a request can be performed
during the file preparation process in advance of
processing of requests. (An example is the
"inverted file" investigated by Nolan and Firth4
and used in several Patent Office exper1ments. 5 )
It should be noted that such prepared files are
of great utility in either searching or screening
systems, although our attention here will be
confined solely to the latter application.

The key to setting up a nearly fail-safe
descriptor screen naturally lies in the type of '
vocabulary selected. The vocabulary should be
a list of objects, processes and relations
discussed in documents in the file. A
vocabulary which classifies documents ("physics",
"mechanical translation", etc.) or describes
them in general terms cannot be fail-Safe

When an information storage and retrieval
operation is to be carried out, one chooses an
appropriate symbolic It:jIl.guage, which can be
manipulated by computer (English text is one

261
6.3
possible example). Document descriptions are
then encoded in this language. In many cases
there will exist several equivalent symbol
sequences which could describe a given document.
( "Equivalent" should be taken to mean that there
are rules within the formal language which allow
one to transform one expression into another and
the converse.)
From this point on, the symbolic language
will be regarded as fixed. The sequence of
symbols representing each document will be one of
a class of equivalent expressions, i.e., only
transformations of the symbol string from its
original form to an equivalent form are allowed.
One might think of the transformation
P v Q ~ Q v P in the propositional calculus as
an example of this type of transformation. In
this section, no further reference will be made
to the meaning of the symbol strings involved;
only syntactic or formal properties will be
employed.
We then consider possible redundancy within
a file. .It is difficult to give an exact
definition of what is meant by "redundancy" in so
general a context. Reference to one of the
specific symbolic languages is needed in order to
be precise. Basically what is intended is that
if a certain configuration of specific symbols
occurs repeatedly in many document descriptions
in the file, it carries less information per
symbol (in the information theoretic sense) than
does a comparable configuration which occurs only
rarely.
If a "met as ymb ol " or "mOlecular symbol"
were asSigned to each commonly occurring" .
"
configuration (which is composed of the obJect

or "atomic" symbols in the original language),
the file could be in some sense recoded in terms
of these metasymbols. The recoded file would
potentially require substantially fewer bits to
be stored in memory and might be organized to
place documents with metasymbols in common near
one another to facilitate group inspection.
It is helpful to visualize a document
description as a set or as a circle on a Venn
diagram. The metasymbols are applied to intersections or "overlaps" of these sets. (See
Figure 1.) In at least some of the possible
symbolic languages (descriptors, propositional
calculus, predicate calculus, and topological
structure codes), it is possible to develop this
line of reasoning beyond the heuristic level by
imposing an appropriate Boolean algebra on the
symbol strings. The use of Boolean algebra
provides no startingly new information about the
metasymbol techniques, but it serves as a
convenient and compact language in which to
discuss the topic. It permits us to abstract
only those aspects of the situation which are
useful, so that we do not become bogged down in
irrelevant detail.
In files in which there is an equivalence
class of expressions for each document in the
library, the size and frequency of occurrence of
overlaps are strongly dependent upon which
representation is chosen for each document. It
then becomes necessary to discover an algorithm
for finding an optimal canonical represent~tion
for the file, i.e., one which maximizes the
overlaps and selects a unique representative for
each document. Fortunately, in the descriptor
case there exists a unique representation for

c

Regions I - VII correspond to metasymbols
Figure 1

262

6.3
each document, so the problem is entirely avoided
here.

will now describe a recoded file using
descriptors and see what kind of information
such questions elicit in this particular case.

utility of Metasymbols in Screening
A File structure for Descriptor Screens
A set of metasymbols for an index file (a
file of symbol strings for a library) is of some
intrinsic interest, since it is likely to
correspond in many cases to clusters of ideas
that are discussed together in the literature.
Such "co-occurrence" phenomena may lead to some
interesting information about the structure and
interrelationships of various scientific
disciplines represented in the basic library.
Such possibilities rest upon a number of
debatable epistemological assumptions about the
connection between the content of a library and
the syntactic relations between expressions used
to encode that content. The teeth of any such
arguments must be sharpened on a plentiful
supply of data before they can be expected to cut
into the problems of information retrieval.
Fortunately, the "conceptual significance"
of such metasymbols need not concern us here. So
long as they exist at all (which is tantamount to
saying "the file is not a completely random set
of symbol strings II) we can put them to work in
screening our files. Their utility lies in the
possibility of "block screening". Every metasymbol represents the cornmon overlap of a set of
documents and may be said to apply to each
document in that set. Suppose that when a
request (sequence of symbols) is directed to the
file, comparison of this request symbol string
indicates that no document to which the metasymbol applies could satisfy the request. Then
every document to which the metasymbol applies
can be rejected outright. No further information
about these documents, e.g., which other metasymbols ~pply, what are their identification
numbers, etc. need be examined. In other words,
documents can be rejected in large blocks if
metasymbols are used.
Hence, after discovering a fruitful way of
recoding a file in terms of metasymbols, we
concern ourselves with finding methods of
exploiting this "block rejection" in as efficient
a manner as possible.
In fact, one of the best measures of the
value of meta-recoding a file will lie precisely
in the degree to vrhich it can be applied to the
idea of block rejection of irrelevant documents.
If a document will eventually be rejected in
response to a given request, how early in the
process (on the average) can it be rejected? Is
rejection information used as soon as it is
obtained, or must rejected documents be carried
as excess baggage for a time? If a document is
to be accepted, how much extra manipulation is
necessitated by the recoding? How much extra
IIbookkeeping ll information is introduced by the
recoding?
Questions such as these must be answered
before any proposed system can be evalu8.ted. We

Since our discussion is limited to the,
problem of designing a file structure for use in
descriptor screening, we can develop the set
analogy explicitly and demonstrate its utility,
as well as give a concrete example of a symbolic
language. It will become clear that the
simplicity of descriptors and the application of
the screening approach both contribute to the
workability of the structure.
Initially, the documents are indexed
according to which of the N descriptors apply to
them. However, it is more convenient for
screening to represent them in terms of their
II re jectors II or descriptors-that-do-not-apply.
Also, a binary number notation is often used in
connection with descriptor methods.
The N-word vocabulary is then organized in
some convenient order, e.g., alphabetical, and
an N-digit binary number or Boolean N-vector is
constructed for each document by putting a 1 in
the ith position if the ith descriptor applies
or a-O in that position If it does not. Note
that this is the usual "fixed field" method of
handling descriptors, particularly in punched
card applications. This technique is of great
use here, although the logical complement of the
usual Boolean vector will be used (this is the
vector for the rejector list of the document).
This complemented number will be called the
rejector vector for the document. Since the settheory operations of intersect~on(A A B), union
(A V B), and complementation (A) correspond
directly to the Boolean or logical II and " , "or",
and "complement" as defined on the rejector
vectors, any manipulation of rejector vectors
can be pictured in a suitable Venn diagram and
vice versa.
Using these tools, we are equipped to talk
about the appropriate metasymbols. Giving up
the logician's prefix IImeta" in favor of the
more suggestive chemist I s term "mOlecular II,
henceforth, in this paper, we shall call the
metasymbols "molecular rejectors". (Note that
these behave more like radicals than molecules,
so the chemical analogy cannot be pushed too
far.) A molecular rejector will be the intersection or "and" of some collection of rejector
vectors, that is, the set of l-bits common to
all the rejector vectors in that collection.

Oux purpose here is to describe an
algorithm for selecting a set of molecular
rejectors for the file. In the interest of
efficiency as previously discussed, this
particular scheme defines molecular rejectors
which in many cases "make sense" only for a
subsection of the total file. The molecular
vocabulary then has a highly variegated "local"
aspect, and the different molecular rejectors

263
6.3
are not necessarily disjoint. Nevertheless each
document will have a unique molecular
characterization.
The guiding principle is that of maximizing
the average size of blocks rejected and of
carrying out this rejection as early and as
completely as possible. It is possible to
develop a rigorous theory of the redundancy of a
molecular rejector S, but this is an involved
process. Such an analysis is important in
choosing the method of storing atomic definitions
of molecular rejectors that minimizes storage
requirements. This will not be taken up here,
however.
The quantity of interest is the average
number of documents in a given subfile G rejected
by a given molecular rejector--a measure of merit.
This measure Il (S) will be the product of the
number of documents, t, which would be rejected by
S, t(S,G), and the probability peS) that the
request will overlap with S(l,L(S) = t(S,G)p(S)] .See
Appendix for a discussion of possible approximate
functions for Il(S),
To find a structure for the file
proceed as follows:

F we

Generate all 2N_l possible molecular
rejectors for the file F.(Ingenuity may make it
possible to reduce significantly the number to be
generated. )
1.

2. By appropriate manipulations, compute
Il(S) for each molecular rejector and select the
one of highest Il as Sl' If there is more than
one S of the same maxTImum Il, life becomes
complicated. We could pick one of these at
random for Sl' (See the Appendix for additional
discussion. )

3. Decompose the file F into two disjoint

subfiles, Fl and F2:
a.

Fl contains all documents which
Sl rejects -- all of the t(SlF)
documents having at least the
atomic rejectors, or I-bits, of Sl
in their rejector vectors.

b.

F2 contains all documents of F not
in Fl -- those which lack at least
one l-bit of Sl'

,

4. Repeat the procedure of steps 1 through
3 for F2 to generate S2' (F might be called Fi
for consistent notation.) We then have F2
which S2 rejects and F~ which it does not.
This procedure may be repeated, always operating
on a "primed" file to produce an "unprimed"
file with the same subscript, a molecular
rejector with the same subscript, and a new
smaller "primed" file with its rightmost subscript
one greater than that of the subfile from which
it is derived. It is possible, when attempting
to apply this process, that the primed subfile
will turn out to contain a number of "rugged

individualist" rejector vectors which do not
group well with one another, i.e., all Il(S) in
this subfile are very small, less than 1, for
example. In this case, each rejector vector in
the subfile is given its own molecular rejector
vector, which completes its definition and
produces no new primed ~ unprimed subfile.

5. Apply the following procedure for F
(note that it is slightly different than that
applied for F2 ): Since all Fl rejector vectors
have Sl in common, it is necessary to disregard
these nits in any further processing. This is
accomplished by "subtracting II Sl from all the
rejector vectors, AiJ of Fl (Ai 1\ Sl) before
proceeding. Then steps 1 through 3 may be
applied to the modified Fl generating Sll' Fll
and Fi2' In general, this process, when applied
to an unprimed subfile, generates a new smaller
unprimed subfile with the new subscript q to the
right of those subscripts applying to the old
file (F.fJo
p ~ F-IJo
p),
a new molecular
q
..L,.

•••

..L.

•••

rejector with the same subscript (S-IJO
p) and a
....... CJ.
primed file with its rightmost subscript
increased by 1 (F'ij ••• (p+l»)'
In some cases of application of step 5, the
subtraction process will leave some of the
rejector vectors in the modified file (modified
Fl in the case explicitly considered) completely
empty of I-bits. This means that these have
been completely characterized by the existing
list of molecular rejectors, so the corresponding document numbers, with complete molecular
rejector list, can be set aside for later
insertion into some storage unit. The process is
then continued on that part of the mOdified subfile containing non-empty rejector vectors.
Another possibility is that the newly
generated subfile contains either only one
rejector vector or a set of identical vectors
(applying, of course, to different documents).
In both cases, the remaining rejector vectors
are assigned a single final molecular rejector
and classified completely. Hence no unprimed
file is generated in this case.
The procedure just outlined, when developed
in detail, leads to a complete characterization
of the rejector vector of every document in
terms of molecular rejectors. If Al is the
original rejector vector and Sj' Sjk' ••• Sjk ••• p
are the molecular rejectors assigned to it,
then ~ = So v Sjk v ••• v Sjk ••• p'
Figure 2 isJa rough flow chart for one possible
algorithm for accomplishing thiS, and Figure 3
sketches the "generation tree ll for some file.
At this point, one might set up a
dictionary of a sort, defining each molecular
rejector in terms of the I-bits it contains, and
then store the file described in terms of these
molecular rejectors. Actually the subscripts of
the rejector with the most subscripts which
applies to a document suffices to specify all its

264

6.3
molecular rejectors, as study will indicate.
Such a procedure would lead to no particular
advantage, however. The full potentialities of
block rejection can only be realized if a somewhat more elaborate storage method is used. Two
such methods will be described in the next
section.
The File Growth Problem
In the majority of applications, the library
will not be a static collection of documents, but
one which is continually growing. Hence it is
neoessary to examine the possibility of incorporating the rejector vectors for new documents into the structured index file. The file described
above can accommodate considerable expansion
easily, particularly if new material has the same
basic statistical properties as the original file,
i.e., all of the t,(S,G) in the augmented file are
nearly proportional to those of the original.

It is not difficult to insert a new document
into an organized file of the sort described.
What is required is an algorithm which will give
a unique molecular characterization of the
rejector vector of the new document. If AM+l is
the new rejector vector, AM+l is compared with
Sl,S2. • until the Sj of lowest index which
applied to it (AM+l /\ S.1 = Sj) is located. If
no such S· exists, all of ~l becomes a new
single in~ex molecular rejector. I f an Sj does
exist, then it is applied to the document and
subtracted from the rejector vector. The remaining rejector vector is then compared with
SJ1,Sj2, ••• until again either one of lowest
r1ght index is found which applies or it is
found that none applies. Both cases are treated
substantially as before, and in general the
process is repeated until, in one manner or
another, the new rejector vector is fully
described by a set of molecular rejectors (at
most one of these will be a newly generated one).
In most cases a reasonable number of pre-existing
molecular rejectors will apply, and only a few
atomic rejectors will remain to be lumped into a
new molecular rejector.
Clearly, however, it will eventually become
profitable to reprocess the file and generate a
complete new set of molecular rejectors, because
new accessions will have rendered the old
statistics invalid, and a less efficient file
will have been built up. The file structuring
method outlined here does not really come into
its own until the file is large enough so that
the percentage increase (per year, say) is small,
and the statistics are really representative of
the type of information being inserted. (In
scientific and technical literature, progress and
changes in the areas attracting interest will
tend to cause systematic drifts in file
statistics. )

Storage and Screening Schemata
Conventional Techniques
The usual method of approaching the storage
and search problem consists of storing the index
file on some permanent medium (punched cards,
tapes, drums, discs, etc.), often storing
auxiliary tables such as dictionaries, and
providing an internally stored "search program"
for a high-speed digital data processor. In our
application, this program takes requests as input
and delivers as output the identification numbers
of those documents which'pass screening.
The storage-and-screening method to be
described here is especially well adapted for
tape storage or other storage in which serial
access is necessary. It minimizes storage
requirements by USing tape position to store
much of the information. All molecular
rejectors are defined exactly once on the tape,
and every document identification number is
listed exactly once. Relative position on the
tape determines which molecular rejectors reject
which documents.
In the process, various
marker symbols are used. These may, in practice,
be fields of a few bits in words used primarily
for molecular rejector definition, but in the
discussion to follow they are regarded as
independent entities.
In this storage, one begins with the marker
symbol Me and then gives the atomic definition of
the molecular rejector Sl. It is helpful to
visualize this as the list of atomic rejectors
for Sl' stored as a rejector vector. However,
in general this would be a comparatively
inefficient use of storage space, since Sl
itself is composed primarily of O-bits. The Sl
definition is followed by Mll Sll' ~, S111, M3 ,
etc., until the complete molecular sequence for
one or more documents has been completely
defined. Then a T (terminal) symbol is written,
followed by a list of the identification
number{s) of the{se) document{s).

At this point two different situations may
arise:
1. All of the molecular rejectors which
reject the first document{s) (suppose there are k
of these) may reject more documents, but not
constitute a complete characterization of the
rejector vectors for these documents. In this
case, Mk is written after the identification
numbers of the documents, followed by
Sll ••• ll (k + 1 "ones" in subscript), Mk+l , etc.
until another document is completely described.
2. The molecular rejector Sll ••• l (k "ones ll
in subscript) may apply only to the first
document rejector vector, but the other k-l
molecular rejectors apply to more documents

265

6.3
(in a more general case, this will be
k-p, 1 ~ P ~k-l). Then Mk- l (Mk _p ) is written
after the document identification numbers
followed by Sll ••• 12 (k subscripts in all), Mk,
and so on, until another document rejector vector
is completely described.
This will suffice to suggest the general
method of listing document numbers and defining
molecular rejectors. This method is ite~ative.
Figure 4 gives a diagrammatic example of a
possible tape configuration.
The file is screened in a particularly simple
way. One takes the request descriptor vector and
begins at Mo on the tape. The request is compared
with each mOlecular rejector in turn until one of
the follOWing conditions occurs: (1) a T symbol,
along with a list of one or more identification
numbers, is reached; or (2) following ~, the
molecular rejector definition contains l-bits
where the request also contains l-bits.
In the first case, none of, the mOlecular
rejectors characterizing the rejector vector of
the listed documents has any bits in common with
the request; hence the documents in question must
possess at least those descriptors given in the
request, and the documents have passed screening.
Therefore, the identification material may either
be printed out or passed on to a more powerful
routine which calls in a more detailed request
and compares it with a more detailed index file.
In the second case, the ''bit overlap"
between request and molecular rejector means that
none of the documents to which the rejector
applies can possibly satisfy the request, since
they all lack at least one descriptor that is
asked for in the request. Therefore, all further
molecular rejectors pertinent to these documents,
as well as their identification numbers, can be
ignored, and the tape can be advanced immediately
to the next occurrence of an Mj with j < i, and
screening can be begun again at this point. (See
Figure 4 for an example of this process.)
Clearly this procedure avoids the necessity
of examining many of the molecular descriptors in
the file. The full rationale for the file
structuring procedure now becomes clear, since it
assigns molecular rejectors in such a way as to
maximize the number of documents screened out in
a single operation and also to increase the
probability that rejector-request overlaps will
occur early in the screening process.
Serial storage has the one drawback that it
is costly to add new rejector vectors (with
document identification numbers), since these
will in general belong somewhere in the middle
of the file, and recopying a part of the file tape
will be necessary to accomplish this. It is also
input-output limited, since there is a bare
minimum of "comput at ion II to be done. The best use
of this screen would be as the input-controlling
subroutine of a larger routine (as suggested
above), preferably one with much greater internal

processing. (It may be used in just this way in
connection with the HAYSTAQ chemical structure
search program.)
Random Access Integrated Program-File
If very large random-access addressable
storage is available constituting, for example,
more than half of the total required storage,
another method becomes feasible. The screening
program and the file may be integrated into one
unit, through which, in a manner of speaking,
control IIpropagates" until it reaches an output
point where print-out occurs.
In this method, the same basic organization
is used as was discussed under Conventional
Techniques, but the molecular rejector and its
marker are replaced by a subroutine which
examines the request and, depending upon what
bits occur in the request, passes control
directly on to the next subroutine or diverts it
to the subroutine which replaces the Mj (Q.~~i).
These subroutines might actually have the
rejector vectors of their molecular rejectors
stored with them, which are compared with the
request, but other more efficient alternatives
are available. (See Figure 5 for a sample flow
chart. )
This sort of file-program has the probable
drawback of requiring more storage than the
procedure outlined earlier (experimental data are
needed here), but it is faster in that it avoids
any necessity of passing over rejected material.
It proceeds directly from one comparison to the
next, with great consequent gains in speed.
A compromise between these two methods would
be to keep part of the screening routine distinct
from the file, replacing the marker Mi with an
instruction which loads an appropriate counter
.,ith the address of the molecular rejector which
is associated with Mj in the prior procedure.
This counter would control the "work area" of the
routine and would normally advance by one, unless
modified by an "M instruction".
Such methods could be most efficiently
applied in systems with very limited control
units, having only a few indexing, logical, and
conditional transfer instructions in their
repertory but very large address fields, and a
very ,large bank of addressable memory, for
example, disc storage. The random-access feature
would make it quite Simple to enter new data into
the file, since appropriate adjustment of
transfer (or indexing) instructions can allow
"logically contiguous" material to be stored in
widely separated sections of storage. (The
author would like to acknowledge the relationship
between this and the comparable techniques
employed in the various list processing
routines. 8 It may be possible to use list
processing techniques directly here without loss
of efficiency, but this seems unlikely, at least
as long as elaborate compilers and interpretative

266
6.3
routines are necessary to the use of list
processing.)
Evaluation: Comparison with
Ordinary and Inverted F~
The Files from a List Viewpoint
The best way to evaluate the worth of the
nelf file structure described here is to compare
it with the more familiar ordinary and inverted
files. Then the questions posed in the section
on Utility of Metasymbols in Screening can be
anslfered in a comparative way, and the new file
put into some sort of perspective. In order to
compare the files and the methods by which
requests are handled, Ife need to describe them in
a common language. The terminology of list
processing (used loosely) is well adapted to this
description. We will find it convenient to
describe some files in terms of descriptors,
others in terms of rejectors.
The ordinary file. In this type of file,
the primary list is the list of M document
numbers. Each document number then serves as
the point of attachment for an index sublist.
These appended sublists may be either descriptor
lists or rejector lists, although the latter may
be cheaper to process. In the processing, a
request descriptor sublist is compared with the
sublists for each document in turn to see whether
all requested descriptors are present (or that no
request descriptor occurs as a rejector), and
the document number is added to an "accepted"
list if it passes.
The inverted file. A file is normally
inverted with respect to its descriptors. Then
the primary list is made up of the N descriptors.
To each descriptor is attached a sublist of those
documents to which it applies. A request
descriptor list can be processed by use of a
"candidate list ". This list is initially the
document number sublist for the first descriptor
that appears i.n the request. Each additional
request descriptor is processed by obtaining from
the file the document number sublist attached to
it and comparing this sublist with the candidate
list. All candidate numbers which do not appear
on the descriptor sublist are removed. That is,
only those that appear on both the descriptor
sublist and the candidate list are retained
(thus taking the "and" or intersection of the two
lists). If the file were to be inverted with
respect to its rejectors, one would combine the
sublists for all those rejectors appearing as
descriptors in the request. The result would be
a list of rejected documents, which would have to
be "subtracted" from the list of all documents.
The new file. In the case of the file described in earlier sections of this paper, a more
elaborate and unconventional list structure is
required. The molecular symbols Sij __ •• p
participate in the structure, as well as document
numbers and rejectors. The primary list is a
list of single-subscript molecular symbols.

Each has attached to it two sublists:
1. A rejector sublist vThich defines the
molecular rejector in terms of its atomic
rejectors;

2. A sublist of all two-subscript molecular
symbols which have the subscript of their primary
molecular symbol as the first subscript. In the
case of a "rugged individualist" rejector vector
vThich has been~made a single index molecular
rejector, this sublist will contain the document
number or numbers which the rejector vector
would sincle out for rejection.
In turn, each two-index molecular symbol will
have attached its two sublists, and so on,
creating a highly branched tree of lists. The
logic of the screening procedure for this set of
lists is essentially that previously described
under Storage and Screening Schemata. We
observe that, in practice, the "list" method
would be impractical here, since the molecular
symbols are excess baggage.

Before leaving this topic, it is interesting
to examine the relationship of the new type file
structure to the ordinary and inverted file
types. The ordinary type, it turns out, is
nearly a special case of the new type and the
inverted file is a close relative of another
special case.
If every rejector vector of the new file is
treated as a "rugged individualist", then there
are no molecular symbols with more than one
subscript. Hence, the file becomes a Single list
of molecular symbols, each "defined" by a
rejector list and paired with one (or more)
document numbers. With the exception of the
superfluous molecular symbols, this is a refined
version of the ordinary file. However, unless
all M documents have different rejector vectors
(or lists), there will not be M entries in this
primary list, since all documents with any given
vector are combined in one entry. This is
clearly a sensible innovation in any case.
On the other hand, if each of the molecular
rejectors contains exactly one atomic rejector,
the file structure that results becomes a multilevel file inverted with respect to rejectors.
If the file were inverted with respect to
rejectors, and each of the document sublists
inverted again with respect to the remaining
rejectors, and so on repeatedly, a closely
related file structure would be obtained. In
this case, however, every document appears b
times in the file, if it has b rejectors.
the one-rejector case it appears only once.
Basically, the relational diagram of the
multiply-inverted file would contain Figure 3
as a subdiagram.

In a sense, then, the new file uses the
"inversion" idea in a more generalized form,
but applies it in a more effective way in order

267
6.3

to avoid repeated storage of the same
information.
Detailed Intercomparison of System Efficiencies
Obviously a conclusive comparison of the
relative efficiencies of the three file types is
a matter for experimental test on large real files
by using real requests. No such tests have been
performed to date. However, several possible
tests are under discussion at the National Bureau
of Standards at the present time.
One interesting test could be made on
chemical structures. Descriptors can be chosen
either from the chemical elements or from the
functional groups which can occur in the
structures. Since the occurrence or absence of
these in structures is unambiguous, the worst
problems of finding a IIfail-safe ll vocabulary are
avoided. The "libraryll indexed by these
descriptors might have as "documents" single
structures, sets of structures, and/or information
about properties, depending upon the application
envisioned.
In the absence of empirical knmdedge we are
reduced to the procedure of examining degenerate
cases, which may be quite informative. The cases
of interest are (a) a random sample of questions
which elicit nearly the whole file as answers,
and (b) a random sample of requests which reject
nearly the entire file or the whole file. The
details of the comparison depend, of course, upon
the exact way in which each of the procedures is
programmed. What one \vould like to know is
how many instruction executions on some computer
are required to process the average request. The
best we can do here is to count "manipulations",
but this provides a good indication.
In the case in which nearly the whole file
answers the request, the two more conventional
files are probably superior, but for different
reasons. The descriptor inverted file will be
the quickest for the simple reason that, given a
sensible vocabulary, a request can have fe,,, if
any descriptors in it if it is going to get
anywhere near 100%
response. Granted that
only a few descriptors can occur in a high percentage request, at most two or so lists need be
intercompared. For one, the job is done
immediately; for two-or three, by any efficient
algorithm, probably fewer than M document
identification-number comparisons are required.

more than 2N different rejector vectors pOSSible,
and the actual number in the file will often be
somewhat smaller. If M > 2N by any great
amount, it is possible that there are fewer than
M-2N "super-structure" molecular rejectors. In
any event, every reql1est satisfied by nearly
every document will have to be compared with
nearly every molecular rejector's rejector list,
and if M ~ 2N this will be more expensive than
the ordinary file. (Note that if this is really
done in list-processing form, rather than in
"vector" form, time will be saved by the fact
that there are fewer rejectors per molecular
rejector than there are per individual rejector
vector. In fact, as far as number of descriptorrejector comparisons are concerned, the values
,{ill be comparable.)
Shifting to the opposite extreme now, we see
the new type file come into its own. Here
nearly every document is rejected. We observe
that the single-subscript rejectors may be nearly
all that need be examined. In the majority of
requests having 100 percent rejection this w.ill
be true, assuming the sticky problem of finding
a good measure of merit ~(S) has been solved
satisfactorily. Unfortunately it is difficult
to estimate the probable number of these singlesubscript rejectors. It ,viII obviously be
between 0 and 2N, and considerably less than the
latter. If the degenerate "one atomic rejector
per molecular rejector" case is conSidered, there
would be N (or fewer); the number may lie near N,
in any case. At least, it will be much less than

M.
In some near-total rejection cases a request
may filter i'airly far down the "rejector tree"
before being blocked, but because of the use of
~, this will be in the minority of cases.
Hence
the average number of molecular rejector M
request comparisons will be small, generally
much smaller than M, particularly if M > 2N.
Here, as in the total acceptance case, the
ordinary file still requires M request listdocument list comparisons and will lose out so
far as efficiency is concerned.
The inverted file is a bit more complicated.
Suffice it to say that if a given document has k
out of the r descriptors requested, it is
rejected k tImes, in effect. Until a request
descriptor is reached that does not apply to that
document, the document appears in the "candidate
list
Even after it is stricken from that list,
other lists must be manipulated in which it is
carried as excess baggage, and for each of these
it must be classified as "not in the candidate
list II and discarded. Since the molecular
rejector file rejects each document only once
(in a sense less than once since the document in
question is likely to be rejected along with
others), it is more efficient in this respect.
11 •

In the case of the ordinary file, M
descriptor or rejector lists must be compared
with the request. If vectors are used, this
amounts to M 'and' -and-test sequences (multiple
precision if necessary).
In the case of the new file, we observe that
every set of identification numbers (with the
same rejector vectors) has a "last" S.ik ••• p
associated with it alone. But there is then a
superstructure of molecular rejectors with fewer
subscripts erected over this base. There are no

To summarize our results, then, the new
file is likely to be superior to the ordinary
file if the screen is powerful enough to average

268

6.3
a very hi8h percentage of rejection. If it is
not this powerful, then it is hardly worth using
in any case. If M > 2N , it is better in all
cases. We also conclude that the molecular
rejector file is very probably better than the
inverted file in the high-rejection cases, but
there is a much stronger need for data in this
case.

Documentation, Vol. X, No.1, pp 20-26,
Jan. 1959.
For topological network features: Herbert
R. Koller, Ethel Marden, and Harold Pfeffer,
"The HAYSTAQ System: Past, Present and
Future, II Preprints of Papers for the
International Conference on Scientific
Nov. 16-21,
195 •

A Final Efficiency Problem
Even granting that a case can be made for
the contention that the new type of file is
superior to the other two if heavy screening is
pOSSible, a more difficult question remains. l\n
estimate is needed of the savings which will
result from use of the pre-processed file, as
contrasted with the expense of writing, debugging,
and then using the file-organizing routine. This
Cluestion appears to be unanswerable without data,
but it does suggest the desirability for a
potential user to consider the "reCluest-traffic ll
of his file. A file with light traffic will
naturally take longer to pay for itself than one
with heavy use. Because of the possibility of
periodic reprocessing of the file because of new
docu~ents (as discussed before), the use must be
heavy enough to -pay for this, at least.

2.

"Depth" refers to the degree to which the
complete content of a document is
represented by the descriptor set (or other
representation) aSSigned to it. Deep
indexing, then, leaves out very little of
the content of any document in the file.

D. D. iilldrews and Simon M. Newman,
"Activities and Objectives of the Office of
Research and Development in the U.S.Patent
Office, " Journal of the Patent Office
SOCiety, Vol. 40, No.2, Feb. 1958, pp.7985.

4.

It is hoped that actual machine testing of
some of the theories described here \OTill be
possible in the near future.

F.E. Firth, An Experiment in Literature
Searching with the IBM 305 RAMAC, San
Jose, California: IBM, November 17, 1958;
J. J. Nolan, Principles of Information
Storage and Retrieval Using a Large Scale
Random Access Memory, San Jose, California:
IBM, November 17, 1958.

Conclusion
Two basic ideas helpful in constructing
information retrieval systems have been discussed.
~irst, the advantages of the screenin8 approach
to the fi".:..e orcanization in the develo)ment of
systems were argued. Secondly, the desirability
of pre-processing files to take advantage of
block-rejection was suggested. These two ideas
were then applied to the ~roblem of setting up
a r..early fail-safe screen using descriptors or,
more accurately, rejectors. An algorithm was
sketched for generating descriptions of documents
in terms of "mOlecular rejectors ll • Another
algorithm was outlined for storing on tape a file
of the type described. Also discussed was an
alternative "stora8ell system, using the idea of
"dynamic storage" in ,vhich information is conveyed
by branch points in a program.
The new type of file was then compared with
the better.-&~own ordinary and inverted files.
The conclusion of the theoretical reasoning was
that the system sho\vs enough promise of increased
efficiency to warrant detailed testing of real
files.

Jacob Leibowitz, Julius Frome, and Don D.
Andrews, Variable 3cope Patent Searching by
an Inverted File Technique, Patent Office
Research and Development Reports ••• No. 14,
U. S. Department of Commerce, Washington,
~.C., Nov. 17, 1958.
Jacob Leibowitz, Julius Frome, and F. D.
Hamilton, "Chemical Language Coding for
Machine' Searching,1I Abstracts of Papers,
l35th Meeting, i:l,merican Chemical Society,
Boston, 5-10 April 1959, p. 3G.

6.

Victor H. Yngve, liThe Feasibility of
Machine Searching of English Texts," Preprints of Papers for the International
Conference on Scientific Information
Washington, D.C., Nov. 16-21, 1958. Area 5,
pp. 161-169)

t

, In Defense of English, Preprint of
Paper presented at An International
Conference for standards on a Common
Language for Machine Searching and Translation, Sept. 6-12, 1960, Cleveland,

8 p.
References
1.

For the logical (propositional calculus)
features: Harold Pfeffer, Herbert R. KOller
and Ethel C. Marden, "A First Approach to
Patent Searching Procedures on Standards
Electronic Automatic Computer," American

If tape is used, it is clearly most
efficient to replace Mi with the number of
words between Mi and the appropriate Mj, so
that the tape can be advanced immediately
without having to examine each word until
Mj is found. For large files (many tapes),

2.69

6.3

8.

tape number and tape position could be
specified ,in place of Mi.

of the predicted average may be used as the
"quali ty" of the file.

J. C. Shaw, A. Newell, H. A. Simon and T. 0.
Ellis, "A Command Structure for Complex
Information Processing", Proceedings of the
Western Joint Comuuter Conference - Contrasts
in Computers, presented at Los Angeles,
California, May 6-8, 1958, pp. 119-128.

However, another competing criterion is
involved, that of the complexity of the algorithm
needed to apply the prediction of averages to the
choice of a vocabulary. Such an algorithm -when
programmed, must be able to find a "best"
vocabulary in some reasonable running time.
Unfortunately, in order to get manageable
routines, we have to cut corners and make shaky
approximations in various places. Ultimately,
we must balance time saved in screening against
time consumed in processins, and stop at some
(at this time) ill-defined "break' even" point.

John McCarthy, "Recursive F\mctions of
Symbolic Expressions and Their Computation
by Machine, Part I," Communications of the
ACM, Vol. 3, No.4, April 1960, pp. 184-195.
Victor H. Yngve, "The COHrr System for
Mechanical Translation", Proceedings of the
International Conference on Information
Processing, UNESCO, Paris, France, June 1520, 1959, pp. 183-187.
Appendix
Details of Selecting an Optimal
File Structure
A.

Introduction

Here we address ourselves to the details of
the choice of an optimal molecular rejector
language for the file. The desirability of this
molecular description will be assumed.
The concept of an optimal language needs
considerable amplification. He wish to minimize
the average number of molecular rejector-request
comparisons needed to "screen" a given request,
assuming that the amount of machine time used in
screening one request is proportional to the
number of comparisons performed. This assumption
will be strictly true for fixed field proceSSing
in which N is less than or equal to the number of
bits in a word, and on the average true when fixed
field "vectors" must be processed with multiple
precision. It is not really valid if variable
field (list-type) processing is used. In the
latter case a different analysis is needed.
The notion of an average is necessary Since,
as the section on "Evaluation" pOints up, the
number of comparisons for a given request is
strongly dependent upon the nature of the request.
But then an unpleasant question asserts itself:
"Over what do you average?" The answer one would
like to give is "We want the average over all the
requests which will ever be presented to the file."
If this average is minimized, we have the best
file, by definition. Unfortunately this is an
unhelpful answer to the question since it cannot
be found until all requests have been processed.
Then it is a bit late to be of any use.
Hence we must resort to imperfect predictors
of what this average would be for each molecular
vocabulary in order to choose the best one. Each
prediction method can then be used in a
vocabulary generation algorithm. The reciprocal

B.

Development of the Prediction Method

One can break down the prediction methods
into two types: external sample methods and file
sample methods. In the external sample methods,
a sample of requests is gathered from potential
users, and these are used as predictors of the
kind of requests likely to be directed to the
file. On the other hand, to the extent to which
the content of the file is representative of the
distribution of interest of people using the file
statistics about occurrence of molectuar
'
descriptors in the file can be used to predict
requests without external data f'rom potential
users. Our tentative hypotheSis is that this
latter method is probably quite adequate. It is
obviously cheaper, since it reduces the amount of
error-free machinable data that must be prepared.
One quite serious problem in the choice of
the best molecular vocabulary arises from the
question of how many alternative complete
vocabularies must be eenerated. If viC have a
method of aSSigning each voca1ulary a value,
clearly we could generate all possible
vocabularies, assign each a value, and then
choose the best. However, the nu~ber of
oper ations involved grOloTs as some "fierce II
exponential of the number of documents in the
file and the vocabulary size.
The most desirable method vlould fol101oT a
direct path from molecular rejector to mOlectuar
rejector, in every case choosing exactly that one
which will lead to the best final file. \'Jhat
this requires is that at every point we be able
to assign to each candidate-rejector S a
"measure of merit" Il(S) vlhich predicts the
ultimate quality of the file which will result
if it is chosen. It is by no me~~s clear that
such a 11 can be found "Thieh avoids effectively
"trying out" nearly every possible resultant file
(another exponential factor).
,~t this point,
too, we Ifill find it necessary to make imoerfect
predictions in order to find a manaGeable-process.
Our procedure might be described as
peSSimistic, or cautious. Ithen choosin~ a molecular rejector at any pOint, we consider the
~ possibility for number of com"!;>arisons in a

270
6.3

file determined by the choice, and then choose
the one that gives the IIstrongest guarantee
against the worst". In other words, at each
point of choosing a molecular rejector, we are
in a position to assert, "The maximum possible
number of comparisons to which I might obligate
myself if I choose this rejector is k.
Then we
choose the rejector with minimum k. Every
rejector will, in fact, do better than this
estimate when the whole vocabulary is chosen (this
will be clarified later), and it is possible that
one of the apparent underdogs would come out best
in the end, but to check thiS, one would have to"try it and see", which means another case of
exponential growth in number.
We reason as follows:

ascribed to them, further justifying the fact
that 4 and 5 are pessimistic estimates.

9.

It is clear that we minimize the number of
comparisons that the (sub)file F contributes to
the total by maximizing fiPeS.) where fi is the
number of documents in Fl' TEerefore, we choose
as the measure of merit u(Si) the number ~(Si) =
fiP(Si) •
10. Suppose ~(Si)
~(Sj)' We would like to
find a further, criterion for chOOSing Si or Sj
as "most likely to succeed". Suf'fice it to say
that the one of largest P(S) is probably best.
If P(Si) = p(Sj), a random choice is the best we
can do.

C.

Estimates of the Rejection Probability pCS)

1.
For each molecular rejector S there will be
a probability peS) that a IIrandom ll reCluest will
have one or more bits in common with S. This we
will call the "rejection probability" of S, the
probability that S will be able to reject all
the documents to which it applies. How one
computes peS) will be taken up later.

Clearly the tally t (SbF) can be obtained
simply by counting and presents no important
problems. The rejection probability P(Si) is
more difficult, however. A number of methods of
evaluating or estimating P(Si) will be presented
here, in order of decreasing sop~stication
(and complexity, in general).

2.
For a (sub)file F with R different rejector
vectors (documents with like vectors are already
grouped), the number t(Si,F) of vectors to which
a given Si applies is calculated. The subfile to
which it applies is Fi'

1.
Outside Sample; Complete Request Method.
Here we assume that information is available
from an outside sample. Every possible set of
descriptors which might make up a request
Q (all 2N_l of them) is tested against the
sample, and a percentage p(Q) is computed.
p(Q) is the percent of sample requests which are
exactly the request Q, in other words, the
probability that a random reauest will be Q. A
table of the P(Qi) for all 2 -1 of the Qi is
stored.

3. We then note that if a reCluest Q is directed
to the (sub)file F, it has a probability P(Si) of
being rejected by Si' and hence a probability
l-P(Si) of passing on to the file Fi'
4.
Being pessimistic, 1tre assume that after Si
is removed, each of the Fi rejector vectors has
to be given its own molecular rejector. Hence
if Q "passes through" S1, t(Si,F) further
comparisons must be made. But only
(1-P(S1»xl02 percent of the reCluests are passed
through. This leads to an average of
(l-P (Si»·t(Si,F) comparisons.

5. In addition, Q must be directed at the
R-t(Si,F) other vectors. Assume all of these
must be treated as rugged individualists, at one
comparison per vector.
6.

Totalling the number of comparisons
(including that with Si itself), we ~et
K=l+(l-P (Si»t (Si,F);j-R-t(Si,F)=R+l-t (Si,F) P(Si)'

7.
We observe that if t(Si,F)' P(Si)< 1, then
K < R. Since R comparisons would be involved if
each vector were processed separately, in this
case use of the molecular rejector Si will be an
improvement over "ordinary II processing. It
t(Si,F). P(Si) < 1, all the vectors should
probably be stored individually.

For each Si, all those requests overlapping
with Si,identified as R~j' are generated. Then
we have P(Si) = ~(Rij)'
(This is the usual
probability that at least one of a list of
independent events will occur. In this case the
"event" is the occurrence of exactly the
particular Rij as a request and hence is
independent of that for any other R~k' klj, even
if Rij A Rik has many I-bits in it.)
2.
File Sample; Complete Request Method. The
procedure is exactly the same as above, except
that the file itself is used as the sample.
This is not a trivial process, since the file is
encoded in terms of its rejectors rather than
descriptors. It is best done for each Q~ by
looking for exactly Qi (complement of Qi) in the
file, counting the number of these to get N(Qi),
and then dividing by M to get P(Qi) = N(Qi)'
M

It the file is uncomfortably large, one might

select a manageable random subfile for this
purpose.

8.

Furthermore, we note that if this reasoning
is applied to the subfiles of Fi and F-Fi,
respectively, it is clear that each of these will
probably do better than the t(Si~F)and R-t(S~)

3.
Atomic Request Method: Either Sample.
for each atomic descriptor Di{ we simply
evaluate the probability P(Di) that this

Here,

271

6.3
descriptor will be found in a given request. ~:re
then estimate the number P(Qj) of the above
analysis by listing the set {Dk} that occur in
Qj (symbolically, Qj = VDk). We say
k

p(Q) ~ P(Dl)·P(D2) ••• P(Dk) = n 1'(Dk) (iterated
proauct). Then the formula for P(8i) is
computed as in C.l. This computation is based
upon the well known formula for the probability
that a number of independent events will occur
together. We know that, in fact, the occurrence
of descriptors in a request will not be
independent; they will tend to "cluster". C.l
-and C.2 take this into account, but it is thrown
away here. Clearly a considerable computation is
aVOided however.

4. The No-Sample Complete-Request Method. Here
we assume that all requests are equally probable.
Thus we need only to evaluate the percentage of
all possible requests which overlap 8 i to get
P(Si) to within some multiplicative constant
(which can be ignored). There are i:N-l
requests. If 8i has bi bits, there are N-bi bits
that do not "intersect" 8i and 2N- bL l requests
made up only of those bits. Hence there are
(2N-l) - (2 N- b i-l) = 2N_2N- b i requests which do
overlap with 8i. This P(Si) equals
-

The important quantity is
1-

-1- , which clearly increases

2bi
directly with bio (This can be found neatly
on a computer with one-bit right shifts as one
counts the number of bits in 8i and then
subtracts. )

Clearly, all of these methods are
programmable and not too forbidding when
considered separately. The problem arises from
the fact that the ~(S) subroutine will need to
be used an astronomical number of times in
generating a file structure and, in fact, is
likely (it appears to the author) to account for
a high percentage of the running time of the
file-organization routine.

272

6.3

C

start)

~
Load F-store ',rith
entire File, Clear Rstore, enter entire
vocabulary into V-store

of

Yes

No
F-store empty?

f

;Reload V-store i'Tith
full vocabulary, remove
all terms appearing in
R-store molecular
rejectors

Generate all molecular
rejectors containing
terms in the V-store

t
Remove all descriptors
occurring in S from
the V-store

t

Read in from Prime Store:
(a ) Prime d urnp
F-store
(b) R-dUIllP
R-store

Dlliap each doc. IU,
new molecular rejector,
plus contents of R-store,
on output

,

Make each rejector
vector in F-store into
new molecular rejector

Compute ~(S) for each
of these molecular
rejectors

Select the S with
maximwa ~(S) (or one
such S)

Max J..I.:

Remove all doc. ID's
with empty rejector
vectors from F-storej
dump with copy of Rstore on output

"Subtract" S from
all rejector vectors
in F-store

1
Remove all entries
to which S does not
apply from F-store

Select from F-store all
entries whose rejector
vectors do not contain S;
dump these as prime dump
on prime store

Insert S into
R-store

tI
Copy R-store on Prime
store as R-dump

I

Note: The Prime store is a storage area for primed subfiles. Each primed
sub file is stored along with all those molecular rejectors applying to every
document in the subfile. The left branch corresponds to Step 4 in text;
the right to Step 5.

Figure 2.

Flow Chart of File Organization Routine

•

273

6.3

Figure 3.
a.

file Generation Tree

Tape Configuration

c

A

b.

Generation Tree for File

E

0

c.

F

Documents in File

Document

Molecular Description of
Rejector Vector

A

81 V 811

B

81 V 811 V 8111

C

81 V 812 V 8J.2l

D

81 V 8
V8
122
12

E

81 V 8
13

F

8 2 V 8 21

d.
Request
hence by c.

has bits in common with 8111 8122 , and 82' but with no other molecular rejectors,
should reject A, B, D, F. C and E should be accepted.
Q

Processing starts at

0 ' proceeds word by word to ® where overlap is

immediately to (]) , proceeds word by word to
occurs.

Tape is advanced to

occurs.

At

®

@ ,

 e , where e is an arbitrary
1-1 ==
but fixed real number, the associator
al-1 is said to be active and instantly
transmits a signal vl-1 to the response
unit. An inactive associator transmits
no signal. The total signal arriving
at the response unit is
(i) ~(i)

u

=L-..J.
1-1

,(i)

, where L-

v
1-1

is taken

1-1

over the associator units activated by
Si' If
()
u i >
where e is an

e,

283

7.2

arbitrary but fixed non-negative number,
the response output is + 1. If
u (i) < - GJ the response output
is - 1. If lu(i)l~
the response output
is O.

e

Let us assign each stimulus Si
(i=1,2, ... ,n) to one of two classes which
we denote by + 1 and - 1. Say stimulus
S· is assigned to class Pi where Pi is
or - 1. This dichotomization is then
represented by P=(P1, P2,".' p). The
response of the perceptron to s¥imu1us Si
is said to be correct if, when Si is
presented, the sign of the output is Pi.
We say that a solution exists to this
discrimination problem for this
perceptron if there exist numbers y~ such
that if v~=y~ then the perceptron w111
give the correct response to all the
stimuli.

+I

The terror correction'reinforcement
procedure is as follows. A stimulus Si
is shown, and the perceptron gives a
response. If this response is correct
then no reinforcement is made. If the
response is incorrect then the v~ for
ac~ive associators a~ is incremented by
'Pi. The inactive associators are left
alone. The initial values v~ are
arbitrary. The stimuli are presented in
any order Si ' Si , •.. with repetitions
allowable. 1
2
Theorem. Given a ~erceptron of the
type described above an a dichotomy for
which a solution exists. then there is a
constant M such that. under the 'error
correction' reinforcement procedure
described above. the response of the
perceptron will be incorrect at most M
times. In particular if each stimulus
recurs infinitely often then the
perceytron. after a finite number of
stimu us presentations, will thereafter
identify all stimuli correctly.
In words: If a solution exists the
perceptron will learn the dichotomy in a
finite number of steps.
For proof, generalizations and
analysis of other reinforcement rules
see References 4 and 5. Although a few
questions regarding the simple perceptron
remain unanswered, the theory has reached
the point where performance of many such
systems can be closely predicted and,

conversely, values of the parameters can
be specified which will insure successful performance of prescribed discrimination tasks. These systems have
a limited ability to generalize. Only
in exceptional cases do they exhibit
meaningful sRontaneous classification
of patterns. 5
Four Layer Series-Coupled Perceptrons
The simple perceptron of Figure 2
generalizes on the basis of overlap of
the stimulus patterns on the sensory
field. The perceptron of Figure 3
generalizes rather on the basis of
temporal contiguity of the patterns.
The values of the S to AI
connections do not change with time.
The All units are in one-to-one
correspondence with the AI units and
have threshold e. An active AI unit
a~ delivers a fixed signal of e to its
corresponding All unit all and also a
time dependent signal v : to a;I
~
(v=1,2, ... ,Na ), where Na is the number
of A units. An inactive unit puts out
no signal. The values v v are
initially zero and chang~ with time as
follows. Stimuli are presented at
times 0, ~t, 2~t, 3~t, .•.. If
is
active at time t and aIJ is active at
time t + ~t then v~v receives an
increment ( 7.~t); otherwise it does not
receive this increment. At the same
time each v~'v' is decremented by
(~.~t) v~'v"
These two effects
represent a facilitation of used pathways and a decay, respectively. The All
to R connections have values which may
be varied according to one of the
standard rules of reinforcement as in
the simple perceptron of Figure 2. For
convenience we take these initial
values to be zero, so there is no
activity in the R units until the
experimenter intervenes. Since we are
interested in the self-organization of
this system in the presence of an
organized sequence of stimuli, this
intervention of the experimenter will
not occur, as will be seen below, until
the experiment is almost over. There
is no time delay of transmission of
signals through the system.

aJ

The analysis of this system is

284

7.2

given in detail in References 5 and 6.
Here we shall describe some of the
results of that analysis.
Let r(i)(t) =2:(i) v
~

v

~v

(t)

'

where

~(i) is taken over those ~ such that
~

the associator a~ is activated by Si.
That is,1y (i)(t) is the input signal
from AI units a I (~;y) arriving at the
~

All unit a~I, at time t. Let ~(i) be
the input signal to a~I from a~ywhen
stimulus S. is presented. Let the
probability of occurence of stimulus SA

~~o~is~n~ot~; b~o~:~~li~~ ~;s~:n~~;!~

to be stationary. We also assume ~t«l.
Now it is shown in Reference 6 that for
t sufficiently large the system response
reaches a steady state. In this state
the r~(i) are the unique minimal
solut1ons of the equations:
tV
I y

(i)

=

.2.. '"'

~

I

At. fA

(~)

(.(~

& L... L-k nij P j Pjk'Yi~ .j.~)
j

where ni j is the number of AI associators
activated by both Si and Sj; and
~(x)=l for x~, ~(x)=O for x<9.
From this basic result conclusions
can be drawn about the terminal state of
many such systems. We illustrate with
two training programs on such perceptron~
In the first training program the
stimuli Sl' S2'· .• ' Sn are divided into
two classes: [Sl" S2' .•. ' SK] is class X,
while [SK+1' SK+2, •.. ,Sn] is class Y.
There is assumed to be no appreciable
difference in the retinal overlaps:
ni j = q+s ~ ij' where s)O, ~O. Thus the
diagonal elements of the nii matrix are
all (q+s) and all other elements are q.
Note that by raising thresholds of the
AI units, the ratio q/s can be made as
small as desired. The stimuli are
presented at random, subject to the constraint that the probabi1it~ of transition to a stimulus of the same class is

p, nearly one, while the probability of
transition to a stimulus of the opposite
class is (1 - p), nearly zero. Inside a
class all members are equally likely.
Let A~I(Si) be the subset of All
units activated by stimulus Si in the
initial state, and AII(Si)
the subset
00
activated in the terminal state. If the
parameters satisfy the inequalities
'1

"K2

K

sp+qK ~ 29S < -s"7":(l::-=;-:-p~)+q:--:;K
then if 1

~

~

i

K

AII(S)=
00

i

while if K < i

1
~

i

=

AII(S)
0

j

==

n then

AII(S):
00

Uj < K

<

K

U

L

j~L

j ~ K
K+1~j~K+L

j

>K+

L

We also assume that no associator is
activated by more than ~ of the stimuli
Sl' S2'···' SK' where ~ < K/L.

A stimulus Si from {Sl"'" SK}
is picked at random and the next stimulus
is the transform T(Si)' Then another is
picked at random from (Sl' ••. , SK) and
this is followed by its transform, and
-so on.
If the parameters satisfy the
inequalities

q(K~)~ < 2KSe < q+r
K

r =

then it is shown in Reference 6 that
AII(S )=AII(S )+U AII(T(S ) )
CD
X
0
X
j  0,
For
n = 0, the "action-function" would become an
"interaction-function" which would specify how
much output activity of any element in lamina i
would be passed on to the input of any other
element in the same lamina i.
If n < 0, the
"action-function" becomes a "feedback-function"
which would specify how much output activity
of any element in lamina i would be passed on to
a preceding lamina.
Only "action- and interaction-functions" have thus far been considered,
There is also no conceptual difficulty if
the elements are allowed to become indefinitely
small and indefinitely close to each other,
resulting in a continuum n-dimensional lamina.
The resulting stimulus density a(p) and the
response density p(p) at any point p in the
n-dimensional continuum can be defined as
follows: 1
lim
a(p)

~V~o
lim

P(p)

~V~O

~

S(p)
n

(5)

S(p)
n
~V

(6)

L1 V
~

where:
S(p)
R(p)

the stimulus or
point p on some
the response or
point p on some

excitation input at
lamina.
activity output at
lamina.

297

7.3
yn = the n-dimensional volume element.
Therefore for an n-dimensional continuum
lamina structure a response density function is:

p. (p)=
~

J

P . (q)F(q, p)ds (q)+
J

f

p. (r)G(r, p)ds (r)
~

(7)

are those which involve anisotropy of the functions involved. These result in property filters which may allow certain characteristic
information about the geometry or the topology
of the lamina activity to be extracted.
Of
immediate interest are action-functions of the
form:

i

j

F(q,p)

where:

p. (q)
J

F(q,p)
ds (q)

J
p.

~

J
(r)

G(r, p)

the response density at point q in
lamina j.
the action-function between point q
in lamina j and point p in lamina i.
differential n-dimensional volume
element in lamina j.
integral over all the n-dimensional
lamina j.
the response density at point r in
lamina i.
the interaction-function between point
r in lamina i and point p in the same

€

-(r/r )2
1
cos 2~

(8)

where:

r

F(q,p) expressed in polar coordinate,S .
the radial distance from the point
p in question to the point corresponding to q in the previous lamina.
the angle between the radius, r,
and some reference line in the
lamina in question.
arbitrary constant.

lam~na.

ds(r)

differential n-dimensional volume
element in lam~na i.
integral over all the n-d~mensional
lamina i.

Using this formalized approach to property
filtration, Yon Foerster, et al., derive some
interesting results for va~~ous choices of
F(q,p) and G(r,p).
For example, if a one-dimensional lam~na is excited by a stimulus function,
U(p), that can be expanded ~n a Fourier series
with respect to distance along the lamina and
if the interaction-function, G(r,p), is chosen
as a normal statist~cal distribut~on w~th respect
to distance--note that there lS no actionfunction here since only one lamlna is being
consldered--then lt can be shown that the lamina
in question will perform a "sharpenlng" of the
stimulus function dlstributlon. That lS, the
hlgher order terms in the Fourler serles expanSlon are enhanced.
Another interesting example lnvolves two
two-dimensional laminae wlth an act lon-function,
F(q,p), defined as a normal statistlcal dlstrlbution with respect to distance from corresponding points--note that there lS no lnteractionfunction here. The result of this analysis, as
might be suspected by analogy to the first
simple illustrat~ve example, is a contour filter
for all two-dimensional response functions of
activity from the first lamlna, provided the
response functlon can be expanded ln a polynomlal
which contalns any distance, x, to a degree no
greater than one. This condltion can easily be
satisfled.
Perhaps the most interesting examples of
action- and interaction-functions among lamlnae

If action-functions of this form operate
upon the response of some lamina, and if that
response is a straight line contour of activity
only, then the response of the analyzing lamina
will be a function only of the normal distance
from the point in question to the straight line
contour and of the angle of lnclination of the
contour with respect to the arbitrarily selected
reference line.
The action-functions defined by Equation
(8) have certain lines of symmetry with respect
to the angle of the radius, r.
If more than
one anisotropic laminae operate upon the same
response function and if the llnes of symmetry
of each of these laminae are rotated relative
to each other, interesting results are obtained
when the responses of these computation laminae
are summed.
In particular, if three parallel
anisotroplc computation lam~nae, each differing
from the other only ln the ~ of Equation (8),
thlS difference being rr/3, and the responses of
these computation laminae are summed, the result
is zero at every point provided the input stimulus is a straight line contour of infinite
length.
Such a system might be called a
"straight line fil ter" .
5.

Conclusions

The numa-rete, the topological counter, and
the theoretical property filters just descrlbed
suggest ways in which the afferent stimulus
information from an environment can be reduced
and classIfied into responses appropriate to
different requirements.
Separately, they indeed
detect certain limited properties, but in combination, they may be even more useful.
For
example, the numa-rete and the topological

298
7.3
counter operating in parallel can classify an
object or character as to its order of connectivity and the numa-rete and the straight line
filter in cooperation may be able to dichotomize
a group of characters as to whether they are
composed of only straight lines or whether they
contain curves as well as straight lines.

Biological Computer Laboratory of the University
of Illinois by not only himself, but also by
A. Inselberg, L. Lofgren, p. Weston, and G. Zopf,
under the direction of H. Von Foerster.
The
work is sponsored by the Information Systems
Branch of the Office of Naval Research.
References

These concepts lead lnto the over-all
larger property detectlon area, that of the detection and use of gross object characterizations such as mul tiple-connectivi ty ("holeness"),
"straight lineness", "angleness", "intersectness", etc.
If such property fil ters as are
needed to detect these more gross characteristics of patterns can be designed and constructe~
then the way seems open to pattern detection
and identification in terms of neighborhood
properties only.
TABLE 1
Properties

" single

holeness "

Printed Digits

°

1

2

3

x

4. 5

6

x

x

7

8

9

x

" straight
lineness "

" curveness

x

" smoothness "

x

"slngle intersectness "

x

x

x

x
x

x

x

x

x

x

x

-~--

Table 1 illustrates a slmple tentative
characterization of printed digits by this
scheme where the "properties" to be filtered
are listed at the left for the corresponding
digltS across the top.
It will be noticed that
such characterizations will be lndependent of
size, translation, and rotation, but not entirely from distortion. To make these independent of distortion, topological properties
such as the connectivity should be chosen, if
posslble.
It should be emphasized that the particular
characterization illustrated by Table 1 lS not
yet possible since all the required property
Filters are not designed and inde~d may not be
'~chnlcally possible.
However, the concept of
property filtering using methods other than
standard statistical methods seems to offer
some promising possibilities.
Acknowledgment
The author wishes to emphasize that the
present article is, ln part, a summary of work
on data preorganizatlon being done at the

1.

Babcock, M. L.; A. Inselbergi L. Lofgren; H.
Von Foerster; p. Weston and G. W. Zopf, Jr.:
"Some Principles of Preorganization in SelfOrganizing Systems," Technical Report No.2,
Contract Nonr 1834(21), Electrical Engineering Research Laboratory, Engineering Experiment Station, University of Illinois, Urbana,
Illinois, June 24, 1960, 110 pages.

2.

Babcock, M. L.: "Reorganization by Adaptive
Automation," Technical Report No.1, Contract Nonr 1834(21), Electrical Engineering
Research Laboratory, Engineering Experiment
Station, University of Illinois, Urbana,
Illinois, January 15, 1960, 141 pages.

299
8.1
COMBINED ANALOG/DIGITAL COMPUTING ELEMENTS
Hermann Schmid
Link Division, General Precision, Inc.
Binghamton, N.Y.
SUMI1ARY

This paper describes a number of computing
elements whose accuracy is considerably higher than
that of comparable analog devices and whose speed
of operation is considerably faster than that of'
comparable digital circuits. These elements employ both analog and digital oomputing techniques.
The input and output signals of these hybrid computing devices are presented by two quantities, the
most signif~cant part by a parallel binary number
and the least significant part by an analog voltage.
The hybrid multiplier consists of a relatively
siIllple digit.al multiplier, three D/A converters, a
very simple analog multiplier, and a summing and
comparison circuit. An accuracy of I part in 104
and a bandwidth of lKC is possible with a relatively simple circuit. The accuracy can be expanded
theoretically as high as desired at the cost of
additional equipment.
The hybrid integrator consists of an accumulating register, a variable amplitude sawtooth
generator, a simple analog integrator and a summing
and comparison circuit. Its dynamic range and
accuracy are limited only by the number of digits
used in the R-register.

is a function of the time required for one addition and on how many additions are required. Besides, a digital computer performs a large number
of mathematical operations in sequence. The
larger this number the slower must be the repeati.
tion period. Generally speaking, a digital computer is limited to the solution of problems in
which the input variables change slowly with respe<;t to time.
The analog and the digital computer are therefore limited to the two distinct fields of computation. When the solution of a certain mathematical problem requires an accuracy higher than 0.1%
and a computation speed higher than several hundred
cycles neither the analog nor the digital computor
is capable of performing this task economically.
Hence, a new computer is required for this class
of problems.
It is the purpose of this paper to describe a
number of combined analog/digital computing elements
which have an accuracy-speed product several orders
higher than comparable analog circuits. These elements employ both analog and digital computing techniques, and associated with them are hybrid (partly digital, partly analog) input and output signals.

DESCRIPI'ION OF A HYBRID COMPUTING SYSTEM
The hybrid function generator consists of a
diode decoding matrix, a diode translation matrix,
An analog computing system is characterized
two D/A converters and a summing and cpmparison circuit. A repeatability of 1 part in 104 and a band- by the fact that all computing elements, such as
adders, multipliers, function generators, integrawidth of lKC is easily possible.
tors, etc. accept analog input voltages and proThere are no A/D converters, no external stor- duce analog output voltages.
age means, no timing circuits (except clock freAccordingly in a hybrid computing system all
quency for integrator), no controls to adjust, and
computing elements must accept hybrid input signo temperature-sensitive circuits in a system using
nals and provide hybrid output signals. A hybrid
these hybrid computing elements.
signal is defined as being partly digital and partly analog. The more Significant part of the sigINTRODUCTION
nal is represented by a digital signal XD in the
form of a binary or decimal number or some other
Analog computers are limited to the solution
of problems requiring a speed of operation of sev- digital code. The less significant part is repreeral hundred cycles, and an accuracy bot higher than sented by an analog quantity XA usually a dc voltage. In this paper the digital'signal has been
0.1% of full scale. In an analog computer each
chosen to consist of a three-bit binary number and
computing element performs, in general, one mathethe analog signal of a dc voltage which varies bematical operation, and there exists thus a one to
one relationship between the complexity of a mathe- tween 0 and 1. 25v, though in many instances it may
be more convenient to use other values. If the
matical problem and the complexity of the computer.
variables of this hybrid system are related with
We say, the analog computer operates "in parallel".
the variables of a 10V analog system the magnitudes
of the 2-, 21 , 22 bits of the binary number repreDigital computers are inherently sequential
senting the digital portion are 1.25v, 2.5V and
machines. They perform every mathematical opera5.0V; respectively.
tion through repeated addition or subtraction. The
time required to perform a mathematical operation

300

8.1
A magnitude of an input signal X of 6.68SV,
would therefore be presented as the binary number
B 101 and the dc voltage of .l.U5V. As X increases
to 7.499V the analog voltage XA increases to 1.249V.
Wnen X becomes 7.S01V XA increases first to 1.250V,
then returns to zero, and increases to .OOlV. As
XA returns to zero the binary number XD must increase by a magnitude equal to 1.25V, i.e., the 2°digi~
in order to maintain the magnitude of the total
signal constant. It is important that this changeover will be executed as quickly as possible.
Hybrid computing elements enploying DDA (Digi-

is the hybrid multiplier. The circuit described
here differs from the NBS version in that the digital input sign~s are binary numbers and not pulse
rates. This e~iminates the need for the X and Yregisters, in which the input variab~es are integrated. Without this integration considerable higher bandwidth is possib~e at the same clock frequency.
Further, instead of the con~on R-register,
which allows multiplication only by a unity step,
the device to be described emp~oys a paral~el binary multiplier 2•
The operation of the hybrid multiplier shown

tal Differential Analyzer) techniques in combina-

in Fig. 2 is based on the multiplication of two

tion with analog computing techniques have be In
developed by the National Bureau of Standards. It
will be shown later that such a system not only requires more components but its slewing time is limited by its "unit step" digital. operation. In addition, if a DDA computer once makes a mistake, as
e.g. due to power fai~ure or transients in the system, it would carry that mistake, until the computation is restarted from the initial conditions.

sums, namelyt
XY .. (Xn+XA) x (YD+YA) .. XnYn + XDYA + XAYn +XAYA

(1)

From the equation above it can be seen that the
product XY is now expressed as the sum of four
products. At first this may not ~eem to be a very
elegant solution but it will soon become apparent
The hybrid computing system described employs
that the circuits required for the individual multiplier elements are relatively simple and that
only pure digital computing techniques (in addition
to the analog techniques). Since all digital opera-, conventional circuitry can be used. From the block
diagram in Fig. 2 it can be seen that the first
tions are carried out in parallel its slewing rate
is considerably better. In addition, any momentary multiplier element is a simultaneous binary multiplier, described in the literature(2). The second
fa~lure of the input signals or the equipment has
and the third multiplier elements accept one digionly a momentary effect on the output signal.
tal and one analog Signal and thus digital to analog CD/A) converters can be used. The fourth mulFrom the generalized form of a hybrid computiplier e~ement can be a Simple, one-quadrant anating element in Fig. 1 it can be seen that all inputs and the output consist of a digital part and
lo~ multiplier.
The outputs from all but the digian analog part. The digital input signal Xn is
tal multiplier element are in analog form and must
accepted by a digital computing circuit, while the
be added in a conventional analog summing device.
analog input signal XA is accepted by an analog
The number of digits at the output of the digital
computing circuit. There is also one (or more) commultiplier is the sum of the digits of the two inputing circuit~which accept both analog and digiput signals. In order that the form of the output
tal inputs. Since the digital output must have as
signal is compatible with the form of the input
many digits as the input, the additional digits
signals the output signal should have the same nummust be converted back into an analog voltage. The
ber of digits as the input signal. This can be
outputs from the D/A converter, the Dig./Anal. com- e~sily accomplished when the output signal is ~aled,
puting circuit, and the analog computing circuit
i.e., when the binary point is shifted and when
are summed in the analog adder. If the analog sum
the least significant digits are converted back inis larger than a certain threshold voltage the
to an analog signal. The latter conversion is achcomparison circuit subtracts the threshold voltage
ieved by means of the n/A converter #3. The scalfrom the analog sum and provides a "carry" signal
ing of the output signal will be explained in greatwhich is added to the digital signal. This auger detail on hand of the specific example below.
mented digital signal is then the digital portion
of the output signal. The output from the analog
The real advantage of this multiplier can be
adder is the analog portion of the output signal.
appreciated only if the magnitudes of the accuracies required from each multiplier element are desWith the number of blocks shown in the diacribed. To simplify this description it is useful
gram of Fig. 1 a hybrid computing element seems to
to make the assumption, that both input and output
signals vary in magnitude between the equivalent
be a complicated and costly device and one begins
to wonder whether the incre,ase in cost and complexi- of 0 and ±lOV.
ty justifies the improvement in performance. HowBefore describing the scaling operation in the
ever, it will be shown that most of these blocks
multiplier for the binary system it is convenient
consist of relatively simple electronic circuits.
to describe it first for the decimal system. Assume
therefore further that the digital multiplier acTHE HYBRID MJLTIPLIER
cepts one decimal digit on each input and provides
two decimal digits at the output,and that the anaOne of the best examples on how simple and
log portion of the variable is represented by a DC
how cheaply a hybrid computing element can be built
voltage varying between 0 and 1.OOV.!f we desire now

301

8.1
to multiply, e.g. 9.2 x 8.6 the output from the

the binary number representing the more significant part of the output variable, and l.2SV must
first multiplier should be 9 x 8 • 72; from the
second multiplier the output should be 9 x .6 D S.4, be subtracted from the summed analog voltage representing the least significant part of the outfrom the third multiplier the output should be
8 x .2 • 1.6, and from the last multiplier the out- put variable. The comparison circuit in the block
diagram of Fig. 2 performs this comparison and
put should be .2 x .6 • 0.12. When summed this
switching operation. A carry signal originating
gives the correct value of 79.12. In order to represent the output signal with only one digit it
at the comparison circuit is sent to the appropriis necessary to scale it down, i.e., to move the
ate adder circuit in the digital multiplier eledecimal point one place to the lef't and convert
ment. It also provides the 1. 25V which must be
everything on the right of the decimal point into
subtracted from the summed analog voltage.
an analog signal. v[hen this analog Signal is added to the analog sum derived previously and the
The full advantage of this multiplier will
whole sum is then attenuated by a factor of ten
become obvious only when the circuit simplicity
we have a total analog output Signal of (S.4 + 1.6
of the individual multiplier elements is shown.
+ 0.12 + 2.0) x 0.1 = .912. Together with the single decimal output (1) from the digital multiplier
The Parallel B~ MUltiplier2 in fig. 3
the total output is then 7 + .912 • 1.912, which
accepts two 1nput s~s with three binary digits
is the product xy/lO.
each and provide & a six digit output. It performs
the same algebraic operation as a human operator,
The magnitudes of the output signals of each
multiplying two binary numbers. Binary multipliof the multipliers in the example above is inverse- cation is achieved by adding the digits in each
column of the array of binary numbers, which is
ly proportional to the accuracies required from
obtained by writing the multiplicand once for every
the various multiplier elements. If it is, e.g.,
"oneil in the multiplier, but always Shifted one
desired that the overall multiplier be accurate to
0.01%, then the D/A converter/multipliers must per- digit to the left for each increasing digit of the
multiplier.
form to an accuracy of only 0.1% and the analog
multiplier must be accurate only to 1%.
In the circuit of Fig. 3 the "writing" is done
Returning now to the actual multiplier, where
by producing coincidences between the specific dia variable is represented by a three-digit binar.y
gits of the multiplicand and that of the multiplier.
number and a dc voltage varying between 0 and 1.25V, Only when ooincidence occurs the "and" gate proand using the same numbers as in the example above,
duces a "one". The "ones" in each column are sumgives X = 9.2V = 8.75v + 0.45v = Blll + 0.4SV and
med by conventional adding circuits. The number
Y = 8,6V = 1.5v + 1.lV = BllO + 1.lV. The product
of components required by parallel binar,y multiXY then becomes (Blll + 0.~5) x (BllO + 1.lV) =
plier rises sharply with the number of digits used
BIOIOIO + (8.15 x 1.lV) +(7.5 x O.L!SV) + (l.lV x
at the inputs. With three digits for both Xn andYD'
0.45v) • B 101010 + 13.49SV which is, for B 100000
the circuit required only 9 gates (2 diodes each)
equal to 50V, and B 010000 • 2SV, etc., equal to
and 9 half adders J if the two inputs have 4 digits
6S.625V + l3.·b.95V • 19.12V. Comparing this with
each, 16 gates and 16 half add~rs are neede~. In
the results obtained above for the decimal system,
general the circuit requires n gates and n half
it can be seen that it is the same result. Proper
adders, some buffers (emitter followers) and some
scaling now requires that the six digit binary numadditional gating between the half adders to permit
ber B 101010 is reduced to B10lOOO ~d that the
full adder operation.
rest, BOlO, is converted back to analog. The voltage equivalent of BOlO is in our example 3.12SV
The Di ital to Analo CD A Converter ultiand thus the total analog voltage increases to
flit:- or a t ee-dig1t input the D A converter
16. 620V. If now the binar.y point is shifted three
mu 1plier in Figure 4 comprises only three complementary transistor voltage switches and three
places to the left and if the analog voltage is
reduced by a factor of 10 we obtain B 101 and 1.662 binary-weighted precision resistors. One side of
V which is by our definition 6.25v + 1.662V = 1.912 all switches is always connected to ground and the
other to the analog input variable XA or YA. When
V, the desired product XY divided by 10.
the switches are energized by the cO~ltrol lines of
the digital input variable the current flOWing to
However, the proper code for 7.9l2V would be
the summing point through the three resistors is
BllO + 0.412 and not B 101 + 1.662V, because by
definition the analog voltage can vary only between proportional to the product XA YD or XD YA.
o and 1.25V. It can be seen that if l.2SV is subThe Analog MUltiplier.It has been mentioned
tracted from 1.662V the remainder is exactly 0.412,
before that the accuracy required from the analog
and that V 101 is just one binar.y dig:i.t smal.Ler
multiplier needs to be only approximately ±l% of
than BllO. The "carry" operation, so familiar in
full scale in a s.ystem with a total accuracy of
digital computers, must also be performed here.
This requires that whenever the summed analog volt- ± 0.01%, and with 3-digit binary numbers.
age is larger than 1.25V, a "one" must be added to
An extremely simple version of such a multiplier is shown in Figure 5. Its operation is b~S
ed on the triangular wave integration principle •

302

8.1

A triangular wave is biased to the level of the
first input variable X. This biased waveform is
then clipped at +Y and -Y, the second multiplication va~iable, to produce a trapezoidal wave train.
When this wave train is subjected to low-pass filtering the resultant output voltage is proportional to the product XY.
Biasinp, of the triangular wave is obtained by
returning the secondary of transformer T-l to the
potential representin~ XA• Clipping of the triangular wave is achieved by feeding the triangular
wave to the bages of a complementary transistor
vol tage sm tch. This switch consists of one pnp
and one npn transistors, connected at the emitters.
The collector of the pnp transistor is connected
to -Y, the collector of the NPN transistor to +Y.
The voltage at the emitters of these two transistors is then the biased triangular wave accurately limited to +Y and -Y. Low-pass filtering is
done by cascaded RC stages.

The frequency response of such a hybrid multiplier would also be relatively high since the
bandwidth of the individual multiplier elements
can be made high. It is expected that this multiplier should be able to handle frequencies above
lKC.
THE HYBRID INTEGRATOR
The hybrid integrator described in this paper
differs from the NBS version because it employs
pure digital techniques instead of the DDA techniques. This eliminates the need for an additional register in which the input sighal is integrated. Without this integration higher bandwidth is
possible. In order to assure that the output approached a continuous fUnction, the NBS integrator employs a resettable integrator, whereas the
integrator to be described uses a variable-amplitude sawtooth generator. The replacement of the
resettable integrator by a variable-amplitude sawtooth generator is a major circuit simplification.

The Summing and Comparison Circuit. For ease
If the independent variable X, which is repreof understanding, the summing and comparison circuits
sented by a digital value Xn and by an analog
have been shown in Figure 2 as two blocks, in realvalue XA, is integrated with respect to time, the
ty, however, they are combined in~to one circuit.
integral becomes
In essence, this circuit is ~ simplified version
of the partial AID convert€~ with only one binary
t t
digit.
t/
I
,
1
1
1
Xn dt + T I1XA dt
(XD+XA) dt aT
y-The single flip-flop shown in Figure 6 is
T
oj
supposed to be set when VB' the output voltage of
o /J
/
the d-c amplifier, becomes more negative than 1. 25v,
(2)
and it is supposed to reset when VB becomes more
positive than ground. The output of the flip-flop
is used to provide the "CARRY" signal to the addFurther, if the time is divided into equal iners in the digital multiplier and to energize the
tervals of durationAt and if the digital quantitransistor voltage switch, which connects - 1.25V
ty is permitted to change its value only at the
to the Summing point.
beginning of t:. t, then the integral can be expressed as
Overall Performance of the Multiplier. The
static accuracy of th1S mul£ip11er 1S 11m1ted only
by the complexity and finally by the cost of the
circuit, in particular, of the digital multiplier.
+
Theoretically, as in any digital computer, this
multiplier can be made as accurate as desired, by
using more digits for the more significant part
(3)
of the variable. Practically, however, it may be
most economical to operate the hybrid mgltiplier
with an accuracy of 1 part in 104 to 10~.
Where Xni is the value of Xn at the i-th interval of t1 t, and Xnn is its value in the n-th interIn contrast to the digital computer the resoval.
lution of these hybrid circuits is very high, due
to the analog portion of the variable, and is limiIn Fig. 7 the integral is represented as the
ted only by the steps occuring during switchover.
area under the curve xet) from the time t • 0 to
an arbitrary t. In the graph and in the equaThe dynamic range of this multiplier is detertions above it has been assumed that the initial
mined by the noise in the circuit, which consists
value of Y is zero. The three terms within the
mainly of the drift in the d-c amplifier, the voltbrackets of equation (3) correspond to the three
age drop across the transistor voltage switches
areas depicted in Fig. 7. Area 1 is the integral
and the integrated effect of the transients occurof the digital part of X between t .. 0 and t =
ing during the switchover. In general this noise
(n-l)_,\ t; area 2 is the integral of In between
1s below lmV and thus in a 10V full scale system,
(n-l)At and tJ area 3 is the integral of the anathe dynamic range would be 10,000 or higher.
log part of X between t • 0 and t.
I

I

oj
/

303

8.1
The digital parts of the input and output signals of the Link hybrid computing elements are
parallel binary signals, and therefore no input
registGr is required. The Link hybrid integrator
in Fig. a consists of three parts which produce
the three signals that correspond to the three
areas in Fig. 7.
In the di~ital part of the integrator XD is
added into the R-register at the beginning of each
Lt. The contents of the R-register is an-digit
binary nwnber representing area 1, 'tv-here n is a
function of the dynamic range desired. k of the
n-digits (in Fig. 8 the constant k is assumed to
be 3) represent directly the digital output of
the integrator. Th~ remaining n-k digits are converted back into the analog voltage VlIn the hybrid part of the integrator a variable amplitude sawtooth generator provides the voltage V2 which is proportional, to both XD and to
t. The variable amplitude sawtooth generator can,
but need not be a n/A converter and a resettable
i~tegrator, if In is kept constant during the
time interval A t. The output from the reset table
NBS 1ntegrator is nothing but a sawtooth with
variable amplitude and constant period L::. t. It is
therefore suggested for the Link hybrid integrator that only a D/A converter be used and that
the reference voltage VR be replaced by a sawtooth
wave with a constant amplitude and period. The
output from the D/A converter will then be also a
sawtooth wave with an amplitude proportional to
Xn • The resettable integrator in the NBS circuit
is a rather complicated device which consists of
an analog integrating amplifier and a pulse switching circuit to discharge the integrating capacitor in as short a time as possible. The replacement of it simplifies the integrator circuit significantly. The reference sawtooth used in the
Link integrator needs to be generated only once
for a number of integrators.
In the third part the analog portion Xa of
the input variable X is integrated into the voltage V3 by a conventional integrating amplifier.
The three voltages Vl, V2, V3 are summed to
produce the analog output voltage of the integrator VYA' which is also compared with an upper or
lower threshold voltage ±Vth. When VYA':? + Vth
the comparator produces a "carry" signal which
adds "1" to the contents of the R-register and
subtracts Vth from VYA; when VYA ~ - Vth the
carry is removed from the R-register.
The R-register accepts the Xn input lines
and adds the value of In to the binary number
stored already in the register. There are several methods of achieving this task. Fig. 9 illustrates a straight-forward version of such a circuit, which consists of a parallel-binary adder
and a n-digit binary counter. The parallel adder
sums Xn with the outputs from the last three
digits of the counter. Its output is connected

to the inputs of the least significant stages of
the counter in order to"set 1t or "reset" the flipflops. To assure that the register output is
changing only at the beginning of each t the
di_gi t lines of In are gated with the clock frequency by means of conventional "And" circuitry.
The Variable Amplitude Sawtooth Generator
must generate a sawtooth wave with constant period and an amplitude proportional to In- As mentioned before this can be accomplished by connecting a sawtooth wave with constant period and
constant amplitude into a conventional n/A converter as shown in }t'ig. 4.
The D/A converter attenuates the reference
sawtooth wave accurately to an amplitude which
is directly proportional to Xn- The number of
st~ges required for the n/A converter is identical with the number of digits used for Xn.
Since there exists a multitude of circuits
which generate a sawtooth wave with constant
period and constant amplitude this circuit will
not be described here.
The Analog Integrator is an operational amplifier with an integrator capacitor in its feedback path, however, it can be just as well only
a simple RC integrator, if more digits are used
for XnThe Summing and COmparison Circuit must add
the input voltages VI to V3 and compare this sum
~~th a positive and with a negative threshold
voltage Vth in exactly the same fashion as was
described for the hybrid multiplier. The "carry"
signal produced by the comparison circuit must be
added to the contents of the R-register.
THE LINEAR-8EX'H1ENT HYBRID FUNCTION GENERATOR
Function generators in which the output signal is related to the input signal in any arbitrary fashion usually employ linear-segment approximation principles. In order to design such a
function generator the designer must know the
values of the function at certain fixed points
of the input variable, and the slope of the function between any two adjacent fixed points_
In present-art linear-segment diode function
generators, a reverse-biased diode becomes conductive when the input Signal becomes equal to
or larger than a certain fixed value, which is
referred to as breakpoint. The amount of reverse
bias on the diode is equal to the value of the
function at the breakpoint. Each diode that has
been made conducting decreases the slope of the
function for a particular segment. It can therefore be said that the linear-segment diode function generator stores the value of the function
for each breakpoint and the value of the slope
between any two adjacent breakpoints. In a lOOV
computing system the accuracy with which a diode
function generator can retrace the linear-segment
curve is of the order of 0.1%. This requires

304

8.1
that the circuit has been carefully set up, which
is generally a very tedious and time-consuming job.
When the diode function generator must operate in a
10V computing system its repeatability is considerably worse, due to the inherent noise or threshold
level of the diodes.
The function generator to be described in this
paper employs also, the linear-segment principle,
but it stores as parallel binary numbers the value
of the function and the value of the slope. In
contrast to the biased-diode function generator in
which all operations are performed by analog computing elements, the function generator to be des~
cribed performs only the linear operation of summing with a dc amplifier; all other operations
are performed with digital or hybrid computing elements.

In order to find the value of y·f(x} for a
value of x that lies between ~wo adjacent values
of x;~i.e., between Xi and xi+l an increment must
be added to the value of the function at xt. This
increment is the product of the slope of the function between these two breakpoints and the increment of x, defined generally as Xl or as xA in
the hybrid computing system. The value of the
function for any magnitude of the input variable
can be expressed for any linear-segment approximation as

(5)

1.

The repeatability is at least one order
higher.

2.

The speed of operation is the same.

Every computing element described in this
paper is required to accept hybrid input signals
and to provide hybrid output signals. The final
version of the hybrid function generator will also
satisfY this requirement. However, it is convenient to describe at first the function generator
with hybrid input and analog output as shown in
Fig. 11. A function generator of this kind, but
in combination with i partial A/D converter was
disclosed previouslyJ. At that time it was desired to have a function generator with analog
input and analog output. This necessitated the
use of a partial AID converter, which reduced the
dynamic performance of the function generator. In
a computing system where all variables are represented by hybrid signals no AID converter is required and the fUnction generator is thus capable
of operating with relatively high input frequencies.

3.

The circuit is largely insensitive to temperature variations, since diodes and
transistors are used only as digital switching elements.

The function generator in Fig. 11 comprises
a diode decoding matrix, a diode translation matrix, two digital to analog (D/A) converters and
a dc amplifier.

4.

The generator has no controls to adjust
and no biases to be set.

5.

The function generator can be set up or
changed from one function to e~other by
inserting pins in a ,patchboard or by inserting a punched card. ~Jhich of these
methods is used depends entirely on the
intended application.

The diode decoding matrix converts the threedigit binary numbers representing the digital
portion (xD) of the input variable X into eight
control lines. Only one of these eight control
lines is energized at anyone time.

The combination of analog and digital computing techniques lends itself very well to the generation of arbitrary functions of one or two variabIes. Such hybrid function generators exhibit
unusually high repeatability combined with a relatively high speed of operation.
Compared with the biased-diode function generator the following advantages can be listed for
the hybrid function generator:

A linear-segment function generator can only
approach the curve Y = f(x) in Fig. 10. The eight
breakpoints PO' PI, P2' etc. on the curve have the
abscissae xo' Xl' x2' etc. and the ordinates f(XO),
f(xl)' f(X2) etc., respectively. The eight equalspaced values of x correspond to the eight values
of the three-digit binary number representing xD•
The slope between any two breakpoints is
f(xi+l) - f(xt)

M(xi)

(4)

Each of these eight control lines is connected to one horizontal bus wire of the diode translation matrix. The diode translation matrix provides two parallel binary numbers as outputs if
one of its eight inputs is activated. The k-digit
number originating in section #1 represents the
value of the function f(xn) for a specific digital
value xn, the m-digit numoer originating in section #2 represents the val~e of the slope f(xn)
in the xD segment.
Each of the two n/A converters accepts one of
the two binary numbers as digital inputs and provides an output which is proportional to the product of the digital signal and the reference potential. The number representing f{xD) is converted into an analog voltage by D/A converter #1.
Its reference voltage is VR, and its analog output voltage is thus VR f(xD). The number repre-

305

8.1

senting the slope Af(xD) is converted into an analog voltage by D/A converter #2 to which VxA - the
voltage representing the analog portion of the input signal - is connected as reference voltage.
The output from the second converter is thus VXAx

With this hybrid form of output representation the repeatability and dynamic range of the
function generator can be, theoretically, extended~ as high as desired by using more digits.

.c1f(xn) •

Most of the circuitry used in the linearsegment hybrid functi8n generators is conventional and therefore emphasis is put to the general
circuit form, to special components used,to the
number of subcircui ts used and to their interconnections. Only where the circuits differ from
the basic conventional design will they be described in detail.

When the outputs from the two D/A converters
are summed in a conventional DC operational amplifier a voltage is obtained the magnitude of which
is proportional to the desired value of the function, i.e.
(6)

It has therefore been shown that the fUnction
generator in Fig. 11 is capable of generating arbitrar.y functions of one variable. Although the
output is an analog voltage there are certain instances where this function generator may be used
also in a hybrid computing system. If, however,
the function generator must drive another hybrid
computing element it must provide an output signal which is also in the hybrid form.
A function generator with hybrid input and
hybrid output signals is illustrated in Fig. 12.
The diode decoding matrix, the diode translation
matrix and D/A converter #2 are identical to those
shown in !i1.g. 11. For any specific xD input, the
two sections of the diode translation matrix provide two binary numbers as outputs; sections #1
provides a k-digit binary number representing f(xD),
while section #2 provides a m-digit binary number
representing A f(xD). D/A converter #2 accepts the
,number for Af(xD) and multiplies it with V , the
voltage representing the analog portion ofxthe input variable, in the same manner as described before, the k-digit binary number for f(xn) is now
split up into the three most-significant digits
and into the k-3 least significant digits. The
latter digits are connected to D/A converter #1,
which converts them into an analog voltage VR fl(XD).
The outputs from the two D/A converters are summed
again by a standard DC operational amplifier.
The output from the DC amplifier constitutes
the analog portion VYA of the hybrid output signal.
This analog voltage is then compared with a threshold voltage in the comparison and switching circuit. If VYA~+ Vth , a voltage is fed back to the
summing point of the dc amplifier to subtract Vth
from Vy and at the same time a digital signal is
provide~, which indicates that VXA has exceeded the
threshold voltage. 'When VYA-~- Y-ih, the feedback
signal and the digital Signal are removed.
A parallel binary adder sums the three most
significant digits from section #1 of the diode
translation matrix with the digital (carry) signal from the comparison circu~ ~.. The output from
the adder circuit is another three digit parallel
binary number which represents the more significant
part of the otJtPt1t variable y.

The Diode Decoding Matrix. This matrix is of
conventional design and consist of 28 "and" gates,
each having s diodes. For example in Figure 11,
and 12, the number s =3. Therefore the required
matrix must hB;ve 8 "And" gates, each with 3 diodes.
Because the outputs from the diode decoding matrix must be able to drive the diode translation
matrix they are passed through conventional emitter followers.
The Diode Translation Matrix. This matrix
is an array of horizontaI and vertical bus wires.
There are 28 horizontal ~reB which accept as inputs the signals on the s control (or output)
lines from the diode decoding matrix. An input
signal can exist only at one horizontal wire at a
time. For each input the translation matrix must
provide as output two parallel binary numbers.
For this reason the vertical wires are split up
into two sections, each of which has as many vertical wires as there are digits in the binary number. Section number one, which is to represent
the value of the function f(xD)' at one of the 2s
breakpoints must have k digits; section #2, which
is to represent the slope of the function Llf(xD)
between the breakpoints x. and x.+l , has m digits.
The numbers m and k depena on th~ accuracy desired from the circuit. Each vertical line has therefore a specific binary significance.
In order to have a specific binary number
as output from each of the two sections, it is
necessary to provide a low impedance path between the particular horizontal line that has
been energized and the appropriate vertical lines.
If e.g., the number desired is 011000••• , only
connections to the second and the third most Significant digit lines are required, for all the "1"
(ones) in the binary number.

The connections required must be such that
a low impedance between a horizontal and a vertical line exists only when the particular horizontal line is energized, for all the other unenergized horizontal lines a high impedance must
exist.
There are several possible ways in which two
lines can be connected with each other, still fUlfilling the requirements set forth above. However,
if speed of operation and cost must be considered
the semiconductor diode seems to be the most
favorable component for this job.

306

8.1
The operation of the maxtrix will be understood better by referring to Figure 13 where a circuit is shown, which solves the function f(x) = x?.
This particular function has been chosen because
it requires only a few diodes in the translation
matrix, and because the performance of the function generator is simple to check.
The circuit in Figure 13 is used for the actual
breadboard, for which there are 8 control lines and
thus 8 horizontal lines in the translation matrix.
Since f(XO) and Lf(xD) are r~presented by an ll-digit
binary number, there are thus 11 vertical lines
in each section of the matrix. For conven~ence
these vertical lines are "weightyd" from 2 for
the most significant digit to 2-~ for the least
Significant digit. The input signal to the horizontal lines are +15v when the line is energized,
and -15v when the line is not energized. The
breakpoints had been chosen as follows: Xo = 0,
x = 1, x - 2, x 1 = 3, etc. For these VaLues of
~ the va~ues of ~he function are f(x o) = 0, f{xl)=
1, f(x2) = 4, f(x3) = 9, etc., and the values of
the slope are 4f(xo) = 1, Af(xl) = 3, .6f(x2) = 5,
Af(x3) = 7, etc. The diode translation matrix
which can also be looked upon as a memory must
therefore be loaded with the binary numbers for
the values of f(xD) and..b. f£Xn). Since they are
all integers the digits 2- to 2-4 are not required. When the maximum number of diodes on one vertical line is eight, one is conducting and seven
are cutoff. The ratio between the forward impedance and the cutoff impedance of the individual
diodes should be at least 10 times higher than the
7:1 ratio mentioned above, to obtain a signal to
noise ratio larger than 10:1. Even the cheapest
diodes on the market will fulfill this requirement.
The signals on the vertical lines are the outputs
of the translation matrix, which must control the
switches in the D/A converters. The amplitude of
these output signals must thus be ±15v, and the
output impedance must be low enough to quarantee
a current of lmA. All diodes on one vertical line
form an "OR" gate. In order to provide -15v at
the output it is necessary to return the "OR" gate
to a negative potential.
The Digital to Analog CD/A Converter) used
with rne Ilnear-segment hYbrid function generator
is based on the voltage decoder with ladder network II, which is described by A. Susskind4. The
voltage generators shown in the above mentioned
reference are replaced with complementary transistor voltage switches which connect either zero or
the reference potential to the ladder network. The
basic characteristic of a.ladder network is such
that the current contribution from each stage is
half the current contribution of the previous or
more Significant stage.
The number of stages of the ladder circuit is
determined by the number of digits used in section

#1 or #2 of the diode translation matrix. There
is no theoretical limit to the number of stages
in the ladder network; a practical limit is the
noise level in the system. Because the Signal to
noise ratio on these transistor switches is higher than 10,000:1, the D/A converter is capable of
operating essentially with the same accuracy as
the accuracy of the resistors used in the ladder
netlvork.
The Summing and Comparison Circuit shown
in Fig. 12 for the hybrid function generator is
identical in hardware and performance as that described for the hybrid multiplier and illustrated
in Eig. 6.
The DC Operational Amplifier used for summing
and comparison is operating most o~ the time as
simple summing amplifier with a 1:1 feedback ratio.
But, when VYA exceeds the upper or lower threshold
voltage the input to this amplifier is changed
rapidly and the output is required to follow as
fast as possible. Since, however, VYA must be
accurate only to approximately 1/10, (for threedigit hybrid signals) the requirements on drift
stability and linearity are not extremely high.
Summing up the dc amplifier should provide a gain
of appr. 104 and a bandwidth of 10KC or higher.
tJrid Function Generator.
0
t e statlc an dyn~c performance
of the hybrid function generator are limited by
the performance of the D/A converters, by the
summing and comparison circuit and by how many
digits are used in the digital portion of the input and output variable.
With the circuit of Fig. 12 a static repeatability of ±O.Ol% of full scale output has been
measured, but better repeatability is possible,
even with three digits, since the analog portion
of the output signal alone can be made to be accurate to 0.05%.
For any linear-segment function generator repeatability expresses how closely the circuit approxin~tes a given curve.
The latter can therefore be used only when reference is made to a
specific curve and when the number and position of
the breakpoints is defined.
Dynamic performance results are not available yet, but it is expected that the circuit
will operate with input frequencies of several
kilocycles. The major limitation on bandwidth is
placed upon the circuit by the dc amplifier, used
for Summing and comparison, which is required to
change its output signal from the threshold voltage back to zero in as short a time as possible.

307

8.1

ADDITION OF HYBRIB SIGNALS
In analog computers the addition of tvJO or
more signals can be easily performed by connecting the signals over suitable resistors to the
summing point of a de operational amplifier.
In digital computers the addition of two ndigit bi:lary numbers requires n "full adders ll if
the operation is to be performed in parallel.
Consequently the addition of two hybrid signala, each consisting of a k-digit binary number
and a dc-voltage, requires k "full adders" (K'(~·n)
and a dc operational amplifier. Since the sum of
the two analog signals may exceed the full-scale
value a comparison circuit is also needed to provide the IIcarryl1 to the digital adder.

most advantageous to represent the input variable
in the'fsigned magnitude rt code in order to keep
the digital circuit as simple as possible. All
circuits involving adders, on the other hand,
prefer nLJmbers represented in the "two's" complementl! form. Therefore some compromise solution
must be found.
A third problem is that of scaling or multiby a constant. In analog circuits this is
easily achieved by changing the scaling or summing resistors. In hy~rid computers it may require as much as a simplified multiplier circuit.
pl~~g

These, and probably many more, problems must
be solved before a hybrid computing system with
the elements described will become a useful tool.
REFERENCES

CONCLUSION
The hybrid computing elements described in
this paper, with their expandable accuracy and
~elatively high speed of operation, are believed
to be most useful in the solution of problems requiring an accuracy one or two orders higher than
that of comparable analog circuitry and a speed
of operation in the kilo cycles region.
A breadboard has been built for each of the
hybrid circuits described in order to confirm
the basic principle of operation. Although only
the hybrid function generator has been subjected
to thorough static testing a number of basic problems have come up, which require fuxther attention.
The speed of operation of all hybrid computing elements is largely determined by the speed
of the comparison circuit. More specifically it
is the DC amplifier used with that circuit, which
must ch~~ge its output from full scale to zero
during a transition from analog to 1igital, and
vice versa. If the amp~ier output returns to
zero after the digital number has changed, then
the t~output is for a short time too large,
if it returns before, the output is too small.
When the digital and analog outputs are appropriately combined and displayed this overlapping
or ambiguity shows up as a positive or a negative
spike, the duration of which is equal to the time
difference between the analog and digital operations. Since the frequency of these spikes is
high compared to the basic oper~ting frequency
it should be possible to filter them out. However, the integrated transients produce a change
in the amplitude of the output signal and thus
an error, whose magnitude can not be neglected.
In closed-loop systems both the unfiltered transients and the discontinuity of output signals
during the switchover have adverse effect on the
stability characteristic.
Another problem to be analyzed is how to handle variables with both positive and negative polarities. In the multiplier, e.g., it would be

1.

Skramstad, H.K., "A Combined Analog-Digital
Differential Analyzer ll , 1959 Proceedings of
the Eastern Joint Computer Conference.

2,

Richards, R. K., ttArithmetic Operations in
Digital Computers", D. Van Nostrand Company,
New York, 1958.

3.

Schmid, H. "High-Speed, High-Accuracy, LinearSegment FlLUction Generators", Proceedings of
the Combined Analog Digital Computer Systems
Symposium, Dec. 16/17, 1960, Philadelphia.

4.

Susskind, A. "Notes on analog to digital
conversion techniques", Technology Press
1957, MIT, Boston, Mass.

5.

Heyers, R.H. & Davis, H.B., "Triangular
\1ave Analog Hultiplier", Electronics,
August, 1956, pp. 182-185.

6.

Schmid, H., "A Transistor Bidirectional
Limiter", Se:niconductor Products, April
1960, pp. 29-32.

308

8.1

...

~

~

DIGITAL
-.. COVIPUTING
CIRCUIT
L

::::

J,

~

1

--

DIGITAL
ADDER
~~

--------e

CARRY

D/A
CONVERTER

I

4

X

DIG/ANALOG
~ COMPUTING
r---to CIRCUIT

INPUT
VARIABLE

I--

~

ANALOG
COMPUTING
CIRCUIT

1

V:r.A

f----

Fig. 1.

.

COMPARISON ~'
CIRCUIT

ANALOG
ADDER

-----.
~

Z

OUTPUT
VARIABLE

~

Generalized form of a hybrid computing element.

DIGITAL

3 LINES

6 LINES

r---t MULTIPLIER

lo

13 LINES

4

D/A
CONY NO.3

jYD
Y'lV

3 LlNES

-..

-">.V2
D/A CONY
,
r-I' MULTIPLIER *1

,\ "CARRY"

VI -'
"'

YA

COMPARISON
CIRCUIT

,

-'

XJXD

~XA

3 LINES

4

D/A CONY
r-. MULTIPLlER#2 I ~3
-..

~

~

4
~

ANALOG
MULTIPLIER

Fig. 2

SUMMING
CIRCUIT.

V4

~

Block diagram of the hybrid multiplier.

V~A

2°

~

Xo

.2'
II

~

~

\ I

I

\

I

~

I

\

19"AND GATES

I

22

2 HALF
ADDERS
4 FULL
ADDERS

s

CARRY FROM

' - - - - - - I - - - - - - - + - - - - - - + - - COMPARISON

CIRCUIT

25

~

24

23

22

21

20

v

/

XD YO
Fi? 3.

Parallel binary multiplier for two 3-digit numbers.
co

•

~

0

I-'-./)

310

8.1

R

TS

2R
I------+--'\/Vvv--_e_____

2°.-------------~----~

4R

TS

TS = TRANSISTOR
Fig.

4.

XD VYA

SWITCH

Three-dirit D/A co:-:v'3yter/multiplier.

+VYA

fc
T-I

:J
---.y~-----.
. - +VXA-+VYA
___~ __ -VYA

Fie.

5.

Two-quadra!:.t

d..I.,,~og,

lflultiplier.

GRD

311

8.1

-1.25V

VB

-1.25V

~-*----....

ANALOG~--+--E~VV---4

MULTIPLIER

Fig. 6.

CARRY TO

ADDER IN
DIGITAL MULTIPLIER

Swmning and comparison circuit.

x=.r:(t)
~~

III

II I
110

101
100
01 I
010
00 I

~

XA
I

.L

~~
f

~ '~3"
'"
\..

~

~

~~
'"
"

~ ~~

",,- ,"
~
~
~
~~

'\

I~

AREA AI
X03

--Xor
001

l

I

~t

2~t

3~t

4~t

Fig. 7.

5~t

~
~
~

en -I)~t
HYbrid integration illustration.

t

---....

110

~
""'II1II

101
100
-

011
010
001
00 0

-.. t

312

8.1

CLOCK

~

-

::

~

;.

..

R
REG IS'TER
~

yl

....----

CARRY

==

!

,

VAR. AMPL.
SAWTOOTH

~

J~

~

VR

-

VXA

y

COMPARISON fECIRCUIT

Vz:XotIVR

i

x

,II

O/A
CONVERTER IE-V R
VI =VRyl

CLOCK

..::::

,

,It

ANALOG
SUM

ANALOG
INTEGRATOR r;-= K +
[VxAdt
3

VYA
VYA=VI +V2 +V3

o

Block diagram of the qybrid integrator.

Fig. 8.

TOTAL OUTPUT FROM R REGISTER

/
/

A

Yo
A

"

TO AID CONVERTER
\.

/

A

\.

n- DIGIT BINARY
COUNTER

CARRY

CLOCK~-------~--4-~~

'----v--....J

Xo

Fig. 9.

Block diagram of the R-register.

313

8.1

8Xi
Xl

y

~--X-----+--;..I
~--Xi---~

f(X)

f(Xi)

__~P~O~____~______~__~~______~~__~____~______~__-+X
X7
Xs
X5
X4
X3
X2
XI
Fig. 10.

Linear segments approximate the curve Y •
SECTION#2

SECTION#I
f (Xo)

~f(Xo)
/~_---,A

x

"

A~_______,

/

oS------------,

Xo

+
+
--4

rex).

I
I
I
I

:J

I
2 ::

(24) DIODE
DECODING
MATRIX

3J
4 ~I
5:1

6 ~

7J.....
L-

u

--1-- -f-

-I--!-- -

,

~,

...,

D/A
.... CONVERTER#2

VXA

VXA 8f(Xo)

..,

.,

DIODE
TRANSLATION
MATRIX

-~

. ,
D/A

CONVERTER#I ~
VR f (Xo)

~~
DC
Vo ( X)

41
~

I

-

VO(X)=VRf (Xo)+ VXA8f (Xo)
Fig. 11.

Linear-segment hybrid function generator with
hybrid m::.:m.t and analo€, Ol.ltput.

314

8.1
SECTION#I
f (XO)

'1/

/~____~A

A

,

O~-----------:-l

~
J

..L-.

~
---,..

~"I

3:

I

5

1

1

(24)DIODE
DECOD ING t-----='4-1:+--+-+--+------I-+---1,f--+--+-+------+--I----t--1--II
MATRIX

6:

x

7..

t

~

-fo~

-

1--1_1- - ' - 1 - - -

-

v

I
~~~

-0

C\IC\IC\I

,r

VXA

_ _ _ _ _ _~..

DIODE
TRANSLATION
MATRIX

C\IC\IC\I

C\IC\I

"F r

F F

D/A
CONVERTER#2

f()

~~

O-C\I

C\IC\I

f-C\I
,
~

f-~-W
-I

~

~

C\IC\IC\I

r

r

U

D/A
~ CONVERTER:J:t1

'F

PBA

22}

~
~

YO

jCARRY

VRfl (XO)

COMP.
CIRCUIT

PBA= PARALLEL BINARY ADDER

~±VTh

Y

41~

-.. I I D~~--4II------"~ VYA

VXAlJ.f (XO)

~--------------~~~V'

Linear-segment hybrid function generator with bybrid input and output.

Fig. 12.

-30V
Llf (X o )

ALL
RESIS TORS
33 K

o

,

:>

»

:>
.~

~

:>
.~

'>

:>
,>

:>

.>

'>

DIODE
DECO'DI NG
MATRIX

-

3

...

4+

5
6

7

:~
'>

,~

>>

V

~

2

>

:~
:~

V II
/ I
VVV

I~V

II
II

~
~

~

.-.
,

Fig. 13.

:>
,>
'>

'>

:;.

'>

';.

I>
'>=>

.~

>

.

.~
'>

,~

>

,~

,

.

:> :>

I

,

.~

'>
>

:>
.>
.~

V

I
I

V

1

l
I

V/

I

l/V 1/ II

:>

I

/

VV /

~

Llf (X D )

I

I

I
I

I

II

V

V '/

V

V

/
V
l/ II
l/

Diode translation matrix wired for f(x)

= x2

'>

:

S
>

:>
.>

315

8.2
OPTIMIZATION OF ANALOG COMPUTER LINEAR SYSTEM
DYNAMIC CHARACTERISTICS
By Charles H. Single and Edward M. Billinghurst
BECKMAN/Berkeley Division
Richmond, California
Summary
The characteristics of the linear computing
elements of the general-purpose electronic
differential analyzer are discussed. Equivalent circuits are given for these elements.
Criteria for optimization of the linear computing system in order to obtain maximum computational accuracy are given. Examples of
the effects of optimization in improving system stability, transient response, and increasing bandwidth of high and medium accuracy computation are shown.
Introduction
The modern electronic differential analyzer
(EDA) is widely used for solutions of various
linear and non-linear mathematical eqlB.tions
and for the simulation of complex physical
systems. The modular construction of such
computers allows a large problem range. 1
Optimization of individual unit characteristics
does not always lead to best overall computer
performance; therefore, unit design must be
undertaken with as complete a knowledge of
final system configurations aIJtd accuracy requirements as possible. 2, 3, ,5
This paper reviews proven design techniques
that have been us ed to optimize the linear s ys tem dynamic accuracy of the EDA. The techniques have resulted in simultaneous improvement in these important system areas:
1) stability margin, 2} transient response
(both small and large signal), 3) bandwidth
of high accuracy computation, 4) bandwidth
of medium accuracy computation ..
For the EDA, improved stability margin considerably increases the flexibility of the computer. Difficult or unusual prdblems involving high gain loops, remote equipment operation, peculiar amplifier feedback and input
networks, etc., can be patched with essentially no tendency for system oscillation.
The improved transient response and extenred
bandwidth of high accuracy computation have
considerably improved the most basic performance criterion: Computational accuracy.
This should be of general interest since many
of these techniques can easily be applied to
existing computers at small cost.

With the recent extension of the EDA to also
serve as an iterative differential analyzer
(IDA), the bandwidth of medium accuracy
computation becomes more important. The
numerical techniques used with the IDA often
res ult in repetitive steps to final problem
solution. This is coupled with a desire for
higher speed in the repeated partial-solution
phase to minimize total problem time. The
accuracy limitations associated with the extension of computing frequencies beyond a
few hundred cycles per second are diE£ussed.
Basic Considerations
Static accuracy for both the EDA and the IDA
is achieved through high-gain, low-drift dc
amplifiers, precise and stable resistive networks, and stable, high-resolution coefficient potentiometers. Dynamic accuracy and
stability margin are of at least equal importance. For optimization of these ac characteristics the composite system closed-loop
transfer function should approach its ideal
most closely. Contributing ac factors inclure
integrator capacitor stability, equivalent circuit of network resistors and capacitors,
potentiometer capacitive loading, amplifiersystem transfer characteristics, and amplifier output impedance. The interaction of
these factors is further complicated by practical considerations such as system wiring
capacity. All of these factors should be included in the system performance analysis.
The improved system stability margin of the
optimized-computer is of great value for accurate simulation of complex and/ or highgain loops of either differential or algebraic
form. It is also advantageous with simulations requiring unusual feedback configurations or large capacitive loads. The better
acc uracy comes from the fact that no additional stabilizing capacitors are needed to
avoid system oscillation. Such capacitors
can cause large dynamic error. However,
for many problems the stability margin
typical of the non-optimized computer is
still sufficient. In either case, the principal error limiting the bandwidth of high
accuracy computation will be caused by
net system phase error, since even slight
phaseshift errors produce significant dynamic errors.

316

8.2

The large effect of small undesired phaseshift can readily be shown by examination
of a steady state sinusoidal signal with only
small phase error. With error defined as
the difference between the ideal and actual
signal:
() (jw)

=Ei (jw)

- Ea (jw)

=Ei [sinwt-sin(wt- cos wt (2)

The maximum amplitude of the per-unit error is therefore equal to the phaseshift in
radians:

The amplitude erl"or from an undesired pole
is the difference between the ideal and actual magnitude. For small phase error
(~wT«lL the amplitude error is:

IMal

(3)

In any problem the net phase
found by

error~ o~

2

o

1 - (1 -

=w ~

(4)

j=l

1----

\/1 + (wT)2

is

m



=

~

2

) ~i¥L

(5)

Thus, the phase error from one pole, () ,
is much larger than the amplitude error:
€ p:
2

where 'To is the equivalent first order time
constant formed by the algebraic sum of the
inevitable undesired high frequency poles
and zeros introduced by the practical system. The lead or zero terms are taken as
positive and the lag or pole terms taken as
negative with ~ k the damping factor of any
quadratic terms.

Hence, amplitude error can be neglected
with respect to phase error within the
bandwidth of high accuracy computation.

To hold the maximum amplitude error due
to undesired phaseshift below 0.2% at 100
cps (0. 12 0 ) in a particular problem closedloop~ or between two signal voltages inside
a loop, the equivalent first order time constant l 'TOI of the loop or signal path must
be sma ller than 3. 2 JJ sec. If there were
only one undesired pole l the breakpoint
frequenc~ (f =
1
)' would have to be

The passive components in the linear system are resistors, capacitors l and coefficient potentiometers. Their characteristics will be reviewed briefly~ followed by
a more detailed discussion of operational
amplifier-system characteristics, and then
the optimized linear system results will be
shown.

~

higher than 50 1 000 cps for this 0.2% dynamic
error at 100 cps. In practice more than one
undesired pole will exist; this requires individual breakpoint frequencies to be much
higher than 50,000 cps.
A conventional method of achieving lower
phase error without the high bandwidth
cited above is to allow individual elements
to have a quadratic characteristic with a

()

A../

p""""

WT

»

(W'T)

2

""v

-"V€ P

(6)

Passive Component Characteristics

Computing Resistors
Most computing resis tors at present are
wire-wound because they have better long
term stability than other types. This situation may well change with continuing development of other resistor types, e. g., metalfilm. Conventional specifications such as
value~ stability, temperature coefficient,
power rating, packaging and ac characteristics are somewhat inadequate. Value may

317

8.2
vary with self-heating, and with time due
to changes in mechanical stress of the
wire. Thermal potentials may exist from
even small temperature gradients. Encapsulation can aggravate self-heating effects
and may not be necessary within normal
humidity limits. The resistors should be
evaluated as used in their expected environments to establish such specifications.
The importance of relative or absolute
value is primarily determined by the
grouping of resistors allowed in the computer. If resistors aTe limited to definite
groups, only the relative match of resistors within each group is important in
establishing basic dc accuracy limits for
the computer system. The more flexible
arrangement, where resistors can be conveniently assigned to any amplifier, requires preCise matching throughout the
computer system.
The ac characteristics of wire-wound resistors for frequencies below 100 Kc can
be approxima ted by the circuit shown in
Figure 1. The driving-point impedance
of this approximation is:'
Z

Er

r

(s) =,.--

(s) =

(~

s + 1)

Rdc

~r

J:\dc

=

1
C

s

2

Rdc
+,-

(7)

1

s + 'LC""'

~

10 -7 sec -1 (breakpoint beyond 1. 5 megacycles) whereas the capacitor time constant Rdc C will typically be between
2 x 10- 6 and 4 x 10- 6 sec- 1(breakpoint
as low as 40 Kc). This allows simplification of the resistor transfer characteristic to:
1

s+~

dc

For integrator amplifiers, the resistor
shunt capacity must be kept as low as possible since there is no convenient way to
cancel its error effect. Four picofarads
of capacity in para llel with 1 meg input
resistors causes O. 25'fomaximum error at
100 cycles. With integrator feedback
capacitors restricted to values greater
than 0.01 /oAf, 100 cps computational frequencies can only be achieved with lOOK
input resistors. Four pf in parallel with
a lOOK input resistor causes only 0.0250/0
maximum amplitude error.
These considerations make the presently
available wire-wound resistor ac characteristics acceptable, i. e., resistor phaseerrors essentially cancel in summing amplifiers and are small enough with the lOOK
integrator input resistors normally necessary to achieve high computational frequencies.
Integrator Capacitors
The precision capacitor used for integration is a much less ideal component than
the precision resistor, i. e., the practical
capacitor approaches its ideal characte~- 6
istic Zc (s) = _1_ much less closely. '
sC
Capacitors generally have a larger temperature coefficient, are less stable in
value, and have more complex ,lectrical
characteristics than resistors.
Most Polystyrene capacitors have a temperature coefficient of -0.010/0 10c to
-0.0120/0/oc. To avoid value variation
with ambient temperature changes they
are generally mounted in an oven. The
oven temperature usually is set above expected maximum ambient temperature to
avoid the expense of a cooling system.
Capacitor leakage increases with temperature, so the oven temperature should be
set for the lowest temperature consistent
with ambient requirements.
'

For resistors wi thin the conventiona 1
range of lOOK to 1 meg. the time constant.
L • can beheld to less than

1

In the summing amplifier. the undesired
poles of the resistors contribute no lowfrequency phase error if Zf and Zi are
perfectly ma tched. Mismatch in resistor
capacity can cause either phase lead or lag
as determined by Equation 4.

(8)

Even carefully selected capacitors in a
constant environment show, variation in
value with time. Thus, it is desirable to
be able to readjust or pad the capacito,rs
without causing thermal disturbance.

318

8.2
Variations in temperature coefficient between capacitors and the fact that some
capacitors show a temperature retrace
error~ i. e. ~ they do not return to the
same exact value when returned to precisely the original temperature~ make
the avoidance of temperature change desirable during adjus tment. Another physical characteristic that affects value
is the change in mechanical stresses
with applied voltage.
The electrical characteristics are quite
complex as show~
the equivalent circuit of Figure 2. ~
This 1 JJf equivalent circuit is valid for 0.001 cps "f" 100 qls.
It is consistent with these easily measured
capacitor error effects:

0/

1) integrator drift increases with voltage level~ due to dc leakage
2) effective leakage increases with
frequency~ due to dissipation factor

Co of Figure 4 is essentially constant for
all potentiometer settings~ and only slightly
dependent on potentiometer value. This
equiva lent circuit is va lid within the limits
of Table I.
Table I

*

R
10K

Co
200pf

Equiv. Ckt.
Freq. Limit
(cps)
1. 1(10)3

Figure 5
Freq. Limit
(cps)
6.6(10)3

550(10)3

3.3(10)3

20K

204pf

30K

208pf

350(10) 3

2.1(10)3

50K

216pf

200(10) 3

1. 2(10) 3

lOOK

224pf

82(10) 3

590

* The

equivalent circuit frequency limit is
derived from potentiometer experimenta I
step function data.

3) capacitor value reduces with frequency~ due to dielectric characteristics

The transfer function of the equivalent circuit is

4). integrator output va'ries with capacitor history~ due to dielectric
absorption.

Yi (s) =

The simplified equiva lent circuit, Figure
3~ is usefulior steady-state sinusoidal
ana~yseos. The leakage reSistance, R •
L
var1es 1nversely with frequency. The
capacity value can be considered constant
for many analyses, as the variation is
only about 0.02% per decade frequency
change. However, this variation in value
with frequency is quite important in adjusting or padding the capacitor to its
"precise value." For computer use,
capacitor value should be determined at
frequencies consistent with those used in
the computer. This can conveniently be
done with the various resistance-time
measurements where the time constant
of the capacitor resistor combination is
determined. 6, 7 ~ 8 Capacitor value is
then based on the same resistance standard used for computer resistor measurement. Details of these electrical characteristics are given in Appendix 1.
Coefficient Potentiometers
For extension of the bandwidth of high
accuracy computation~ the phaseshift
from potentiometer input to wiper arm
is an important c0nsideration. 9 An
equivalent circuit for the typical computer multi-turn. copper-mandrel, coefficient potentiometer is shown in
Figure 4.

Eo

"Y[(1-"Y)RCos+1]
"Y (1 - "Y) R (2 Co + C ext ) s + 1
"Y (7'1 s + 1)
7'2 s+l

(9)

where "Y is the pot arm displacement from
the grounded end.
Equation 9 is a simple lead-lag, or lag-lead
transfer characteristic. Phaseshift~ cp, for
°
°d a 1 slgna
°
1s 18
° cp =tan -1 W7' 1 - tan -1 W7' 2.
SlnUS01
(10)
However. for both EDA and IDA error analysis the frequencies involved allow further
simplification. For W7'l < 0.1 and W7'2<0.1,
cp can be approxima ted by W (7' 1 - 7'2): .
CP~W7'net

= w(7'l

C ext
where k = - Co

- 7'2) = W RCo(l-"Y) (1-2"Y- k y)
(11)

Universa 1 curves showing normalized
phaseshift cp/ W R Co~ are given in Figure
5. These curves are valid for any potentiometer value, displacement~ ca pacitive
load.and frequency within the limits of
Table I.
The amplitude error at low frequenCies

319

8.2
can be obtained from Equation 9 as:

The slope as a function of displacement is:

1/2

2

1 +a "I
(l2)

[1

At "I

+ 'Y

(1 - 'Y)

al 2

(15)

= 1 the largest slope is found:

(13)

(16)

=a+1

Comparison of Equations 11 and 13 shows
that amplitude error is smaller than pmse
error by an order of magnitude in the frequency variable, w. Thus the potentiometer error is adequately represented by
considering only phase error if w T 1<0. 1
& wT <0.1.
2
From the simple equivalent circuit of
Figure 4 it would be easy to compute a
compensating capacitor placed from
potentiometer wiper to either potentiometer input or ground that would exactly cancel phase error for any particular displacement and resistive-capacitive loading. However~ this is not practical with the large number of coefficient
potentiometers of various displacements
and loadings.
A more practical solution to this problem is to compensate the potentiometers
by using tap-capacitors to avoid displacement dependence. 10,11 This makes it
possible to achieve excellent square wave
response to fast-rise step inputs and much
improved phase error. Details of these
teclmiques are given in Appendix 2.. A
few such capacitively compensated potentiometers will prove invaluable for critical simulations in even a moderate-sized
computer.
From the above discussion it is apparent
that higher potentiometer computational
accuracy can be achieved by minimizing
both external load capacity and potentiometer value.
At dc the potentiometer transfer characteristic is a function of displacement~ "I,
and resistive load, R :
L
Eo
Ei
where

"I

l+'Y(l-'Y)a

(14)

max

a'Y

If coefficient-setting accuracy of one part
in 10~ 000 is desired and minimum RL is
lOOK:

*

Std.
Set.
Needed No. of Std.
a
max....!... Acc 'y. _ No. of Turns req.
Rp ~ slope· 2
'1Urns ~10%) (%)
30K O. 3

1.3

1
5,000

50K 0.5

1.5

1
5,000

7,500

11,950 159

* Maximum

setting error is equal to 1/2
potentiometer resolution.

If standard potentiometers are used, the 30K
value is preferred since it has 40% less computational error with the same capacitive
load. Both potentiometers have adequate resolution.

Opera tional Amplifier
System Characteristics
The operational amplifiers in an analog computer have important effects on the dynamic
accuracy of linear computation. A knowledge
of the performance of operational amplifiers
and their associated passive networks in a
large computer is necessary in order to make
judicious choices in amplifier and system
design for optimum results. The definition
of optimum may vary with the particular
problems to be solved.
It may mean high
accuracy at slow computing speeds, or medium accuracy at faster computing speeds. In
any event~ accuracy will tend to decrease as
higher frequencies are encountered in the
problem solution.
General Purpose Computer Amplifiers
Computer amplifiers are primarily used to
perform the opera tions of summation, integration with respect to time, and multiplication by a constant. In addition, they

320

8.2
may be used with special networks as limiters" transfer function generators l dividers or square root extractors (in association with computing multipliersL inverse
function generationl etc. The amplifiers
are always operated in a feedback or
closed-loop configuration. As a r.esult"
their open-loop frequency response characteristics must be designed so that the
amplifiers are stable under any permissible computing configura tions. They
must also have low dc drift and low gridcurrent. 1 As operational amplifiers"
they must accurately perform their mathematical functions. Accuracy of computation requires high open-locp gain;
stability requires that the gain be reduced in a fairly restricted manner. Thus"
a compromise must be made between accuracy and stability.
Open-Loop Transfer Function
Figure 6 is a block diagram of a typical
dc amplifier with chopper stabilization
to minimize drift effects. The open-loop
transfer function is given by:
Eo(S)

- - =G
e (s)

(s) = (G 1 (5) + G 2 (s) ) G 3 (s)
(17)

negligible effect on the closed-loop response
providing they are not widely separated. The
cluster of poles far to the left rEWresents the
high-frequency poles of G1 and G3. Since
several decades separate the lowest and highest poles" the drawing is not to scale l but
merely indicates the relative positions.
Determination of Closed-Loop Transfer
Function
The amplifierl as actually installed in the
computerl is represented by the diagram
of Figure 8a. Figure 8b is the currentgenerator equivalent circuit.
Zi is the input impedanc e from a pa rticula r
signal source" E i •
Zg is the impedance from summing junction
to ground.

Zr is the feedback impedance.
Zo is the output impedance of the final stage
of the amplifier.
ZL is the load impedance, including the output to ground capacity of wires connected to
the output terminal. -G(s) is the open-loop"
open circuit gain of the amplifier.

G1 and G3 can be calculated from standard linear vacuum tube and network theory.
G2.ean be satisfactorily approximated by
considering the chopper modulated ac
section as a frequency-independent amplifier associated with the input and output
filters. G1 + G3 contribute a cluster of
equal numbers of poles and zeros fairly
close to the origin., 03 will usually have
a pole in the neighborhood of 10 to 100 cps
and other poles in the range between 100 Kc
to several Mc. These are in addition to
those contained in (Gl + G2). The lowfrequency pole is inserted to obtain a
frequency response approximating a 20 db
decade roll-off until the gain is below zero
db. If the low frequency pole is at 10 cps
with gain at that frequency of 10 5" the frequency response goes through unity gain
at one Mc.

The following equations can be written from
Figure 8b:

Once G1" G2 and G3 are known" the amplifier open-loop pole-zero plot can be
drawn. A typical pole-zero configuration
is shown in Figure 7. The poles and zeros
nearest the origin are due to G1 -f: G2" the
fourth pole is the low frequency pole of G 3 •
The dotted dipole combinations represent
incomplete compensation of inter-stage
coupling networks. These generally have

Eo

E.Y.
~ 1

~

-E 0 Yf + e(Y.1 + Y g + Y >
f

Substitute Y a == y.1 + Y g + Yf

Solving for Eo/Ei" we obtain:
r-

~GYfYO
Ei

_:f~a Vb

- Yf 2

i 1+ GYf Yo

2

1 -

Yf
GY o

(18)

YaYb-Yf
The extension to multiple inputs results in
the following expression:

321

8.2

E

C

will be large compa red to C i and C f , so
quantities will usually have small effect
on Y • Similarly C L dominates C f in Y b •
a

GY f YO

=1- ~
o \ i=1

Ei Zf)
Z.
1

th~se

Y a Y b - Y/
GY f YO

1+-----"""l2~

Y

Y - Yf
a b

(19)

Returning to Equation 18~ the expression in
the first bracket, is the transfer function desired for the solution of problems. This expression is modified by the second bracket,
thus introducing extraneous roots in the
problem solution. For accurate computatim,
the extraneous roots must be kept as far out
as possible.

n

Where Y a

=

Y
g

+Y f + L

i=1

Yi "

The first bracket of Equation 18 is the
ideal mathematical operation to re performed. The second bracket determines
stability and error in computation. For
most computing configurations~ the third
bracket [
Yf
,so this term can

I

The middle bracket expression of Equation
18 is of importance in determining the amplifier closed-loop extraneous poles and
zeros. It also indicates the amplifier
stability. This expression is well su\t:fd
to analysis by root-locus techniques,
especially for qualitative determina tion
of the effects of external summing junction
and load capacity.

,1-~1~1

L

~

usually be ignored. It is necessary to consider it if the respmse to very high frequencies (greater than approximately 300 Kc)
is of interest. We note a Iso tha t
Yf
1-~
will not generally produce a right-half plane
pole, so it does not affect the stability.
L.i is usually a resistor with characteristics
as discussed earlier. Zf will be the same
type of resistor as Zi~ or an integrating
capacitor. Zg i~ the input impedance. to the
amplifier, including the chopper sectlOn~
in parallel with the capacity of wiring connected to the grid or summing junc tiona The
amplifiers in a la rge computer must be located some distance away from the patchboard
and passive elements. As a result~ the summing junction capacity~ Cg~ is usually of the
order of 500 to 1000 pf.
Z will be adequately represented by a resista~ce if the output stage is designed for low
impedance. This is generally a requirement
in a computing amplifier to allow it to deliver
several milliamps output current.
Ya is the parallel admittance of

Yi~

yO' and Yf.

lt can usually be replaced by a parallet R-C

circuit. Yb can frequently be represented by
a single R-C circuit since it is the parallel
admittance of Yf • Yo ar;d Y • Ho~ever, Yb
may be more complIcated tllan a smgle parallel R-C circuit if Y is not a pure R-C combination. For typicaIr summer configurations"

Figure 9 is a root-locus plot for unity gain
summer using 1 M resistors. This is a typical configuration found in a general-purpose
computer, where the summing junction and
load capacities are fairly large due to the
long lengths of interconnecting wires required. We see here that the dominant portion of the root-locus is bowed somewhat due
to the action of the zero produced by Yf. Yb
is a pole prirl1arily due to the loao capacity
and open-loop output impedance. Y a is a
pole due to the input and feedback resistors
in parallel, shunted by the grid to ground
capacity. Obviously.. as either CL or C g
increases, the root-locus will be displaced
more to the right, and the closed-loop poles
will show less damping. This root-locus
demonstrates the fact that the externa 1 networks and wiring capacity have a major effect in determining the actual summer response in the system.
The underdamped characteristic of the configuration of Figure 9 is undesirable due to
low bandwidth and excessive phaseshift. Amplifiers of this type are particularly unsatisfactory when used in problems involving
multiple closed-loops~ e. g. in high gain
algebraic loops. Multiple loop configura tions
have a large tendency toward self-oscillations.
Figure 10 shows a root-locus plot for a summer which has extra capacitors pa ralleled
across Zf and Zio Proper selection of these
capacitors gives an over-damped response
with much wider bandwidth: e. g. 30 - 50 Kc
instead of 12 - 15 Kc, and no peaking. The
compensa tion also produces a dipole in the

322

8.2
vicinity of the zero caused by Yf. This dipole adds a slight phase lead to help reduce
low frequency phaseshift. Multiple loops using these compensated amplifiers have much
less tendency to oscillate. The amplifiers
can also drive larger capacitive loads then
the uncompensated type. This is important
when amplifiers must drive remote equipment. Load capacity can be increased by
at least a factor of 10. Capacitor values
must be selected so that Zf and Zi have
equal tim~ constants. That iS I X10 (lOOK)
resistors are compensated with 10 times the
capacity associated with Xl (llVl) resistors.
Addition of compensating capacitors is very
important in increasing overall linear system stability and the bandwidth for a given
accuracy of computation. It should be noted
that input resistors associated with integrators must have no compensating capacitors~
since in this case the capacitors will cause
additional undesirable phase error.
The selection of the proper values of compensating capacitors for the computing resistors is based on optimizing overall-performance of coefficient potentiometer-amplifier res'ponse~ rather than amplifier response
alone. Summers are very often driven from
coefficient pots~ and the capacitor associated
with the amplifier input resistor loads the
pot. Therefore~ the compensating capacitor
value is chosen to give minimum phase error
and maximum bandwidth from pot input to
amplifier output. In computers using 30K
pots J the best values of compensation are
about 43 pf with 1M resistors~ and 430 pf
with lOOK resistors.
Conclusions
We have discussed the linear computing
elements from the standpoint of their actua 1
equiva lent circuits and their relationships
to computing accuracy, system stability,
and system bandwidth.
Computing resistors are shown to require
minimum shunt capacity in their construction J and the shunt capacity must be equalized for all resistors. Minimization of the
capacity is required for small phase errors
in integrators. Capacity match is required
for small pha se errors in summer amplifiers. They should have a means of adjustment if their value is likely to change with
age.
Computing capacitors must be stable in
va lue J have large leakage resistance, and
minimum dissipation factor and dielectric
absorption (soakage) effects. They should

be installed in a controlled temperature
environment and be easily adjustable.
Coefficient potentiometers require high
resolution for accurate settings~ and low
resistance value to minimize capacitive
loading errors.
Operationa I amplifiers must be designed
to be stable under any permissible computing configuration in the computer system.
In addition, summing resistors should be
capacitively compensated to produce maximum bandwidth and stability in multiple
loops and when the amplifier is driving
large capacitive loads.
The following examples indicate the improved computer performance obtained by
the careful consideration of the design of
the linear system. The data was obtained
from a standa rd BECKMAN EASE® 1100
Series computer using compensated resistors, 30K coefficient pots J Polystyrene
computing capacitors and EASE® Model 1148
Operational Amplifiers.
Improved summer amplifier bandwidth is
shown by Figure 11. The absence of overshoot indicates an overdamped (dominant
first-order) system. The rise time of
about 15 f,.t sec indicates a bandwidth of
approxima tely 30 Kc.
Figure 12 shows the ability of a summer
to operate with large summing junction or
load capacity. The step response is still
well damped with as much as O. 3 f,.tfd load
capacity. The uncompensated amplifier
would oscillate with as little as 0.02f,.tfd.
Figure 13 shows the stability of loops with
up to 19 unity gain amplifiers. Unity ga in
loops will oscillate with as few as 3 to 5
uncompensated amplifiers.
Figure 14 illustrates the gain possible in a
three-summer loop with a coefficient pot.
The response is still damped at a loop gain
of 8~ while uncompensated amplifiers will
oscillate at a gain of less tha n 2.
The steady-state error of a three-amplifier
oscillator is easy to measure. It uses all
of the basic components of the linear system:
Resistors, capacitors, coefficient potentiometers~ and amplifiers.
Therefore J it
is useful in demonstrating the balanced or
optimized system design.
The three-amplifier oscillator uses two integrators, one summing amplifier J and one

323

8.2
to two coefficient potentiometers. The frequency of the oscillator varies with loop gainl
f

=Vloop

ga in/2 1[.

The steady state error occurs as a slight
divergence or convergence in voltage amplitude. This can be measured over many cycles l or by reading successive peak voltages.
Both techniques are useful over the frequency
range of interest O. 001~ f ~ 100 cps. The
data can be plotted more conveniently as
damping" ~
a/wn, i.e' l amplitude decrement per radian.

=

Figure 15 shows component damping l vs.
frequency for a three-amplifier oscillator
with a maximum loop gain of 10 5 which gives
a maximum frequency of 50.4 cps. Note
that the loop gainl or frequency, is varied
by coefficient potentiometer displacement, 'Y.
The capacitor loss is approximated by a constant dissipation factor of 1. 5 x 10- 4, causing a frequency independent convergent effect.
The resistor ac mismatch can cause either
a convergent of divergent effect as determined by the net lead or lag in summing amplifier.
Note that phase error effects increase with frequency. The large spread
is for 3IJ.sec net mismatch; the smaller
spread is for 21J. sec mismatch. Current
production achieves less than 21J. sec worst
case mismatch" or less than ±1. 0 pf spread
in the one meg capacitively compensated
resistor. A slight lead or convergent effect
comes from the two integrator input resistors. The potentiometer effect is initially
convergent due to lead then divergent due to
lag" and finally zero at full displacement ..
'Y = 1. The potentiometer data is for uncompensated 30K potentiometers under 660 pf
capacitive load. This error effect could be
reduced by a factor of 5 with the two-tap
compensated potentiometer" i. e. essentially
eliminated.
Figure 16 shows the combined errors of
Figure 15. Note that phase error of the
uncompensated potentiometer causes the oscillator to go divergent between 22 and 34 cps.
Typical cross-over frequency for non-opt~
ized computers is between 4 and 5 cps. ThlS
is very easy to check" since the oscillator
amplitude is neither convergent nor divergent at the cross-over frequency.
4
A loop gain of 10 yields an f
of 16 cps
max
whic h is more common for the EDA. The
potentiometer error is too large at loop
gains of 10 as shown above. For such high
gain loops compensated potentiometers
should be used. (10)5 loop gain will occur

more often in IDA problems. Figures 16 and
17 show the balanced error that is achieved
for the EDA at loop gains less than 10 4 •
Note that the capacitor dissipation effect offsets the damping curve approximately the
same amount that the summing amplifier resistor mismatch spreads the curve, and approximately the same amount the potentiometer capacitive load distorts the curve. It
is on this basis that optimum design is
claimed. No error source is over-emphasized. and the excellent system stability margin has been achieved.
Although there is a rather large class of
problems for IDA application that do not depend on dynamic accuracy for correct solution there is an even larger class of problem~ whose accuracy will be severely limited
if dynamic accuracy is poor. Caution should
be exercised in extending IDA computationa 1
frequencies to obtain faster problem solutio~
unless careful attention is paid to the dynamlc
error factors reviewed in this paper.

324

8.2
APPENDICES
1. Capacitor Characteristics

RL varies inversely with frequency. For
1 J,tfd Polystyrene capacitors

Dc Leakage
The dc leakage of the integrator capacitor
limits integra tor accilracy at low frequency
and causes drift of non-zero integrator voltages (in the HOLD mode). Both effects can
be used as a convenient means of cap~itor
leakage measurement in the computer:
1

Rdc~ar

(A. 1.0)

where a is the exponential decay coefficient
for a three-amplifier sine-generator. The
frequency should be below 0.001 cps and the
circuit should use equa 1 capacitors in the
two integrators. See Figure A. 1. o.
In the HOLD mode
E

1

max
R dc~--
C

(A. 1. 1)

(dEl dt ) max

Emax should be both plus and minus to cancel the effects of amplifier input grid cUITent
and grid to ground current due to offset.
Averaging the two initial decay rates will
yield a valid Rdc only if the Emax is established (in I. C. mode) for a few minutes to
avoid dielectric absorption effects. 7 Rdc
is typically 10 12 to 10 13 ohms for Polystyrent 1J,tf capacitors.
Dissipation Factor
Dissipation factor~ D, is conventionally
used to evaluate dielectric materials for
capacitors. f3It is a steady-state sinusoidal
voltage characteristic defined as
D

=

Energy loss radian
Peak energy stored
(A. 1. 2)
1

(A. 1. 3)

between 0.01 < f < 100 cps, or D~1. 5 (10)-4.
Figures 15 through 18 in the main report
use the steady-state approximation of capacitors in computing capacitor error.
Figure A.1. 1 shows diSSipation factors for
various materials at a lower frequency range
than typically specified, i. e. computer frequencies.
Figure A. 1. 2 shows dissipation factor data
for a typical Polystyrene 1J,tfd capacitor. A
selected Teflon capacitor's data is also
shown to indicate possible reduction of this
important dynamic error.
Figure A. 1. 3 shows the normalized dissipation factor of a large, perfect capacitor
shunted by a series resistor and capacitor.
It is possible to approximate the practical
capacitor with a series of such resistorcapacitor shunts to achieve an approximately
constant dissipation factor, as shown.
Equivalent Circuit
Figure A. 1. 4 shows the application of the
technique of Figure A. 1. 3 to the measured
Polystyrene dissipation factor of Figure
A. 1. 2. This is the basis of the fixed-parameter equivalent circuit of Figure 2 in the
main report. The fixed-parameter circuit
is somewhat complex. but can be used for
transient studies.
Value Variation with Frequency
Figure A. 1. 5 shows the capacity change predicted from the equiva lent circuit as the solid
line. Note that the experimental points confirm the equivalent circuit's validity from
this viewpoint. Approximately 0.02% variation in 'value per decade frequency change
is observed.

w RL (f) C (f)

Dielectric Absorption
where C (f) and RL (f) are frequency dependent. ,and para.llele;d to form the steadystate equl\,;alent CIrCUIt of the capacitor,
Figure 3. A three-amplifier Sine-generator
with only capacitor loss (no net phase error)
yields a per radian amplitude decrement~
~ = a/w, equal to the capacitor dissipation
factor" D.
For Polystyrene capacitors C (f) varies only
about 0.02% per decade frequency change and

Limited transient evaluation of the equivalent
circuit has been done to explain dielectric
absorption or history effects. The analysis
is somewhat difficult~ as the equivalent circuit is complex. It is sufficient for this
paper to point out that it is easy to get 0.02%
to 0.04% differences in answers us ing 1 J,tfd
capacitors even at low computational frequencies (f < O. 1 cps) due to capacitor history. A
typical example is this: Soak the capacitor
at +100 V in a false I. C. mode; establish zero

325
8.2

volts I. C., for 10 to 30 discharge time con stants, and then integrate a small step input
in the COMPUTE mode. The "ramp" integrator output voltage will differ 20 to 40 mv
if the false I. C. is changed to -100 volts.
After the first 15 to 20 seconds the difference
is independent of time. This false I. C. can
easily be a voltage computed in a previous
cycle and soaked in a long HOLD interval
before the next compute cycle is started.

P0tentiometer capacitive loads generally
are limited to 1 ~ k C e Co ~ 3 in the
computer. From Figure 5, the normalized
phaseshift, 
S\ r-~ l\~
H 1/ V; 11 ~~ ~ ~ l\\
"/
~

l~

I>::,.,

,.

u. ~

1\ .\
~r.-o

T

..

,\\ ,\ 1\:\\[,\ ~
LI-"

.-L-

L.L.

#0

......
'0

\"

~

.

L\' .\.

\

\.

~[\
Krl\ rDr.C,

\.

s
.J

,I..

'Nol

!J 7

If/

''\'\ \ \
o , \ \ ,\ \

",•••• .,..

FIGURE A. 1. 0

Sine-Wave Damping vs. Frequency

I

"."'' .

.\ _\IYI\

,\

,\i\

//
.....
.\\1\ \ \

I--r

~

r;,rl

.i.

- '-

t\ K!\ ~~
\ \ \ \\1\

\\.

"

FIGURE A. 1. 1

Dissipation Factor vs.
Frequency

----+------+--\----+--

FIGURE A.

~.

ill1---~

2

Dissipation Factor vs. Frequency

FIGURE A. 1. 3

Normalized Dissipation
Factor vs. Frequency

FIGURE A. 1. 4

Dissipation Factor vs. Frequency

FIGURE A. 1. 5

Capacitor Variation vs. Frequency

335

8.2

-.1

I

I I

,
, I,,.)

j[G:
I

I
I

~i"

1 ---r1
("'-1'1

I

.::n (.... :;''''C-C'

I

No.""". ,.c
(0..,.

AA.JI

rA~)

5 ...

,~

V."

"/.,f"'."' ..£NlINI"

(,

I
P"T 015'-

,
If)
I
I

I I

e"
C< "Y< .0

/"7I'N"4M~'-!1li.

IFUA.hNI.III

IrAI=',

I

FIGURE A. 2.

a

FIGURE A. 2. 1

FIGURE A. 2.2
FIGURE A. 2. 3
~~"""'MI

--~~-

FIGURE A. 2. 4

See Next Page

v.

',12;

U'e.

/.2S""'$Ec.

1'= IOO*c

F=250/(c

FIGURE A. 3.0
OPERATIONAL AMPLIFIER SYS TEM CHARACTERISTICS
SHOWINII

0,-,,,, £oo~ Po'~.1 ANO #'lfO.l
ANO POSSllt.1 Roor £OCl

(NEAIf TNe

A"I'''' AXIs)

rOlf

A
C• • 'SOO,.~

..

.

-"

r

..

~

-Y"I
<#

Y,

~

-t-

Ya

1111 I I

I

I

111111 I I

"

.3 2

I

II11I1

I

I

111111

I

I

1111111

I

I

,11.111

I

~

..
, ---

.

-

~

'W

/I

7

-

..

y,

.~-

...

,

I

III

III

I

AI

y.

zf'

lltlJ.l I

1

Itllil 1 I

(101'

I

111111 I

(ltIl

I

111111 I

(/0)6

I

1I1111

rIO)S

I

I

111111

(10)"

I

I

111111

(/O)J

I

(UNIDCNrll'ICO POLCS

I

II

III

I

rlfO"" nu

1111111

-10

I

I

-20

Zc"os

10"188)

I1111

I

I

1111

I

-J"W

Jw
S-PLAN'
rOlf
UNITY GAIN 1148
IN

z::

1100 $..-STCAIP - - -

.. l...

...

o

-ow

f

111111

(Iol

I

-10

I

(UNIDENTIFICD POLCS
~ClfOS
AlfE rlfO"" THe 1148)

I

1\

111111

{IO/

I

I

111111

(10)°

lO
10

~

I

I

o

!

1
(lOf

10

,"

.. ...

4.3

S

20

!

•-

C4 ·'SOO,.F

..

---I

~

1..

Ie • 1..1 Ie

-v.-

IN 1/00 SYSTE''''

~

AIU
I

z:

I

•

\
Lllli I I I

10488

"

A~

...

j.

U,..,rY GAIN

L_

,.•. "0

JiAJ

I

IS-PLANe

I

I

111111 I I

(10)-1

I

-20
I111I I I

(IO)-l

I

-Ji.J

(Iof.]

(J

ex>

UJ
UJ

N

-.J

338

8.2

,...-r
..,..-r;.::
..

~.

.~_,

".

~--

,..,.",.,.",..,..,.

,.

: 1__~'r'

, -..
I

:;-'''

1I.-----I-~,..._-__s..4_----__I1_

.."

1-

."

."

..

---+----~----~~--------~,~

----+----lw-+.:.r.---~
.....

~..",

• MNU'''' ..",-

•

~==~~~==~--~~~~---~

,11,.111111,11,,,, Atl"A
A41MrTl(' IMF"iI

•

,.......,......,.

• • "","AlI6JIfIC4WNr.",. . , . , . . . .

......- - - - - Ga, •

.&r'.f!o4Il"~J.lHIl

.. /- u"'Jr... ."..J

~fIII~":

,....

----

~.). ""'W£..~trelfl •• oOO#lll'J/"-"·'~:J b·~",-""-#.r.J 11
I~

I"~ .fOIfI'Jr~~1 'ftJ1w')z .. ·.,. I.~I'" •.

::l!:(~':J

FIGURE A. 3. 1

,.".->Z."".,

~

350

8.3

Figure 10.

Control and Patchboard Panels.

351

8.3

Figure 11.

Input and Outputs of Three Cascaded Channels.

~CHANNEL
~=f

1='

~E'.;;'

-j.::.CH'fA..;..N;,-N"'iEFL,I_IT'N..;..P;,-UT_, _.+-H-t--+-f-.....I---!..-'--'--'-.l......-'........---!..-'--;-L
-i-+-+--+-+--+-f-t-+-+--+-+-+-I SAMPLING RATE: I CPS
-j
-i-+-+--+-+--+-f-t-+-+--+-+-+-I CHART DRIVE:
2 MM/SEC--j
~ \+-5 MM

3 OUTPUT
.~

-

r_~

c"=l::-::c

~ ~~. ~

.. '? 0=0,
~~E

-.~_tc=I=~ ~

:_~_

J:.

E-:.---

~~'r::'
-==t""f;:'

CHANNEL 4 OUTPUT

t==EF'~ +.~ = ===~

~

• ~=

~

'-

..

_c=l=

VERT SCALE: I V/MM
CHANNEL I OUTPUT

~~

-r--.

"';=~=-'~'-=-I=~~~

CHANNEL 5 OUTPUT

CHANNEL 2

OUTPUT~

~ ~

1=

r=:: =

="'c .. ~ ~~_

. CHANNEL 6 OUTPUT",

Figure 12.

~~FE"

._

-:.='r=of=:- .

Comparison of Input and Outputs for Six Cascaded Channels.

353

8.4
A DIGITAL CONTROL UNIT FOR A
REPETITIVE ANALOG COMPUTER
By Thomas A. Brubaker
and
Harry R. Eckes
Analog Computer Laboratory
University of Arizona
Tucson, Arizona
Suuunary

The University of Arizona's repetitivecomputer control unit combines a 10 kc crystal
oscillator with inexpensive preset decimal
counters and simple digital circuitry to
generate accurate timing pulses which perform
the following functions:
1.

Reset a repetitive analog computer
at 100 cps, 50 cps, 25 cps, 10 cps,
or on external triggering.

2.

Actuate external equipment (statistical averaging computer) during
a preset number of 1000 to 10,000
successive computer runs.

3.

Furnish sampling pulses to sampling
readout devices at push-button selected sampling times tl and t2
(or tl +~), seconds after the start
of each individual computer run.

4.

Furnish variable-brightness oscilloscope timing markers at 1000 cps,
500 cps, 250 cps, and 100 cps. In
addition, markers are available at
the computer repetition rate and
at times tl and t •
2

For statistical experiments, the 10 kc
clock OSCillator will be de tuned slightly, so
as to produce sampling rates not harmonically
~elated to the 60 cps line fre~uency.
Principles of Operation
Figure 1 shows a repetitive-analogcomputer set up in block-diagram form,
together with a pictorial representation of
four computer runs with random initial conditions, random parameters, and/or random
forcing functions.
Referring to Fig. 1, the control unit
furnishes accurately timed reset pulses to
the repetitive-analog-computer. While the
reset pulse is pOSitive, all computer
integrator output voltages are reset to
their preset or random initial values
(RESET condition of the repetitive-computer).l
A typical computer run begins when the reset
pulse is turned off (COMPUTE condition). At

push-button selected sampling times tl and t2
(or tl +'1") seconds after the start of each
compufer run, timing pulses from the control
unit cause two sample-hold circuits to store
two selected computer voltages, X(t) and
y (t), so that sample values x(t ) and Y (t ) are
2
read out. Separate pulses from l the control
unit terminate the computer run and reset the
two sample-hold circuits so that they can
track a set portion of the next computer run.
The computer cycle is typically repeated
at repetition rates of 10, 25, 50, or 100 cps,
with reset periods taking 10 percent of the
repetition period. Thu~a 90 msec computer run
would be followed by a 10 msec reset period.
Computer runs and/or sample-hold circuits may
also be triggered by external devices, such
as analog or digital computers or external
measuring equipment.
or

from a predetermined number of successive
computer runs are used in the statistics
computer to produce estimates of ensemble
statistics, such as probabilities, probability
densities, expected values, mean-square delay
error and correlation functions. Sample sizes
between 1000 and 10,000 are push-button selected on the control panel.
Block Diagram
Figure 2 is an overall block diagram of
the digital control unit.
To begin a series of computer runs, it
is necessary to reset all counters, binaries
and bistable multivibrators to their proper
states. If the manual reset button is used,
all sample-hold circuits are reset to track,
the analog-computer is in RESET and all
counters read zero. For external resetting,
pulses generated in the correct sequence are
used to reset the proper counters and multivibrators for the desired operation.

354

8.4
Internal Clock Mode of Operation
Generation Qf Reset Pulses
Referring-to Fig. 2, when operating in the
INTERNAL CLOCK mode, the 10 kc crystal oscillator
provides the input to a group of three preset
decimal counting units, (preset DCU's 1, 2, and
3), which are used as frequency scalers. The
output of each ncu is a positive square pulse,
whose negative going edge may be differentiated
to yield pulses at one-tenth the input pulse
rate. To get sharp negative pulse trains at
1000 and 100 cps, the outputs of counters 1 and
2 are put through simple differentiating buffer
amplifiers. Binary scaling of the 1000 cps
pulses yields four negative pulse trains at
1000, 500, 250, and 100 cps, which are selected
by the repetition-rate selector switch to give
10 times the desired repetition rate of 100, 50,
25 or 10 cps. These pulses are also used as
oscilloscope timing-markers.
For a particular repetition rate, the
selected pulse train drives a permanently
"preset-nine" ncu (DCU7) The output of this
scaler is positive between each 9th and 10th
input pulse (Fig. 5b). This square pulse,
whose length is one-tenth the computer run time,
is the reset pulse used to restore the analogcomputer elements to their initial conditions.
The negative-going trailing edge of this reset
pulse precisely marks the start of a computer
run and differentiation yields the negative
COMPUTE pulses, which can be counted to determine the total number of runs. These COMPUTE
pulses are also used to reset various counters
and bistable multivibrators and as time-markers.
Generation of Sampling Pulses
tl Pulses.
To obtain a pulse tl seconds
after the start of a computer run, decimal
counters 1, 2, and 3 can be push-button preset
to provide a pulse from 0 to 100 msec after the
start of a run in 10- 4 second intervals. At
time t , the preset outputs of the three counters
l
all go positive and the AND gate (AI) emits a
positive pulse which is differentiated and inverted. This negative pulse turns on a bistable
multivibrator (Ml) which blocks any more preset
counter pulses until it is reset by a COMPUTE
pulse. The differentiated multivibrator output
is the desired negative tl pulse which performs
the following functions:

1.

It actuates a bistable multivibrator
(M4) which puts the tl sample-hold
circuit in HOLD.

2.

It serves as a tl time-marker.

3.

It resets a permanently "presetnine" scaler (ncu 8).

The input to this ncu is the pulse train
at the frequency equal to 10 times the repetition rate. The output of the scaler is a square
pulse (Fig. 5c) whose positive-going leading edge
is differentiated and inverted to reset the tl
sample-hold circuit into its tracking mode.
The resulting sample-hold tracking time can,vary
from one-tenth to two-tenths the computer run
time. If it were desirable to make this tracking
time equal to exactly one-tenth the length of a
computer run for all possible t l , additional
counters would be required; since the tracking
time is not critical, one counter suffices.
t2 Pulses.

To obtain a pulse t2 seconds

after the start of a computer run, the t2 -~
selector switch is placed in the t2 positiOn.
Now the bistable multivibrator (M6J holds the
AND gate (AS) on and DeU's 4, 5, and 6 are
synchronized with counters 1, 2, and 3 to produce a positive output at the preset time t 2 •
If a sampling pulse is desired 't" seconds affer
the first sampling time t , the selector switch
l
is placed in the~position. In this mode, the tl
pulse resets counters 4, 5, and 6 and turns on
the bistable multivibrator (M6) which, in turn,
permits the AND gate (AS) to pass the counter
input pulses from the crystal OSCillator, so
that scaling begins at time t • In either case,
l
the pulse at the time t or t +1"is used to
trigger a bistable mult~vibra!or (M2), which in
turn provides the negative pulse which actuates
the t2 sample-hold Circuit, acts as a time marker
and resets a "preset-nine" scaler (DCU 9).
The Total-Run Counter
The total-run counter counts each COMPUTE
pulse as the end of a computer run, so as to
count only completed runs. The run counter
consists of two regular ncU's (10 and 11), and
two preset DCU's (12 and 13), so that any
integral number of hundreds of runs between 1000
and 10,000 can be push-button selected. When
the selected total is reached, the preset outputs of ncu's 12 and 13 go pOSitive, and the
AND gate (Al) emits a positive pulse. This pulse
is differentiated and inverted to trigger a bistable multivibrator (MS), which then turns off
the AND gate (A4). This pulse also turns off
all external statistical computing equipment,
which now contains statistics of the system
under study.
External Computer Control
In the EXTERNAL CLOCK mode, an external
device supplies reset pulses, which must be
negative during the COMPUTE portion of the run
and must go positive to reset the analog-computer
elements to their initial conditions. Here
again, the trailing edge of the reset pulse is
differentiated to provide a negative COMPUTE
pulse for run counting, for resetting counters
and multivibrators, and as a time-marker.

355

8.4
This COMPUTE pulse resets DCU's 1, 2, 3, 4, 5,
and 6 which are preset to give pulses at times
t and t . The only difference is that now the
1
2
"preset-nine" OOU's 8 and 9 operate at a 100 cps
input rate, which means that the computer repetition rate should not exceed 10 cps. This maximum
repetition rate can be easily varied if faster
rates are desired. If external tl and t2 pulses
are used to actuate the sample-ho d bistable
multivibrators (M3 and M4), sample-hold reset
pulses must also be supplied to reset the
sample-hold circuits into track between
successive samples.
Oscillator. Buffers, AND Gates. and Bistable
Multivibrators
Figure 3 shows the 10 Kc crystal oscillator
and inverter circuit. The circuit operates so
as to produce a non-sinusoidal output waveform
of amplitude 90 volts Which has a sharp fall time
of about 3 ~sec.

The digital control unit operates from the
+ 300 volt repetitive-analog-computer power
;upplies. For reasons of economy vacuum-tube
components were used, however, the system
would lend itself to implementation with
standard transistor modules.
Acknowledgements
The circuit was developed in the course
of a repetitive analog computer project directed by G. A. Korn. Acknowledgement is due
the Electrical Engineering Department of the
University of Arizona and Dr. P. E. Russell for
continuing support of this work.

References
1.

Huskey, H., and G. A. Korn,
Computer Handbook, McGraw-Hill,
N.Y., 1961 (in print).

2.

Millman, J. and H. Taub,
Pulse and Digital Circuits,
McGraw-Hill, N.Y., 1956.

Figure 4 shows the following circuits:
a.

Diode AND gate and inverter. This
gate uses the preset counter outputs
as the diode inputs to give a positive
pulse when all inputs are positive.
The pulse is differentiated and drives
an inverter biased to cutoff, so that
the final output is a sharp negative
pulse.

b.

Cathode follower AND gate. Here a
bistable multivibrator output controls
the bias on a cathode follower. If
the multivibrator output is pOSitive,
the tube conducts and the differentiated input signal is passed.

c.

Buffer amplifier. This circuit
differentiates the input pulses
and clips off their positive going
portion. The results are sharp
negative pulses from a low-impedance
source. For use as a dc coupled
cathode follower, the input network
is removed and the circuit is dc
coupled through a voltage divider
to the input to provide a square
pulse output at the proper voltage
levels.

d.

Bistable multivibrator. When the
input is taken through the diode
network the circuit acts as a binary
scaler. For use as a bistable device
for driving gates the diode network
is removed and the inputs are the 50
pfd capacitors.

356
8.4

DlGITAL
READOUT

TO AMPLITUDE
DISTRIBUTION ANALYZER

XU)

XUI)

RANDOM

SAMPLE
VALUES

INPUTS

ENSEMBLE
STATISTICS
COMPUTER
CORRELATIO
FUNCTIONS ETC.

YU)

~

RUN I

I

I
REP RATE

+--

i

tl

...

RUN 2

"-4

l

RUN 3

n

RUN4

I

I

tl

COMPUTER

II

tl PULSES

n

RESE T

PULSES

I I

n

n

I I
I I

COMPUTER

n

I I
I I
I

I

~l

I I

I I
I

I

n

OUTPUT

SAMP L E - HOLD

I I
I I

RESET

PULSES

11I

fLI
I I
I I

n

I I
I I

I I

I

I I

I
SAMPLE -HOL D

..

I

II

START RUN AND

n

.. 4

I

OUTPUT

Fig. 1
A Typical Repetitive-Analog-Computer Set Up
Together With Four Sample Computer Runs

n
I I
I I

~:

I I
I I
I

I

EXT

RESE~

250 CPS MARKER
~

REP. RATE SW.
25
•

PS9

50.

7

DCU

MR

START

AND STOP

STAT.

COMPUTER

EXT.
CLOCK

EXTERNAL

MODE

COMPUT£R
RESET

~ICFJ

i

•

REP. RATE
XTAL
OS Cll.

t1

MARKER

RESET

tl MARKER

t2

M31-JeFl

.b

t2 RE SET

•

EXT. t2
M~

t 2 MARKER

Fig.' 2
Digital Control Unit Block Diagram
ex>

•
~

W

01

-.J

358

8.4

.---------~--------~~------._--------------------~~~~_o300

7K

OA2

.OIMF

c::J
IOKe

50PF

XTAL

'-------.0 OUTPUT
270K

_

-3PO

Fig. 3

10 KC Crystal Oscillator

359

8.4
300
300

GATE PULSE
leOK

1/2 12AT7

IN6!
PRES E T IN. ~"""'--I 1--1IIf----I

OUT
IN63

1/2 12AT7

1M

1M

.,.300
-300

a

b

300

300

IN643

1M

-

-300

c

d
FiS. 4
a.
b.
c.

Diode AND Gate and Inverter
Cathode Follower AND Gate
Buffer

d.

Basic Bistable Multivibrator

360

8.4

.j
I

I.--- RE~ .. RATE

I

t,

I

I

I
I
I
I

I
I

[j
I

I
I

I

tJiI
I
I

I
I

H

a

I

n

I
I

I I

I

I

I
I

I

I

I

01
I

I

REP. RATE X 10

I

I I

b
COMPUTER

I

I

o

o

RESET PULSES

I
I

c

I
I

I

I
I

hi

t, PRESET NINE OUTPUT

I

I

I

n H

d
t I SAMPLE-HOLD RESET PULSES

Fig. 5
Reset Pulse Diagram

361

9.1
TRENDS IN DESIGN OF LARGE COMPUTER SYSTEMS
Charles W. Adams
Charles W. Adams Associates Inc.
Bedford, Massachusett~

Sturunary
New developments in computer design are
reported and trends are analyzed -- first
with regard to physical devices with emphasis on fixed and high-speed storage systems;
then with regard to logical techniques including various logical organization schemes,
stored logic, concurrent autonomous operation, and new approaches to modularity in
design.
Introduction
When the WJCC Program Committee invited
me to prepare a survey of newer, larger computer systems, they suggested that I include
both an indication of what are the characteristics of the specific systems and an interpretation of what these imply for the design of future systems. It was also their
intention that computer manufacturers be encouraged to give five-minute discussions of
their new systems.
I have taken the liberty of changing
that format slightly, by considering different aspects of computer design and using
specific systems as illustrations, and have
sought to find "interlude" speakers who
would give broad, unbiased, expert coverage
to important design aspects.
In preparation for the session, a
rough but fairly comprehensive outline was
prepared and distributed to a dozen or so
friends who are well-known both for their
awareness of new developments and trends in
the computer field and for their willingness to express themselves freely. Based on
their thoughtful replies, as well as on material received from a number of people
representing the computer manufacturers who
had new systems and devices to report, I
came to an almost obvious conclusion: the
keynote in the design of new systems,
whether large or small, is not hardware but
logic. Consequently, new developments in
physical hardware can be rather quickly
summarized, with principal emphasis given
to new approaches to internal logical operation.

New Devices
The hardware development which caused
so much excitement only a few years ago -solid-state circuitry, two-microsecond core
storage cycles, magnetic tapes operating reliably at upwards of a hundred thousand
characters per second, magnetic juke~boxes
for Tlrandom lT access to files of ten or
twenty million characters, scanners for
reading and interpreting printed numbers
optically -- are now taken for granted.
Today the attention of the system designer
is turned toward facilitating the effective
utilization of these now almost humdrum
achievements in hardware design.
This is not to say that new developments are not being made in the hardware
area. For one thing, rather spectacular
improvements are being made in the reliability and costs associated with all aspects of computer componentry. Most notable
perhaps are the continually improving performance of the large drums, disc files,
optical scanners, and magnetic-ink character recognition devices (witness the new
checks the banks are supplying to all their
customers as part of the nation-wide adop.
tion of MICR) .
A new approach to random-access files
has been announced by NC~, to wit the CRAM
system involving automatically selected
magnetizable sheets which are then wrapped
around a drum. IBM demonstrated last year
a TltractorTl system for selecting tape reels
automatically. The really new hardware
seems, however, to be appearing primarily
in the area of high-speed storage, both
erasable and non-erasable.
Non-erasable Storage
Increased attention is being given to
various forms of non-erasable (i.e., readonly) storage systems, large and small.
Though new in detail, such devices are far
from new in principle. Interest in them
stems in part from increasing recognition
of the potentialities of using ITstored
logic" in system design, and in part from

362
9.1
the growing popularity of special-purpose
stored program and/or stored data file
systems.
One example of non-erasable storage is
the 0.2 microsecond magnetic slug memory
which will be used in the Ferranti Atlas
computer to contain an executive routine
for multi-programming control as well as
such things as macro-·program subroutines.
Speed is the most important aspect of this
device, but the fact that it can be altered
only at the factory puts the Ferranti programmers in an enviable position indeed
since no user can tamper with their programming package once it leaves the plant.
Various control computers use nonerasable memories such as drums with mechanically locked-out recording circuitry,
twisters biased with permanent magnets, and
non-destructive reading from thin-film storage. For example, Remington Rand announced
some time ago a dual-film device in which
an erasable cobalt film biases a read-only
permalloy film, and is actually using Bell
Labs twisters with magnets mounted on removable cards. In the M490 and 1206 they
provide a small diode matrix, the contents
of which can be mechanically altered only
by inserting plugs pre-assembled with wires
soldered in place, for use in loading error
routines, etc. And, of course, more than
ten years ago M.I.T.'s Whirlwind I was performing 125,000 instructions per second,
working from a storage of 80 flip-flops
and 432 diodes controlled by individual
toggle switches.
A third class of non-erasable storage
is photographic storage, optically scanned
by cathode ray tubes, which yield a very
large high bit-rate file -- e.g., the IBM
photoscopic disc memory for use in language
translation and the BTL random-access photographic storage used in electronic telephone
exchanges. In these cases, the emphasis is
large volume data storage at reasonable
costs and/or rather high speeds, while the
fixed drums, twisters and diodes are aimed
at protection against accidental loss of
information (usually of a real-time control
program) .
High-speed Storage Devices
Reading or writing in a random-access
storage in less than a microsecond is being
accomplished in thin-film magnetic memories
on which the M.I.T. Lincoln Laboratory,
Remington Rand, and Honeywell have announced
some results. Small (128 word in the UNIVAC
1107 case) thin film storages operating at
0.6 microseconds are now functioning satisfactorily in the laboratory. A ten-thousandword storage at 0.1 microseconds is the present objective at Lincoln Laboratory.

The National Cash Register Company has
announced a magnetic rod memory storing a
thousand bits per cubic inch with switching
times of 0.05 microseconds.
Aside from film and rod systems, work
is apparently still continuing in the cryogenic area, (involving superconducti vi ty
phenomena at temperatures near absolute
zero), but no real breakthroughs have been
made public.
Increasing Effective Storage Speed
Techniques for increasing the effective
speed of a conventional magnetic core storage device include using two or more independent banks alternately, providing asynchronous operation in which rewriting is delayed if possible until storage access is
not otherwise needed, and implementing lookahead schemes by which any potentially idle
time is used to read information in anticipation of its later use. The gain from such
techniques is limited to a reasonably small
percentage increase (under 100~ and certainly in practice has not always been as
great as has been expected. In any event,
these are basically logical rather than
physical means of increasing speed, akin to
the various forms of autonomous operation
discussed later.
New Logical Designs
The biggest design improvements in
large digital computers during the next few
years seems likely to come in the logical
organization of the systems rather than in
the componentry involved. The techniques
employed and the objectives gained are many
and various --- so much so that what follows
cannot lay claim to being even a comprehensive catalog, quite aside from not containing any appreciable degree of detail. What
has been attempted, rather, is to make mention of a number of techniques and to assess
their apparent purpose and merit.
Arithmetic
The age-old problem of choosing among
binary, decimal, and alphanumerical operation has not been completely resolved, but
the natural tendency is toward compromise:
binary computers which have convenient facilities for conversion to decimal and even
machines with two or more forms of operation
built directly into the hardware (e.g., the
Honeywell 800 and the RCA 601).
Effective storage capacity is sometimes
increased by use of a short word length with
built-in double-precision arithmetic to be
used when needed without an excessive speed
penalty (e.g., the Ramo-Wooldridge AN/UYK-l
and the Packard Bell 250) .,.Alternatively,

363

9.1
provision is made for dealing with halfwords as in the Remington Rand 1107 and the
IBM 7030.
Most large machines of course have
built-in floating point arithmetic at least
as an optional feature, but the Bendix G20
is the only one being promoted as having
no fixed point operations. (Provision is
made for unnormalized floating operations
which then are essentially fixed point) .
However, one sometimes encounters business
men who will not consider the G20 because
they definitely want to be able to do fixed
point arithmetic.
Address Logic
Merely numbering storage locations consecutively from 0 to N is now old hat. Indirect addressing and !!literals Tf permit instructions to operate on the contents of
the location whose address is contained in
!Tx!T or, much more directly, on !!x!T itself.
Both are convenient in certain cases, but
designers who haven't bits to burn in their
instruction words sometimes omit these features.
On the other end of the scale, when
there are more storage registers than there
are bits to distinguish between them in the
instructions, bank addressing (Honeywell
800) or relative addressing (CDC 160) are
sometimes used. The programmer often has
little patience with these --- to him they
are merely a nuisance. But the obvious
economy of storage they permit (since most
instructions do, or can be made to, refer
to nearby locations) has perhaps not been
as widely recognized as it should.
Engineering problems occasionally deal
with short numbers, but usually with fairly
long ones, and never with very, very long
ones. The business user, however, concerns
himself with !Tfields,!! not variables, and
his fields can and do run the gamut from a
single bit (male or female?) to several
hundred bits (home addr€ss). Characteraddressable machines like the IBM 705 and
RCA 501 (and many others) help dispose of
this problem, but a more honest way of dealing with the situation appears to be !!field
addressing, II as in the IBM 7070, in which
the existence of words is admitted (the
IBM 705 has of course a 5-character word
length, but evidently is ashamed to mention
it). Semantics aside, character addressability would be a very desirable feature if
it could be accomplished in a parallel manner to preserve speed.
A novel addressing scheme is planned
for the Ferranti Atlas computer. It is a
!Tone-level store!! in which a large drum

logically comprises the main memory, each
drum location being separately addressed,
yet the programs actually operate from a
large magnetic-core working storage. TTPages!!
of 512 words are !!automaticallyTf. brought into
core as needed, and copied back to the drum
when the space is more urgently needed
for another page. Every reference to storage requires that the page number be first
processed against a list of all the pages
already in core, and the appropriate core
page used where possible. An executive
routine in the !Tfixed store!! (mentioned
earlier) is used to decide which page to
replace if the reference is to a page not
already in core. The scannj~g of the list
is done in a parallel fashion in a fraction
of a microsecond.
Instruction Format and Repertoire
Of the 35 commercially announced
solid-state general purpose computers, 2~
are single address and only the Honeywell
800 and ~OO and the NCR 30~ are threeaddress. The IBM 1~01, 1~10, and 1620,
all the RCA systems, and the RW~OO are
basically two-address machines but in a few
cases four, three, two, one or no addresses
are used in different operations.
More startling yet is the logic of the
Burroughs B5000 which, they assert, !Tcan
well be considered the first non-vonNeumann
computer. IT Operations are grouped into
classes and the computer operates in arithmetic mode, subroutine mode, data-manipulation mode, or control mode, the last involving an executive routine permanently
recorded on a magnetic drum. In arithmetic
mode, the computer interprets a string of
intermingled addresses and operations
written in TTpolish notation!T (the expression y = (w + i + t) (p - q) /z becomes
ywi + t + pq- -z/ = in Polish notation).
A number of interesting operation codes
appear in some of the new computers, with
the Remington Rand 1107 certainly among the
leaders in the aspect of design. It is the
stored-logic approach that holds the greatest fascination, however. The Ferranti
Atlas, for example, uses one bit to make
operation codes which are used as any builtin code but is executed by the computer as
a subroutine stored in the fixed store. The
Ramo-Wooldridge AN/UYK-l on the other hand
has no elaborate built-in operation at all
and uses TTlograms!! of simple instructions
to carry out more complex operations. Means
are provided to go from logram to logram
automatically, so that in use they behave
much the same as conventional built-in
instructions.

364

9.1
Autonomous Operation
To make effective use of each of the
expensive high-performance magne~ic ta~e
units, core storage banks and arlthmetlclogical elements in a large computer, each
must be kept as busy as possible. Since
different problems make different demands
on the various units, proper balance can
only be obtained by operating several programs at one time in the hope th~t t~e c?mbination will provide a better dlstrlbutlon
of demand. If a balanced load is to be thus
achieved, each unit must be designed to operate autonomously, under its own contr?l, and
there must be an executive or schedullng
procedure that keeps everything running
as smoo~hly as possible.
The approaches to the control of a
large system of autonomous units are sever.al. In the IBM 7090, Bendix G20, and
others, for example, elaborate input-output
control units work out of the same storage
as the central control, and means are
usually provided to "trap!! or interrupt the
main program when the input-·output control
needs a new set of instructions. All of
the scheduling is done by a part of the
regularly stored program, although most
users do not have to concern themselves
with the question of this tTdriver" or
"executive" routine.
The Honeywell 800 uses a highly-publicized multi-programming arrangement, in
which each of up to eight different programs are performed virtually a step at a
time in sequence. While a program is waiting for an input-output unit to function,
it is by-passed so that there is no idle
computer time.
The Ferranti Atlas fixed store is used
to hold an executive routine which is automatically called in whenever any autonomous
unit needs attention and whenever the central computer is faced with any delay for
input-output in the program currently being
processed. Thus, the executive routine
carries on moment-by-moment control of the
scheduling, intermixing programs as required to give the best possible utilization of the system.
The Atlas also provides multiple consoles so that the separate programs may have
separate operators. Furthermore, the executive routine can alter the page reference
list (see above) according to what program
is in use, so that the same "drum" location
can be referred to by more than one program
yet have no interference between them (the
executive routine keeps track of the actual
drum pages corresponding to the page numbers
used in each program --- the absolute addresses are in effect treated symbolically
during operation) .

The multiple consoles of the Atlas or
the Bendix G20 make possible concurrent tTon_
line" program debugging by several programmers
at one time. The Atlas scheme provides foolproof protection of each program against accidental alteration by another sharing the
machine at the same time. The potentialities
and techniques of concurrent (autonomous) debugging would be a worthy subject for considerably more discussion than can be provided here.
Modularity, Graceful Degradation
Intimately associated with the provision
for and control of autonomous operations of
control computers, storage banks, drums,
tapes, in-out devices, consoles, etc.,.i~
the question of modularity and of provldlng
for graceful degradation. This latter term
is unfortunately not familiar to many computer
designers -- as will be seen, it refers to
the behavior of the system in the face of
malfunctioning of one or more of its parts.
Most present-day large computers are
modular in that the user has a wide selection
of usable configurations and correspondingly
of system rental cOpts. In most cases, failure of a peripheral device affects only that
part of the system, and the rest of the computer can contidue in operation. But when
any part of the central computer fails, the
system is usually completely out of action.
Graceful degradation is a design objective and for large systems is an important
one. It implies that multiple central control
units and storage banks are provided and are
treated as autonomous units along with the
tapes, etc. Using the philosophy of a telephone exchange, components are assigned to
jobs as needed, and malfunctioning units
are by-passed.
Conclusion
It seems safe to predict that computer
hardware will continue to gain in speed and
particularly in reliability as the years go
on. Almost equally certain is a continued
trend toward stored logic and toward more
autonomous operation in a multiprogrammed
modular computer capable of some degree of
graceful degradation. The results in terms
of improved performance per dollar should
be impressive.

365

10.1
CURRENT PROBLEMS IN AUTOMATIC PROGRAMMING
- Ascher Opler
Computer Usage Company, Inc.
18 East 41st Street
New York 17, New York
Summary
The proliferation of the number of applications to be programmed and the number of
types of computers available has created a
significant and challenging problem. This
survey will consider some of the more promising suggestions for enabling automatic programming to keep pace with other developments
in the field. Among the prospects are: standardization of source languages, development
of a standard universal intermediate language,
design of computers to operate directly in
source language, automatic translation of programs between computers, the automatic construction of compilers and the standardization
and construction of common compiler modules.

Automatic programming has grown from
an interesting child to a troublesome adolescent
in the past few years. The number of publications and meetings devoted to its problems
has continued to increase. This presentation
will attempt to state the problems from a prac. tical viewpoint and to survey anum ber of techniques that have been proposed to handle some
of the current difficulties. The viewpoint taken
here is a practical one and considerations of
costs, staff, delivery time, competitive position, etc., will not be foreign to the discussion.
The number of groups developing automatic programming (and closely related systems) today is enormous and continually increasing. Personnel so involved are primarily
(a) representatives of the computer manufacturer, (b) system programmers at the more
'advanced computer installations and (c) those
involved in computer programming research
(most frequently at universities and large independent research organizations).
With time, the responsibility for developing system programs has shifted from the user
to cooperative groups and now to the manufacturer. Computer manufacturers are expected
by their customers to supply with the machine

a complete set of automatic programming systems. With the field becoming highly competitive, each manufacturer is expected to deliver
the automatic programming package at the same
time the computer is delivered. Manufacturer's
representatives are asked questions concerning
the programming package and its availability
more often than they are asked questions regarding the hardware and its availability.
Dependence upon the applied programming
staff of a manufacturer is not completely universal. A dozen or so highly experienced computing equipment users have developed their
own systems either because they believe them
to be superior to those of the manufacturer,
more compatible with their own operating
methods or can be made operational months
before equivalent programming systems promised by the manufacturer. The achievements
of this group has been particularly impressive.
Most automatic programming systems
built by manufacturers have been based on fairly
solid concepts developed over the past few years.
Fortunately, there are a group of brilliant
energetic researchers in the automatic programming area who have been making rapid
strides toward developing greatly improved
techniques. Some of these contributions will be
reviewed in the light of the overwhelming problems confronting the programming field today.
Mounting Pressures on Developers
of Automatic Programming Systems
Those who would develop automatic programming systems for production purposes are
faced with many pressures. Some of the most
troublesome are outlined below.
A.

Increased Requirements

At the present time, the automatic programming systems supplied by a manufacturer
(and expected by his customers), fall into two
categories: (1) compilers and (2) other systems

366
10.1
(including assemblers,. operating and debugging
systems, sort generators, etc.). For most of
what follows, the discussion will center on the
compilers (defined as programs which translate
from a source language (usually problem
oriented) to a mac~ine language). In the area of
automatic programming systems, the manufacturer is expected to supply compilers to translate from one or more standard algebraic languages and from one or more standard commerciallanguages. Furthermore, these compilers
should be able to operate effectively on any of
the numerous configurations of equipment that
one may order and install and to produce object
programs that are also effectively operable on
any of this wide degree of equipment modularity.
It is not uncommon for the purchaser of equipment to request assurance that the manufacturer
will supply compilers to translate the same language for all successor machines that he may
market in the future. A large manufacturer
may very well have as many as ten compiler
projects going simultaneously to prepare translators for several current, several announced
but undelivered and perhaps one or two unannounced machines.
These reqUirements for" general purpose"
languages for "general purpose" computers
also carryover into the area of special purpose
languages (automatic programming for numerically controlled machine tools; natural language
translation; special military task programming,
etc.) and into special purpose computers. At
the present time one might hazard a guess that
the amount of automatic programming effort in
the category of the "special purpose" areas is
approximately equal to that in the "general purpose" areas.
B.

Shortage of System Programmers

The task of constructing automatic programming systems is generally relegated to the
direction of those familiar with the art. Up to
the present time, it has been a highly specialized skill and those who practice it are much in
demand at premium salaries. The number of
competent experienced system programmers is
very few while the demand is very high. There
is also at present very little concerted effort to
train for this specialized programming area.
C.

Mounting Costs of Automatic Programming

At the present time, the development of a
compiler will range from less than $100, 000 to
over $1, 000, 000. These costs include the man
months spent in programming and check-out,

large quantities of machine time, great documentation efforts, education, training manuals, etc.
D.

Developmental Time

The time to develop a compiler today will
vary from six months to better than two years
depending upon the type of compiler required,
its quality and its associated features (diagnostic system, restarts, compatibility, modularity,
etc. ).
E.

Simultaneous Development of Computer and
Compiler

Because of the demand by customers that
manufacturers deliver compilers starting with
the first machine, an added pressure is placed
upon the supplier. He must program and check
out the compiler during the period of construction and testing of the computer prototype.
Therefore, the compiler work is hampered by
the necessity of using large amounts of simulation on another computer, working on an engineering model and operating without the availability of programming and debugging systems
which are being concurrently developed. Considerable difficulty will also arise because of
the large quantity of machine time required for
effective check-out and testing of a compiler.
It is easy to see, under the extreme pressures created by the competitive marketing situation (which itself results from the rapid acceptance of automatic programming) and the continual spread of computer technology into new
areas, that the pressures on developers of automatic programming systems are enormous. It
is little wonder that they are rapidly scanning
the horizon looking for developments that will
reduce their costs and enable them to deliver
new automatic programming systems rapidly
and with a limited staff of system programmers.
Let us turn our attention now to some of
the solutions that have been offered. These are
divided into three areas: Elimination of the
necessity of writing compilers, minimization of
the number of compilers to be written, and
minimization of the effort of writing each compiler.
Elimination of the Necessity
of Writing Compilers
Recalling the well-known equivalence of
hardware and programming leads one to consider the possibility of designing computers that
will accept and directly execute programs written in source languages. Thus, in effect, the

367

10.1
burden of translation would be eliminated and
the source language would be the machine language.
From the fundamental nature of a Turing
machine, one may deduce that, while the crude
input could scarcely be executed directly, it is
feasible to automatically convert the input to
some directly useable form by programming.
To pursue this further, consider a program
available for the IBM 1620 that is called
"GOTRAN". This program accepts FORTRAN
statements and produces as output (in a single
manipulation) the results of executing the statements. If one had this computer with the
"GOTRAN" program locked -in, then he could
consider it to be a machine that directly accepts
and executes source language. However, this is
really chimerical because the actual effort of
writing "GOTRAN" was certainly of the order of
magnitude of writing a compiler and thus we have
actually eliminated no programming effort. What
has been accomplished is the "conversion" of a
general purpose computer to a special purpose
device via programming.
For the achievement of the e"quivalent of
programmed source language operation but with
elimination of much of the programming, one
would have to move closer to hardware developments. Two approaches that offer promise are
the use of micro -programming and the design of
computers with compiling requirements as a
primary c~iterion.
The use of micro-programming has been
known and discussed for some time. We hear
much more today about the "customized" computer that can be micro-programmed to order.
With a micro-programmed machine, it might
well be possible to build aggregates of microsteps that would carry out compiler instructions.
Thus, one step toward the computer that would
process source statements directly is the avoidance of construction of any logic higher than a
micro-program step. The same question now
appears - is the effort of micro -programming
the machine to accept source language any less
than that of constructing a compiler for a general purpose machine?
The other approach, to be described in the
paper by R. S. Barton later in this session, lies
closer to computer design. Ultimately, logical
designers and automatic programming experts
will be forced to work together.

Minimization of the Number
of Compilers Required
A.

Standardization of Source Languages

One effort that has been proceeding for the
last few years and apparently producing considerable success is the cooperative movement
to standardize source languages. There has been
considerable international effort in the development of ALGOL and considerable national cooperation 1.'1. standardizing on COBOL. Standardization will certainly continue in these areas.
Ultimately, the fate of standardized languages
will depend upon their acceptance. No matter
how many agencies endorse a standard language,
if the user finds a non-standard language compiler available which meets his needs, he will
make no effort to use the standard language.
Perhaps standard langauges cannot be accepted
until non-standard languages are banned or plans
are made to phase them out over a period of time.
An interesting paper will be presented by Miss
J. Sammet at this session suggesting even further reduction in the number of standard source
languages.
B.

Use of a Standard Intermediate Language

A proposal was made several years ago to
interpose between the problem oriented languages
and the computer oriented languages a universal
computer oriented language (UNCOL). In a
greatly oversimplified manner, if there are n
problem oriented languages and m computer
oriented languages, it requires (n x m) translators to translate from each POL to each COL.
If one goes through the intermediate step and
writes n translators from POL to UNCOL and m
translators from UNCOL to each COL, then one
replaces (n x m) translators with (n + m) translators. The present status of this development
will be reported in a paper in this session by
Mr. Thomas Steel.
C.

Solutions to the Component Modularity
Problem

With modern computers available as a
collection of modules (of internal memories of
various sizes, various input I output, buffering
and control systems, magnetic drums, tape,
discs, etc.), it becomes increasingly difficult to
handle the two configuration problems, the compiling and the object complement of equipment.
Thus far solutions offered to this general problem have not been common. Holt and Turanski
have discussed an Allocation Interpreter for the
object configuration problem. The general

368

10.1
solution for the compiling configuration problem
has been the simple change of table sizes and
buffer sizes (thus speeding and expanding compilation) to adjust to the internal storage requirements during compilation. The answer to this
problem when dealing with hierarchical memories with different ac'cess methods and speeds is
sorely needed. The answer frequently offered
by the manufacturer is to arbitrarily waive al~
but one or two compiling and object configurations.
It is hoped that techniques will be forthcoming
in the next few years that will allow each installation to make optimum use of whatever computing equipment is available to them.
D.

Automatic Translation Between Source
Languages

This is a possibility only in the special
case where one source language is (or can be
converted to) a subset of another. Fortunately,
the FORTRAN language (which has become so
common on many existing computers) is amenable to translation to ALGOL. At the present
time, several FORTRAN-to-ALGOL translators
are being written. This may also appear to be
the relation between several data processing
languages and COBOL.
E.

Automatic Translation Between Object
Languages

This is an area that is now being quite
actively investigated. At one time, it was
believed that the only method of source language
to source language translation was by means of
interpretive simulation. A considerable amount
of research has been carried out recently in reexamining the possibility of machine language to
machine language translation at a higher level
and preliminary reports are encouraging.
Minimization of the Effort
of Compiler Writing
A.

Compilers Capable of Writing Other
Compilers

This has been the subJect of a tremendous
effort in the past few years. When the idea of
automatic programming was first proposed, one
of the various suggestions was that, by"bootstrapping" with one operating compiler, one
would be able to construct compilers for other
machines, for other languages, and in fact even
better compilers than the original one. While it
is perhaps not universally agreed upon, this has
been, in general, a disappointment. It has been

repeatedly demonstrated that one can use a compiler to write a compiler, but the quality of the
compilers so produced, either in terms of the
time required to compile, the memory space
requirements or the quality of the object program
has been such that these demonstrations have
been primarily of academic interest. There has
been a tendency to design special purpose source
languages whose primary function is the description of the manipulations used in compiling.
These have been somewhat more successful.
Furthermore, the efforts of the compilers in
automatically writing compilers have been primarily limited to the writing of algebraic compilers. It will appear that in the next few years,
there will be more need of compilers for the production of data processing compilers than for
algebraic compilers. There is every reason to
believe that continuing developments in this area,
partic ularly in that of improving the language of
a compiler writing compiler will bear fruit before too long.
B.

Develop Special Compiler Writing Systems

When one wants to produce a compiler
automatically, the circumstances are generally
such that no compiler of the type required already exists for the machine in question. Thus
the compiler of compilers mentioned above usually operates on a machine foreign to the one for
which a compiler is needed and must go through
a complicated conversion and bootstrapping
routine after compilation of the compiler. A
proposal has been to develop a large system in
which the inputs are the desired language for the
compiler to be produced and the description of
the machine on which the compiler is to operate.
This is the concept of the SLANG system under
development by R. A. Sibley of IBM.
C.

Develop Techniques to Facilitate Compiler
Writing

At the present time, the writing of a first
class compiler is entirely dependent upon the
careful fashioning of all sections of the compiler
by manual programming symbolic language,
instruction-by-instruction. The quality compiler
of today will require from 25, 000 to 50, 000
machine instructions each of which must be hand
written and debugged. It has been obvious for
some time that, if techniques are available to
reduce the amount of coding to be done, it will
greatly reduce the compiler writing effort. That
is, since the complete elimination of hand coding
by using a compiler of compilers is not satisfactory, what is needed is a collection of tech-

369
10.1

niques for minimizing the effort. Let us consider what programming aids have been offered.
1.

List and String Processors

Since the nature of most source languages
is a rather free running string of meaningful
symbols loosely resembling English or mathematical notation, one important task in compiling is the analysis' of these strings into suitable origins, delimiters, and terminators and
the isolation of the included identifiers resulting
(after recognition of their contents) in construction of tables or codes. This suggests that an
improved scanning and recognition process would
be extreme ly useful. Among available techniques are the threaded list system of Perlis
and associates, the Newell-Simon-Shaw listprocessing approach and others.
2.

Generalized Analyzers and Generators

Following the scanning and decoding of the
raw input, the scanned input is analyzed for its
meaning in terms of the whole source program.
Holt and Turanski have pointed out that, for a
given language, the programming of both the
scanning and the analysis is virtually identical
no matter what the object machine and the nature
of the object program will be. They have therefore suggested the stockpiling of either entire
analysis sections or of the logic of analysis
sections and making effective use of these in all
compilers originating from the same source language. For the actual synthesis or generation
of object program, a number of algorithms for
producing code from algebraic languages have
been developed and published. For the data
processing compilers like COBOL, it is customary to use "generators" which produce the
desired object program sections. Currently
there is fruitful development in the area of generalizing these generators so that they can be
tailored to produce desired object coding without re -developing the whole generator for each
compiler.
3.

Macro Instruction Systems

Another aid to the compiler builder is the
use of the most advanced type of macro instructions. It is well known that as assembly programs become more sophisticated and comprehensive, the requirements of the compiler writer
become less and less. This may be seen in the
ease with which compilers may be written where
highly sophisticated assemblers are available.
The ALTAC compiler (from FORTRAN language
to TAC (assembly) language) for the Philco

S-2000 is a good example of this use. Even
more advanced macro systems are MICA developed by Owen Mock of North American Aviation
and MACROSAP developed by M. D. McIlroy of
Bell Telephone Laboratories. An ultimate
extension of this concept has been in the development of MOBL by North American Aviation which
is a data processing language built entirely of
macro instructions which are processed by the
MICA system. Using the macro instructions
concept, they were able to produce a compiler
with a minimum of time and manpower that is
capable of translating an excellent data processing language into 7090 symbolic language.
As we have seen, the pressures placed
upon those with responsibility in the automatic
programming area are enormous. With necessity the mother of invention, it is sincerely hoped
that at least some of the many solutions currently being proffered will bring the chaotic situation
in automatic programming under control by
1963 or 1964.

371

10.2
A FmST VERSION OF UNCOL
T. B. Steel, Jr.
System Devel~ent Corporation
Santa Monica, California

Summary
UNCOL--UNiversal Computer Oriented Language-is being designed as ail empirical, pragmRtic aid
to the solution of a fundamental problem of the
digital data processing business: automated translation of programs from expreSSions in an ever increasing set of problem oriented languages into
the machine languages of an expanding variety of
digital processing devices. By application of a
program called a generator, specific to a given
problem language, program statements in this problem language are transformed into equivalent UNCOL
statements, independent of any consideration of
potential target computers. Subsequently, without
regard to the identity of the original problem
language, the UNCOL statement of the problem is
processed by a program called a translator, which
is specific to a given target computer, and the
result is an expression of the original problem
solution in the machine language of the desired
processor. The advantage of this apparent complication over the current procedure of employing a
program called a compiler for direct transformation from problem language to machine language is
evident when one examines the number of languages
and machines involved and the not inconsiderable
expense of translation program construction. If
there are M problem languages and N machines, then
M·N compilers are required and only M+N generators
and translators.
In order to arrive at sensible specifications
for UNCOL, certain li;ni tations in its scope are
essential. Accordingly, UNCOL is designed to
cope with only those problem language and machine
language characteristics that can reasonably be
expected to enjoy general use in the next decade.
Any broader approach shows promise of leading to
elegant, impractical results.
A glance at the preliminary specifications
for UNCOL shows a language akin to a symbolic
assembly language for a register-free, single
address, indexed machine. The specific con:unands
are few in number and simple in construction,
depending on a special defining capability for the
expression of more elaborate instructions. The
data referencing scheme is complex, allowing the
application of a signed index to the primary address, and permitting both the primary and index
parts to be delegated to an indefinite leveL
Each item of data, either input or calculated, must be described as to type, range precision and the like by special data descriptive
syntactical sentences in the language. These
descriptions, additionally, provide information
concerning ultimate storage allocation as well as
indicators of contextual meaning for the explicit
commands.

Supplementary to the instructions and data
descriptions are certain declarative sentences,
inessential to the explicit statement of the
problem solutions being translated, designed to
provide information useful to the translator in
the introduction of computational efficiency into
the object program.
Introduction
One of the harsher facts of life in the data
processing world, against which none interested in
the effective operation of a digital computing installation dare wear blinders, is the existence of
pressures toward diverSity. Basic hardware always
appears in a variety of shapes, sizes and configurations. New machines arrive on the scene with
disconcerting regularity. Since 1952 the number
of different types of computers built each year
has oscillated about a mean of thirty, and subsequent to 1955 better than sixty per cent of these
machines have been commercially built, general
purpose devices available to ~y and all having
the inclination and the money.
It follows that
'8. problem of steadily increasing magnitude is
found in the questi9n of inter-machine translatability of programs.
At the same time, an expanding frontier of
applications as well as growing soph~stication of
machine language has led to the proliferation of
the "easy-to-code" problem oriented languages-POLs. Considerable difficulty is attendent to
the employment of these languages in view of the
effort necessary to maintain an adequate supply of
current tools. It is well known that the task of
producing translations from problem oriented
languages to real machine languages is far from
triviaL Until quite recently this translating
was done by people--we call this "machine language
programming. " Now, however, with the exception of
a few special cases not relevant to this thesis,
the responsibility for performing these translations is being assumed by automation. As a
result, requirements appear for processing programs--compilers--that are both expensive and time
consuming to produce.
This capital investment in a translation program would be well advised were the concern solely
for one problem oriented language and one machine
language. However, as indicated above, there is a
large variety of machine and problem languages.
Tnus the required investment increases multiplicity--one compiler for each pairing of problem
language and machine language. In addition, the
first derivative of development in both machines
and problem languages is sufficiently positive
that there is a strong tendency toward obsolescence
prior to use and the day is not yet with us when
programmers will write compilers on lunch hour

372

10.2
just for drill. A principal objective of the
developments described below is a reduction in the
time and money required to live with comfort in
this dynamic environment.
UNCOL-~iversal Computer Oriented Language-is as its name suggests, a langUage--occasionally
referred to by the irreverent as an electronic
Esperanto • Conceptually, UNCOL is a linguistic
switchbax where problem oriented languages are
tr'ilJlsformable into UNCOL and, in turn, UNCOL is
transformable into specific machine languages. If
one is presented with M problem languages and N
machine languages, M+N transformations are required in the UNCOL world, while M·N such transformation programs are necessary in the classical
case. Elementary mathematical insight shows that
when M and N are greater than two it is game, set
and match to UNCOL. For the sceptic, Figure I is
illustrative of the conventional situation and
Figure II the UNCOL schema. A more detailed examination of this position, its rationale and its 4
implications may be found in the literature. 2,3,

UNCOL is advertised as a practical solution
to the problem described above and, as such, need
not be defended against charges that it cannot
handle this or that special case--usually one that
was designed specifically for the discomfort of
the UNCOL. builders. A more legitimate indictment
is that of unworkability in practice. 5 To date,
debate on this matter has generated more heat than
light. Indeed, this appears to be a question that
is answerable only by appeal to experience. The
empirical evidence for a meaningful UNCOL will
come from a definition of the language and the
subsequent development of transformation programs
linking through UNCOL a sufficient variety of
problem oriented languages and machines.
It is clear that the 1nitial step in this
program is the establisbment of a trial language.
In order to find a point of departure it is essential to circumscribe the problem and determine a
bound beyond which this first version of UNCOL
shall not penetrate. To this end a priority ~
of machines has been selected. This class may be
roughly characterized as those general purpose
digital computers having a capacity at least as
great as an IBM 701 which are currently available
or may be expected in a commercial version before
1968 . The selection of 1968 as a cutoff point is
based on an estimate of the validity of current
projections of the characteristics of hardware
with which UNCOL must cope. If one attempts to
penetrate beyond this point all bets are off on
the breadth of applicability of UNCOL as it is now
conceived.
Advancement in the problem oriented language
technology does not introduce the same difficulty.
By designing UNCOL sufficiently similar to the
languages of the available computers, it is clear
that any problem language for which there is a
known compiling algorithm
be put"""in.tO UNCOL
form by a generator program that simply performs
this algorithm. If no such algorithm is at hand,

can

then construction of a compiler is impossible in
any event and the applicability of UNCOL becomes
moot. It follows that the key questions revolve
around the feasibility of constructing effective
translators for the transformation of UNCOL
statements into equivalent expressions in the
several pertinent machine languages.
The point of view best adapted to providing
an informal exposition of this first attempt at
an UNCOL is to consider it in the light of an
assembly language for a generalized machine.
Thus we shall see that UNCOL has a capability for
describing da~a both syntactically and semantically: i.e., with regard to both structure and
meaning. There is a scheme for naming items of
data analogous to conventional symbolic addressing
and an associated indexing procedure permitting
reference to data items standing in some ordinal
relationship to a distinguished item. A set of
verbs in the imperative mood provides a suitable
collection of commands for the language. The
objects of these verbs are data names. Verbs are
of two kinds, primitive and defined, primitives
corresponding to instructions in basic machine
language and defined verbs playing the r8le that
is assigned in conventional assembly systems to
macro-instructions. Finally, there are declarative sentences whose function is the transmittal
of information about the algorithm being stated.
It is these declarative elements that hold the key
to the introduction of efficiency in the object
codes by the translation algorithm.
Syntax of

~

Description

The internal representations of data vary so
widely from machine to machine that any attempt to
extract useful common description procedures is
doomed to failure. Indeed, the occurrence of both
binary and decimal machines precludes an approach
of this type. A common factor, however, exists in
another direction. All of today I S computers, as
well as any machine whose construction is sufficiently imminent to be relevant, communicate with
the outside world in terms of linear arrays or
strings of individual characters. While it is
painfully trug that no standard character set is
in eXistence, it is equally true that no difference in kind occurs from one set to another. As a
consequence, the union over all known and contemplated character sets will yield a satisfactory
universe of basic marks.
The guiding principle in establishing this
universe of characters is the desire to permit an
hypothetical computer to accept input and provide
output in the normal orthography of, any language,
natural or artificial, that is in common use anywhere in the world for scientific, scholarly and
commercial purposes, provided there exists the
technological capacity to employ such a machine.
Practical restraint imposes two limitations which
cause the selected set to fall short of this ideal
but the only measurable loss is esthetic. Those
natural languages that are written ideographically
or employ syllabaries must be excluded in order to

373

10.2
keep the size of the universe manageable. In the
case of mathematical symbolism, a somewhat more
ruthless action is called for. Typographical distinction between various levels of subscripting
and superscripting must be limited in some manner
as it can theoretically go to infinity. The most
reasonable course seems to be an insistence on
a shift indicator whenever the level is carried
beyond the first. Failure to allow any geometry
in symbolism is too restrictive and application
of some limit above the first seems dread.fully
arbitrary.
A synthesis of conversations with printers,
(human type), random sampling of mathematical and
scientific texts and examination of symbol lists
such as are found in large dictionaries, has let
to the development of the character set summarized
in Table!. Occurrence of more than twenty-six
letters in the various roman and italic alphabets
is a consequence of the inclusion of compound
marks such as letters with accents, the cedilla
and the umlaut. One could hardly abbreviate the
angstrom unit without the "1".
UNCOL Character Universe
Alphabet

Number

Roman capitals
Roman small capitals
Roman lower case
Italic capitals
Italic lower case
Greek capitals
Greek lower case
German capitals
German lower case
Cyrillic capitals
Cyrillic lower case
Hebrew
International phonetic
Arabic numerals
Mathematical symbols
Commercial symbols
Punctuation
8pecial symbols
Grand total

55
55

56

55

56

25
28
28
30
35
35
23
45
10
115
50
15

aft

~!..

Allowing three size-position variants of each
basic mark-~ain line, subscript, superscript--a
total count of 2433 distinct characters obtains.
This is not at all as wild as it might seem at
first glance. The author has personally seen some
printed outputs from various computers with 196 of
the listed basic shapes. It is perfectly feasible
to attach a device such as a Verityper to many of
today's computers. It would be a good wag.er that
the above set is conservative.
Every finite expression constructed as a
linear array of the characters in the UNooL universe is a conceivable input or output to a program described in UNCaL. In any given case, however, this input and output will be limited to ex-

pressions of certain well-defined forms, composed
from a subset of the character universe. The
nature of these legal forms and the character subset are a function of the problem language alone.
In the event that the object computer does not
possess the ability to handle the full character
subset for either input or output, a transliteration must be established by the UNCaL to machine
language translation program, according to predetermined rules, dependent only on the uNCaL
character universe and the target computer. Thus,
this character universe does for the subproblem of
character incompatibility exactly what UNCaL
itself purports to do for the general problem of
language incompatibility.
UNCaL must have the capability to state definitions of character subsets and legal expression
forms in structural terms. In order to accomplish
thiS, a formal language, interpretable as syntax,
is required. As a base this language includes the
first order predicate calculus with identity.1
Variables are interpreted as ranging over all
finite expressions composable from the UNCOL
character universe. Primitive names are specified
for each of the characters and the operation of
concatenation of expreSSions is introduced,
symbolized by the arch, """. Table II gives a
summary of this notation.
denial
v disjunction
1\ conjunction
• implication
- equivalence
~ universal quantification
V existential quantification
identity
diversity
" concatenation

f\)

=

*

not •••
or •••
and •••
if ••• then
if and only if
for all .••
for some •••
equal to •••
not equal to
followed by •••

,~!!:

The names of the characters are arbitrary.
By taking them in same reasonable order, a number
can be assigned to each character and these
numbers can be used as subscripts to a general
symbol to construct graphics for the names of the
characters. For illustrative purposes and subsequent use in this paper, let us establish names
for some small subset of characters, say a portion
of the ALGOL character set. Table III is an example of a set of appropriate names.
Names f2:: ~ ~ Characters
8 for
80 for "0"
862 for 11 , "
61
8 for
8 for " "
8 for "9"
68
63
9
It: If
8 for
8 for "a"
864
for
10
69
".11
8 for
865 for ,
861 for liZ"
10
866 for ".-"

.-

Table III.

1110"

"#"
n+"
It_tt

374

10.2
Proper combination of the names listed in
Table III above with the arch notation gives a
mechanism for spelling. For example. the ~
of the expression "3.14159" in the structuraldescriptive form envisioned here is:
S~S6~S~S~S~S5~S9

In addition to the ability to express names
of explicit inscriptions, use of variables and the
logical mechanisms outlined in Table II permits
the characterization of kinds of expressions. By
way of illustration, let~xam1ne a series of
definitions terminating in the expliQation of,a
form described in ALGOL as a number.~ Recall~ng
that variables, which will he~denoted by
Greek letters, stand in place of the names of unspecified expressions, we have: \

(1)
which is to be read as "the expression 0( begins
the expression ~" is defined to be "either C( is
equal to 1> or for some ¥, IX followed by ~ is equal
to ~." Similarly, we have:

(2)
A digit is anyone of the signs "0" through "9",

so:

It should be evident that no breach of rigor has
occurred through the use of dots in definition (3)
as the reader can fill in the missing clauses at
will.
An unsigned integer is a string of characters
each of Which is a digit. Thus we have:

(4)
and it follows immediately that an integer is an
unsigned integer with or without a prefixed sign.
lot for

UiO(""f(Ui~,'(ol=S6i~yt(=S70~»'

(5)

A decimal fraction is an unsigned integer
preceded by a decimal point, viz:
DfcX for \I~01~=S63 -~ .

(6)

Similarly, an exponent part is

and by collecting the parts and performing the
obvious, the definitions of decimal number, unsigned number and number follow immediately as:
Dnl)( for

""VtUi~,~fiJ\o~=~v~=~v~=~~l)'

(8)

UnQl. for

'J~'J~Dn~"XlAI)(={M=~IIIX.=~''''~),

(9)

for 'V~(Ui~"~=firJ.=S69-"~"~=S70"'~)'

(10)

NtA

The above example is indicative of the method
one would use in describing the legal forms of
data for any problem oriented language. Caution
must be exercized in formulating such definition
-schemes, however. The mechanism outlined above
has the capability to describe the syntactical
structures of very powerful languages such as the
logical language of "Principia Mathematica." In
such languages non-constructable items occur and
may be syntactically defined. For example, our
scheme is capable of defining the notion of nontheorem, clearly non-constructable and even undecidable unless one assumes consistency. In
order to circumvent trouble from this source, a
set of rules for generator writers must be given
which limits the complexity and character of the
definitions; rules which the careful framer of
definitions will follow instinctively.
Semantics

~

Data Description

The rather elaborate procedure outlined in
the preceding section is sufficient to give the
necessary syntactical description of data forms,
but it provides no mechanism for introducing the
meaning of the defined concepts. It is all very
well for a translator to know that a particular
juxstaposition of characters is given a specific
name, say "number", but this infon;mtion is of
small value unless the translator ~s also aware
of the appropriate mathematical and logical interpretation of this string of marks in whatever
contexts it may occur. Provision of this information is accomplished by adjoining a semantic
counterpart to the syntactical structure already
at hand.
The basic principle involved in this scheme
is quite simple. A new primitive .operator is
introduced which appears only in contexts where
it is followed by the name of some, expression,
This structure is interpreted to mean "the meaning of the following expression. lI In addition,
a notation capable of indicating the various
different kinds of data upon which machine instructions can act directly must be provided. As
the only types of data involved in computers in
the UNCOL priority class are approximations to
real numbers and binary patterns, this extra
mechanism is relatively insignificant. It is now
possible to relate, in a well defined manner, all
structures described by the syntax to entities
upon which the basic imperatives of UNCOL can
operate.
In vieyT of the desirability of having UNCOL
versions of the various translator and generator
programs , it is important that the semantical
mechanism be able to handle the data of these
programs which includes statements in the UNCOL
language~.
By adoptin~ appropriate ~les
of formation for the syntact~cal and semant~cal
statements, it is quite possible to do this and
avoid the ambiguities which seem fated to arise.
A full explication of these rules is too lengthy
for this paper and fragments of the rules are
not very meaningful.

375

10.2
Referencing

~

The UNCOL referencing scheme--for those who
prefer the analogy to a symbolic assembly language, the addressing scheme--is necessarily
complicated, including an arbitrary level of delegation or indirectness. This last is essential
in view of the existence of both problem oriented languages and machine languages having this
kind of capability. For example, if the problem
language being generated is a list structure
language such as the various Information Processing Languages of Newell, Simon and Shaw,9
and the elaborate scheme of pointing down a list
to find a data item were to be interpreted out
in the UNCOL statement of a problem, it would be
virtually impossible for a translator to reconstruct this list structure for machine language.
In the event that the object machine had the
capability of mechanizing some portion of the
list structure commands by hardware, the loss in
efficiency of the resulting program would be
unacceptable. As machines with this character
are clearly feasible and under contemplation,
UNCOL seems required to shoulder the burden of
providing for them.
As many problem oriented languages are designed to handle ordered arrays in a particularly
straightforward manner, it is essential that UNCOL
have an indexing capability in its referencing
scheme. For reasons analogous to those outlined
above for multi-level delegation, it is desirable
to permit indexing the same freedom.
In order to display the entire scheme some
notation is required. We denote basic names by
capital roman letters. These serve as names for
both primary and index variables, as no distinc-.
tion is made between these types of entities in
data description. We enclose the index name in
parentheses and append it as a suffix to the primary. Delegation is indicated by an asterisk.

Delegation of addressing can occur in three
basic forms and all combinations of these forms
is possible. These are: (1), indexing prior to
delegation; (2), indexing subsequent to delegation, and; (3), delegation of the index. If there
is no indexing specified, then delegation at the
initial level is of the simplest kind. In addition, the delegation may be modified by a level
limit, denoted by an appropriate integer following the asterisk indicating delegation. This
limit is interpreted to mean the number of levels
to which the delegation is carried, independent of
the indicated delegation of intermediate names.
When delegation has been carried out, willy-nilly,
to the indicated level, the delegation process
proceeds normally from there unless the original
name was marked by a limit modifier, denoted by
a prime follOwing the limit integer. In this
latter case action is terminated regardless of
the status of the name at the given level. Implicit in the above description is the fact that
any element to which addreSSing has been delegated is taken to include all its modifiers as

well as the associated index name.

"/hen a string of delegated elements is linked
together; i.e., when a delegated element points to
a delegated element, which points to a delegated
element, etc., a principle of postponement is
app1ied--you step down one more level when required and you do not step back up until everything below is specified. As each name may have
an associated index, this leads to a tree searching procedure ,\fhich terminates only when the tip
of each branch is reached.
The follOwing examples will clarify the situation. A(I) means "take A and apply 1." A*(I)
means "finalize A and then apply I." A( 1)* means
"take A, apply I, and finalize the result."
A*(I)* means "finalize A, apply I, and finalize
the result." The index variables themselves may
be delegated in the same manner and with the same
freedom. Thus, A*(I(J*(K)*)*)* is an allowable
combination.
The level limit, which may also be complex
in structure, has a clear interpretation. For
example, the expression A*5 means "delegate exactly five levels and treat the resulting element as a new address." On the other hand, if a
limit modifier is applied, the situation is as
follows. A*5' means "delegate exactly five levels
and treat the resulting element as the finalized
address. "
Table IV below gives a set of detailed examples of the UNCOL referencing scheme. In each
case, the value called "result ll is the ultimate
quantity to which the entire schema refers.
~

Referencing

~

Name

Value

Name

Value

a
A
B

A*
B
result

A*(I*)*
J
K
B

a
I
A+J

A(I)
J
result

a
I
J
A
B+K
C

a
I
A
B+J

A*(I)
J
B
result

a
V

Z(V*)*
U*(K)

K
U

P
W

W+P

X

a
I
J
A+K

A(I*)
J
K
result

a
I
A+J

A(I)*
J
B
result

B

C
result

X

Y

Z+Y

T

T

L*(M)
3
N
R
result

M

L
N+3
R

Table IV.

----

376

10.2
Imperatives
The fundamental imperative verbs of UNCOL-the instructions of the language--are quite
elementary and their meaning will be obvious at
once to anyone who has coded for a single address
digital cOllI.Puter. It should be observed, however,
that no registers of any kind are implied by the
command structure, contrary to a conventional
machine language, although it would seem at a
quick glance that there is at least an analogy
to an accumulator. Specification of such a register is necessary only when questions of precision
are implicit in the meaning of the command. In
UNCOL these questions are answered in the data
description sentences, not in the instructions.
For ease in studying the Situation, however, a
generalized and exceedingly pliable accumulatorlike gadget may be assumed without creating a
difficulty.
An element of context enters into the meaning of the instructions. As is the case 'With
convention~l machinery, the instructions are presumed to follow one another in a linear sequence.
This is so obvious that it is often overlooked in
an analysis of the situation. Because of the
capability for modification of data context, it
is quite important to keep in mind this sequential
flow.

The exact meaning of certain instructions,
such as the arithmetic commands, are clearly
dependent on the context of the data. Thus a
translator for a machine having both fixed and
floating arithmetic instructions would have to
examine the description of the data called for
by the address of the UNCOL command in order to
determine which kind of machine language instruction to employ. In addition, each arithmetic
command can be tagged with indicators, telling
the translator whether the command is to be interpreted as employing the data direct, taking
the absolute value (where relevant), or complementing the data first.
With the above preliminaries, the following
list of basic UNCOL commands should be easily
understood.
TAKE:

obtain the value by the address.

of the problem or, worse yet, in the coding of
the generator. Such a situation would be quite
unsafe if UNCOL were being conceived as a language in which humans would code, but as things
are it would not seem to be too serious.
The meaning of the commands MULTIPLY and
DIVIDE should be evident by analogy with the above
descriptions, once the meaning of logical division
is explained in some satisfactory fashion. The
command REMAINDER permits the gathering of the
remainder upon division. It needs no operand.
The command REPLACE is the a.na.logue of the
"STORE" instructions of conventional machines. It
provides a facility for altering the value which
is assigned to a given variable.
All decision making in UNCOL form will be
reduced to COMPARE, a command with two objects
or operands. These two operands are compared,
first to second, and the result of the comparison
is remembered--in a real machine this would be by
setting a toggle.
In order to implement the branching function,
it is necessary to give the UNCOL imperative
statements identifiers. These names are, however,
distinct from any data names and cannot be used
as the operands of ari tbmetic instructions. No
instruction can be modified by an UNCOL statement.
Were this permitted chaos would reign. It would
become trivial to demonstrate the existence of untranslatable sequences.

Certain instructions--branching instructions
of both unconditional and conditional variety-have as their operands the names of UNCOL statements. There are two unconditional branch instructions. The more cOllI.Plex of these, ENTER, is
described below. The other, GOTO, is.a simple unconditional branch to the statement whose name is
the operand of the GOI'O statement.
The conditional branch commands are all based
on the remembered result of a COMPARE. They are:

>GOTO

IF

< GOTO

IF::: GOTO

IF

IF

i

IF =I: GOTO

IF) GOTO

GOTb

The meaning of these should be evident.
ADD: compute the sum (aritbmetic or logical
as the case may be) of the result of the previous
instruction and the value named by the address.
SUBTRACT: compute the difference (arithmetic
or logical) between the result of the previous
instruction and the value named by the address.
Notice that in both arithmetic instructions
the actual operation performed is a function of
the data description. In the event that a translator should encounter an instruction and data
description combination which implies a mixture
of incompatible data, an error has occurred,
either in the original problem language statement

The ENTER command, together with its complement RETURN, is used for the purpose of transferring control to a subroutine and keeping a
record of where the entry occurred. It employs
a list, called the "entry list", which must be
established by the translator. This list contains
the statement names of each ENTER which has been
encountered on a last-in-first-out basis. Upon
completing the action of a subroutine, a RETURN
is given and the appropriate point of return is
determined by examination of the last item on the
entry list. This procedure permits a subroutine
to use itself.

377

10.2
Also used with ENTER are the commands WITH
and RESULTS. These follow the ENTER and allow
the specification of input and output parameters
for subroutines. Use of an additional modifier
for the referencing scheme--a relative delegation
indicator--permits the delegation of an address
relative to the current head of the entry list.
Then the list is stepped by one on a temporary
basis (for the duration of the address determination) and delegation proceeds. This permits a
parameter to be called from tbe top of a nest of
subroutines, which is perhaps recursive, without
requiring the asking routine to know anything
about the nature of the nest. This is quite important for the proper treatment of certain
problem oriented languages.
There are cases, particularly in problem
languages designed for writing compilers and the
like, where a given data item is operated upon
in more than one context by the same program. In
order to allow an override of the context that
is specified by the data description, an instruction for explicitly determining the context is
provided. It is SET CONTEXT and its object is a
code specifying context. One of these codes will
be called Ifnormal so that a return to the usual
situation is possible.
lt

The UNCOL commands are rounded out by a pair
which open Pandora I s Box wide! They are DEFINE
and END DEFINITION. These serve as a pair of
brackets that enclose definitions of complex
instructions in basic UNCOL terms. The purpose
here is analogous to that found for the use of
macro-instructions in conventional assembly programs, but its application is inverted. In the
usual case there complex instructions are designed
to improve the source language program by making
it more compact and easier to construct. In the
UNCOL case it is the object language that is the
prime beneficiary. These definitions permit a
translator to recognize units of program larger
than one command and, if the object machine has an
instruction for that function, considerable gain
in efficiency may be obtained.
Declaratives
Not much can be said about tbe declarative
sentences of UNCOL at this stage. Their general
character will be similar to such statements as
the following: liThe next 20 UNCOL statements form
a loop." The information content of these
declaratives will be largely a function of the
availabili ty of flmy information in the problem
oriented language. In order to determine the
proper kind of statement to include, it is
necessary to construct some experimental translators and discover empirically what questions
the translator would like to have answered in
order to introduce efficiency in the object code.
Work currently under way in an effort to
plan and flow chart translators to a variety of
different machines with objective of uncovering suitable declaratives. Perhaps six months is
not too soon to expect some results.

Recapitulation
In the phrasing of H. G. vie lIs we observe,
"The past is but the beginning of a beginning ..• "
The version of UNCOL outlined in the preceding
paragraphs is the result of extensive cogitation
and conversation but its only justification will
be the test of workability and that is still to
come. Perhaps the whole approach is doomed to
failure and oblivion, and then again, perhaps not.
One thing at least is sure; Experience and not
dialectic will be the judge.
References
1. Bourne, C.P ., and Ford, D. F ., "The Historical
Development, an~ Predicted State-of-the-Art of
the General-Purpose Digital Computer,1t Proc. of
W.J.C.C., 17 (May 1960), 1-21.
-- 2. Mock, 0., et a1., liThe Problem of Programming
Communications with Changing Machines: A Proposed
Solution, Comm. A.C.M., 1:8 (Aug 1958), 12-18;
1: 9 (Sept 1958T, 9 -15 •
3. Steel, T.B. Jr., IIUNCOL,ll Da~tion, 6: 1 (Jan
Feb 1960), 18-20.
It

4. Steel, T.B. Jr., I1UNCOL: The Myth and the Fact,"
Automatic Progrannning. (in press).
5. Holt, A. W. and Turanski, W. J., "Man -to -Machine
Conuuunication and Automatic Code Translation,!1
~. of \{.J.C.C., 17 (May 1960), 329-339.
6. Berner, R.W., Survey of Coded Character Representation,11 ~. ~., 3:12 (Dec 1960),639-642.
7. Hilbert, D. and Ackermann, vI., I'Mathematical
Logic," Chelsea, New York, 1950.

~. ~

It

8. Naur, P., et al., ~'Report on Algori tbmic Language ALGOL 60," Comm. A. C.M., 3: 5 (May 1960),
299-314, esp. 112.5.1. - 9. Shaw, J. C., et a1., !'A Command Structure for
Complex Information Processing,1/ Proc. of W.J.C.C.,
T-l07 (May 1958), 119-128.
-- -

378

10.2

PROBLEM ORIENTED LANGUAGES

MACHINE LANGUAGES
FIGURE I

PROBLEM ORIENTED LANGUAGES

FIGURE II

379

10.3

A METHOD OF COlffiINING ALGOL AND COBOL*
Jean E. Sammet
Data Systems Operations
Sylvania Electric Products
Needham 94, Massachusetts

SUMMARY

This paper presents a method for combining
ALGOL and COBOL. The purpose and general
approach is described; the basic principle is to
try to give both groups what they want rather
than forcing one group to conform to the other.
The major and conceptual differences are listed
and described under the headings (a) type of
problem to be solved; (b) general usage of the
language; (c) symbolism; (d) data description;
(e) input-output. Four types and levels of
interchangeability are listed and described,
namely (a) transliteration of basic symbols;
(b) transliteration of groups of symbols; (c)
translation of groups of symbols; (d) arbitrary
differences.
A listing of elements which are interchangeable under one of the first three categories
cited above is given. Arbitrary differences
between the two languages are given.
Two sections consider the changes which
must be made to each language to have them
coincide in the areas where they basically overlap. The paper ends with some indications of
the magnitude of the modifications involved.
This paper is intended primarily to show a
method which can be used to bring the languages
together. The changes can only be made officially through the committee maintaining each
language.
I.

GENmAL DESCRIPTION OF METHOD
1.

Purpose and General Approach

At this point in time, there exist two
languages - namely, ALGOL and COBOL - vmich are
deSigned to be problem oriented languages in the
two major fields of computer applications:
mathe~atics, and business data processing.
(Some people prefer the terms "procedureoriented" or "procedural" rather than "problem
oriented". Since no definition of any of these

* This is based on the versions available in
October 1960 as shown under the references.

terms has ever been agreed upon, the exact
wording is not significant for the purposes of
this paper.)
Although the need for combining the two
languages seems fairly evident, it is worth
pointing out a few of the advantages to be
gained from such a step. The first - and most
important - factor is that of cost. As long
as computer manufacturers need to prepare two
radically different compilers to satisfy their
customers, the hidden "software" costs will
continue to be quite high. Even such mundane
matters as double training manuals and operating
procedures add to the overall price the user
pays for his equipment. Since there is a high
degree of overlap (as will be shown later) it is
certainly reasonable to consider bringing the
languages together. A second factor is the
desirability of extending the area of standard
methods of writing programs. There are certainly large classes of problems for which COBOL is
suitable and ALGOL is not, and conversely. A
conbination of the languages, tentatively named
ALABOL (for ALgorithmitic And Business Oriented
Language) woUld widen the class of problems
Which could be handled in the "standard" way.
In no way is this meant to imply that ALABOL
would serve as a universal panacea, or even as a
single universal problem (or procedure) oriented
language. However, experience from using such
a language ,\-Tould be useful in narrowing the
classical (and still existent) distinction between data processing and mathematical problems.
Both of these languages have many characteristics in common, not the l~ast of which is that
they both were developed by committees and have
gained a fairly wide acceptance by the computing
industry. However, their surface differences
are certainly greater than their similarities.
The main reason for this is the fact that there
is virtually no practical intersection of the
two committees. At the time this paper is
written, the author is the only person who is
officially a member of the two committees
charged with the maintenance of the two languages. (This is not meant to imply that there
are no other people interested in both languages
Simultaneously. Many of the people with the

380

10.3
dual interest have been unable to participate in
both groups.) There appear to be two primary
reasons for this lack of connection between the
two groups. The first reason involves the basic
concept of each language, and the area of problems for which it is intended. In general,
those individuals interested in business data
processing problems are not interested in mathematical and engineering problems, and conversely_
However, the increasing number of problems which
cut across both lines makes the amalgamation of
these areas more prevalent and necessary. It is
becoMing much more difficult to retain the old
individual compartments for large problems, and
this fact must be reflected in language development.
The second reason for the lack of connection between the two groups is a certain amount
of displeasure on the part of each with the
approach taken by the other. These conceptual
differences are indicated in a later section.
However, this paper shows explicitly the large
amount of common ground which actually exists.
Furthermore, the purpose of the work reflected
in this paper is to try to give both groups
what they want, rather than forcing one group
to conform to the other. The result might be
considered by sQme to be a hodge-podge, but the
author prefers to think of it as an effective
compromise in Which the computing industry gains
a great deal, and comparatively little unhappiness is generated.
The present relationship between ALGOL and
COBOL can be summarized by the following diagram:

agreement than a lack of intersection.would
normally indicate.
The basic goal of this paper is to show a
way in which the essential characteristics of
both languages can be maintained. This work is
not intended to develop a "universal language",
although that might well be a byproduct. The
totality of work involved in combining these
languages is necessarily split into a few stages.
Only the first two are being described here, but
the others will be mentioned briefly to show the
direction of future work. The first phase involves a c1earcut statement of the major conceptual differences. Obviously this is essential
if detailed technical work is to be done. The
second phase, which is also covered in this
paper, involves only those areas of both languages where they attempt to do the same type of
operation. Later work will expand the area of
concern. The general approach can be summarized
by saying that wherever possible use will be made
of the types of interchangeability described
below in I.2 and significant restrictions,
changes, and additions will only be made where
absolutely necessary.
In order to allow effective judgment and
evaluation of this paper, it must be pointed out
that there are several things it does not claim
to do. First - and most important - i~oes not
present a complete set of specifications for
new language. In some areas, it has been possible
to show all the details, but in others, the magnitude of the work involved is too large to make
this worthwhile unless strong support is shown.

a--

=1-1 correspondence
= Transliteration
IIIIII1 : Translation
~

COBOL
The three types of shading represent three types
of interchangeability and are discussed in a
later section. The actual verbal meaning of the
diagram is that there are many points in common
between ALGOL and COBOL (where "in common't means
that there is a 1-1 correspondence between
certain elements). Thus, within the area of
intersection, there are at worst some small
notational differences which can trivially be
resolved. Furthermore, even in the non-intersecting areas, there is a wider range of

Secondly, problems of implementation are not
considered here, although nothing has been done
to make the lot of the implementor any more
difficult.
2.

Major and Conceptual Differences

There are five major differences between
AI,GOL and COBOL. These can briefly be stated as:

381

10.3
(a)

Type of Problem to be Solved _

(b)

General Usage of Language

(c)

Symbolism

(d)

Data Description

(e)

Input-Output
(a)

Type of Problem to be Solved

It is probably a truism to
state that ALGOL is primarily concerned with
mathematical type problems, and in particular,
with expressing algorithms. To quote from the
ALGOL 60 report, "The purpose of the algorithmic
language is to describe computational processes".
On the other hand, the COBOL 60 report states
that "The task of the committee was that of
preparing a common business language. By this
is meant the establishment of a standard method
of expressing solutions for a certain class of
problems normally referred to as business data
processing".
It is obvious that this difference in the types of source problems plays ~he
most significant role in any comparison of the
languages.
(b)

General Usage of the Languages

One of the major points of
difference between the two languages is the implied (and sometimes stated) way in which the
languages will be used. This is virtually
equivalent to the amount of connection with
computers each language possesses. More specifically, ALGOL was defined, and is being used
primarily, as a communication language. That is
to say, the emphaSis has been on the establishment of a language Which could be used to transmit algorithms and solutions of mathematical type
problems. This concept is best illustrated by
the use in ALGOL of a reference and a publication
language as well as a hardware representation.
The emphasis on the first two makes it clear that
the actual running of these problems on a computer is considered by many - but not all ALGOL experts to be a secondary matter. COBOL,
on the other hand, has placed a major emphasis
on its use in solving specific problems on a
variety of computers. Tha t is, COBOL experts
are primarily concerned in making sure that the
problems can be physically run on more than one
computer and get the same answers. The lack of
anything but a set of hardware characters (and
the limitation of the number of these to ,1) is
clear proof of the strong computer connection.
It is definitely not the point
and purpose of this paper to enter into a discussion on the merits and demerits of either
philosophy. It is sufficient to point out that
this difference eXists, and that it has a major
effect in the development of each language.

(c)

Symbolism

There are several major d~ffer
ences in symbolism between the two languages.
The first of course is the obvious one of having
mathematical notation wherever possible in ALGOL,
as opposed to the use of normal English wherever
possible in COBOL. A second difference involves
the establishment of three levels of languages
in ALGOL, of which the reference language is the
only standard one. The full effect of this is
merely a reflection of the point cited in (b)
~~ove.
Thus, if people are primarily interested
in exchanging algorithms, then mundane problems
such as non-existent symbols can be ignored. If,
on the other hand, the primary purpose is to run
problems on a computer, then character sets and
hardware limitations must be taken into careful
consideration. A third difference is a philosophic one of whether or not the reader should
be forced to work very hard. In ALGOL, the
emphasis has been placed upon succinctness and
the use of symbolism wherever possible. As a
result, a person uneducated in ALGOL cannot
readily understand any ALGOL program. COBOL, on
the other hand, has placed a major emphasis on
readability, even to the point of being verbose.
A person who has never seen a COBOL manual before can get a fairly good idea of what is going
on from a normal COBOL· program.
Again, as above, no attempt is
being made to sit in judgment on these views." but
merely to point them out.
(d)

Data Description

The major difference within
the area of data description is of course the
fact that COBOL has an extremely elaborate
system while that in ALGOL is fairly simple.
This, of course, only reflects the difference in
the primary type of source problem as delineated
in (a). To be more specific, COBOL makes heavy
use of the concept of a file, whereas ALGOL does
not allow for this at all. Furthermore, COBOL
requires the user to specify the exact format
and placement of both his input and output data,
whereas ALGOL only concerns itself with specifying the type of variable. This, in turn, is
of course directly caused by the emphasiS on the
need for a good input-output system in COBOL
object programs and an omission of this factor
from ALGOL.
(e)

Input-0utput

To cover the subject of inputoutput most Simply, one needs only say that at
least one-third of COBOL is devoted to this
problem, whereas it is ignored completely in
ALGOL. This is again a reflection of the very
basic point given in (b).

382

10.3

3. Types and Levels of Interchangeability
In order to combine these languages with
a minimum of change to either, it is necessary
to specify the types and levels of interchangeability which exist. There are four of these
which are of significance:
(a)

Transliteration of Basic Symbols

(b)

Transliteration of Groups of Symbols

(c)

Translation of Groups of Symbols

(d)

Arbitrary Differences

III, 2.2.3b.)
To save writing, an arrowhead will be used
to indicate the direction of the interchangeability. Thus A-. C means that the specified
COBOL characters can be replaced by the designated ALGOL characters~ A double-headed arrow
means that the replacement clan take place both
ways.
These lists are not guaranteed to be complete,
and in some cases, there are some very subtle
points about these correspondences which are not
discussed.
1.

It must be re-emphasized that all references to
ALGOL symbols are to the reference language.
Actually, the job of combining these languages
would be far simpler if one of the hardware
representations for ALGOL were chosen, since
COBOL is essentially a hardware representation.
The general meaning of (a) - (d) is
given here, and Section III contains specific
instances of these types of interchangeability.
(a)

Transliteration of Basic Symbols

This simply involves a 1-1
correspondence between some of the basic symbols
as d~fined in Section 2 of the ALGOL 60 report,
and the character set as defined in Chapter III,
1 of the COBOL report.
(b)

Transliteration of Groups of
Symbols

In this case, it is possible
to establish a particular sequence of ALGOL
basic symbols and a particular sequence of COBOL
words, because of the fact that a COBOL word is
not considered a basic symbol.
(c)

Translation of Groups of
Symbols

In this case, there is not a
1-1 correspondence, but it is possible to~ans
late certain specific groups of ALGOL symbols
into certain groups of COBOL symbols, and
conversely.
(d)

Arbitary Differences

There are certain differences
in the notation which can be easily taken care
of by making certain restrictions on one or the
other or both languages.
II.

DETAILED LISTING OF INTERCHANGEABILITY

Throughout the discussion of interchangeability, only the reference format of ALGOL is
used. (See ALGOL report, INTRODUCTION.)
Furthermore, no use or consideration is made of
optional words in COBOL. (See COBOL report,

Transliteration of Basic Symbols

The list (Figure 1) shows the basic
symbols in each language which have a 1-1
correspondence with basic symbols in the other
language. This correspondence involves the
syntax as well as the symbol itself, i.e. they
are used the same way. (See ALGOL report,
2, 2.1-2.3 and COBOL III, 1.1 and 1.2).
2.

Transliteration of Groups of SymbOlS

Many of the basic symbols in ALGOL have
the same syntactic meaning as full words (which
are groups of symbols) in COBOL. The list
(Figure 2) actually consists primarily of basic
ALGOL symbols and corresponding COBOL words or
groups of words. These are considered to be
transliterations because a 1-1 correspondence
exists between the groups and no translation is
required.

3.

Translation of Groups of Symbols

It is extremely difficult to list all of
the cases in which a group of symbols in one
language can be translated into another. In some
cases, the correspondence is so complicated as
to render a translation either difficult and/or
impractical. As an illustration of this problem,
consider the existence of the standard functions
which are in ALGOL, but not in COroL. These
can be handled either as a subroutine (which does
not allow the programmer to ever write the
actual form) or by means of the DEFINE. In the
latter case, it is rather a moot point as to
whether or not is is possible to mechanically
translate the follOwing COBOL statements into
sin (X):
DEFINE VERB sin AS COMPUTE Z

=

(appropriate formula for computing
sin (x) ) WITH FORMAT sin (X)

=z.

383

10.3
In order to computer A
have to write
SIN (Y)

=i

sin (Y) one would

.5

GIVING A

This point is mentioned again in Section III
below.
With the realization that not all
"translatable It groups can be shown, it is still
possible to give some widely differing illustrations of cases where this is possible and
practical. This is done in Figure 3.

4.

Arbi~

Word Formation

COBOL differs from ALGOL by allowing
purely numeric procedure-names, by allowing the
letter in data-names to appear anywhere, and by
restricting all words except literals to being
less than or equal to 30 characters. The first
two of these items (i.e., data and procedurename differences) are somewhat arbitrary although
there are good reasons for the choice made in
each language. The restriction on the maximum
number of characters for words is desirable for
implementation reasons, although the number 30
is somewhat arbitrary.

Differences

The question of Whether or not the difference between two expressions is "arbitrary" is
certainly a matter of opinion and calls for a
specific value judgment. Rather than stir up a
storm on this minor issue, it seems better to
say simply that any case in which a straight twoway transliteration is possible is considered
arbitrary. Under this very restrictive definition, it should be noted that the only t1arbitrary
difference" is in the use of the symbols, * and
X for multiplication in COBOL and ALGOL respectively, and the use of ~ as a division
operator in ALGOL.
III.

CHARACTERS AND WORDS
a)

=W

MULTIPLY W BY

1.

DEVIATIONS OF COBOL FROM ALGOL

This section concerns itself with the ways
in which COBOL deviates from ALGOL, considered
only for the areas in which they basically coincide or overlap. (This same procedure is
reversed in the next section, where COBOL is considered as the base.) Thus, the main discussion
centers around the PROCEDURE DIVISION, with some
comments on, and additions to, the DATA DIVISION.

One minor difference is that in
COBOL, two procedure-names are equal only if
they are identical, whereas in ALGOL, leading
zeroes are ignored.
COBOL does not have any means of
representing numbers in their floating point
form. The ALGOL form cannot be adopted because
of the lack
a suitable character for the t1lO".
However, it is easy to add the concept in the
DATA DmSION.

of

b)

Subscripting in COBOL is much more
restricted than in ALGOL. From the point of
view of the language, it is very easy to allow
the more general form. Thus, to have equivalents in the two languages, COBOL subscripting
must first be extended to allow any number; i.e.,
remove the restrictions to 3 subscripts. Then
COBOL must allow subscripts to be subscripted
themselves, and finally, must allow any arithmetic expression to be used as a subscript.
c)

The material sholm here involves comparisons as well as associated suggestions and
comments for changes which are needed to make
COBOL contain equivalent capability to ALGOL.
The general approach taken in both thts section
and the next is to add the necessary features
to each language - without destroying its inherent characteristics - rather than placing
restrictions. Thus, additions are made to each
language within its own framework, so that
equivalent capability is available although in
a somewhat different form.

Subscripts

Functions

The ability to define arbitrary
functions and the standard functions should both
be added to COBOL. Note however, that it is not
actually necessary to include this capability
directly in the specifications because the same
logical results can be achieved by using the
DEFINE. The only disadvantage to this is the
awkwardness of the method. (See II.3 above.)
2.

PROCEDURE DIVISION
a} General Procedural Syntax

In order to avoid repeating material, both
sections III and IV must be considered simultaneously if a complete comparison is desired.

COBOL currently allows only 2 hierarchies of named procedures; i.e., paragraphs
and sections. To cope with the more general
ALGOL form, it is necessary to allow any number
of nested named procedures in COBOL. This of
course should be done by adding BEGIN and END
with the same definitions as in ALGOL. Note
that COBOL already contains "sentences" which
are the logical equivalent of ALGOL compound
statements. It is not necessary to introduce
declarations into the PROCEDURE DIVISION since

384

10.3
the equivalent information can be imbedded into
the DATA DIVISION.
Since COBOL conditional statements
and sentenceS* contain the ALGOL form as a
special case, no change is needed. Furthermore,
the basic verb structure can be kept; since it
is shown below that ALGOL onerations can be
•
expressed in terms of COBOL"verbs.
ALGOL procedures can be handled
equivalently through the use of the DEFINE.
b)

Arithmetic & MOVE Verbs
The five COBOL arithmetic verbs,

ADD, SUBTRACT, MULTIPLY, DIVIDE, and COMPUTE,

as well as the MOVE, can be used to express
ALGOL assignment statements. It is necessary
to extend the capability of each verb to handle
floating point numbers.
c)

It should be noted that the information given in an Array Declaration is basically
available in the OCCURS clause and the level
structure of the Record Description.
IV. DEVIATIONS OF ALGOL FROM COBOL
This section is similar to III in that it
deals with deviations of the two languages.
Therefore, the general remarks at the beginning
of III apply fairly well here, except of course
that the changes will be suggested for ALGOL to
give it equivalent capability to COBOL in the
areas where they are inherently similar:-ThIs
latter qualification is extremely important,
because no attempt is being made to add full
description of data, input-output, or environment to ALGOL.
In order to avoid repeating material, both
sections III and IV must be considered simul- _
taneously if a complete comparison is desired.

Procedure Branching
1.

The simple GO (i.e., Option 1) of
COBOL is of course equivalent to the go to of
ALGOL. The switch declaration in ALGOL can be
handled by the proper combination of GO TO •••
DEPENDING ON, ALTER, and computational procedures.
-

BASIC SYMBOLS AND CONCEPTS
a)

Letters

The use of both upper and lower case
letters cannot be handled in COBOL for hardware
reasons. Therefore, the use of the lower case
letters in ALGOL should notbe allowed.

The COBOL PERFORM is essentially a
special case of the ALGOL for. However, the
PERFORM must have several options added to it to
give it the full power and generality of for.
In particular, in the TIMES option, the ability
to show any number of these in a single PERFORM
must be added. The step until can be handled
provided the VARYING option of the PERFORM is
allowed'to apply to any field and several
VARYING's can be written in one PERFORM. The
UNTIL and while are simply logical negations of
each other:--Thus, the PERFORM can be extended
easily to make it equivalent to the ALGOL ~.

The requirement for a letter at the
beginning of each identifier should be removed.
Furthermore, labels can be allowed to be purely
numeric since they can be identified from the
context. A restriction on the length of any
identifier to )0 characters seems reasonable
when implementation is being conSidered.

The NOTE of COBOL and the conunent
of ALGOL are equivalent. Furthermore, the use
of an extra semicolon to sh~T a dummy statement
in ALGOL is equivalent to EXIT in COBOL.

Although ALGOL does not have all the
variable types that COroL does (e.g. ALPHABETIC)
it does not seem meaningful to add this ability
since there is no real way to handle it, and
adding this to ALGOL is inherently dependent on
large extensions in the area of data description.

b)

c)

3. DATA DIVISION
2.

The major additions to the COBOL DATA
DIVISION are to the CLASS clause in the Record
Description. The additions will then result in
the equivalent of the type declarations. Allowing BOOIEAN to be specified in COBOL will cause
the meaning to be identical in the two languages.
Any data-name which is NUMERIC and does not have
a decimal point is an ALGOL integer; all other
NUMERIC fields are real in the ALGOL sense.
Finally, a FLOATING-POINT category must be added.

Defined in TC 68.1 - See Reference 2.

Variable types

SYNTAX
a)

Conditional Statements

The primary addition to ALGOL needed
to cover the COBOL syntax is the extension of
conditional statements to allow nested "if
clauses". This form is one of several proposals that has been made to handle the ambiguity which now exists in ALGOL. (See 4.5 in
ALGOL report.)
b)

*

Identifiers

for Statements

The main change needed here is to
add the ability to apply the for to procedures
other than those inunediately fOllowing it.

385

10.3

3. COBOL VERBS
Because of the importance of the verbs in
COBOL, it is desirable to discuss explicitly
their ALGOL counterparts.
a)

Arithmetic and Data Movement

Some of the CO~OL verbs appear on
the surface to have no ALGOL equivalent, but
this is not really true. As pointed out in III,
the five arithmetic verbs and the MOVE are
equivalent to assignment statements. The
EXAMINE which appears to have no direct counter ...
part in ALGOL actually does not need one, since
the EXAMINE is not primitive. That is, the
ElJU·ITNE can always be expressed as a proper
combination of PERFORM, equality tests, and MOVE.
Since these can always be put into correspondence
with ALGOL concepts, the EXAMINE is then just a
summation of these correspondences.
b)

Procedure Branching Verbs
See III, 2(c) above.

c)

STOP Verb

The ability to signify the stopping
point of a program does not appear in ALGOL.
This can be handled either by modifying the end
or by just adding a separator stop.
--d)

Input-Output Verbs

There is obviously no ALGOL
equivalent for this.
e)

Compiler Directing Verbs

The ENTER is of course equivalent
to a procedure so there is no problem here.
The EXIT is equivalent to the use of an extra
semicolon in ALGOL. The USE and INCLUDE have
no direct counterpart in ALGOL since there is
no input-output of any kind, nor any reference
to a library. However, the same effect can be
obtained through the proper use of procedures.
The DEFINE is essentially equivalent to a
procedure. Finally, the NOTE is, of course,
eql1ivalent to comment.

4. DATA

AND ENVIRONHENT DESCRIPl'ION

For the reasons given in I.l above, these
COBOL elements are not existent in ALGOL, excent for a few special cases which are handled
by type declarations. Files as such cannot be
described in ALGOL. Similarly, ALGOL contains
no prOVision for describing any of the environmental conditions under which the problem is to
be run on a computer. It is beyond the scope
of this paper to consider this type of addition
to ALGOl,e

V.

COMMENTS AND CONCLUSIONS

This paper has chosen a method whereby ALGOL
and COBOL (as defined in the references) can be
combined to produce a new language called
ALABOL, which will contain major portions of
ALGOL and COBOL as subsets. This l-lOrk has been
done under the strict - although self-imposed requirement that nei thE![' Janguage should have its
basic structure altered in order to conform to
the other. Because this was the basic approach
taken, the resultant combination is perhaps less
powerful than might otherwise be achieved. On
the other hand, it is a very practical fact of
life that the circumstances under which both of
these languages were created do not lend themselves kindly to major conceptual changes. It is
the strong opinion of the author that the additions (and changes where necessary) are definitely not major. It seems that if both the maintenance comnittees really wish to take advantage
of the power to be gained by this method, then
this can be done fairly easily. None of the
changes or additions appear to cause any large
amount of difficulty; in fact, some of them have
already been discussed as ttsomething to be done
in the future".
As was stated earlier, this paper definitely
dQes not intend to present the final specifications for a ne~.. 1anguage. What it has done is
point out quite specifically just what Changes
and additions are needed to bring the languages
together under the ground rules previously discussed. In most cases, full details have been
given, whereas in some other instances, some of
the details are available but not shown in the
paper. It should also be noted that this paper
did not include the addition to ALGOL of inpntoutput, full data description, or environmental
descriptions to ALGOL. This would have involved
a very major conceptual Change in the st~ucture
of the language and, therefore, would violate
one of the current ground rules. It is expected
that this will be done in the future.
The method shown here is practical and not
difficult to apply_ It should be of interest to
users and manufacturers, to both members and
non-members of the maintenance committees, and to
all members of the computing industry who have
any interest in the development of povTerful
procedural and problem oriented languages.
REFERENCES

IG Report on the Algorithmic Language ALGOL 60,
edited by Peter Naur, Communications of the
Association for Computing Machinery, Vol. 3,
No.5, May 1960.

2. COBOL: Report to Conference on Data Systems
Languages, April 1960, U.s. Government Printing Office 11960 0-552133 plus 3 supplements
distributed to CODASYL mailing list, plus
change labeled TC 68.1 which has been officially approved by the maintenance committees.

386

10.3

0,1, ••• ,9

0(

A, B,C , ••• , Z ...-

<

<
>

:

=

Identical symbols.

Identical symbols.

This is the use as a decimal point in numbers,
and does not involve the use as a sentence
terminator in COBOL. (See ~ below.)

,

Identical symbols.

[
]

used only in ALGOL strings and so the
arrow can go only one way.

}
}
}

)

,
end

Dis

Space

Used as subscript delimiters. The use of
parentheses for arithmetic expressions in
COBOL prevents the arrow from going both ways.
Used in arithmetic expressions.
The pair' , , serve as string delimiters in
ALGOL which are similar conceptually to
literals in COBOL. Because ALGOL uses two
symbols, there is not a full double correspondence.
•
The use of the decimal point for numbers in
COBOL prevents the double correspondence.

•

Figure 1

387

10.3

COBOL

~

Comments

:

:C

~

EQUALS

ALGOL basic symbol, COBOL word

=

c

>

EQUAL TO

ALGOL basic symbol, group of COBOL words

~O

c

~

~O

~

•

c

~

'4(

>-

NEGATIVE

>

c

>-

IS GREATER THAN

>

I(

~

t

c

>-

**

[x,YJ

.:

)t

A(X,y)

Groups of symhols in both languages

comment

C

>

NOTE

ALGOL basic symbol, COBOL word

go to

c

>

GO

ALGOL basic symbol, COBOL word

V

c

>

OR

ALGOL basic symbol, COBOL word

/\

E

~

AND

ALGOL basic symbol, COBOL word

-,

c

>

NOT

ALGOL basic symbol, COBOt word

>0
<0

A

POSITIVE OR ZEllO}

Groups of symbols in both languages

NEGATIVE OR ZERO
POSITIVE}

ALGOL group of symbols, COBOL word

EXCEEDS
ALr~L

basic symbol, group of COBOL symbols

Figure 2

ALGOL

Comments

COBOL

This is just one example of the whole class of
ALGOL representations of numbers written in
exponential form.

720

Y:

= X-5

......~----,;)IIo. . COMPUTE

Y :

X-5

This is, of course, just an illustration since
the COMPUTE verb is equivalent to the := in
ALGOL.

for x: :: 1 step until n++PERFORM ••• n TIMES

Figure 3

This is again an illustration.

389
10.4
ALGY - AN ALGEBRAIC MANIPULATION PROGRAM
H. D. Bernick, E. D. Callender and J. R. Sanford,
Western Development Laboratories - Philco Corporation - Palo Alto, California
Sununary
In a great variety of scientific problems,
the reduction of complex analytical expressions
is highly desirable but often such reductions,
although straight forward, are extremely lengthy
and laborious. The following paper describes a
program written for a high-speed digital computer
which accepts algebraic expressions as input and
outputs a similar set of modified expressions.
The description includes the definition of the
ALGY operations used to obtain the desired results,
followed by an explanation of the logical flow of
the program, and concluding with a description of
future operations which will be incorporated into
the system.
Introduction
The kinds of problems which initiated interest in general purpose high-speed digital computers were, for the most part, problems which involved an extreme amount of arithmetic. Hithout
computers, in many cases, no attempt could be
made to solve them, as valuable as their solutions
were deemed to be, not only because of the time
required to perform the computations, but also because of the very small probability that the results after months of hand calculation would be
correct. With the advent of the electronic computer, the arithmetic involved in these problems became a trivial matter and solutions were easily
effected with a high degree of reliability.
Recently, an analogous difficulty has arisen
in solving another kind of problem. l-Je wish to
solve systems of differential equations by perturbation methods. These problems, rather than involving arithmetic, required an overwhelming
amount of algebraic manipulation. The only feasible way to handle this kind of problem is, again,
to tllet the computer do itll, and it ,vas for this
purpose that the computer program called lIALGyli
was developed.
ALGY is an interpretive routine through the
use of which the progranuner or mathematician may
instruct the computer to perform certain algebraic
manipulations. ALGY is basically an elaborate
scheme for the manipulation of alphanumeric bit
patterns. The input and output used with ALGY are
alphanumeric algebraic statements. In this paper,
we will give the definitions pertinent to the
ALGY program, describe the currently available algebraic manipulations in a brief description of
the flow of the ALGY program, demonstrate by use
of an example an ALGY program, and, finally, give
a brief outline of future algebraic operations
and applications.

Definitions
In order to understand the manipulation
which ALGY performs, it is necessary to define a
few special terms. The basic building block in
ALGY is the BCD character. These characters are
combined in various ways into quantities. There
are three types of quantities in an ALGY program:
numeric, such as 51/725; special, such as plus,
minus, period, parentheses, dollar sign, asterisk
and quasi-alphabetic, such as cos 5X. Because it
is desirable in algebraic manipulation to do
exact arithmetic, al] numbers are represented as
fractions. The restriction on the numeric quantity is tbat the total number of decimal digits
of the numerator and denominator must be less
than or equal to 15. Under this restriction,
arithmetic operations are exact. The quasi-alphabetic quantities, also, must have less than 16
characters in them and they must begin with a letter. Those restrictions are mechanical ones and
could be removed by going to mUltiple arithmetic
precision and using larger storages. Of course,
in both numeric and quasi-alphabetic quantities,
no special symbols can appear.
A group is an algebraic expression contained
,.;rithin a plus, minus or a parenthesis. It may
contain several quantities, for example
-13 / 25"J:X$ 2~':s in2X.~':Coe f
is a group.
An expression is any algebraic combination
of groups and/or quantities which are terminated
by a period. Expressions are tagged ,'lith a name.
This name may also be a quantity in another expression.
Throughout the remainder of this paper, reference "will be made to different programs, the
ALGY system and particular ALGY programs. The
mathematician writes the ALGY program which is
processed by the ALGY system. ALGY does not manipulate equations, but only expressions. However, it is easily seen that for algebraic manipulations, expression manipulation is sufficient.
ALGY Operations
There are a few general restrictions on the
size and type of algebraic e}~pression that ALGY
can handle. Any expression can contain at most
4500 BCD characters. Also, all exponents must be
positive integers and the only available symbols
are the usual 64 BCD characters.
The currently available basic ALGY conunands
are EQAT, INQT, BUGG, OPEN, SBST, FCTR, TRGA and
DONE. tvhen the ALGY system was being designed,
we felt that these conunands constituted a minimal
system to perform the algebraic manipulations
that we were interested in performing. Following

390

10.4
is a brief description of each command. It is
very easy to learn to use ALGY. ~vo hours of instruction are all that is usually required.
EQAT:
Equate merely records on tape the left and
right hand sides of the equation. The algebraic
expression on the right hand side ~f the equate
symbol is always preceeded by its name, that is,
the symbol used on the left of the equate symbol.
This method uniquely determines all expression
recorded on the tape.
INQT:
Internal equate renames an expression already
on tape, preserving the recorded name. This
allows an algebraic expression to be re-equated
without having to write the whole expression over
again.
BUGG:
Bugg is essentially an "unequate" operation.
It searches the tape for the tag that is to be
"bugged l1 and deletes it. This is useful if the
user wishes to define a particular variable several different ways during a single ALGY program.
OPEN:
Open removes parentheses from an algebraic
expression. To do this, it performs all algebraic multiplication necessary, grouping identical terms, and sorting in a quasi-alphabetical
manner.

similar set of expressions. To accomplish this,
a logical sequence of events is followed.
The algebraic equations are coded in the
ALGY format 'tvhich- in essence is simply rewriting
them on coding sheets, using English letters
instead of Greek symbols when applicable, followed
by all the ALGY operations necessary to obtain the
desired results. After the program is key punched
it is submitted for recording on tape.
ALGY accepts the coded statements, printing
and performing all operations directed by the input program. A very simple example follows to facilitate explaining the logical flow of ALGY.
Consider
e(f ,g)

(fg + 1)3

2 2
1 + Ax + 1/2 A x + 1/6 A3x3
5
g(x) = Bx - 1/6 B3 x3 + 1/120 B5x - 1/5040 B7x 7
f(x)

11
9
7
4
3
and suppose the factors of x , x , x ,x and}~
of the function e are desired, neglecting all
terms of order greater than 11. The ALGY input
program would appear as follows:
EQAT E

(F*G + 1/1)$3.

EQAT F

1/1 + A*X + 1/2*A$2 + 1/6*A$3*X$3.

EQAT G = B*X - 1/6*B$3 + 1/120*B$5*X$5
- 1/ 5040*B$ 7X$ 7 •
SBST E/F,G.

SBST:
SBST substitutes one or more expression in a
given expression. The routine inserts parentheses about each expression substituted.
FCTR:
FCTR factors a given expression with respect
to a single variable, which may be exponentiated
with the option of equating its coefficient to a
given symbol for future reference. The expression may be factored with respect to several variables with the restriction that if a particular
group contains two or more variables to be factored, it can be factored with respect to only
one of the variables.
~:

Trig A expands a product of sin and cos functions of a given argument to a sum of sin and cos
functions of multiple angles. An exponentiated
sin or cos function would fall under this category.
DONE:
Done is a control word which allows several
independent problems to be processed du~ing the
same run.
Logical Flow of ALGY
ALGY accepts algebraic expressions as input,
processes these e~~pressions in the manner in
which the user has programmed, and outputs a

ie(Equate E = (FG + 1)3).

OPEN E.

ie(Substitute in E, the expressions
equal to F and G.

ie(Remove all parentheses).

FCTR E/X$11,Ell/X$9,E9/X$7,E7/X$4,E4/X$3,E3/ ,ER.
ie(Factor E lvith respect to X$U and
call the coefficient Ell, factor the
remainder of E with respect to X$9
calling its coefficient E9, etc., tagging the remainder of E after all factoring is completed, ER)
EQAT X = 0/1.
SBST Ell/X.
OPEN Ell.
DONE.
ALGY will accept the first three EQAT commands, print and store the algebraic expressions
on tape, where each expression is identified by
its tag on the left-hand side of the equate symbol. It then accepts the SBST command, searches
the tape for the expression equal to E and reads
it into memory. The routine locates and reads the
expression equal to F, and examines E, substituting the F expression with parentheses around it
each time it appears in E. After F has been substituted, the same procedure is done for the expression G. The resulting substitution in E
would appear in print as follows:

"391

10.4
E

«1/1 + A*X + 1/2*A$2*X$2 + 1/6*A$3X$3)*(B*X
- 1/6*B$3*X$3 + 1/120*B$S*X$5
- 1/S000*B$7*X$7) + 1/1)$3. -

ALGY accepts the OPEN command and commences
removing the parentheses in the E expression
above. The results are printed and stored on tape.
The system then enters FACTOR which will factor E
with respect to each variable requested. It then
prints and stores the coefficients of each factor
as well as the new factored E expression as follows:
n
Ell = El1(X ,A,B) where n assumes all integers 1 through 19
E9 = E9(X,A,B)
E7 = E7(X,A,B)
2

E4 = E4(X,X ,A,B)
E3

= E3(A,B)
2

ER = ER(X,X ,A,B)
E = El1*X$11 + E9*X$9 + E7*X$7 + E4*X$4

+ E3*X$3 + ER.
To eliminate all higher order terms in Ell,
simply EQAT x to zero, SBST x into Ell and then,
Ell = Ell(A,B).
11
3
In this manner the coefficients of X and X
can be accurately determined with a minimum of
effort on the part of the user. The computer cost
of the solution for a few typical problems is less
than 1/6 the cost of the solution obtained by manual labor, assuming that the man performing the
algebraic manipulations is as reliable as the
computer.
Below is a flow diagram for the ALGY system.
Because of the logical complexity of each subroutine, detailed flow charts have not been included in this paper.
Future Operations

'I

We will tremendously increase, during the
next few years, the scope and number of allowable
ALGY commands. Because of the specific problem
in' perturbation theory for which ALGY was originally designed, we will immediately develop two
more trigonometric manipulations called TRIG B
and TRIG C. TRIG B allows angle variables to be
separated using the laws of addition for the sin
and cos functions. TRIG C is, in a certain sense,
the inverse operation to TRIG A. In TRIG C, sins
and cos of multiple angles are reduced to powers
of the sin and cos of the angle.
There are many algebraic operations that suggest themselves now that we have the basic tools
for algebraic manipulation developed. Our future
plans include differentiation, restrictive forms

of integration and solutions of linear systems
using determinants. The restriction on the exponents will be lessened. We have already found,
several cases where "super" ALGY commands would
be of value. For instance, it would be extremely
desirable to have a command to generate a polynomial of given degree when only the ith term is
given. There are also numerous form of factorization which suggest themselves. It should also be
noted that the ALGY coded program is a straightflow program just as the first numerical programs
were. Undoubtedly, we will develop a loop technique for the ALGY system.
Conclusion
ALGY represents a first step in a new form
of computer usage. ALGY is not a general problem
solver. If the mathematician does not know how to
algebraically manipulate his equations, ALGY can
be of little help. But ALGY is an extremely powerful tool in the hands of an intelligent user.
It enables the mathematician to consider and to
solve problems that he would otherwise never consider because of the large amounts of algebraic
manipulation necessary for a solution. It enables him to try different forms of a solution
and use different approaches to the same problem,
where before he was often committed to just one
approach because of the large amount of time necessary to verify that one method.

392

10.4

ALGY INPUT
PROGRAM

INITIALIZE
TAPES AND
INTERNAL TABLES

ACCEPT AND
PRINT INPUT
COMMAND

INTERPRET COMMAND
AND EXI T TO
APPROPRIATE ROUTINE

PRINT AND STORE
RESULTING
EXPRESSION

SEARCH AND READ
EXPRESSION
WHICH IS TO BE
OPERATED UPON

393

10.5
A NEW APPROACH TO THE FUNCTIONAL
DESIGN OF A DIGITAL COMPUTER
R. S. Barton
Computer Consultant
Altadena, California
Summary
The present methods of determining the functional design of computers are critically reviewed
and a new approach proposed. This is illustrated
by explaining, in abstracted form, part of the control organization of a new and different machine
based, in part, on the ALGOL 60 language. l The
concepts of expres sion and procedure lead directly
to use of a Polish string program. A new arrangement of control registers res ults, which provides
for automatic allocation of temporary storage within
expressions and procedures, and a generalized subroutine linkage.
The simplicity and power of these notions suggests that there is much room for improvement in
present machines and that more attention should be
given to control functions in new designs.
Introduction
The ideas presented arise from the conviction
that for a true general purpose digital computer both
coding and operation should be fully automated.
Higher level programming languages, such as ALGOL,
should be employed to the practical exclusion of
machine language; questions of efficiency of obj ect
program and transiation process ought not to arise
if the machine has been properly designed. Operation should be under the control of the machine itself, in a fuller sense than is typical in current
practice. The functions of scheduling, segmentation of programs for multi-level storage, and control
of input - output operations should be handled by a
general operational program.
This new approach. will be illustrated after reviewing the customary methods of machine design.
The Special Purpose Machine
In simple and well defined applications t the
design engineer may dispense entirely with programming assistance and the program may be entirely,
or in large part, in the hardware. If the processing
required is complex, programmers are invited to assist the engineers. There will be a period of tradingoff programmed and component logic, but the resulting machine will tend to resemble the conventional
general purpose computer.

The Engineers' General Purpose Machine
In the design of machines to meet competition,
the utilization of new components is likely to be of
vital concern to the designers. While requiring a
complete new set of programs, the new product seldom shows more than minor variations of the traditional design. Its new features originate with both
programmers and logical deSigners, but those ideas
which are contributed by the programmers stem usually from applications experience with previous
machines rather than from systematic theory. The
logical designer has the lastword and is most likely
to accept ideas which require a minimum of new
design.
The Programmers' General Purpose Machine
It seems to be the case that as yet no machine's
"'design has been significantly effected by persons
experienced in the development of automatic programming systems. Programming still must be imposed
upon designs that have been determined by marketing pressures and tradition. It must be admitted
that the programmers of the last decade have been
poorly prepared to make the kind of contribution that
sho\,1.1d be expected. The art of programming has developed in a helter-skelter manner, leaving behind
little of value. There is almost no theory and little
standard methodology. The logical designers have
a large body of switching theory as a basis for their
work and have, consequently, done better.

The Simultaneous Design of Computers
and Programming Systems
Rather than hope for a new spirit of cooperation
among dis parate product planning, engineering, and
programming departments t a single small organization is needed for each product conception. This
would be comprised of three kinds of people responsible for aspects of system model, program design,
and logical design.
As an example, for a computer to proces s applications expressible in the ALGOL 60 language, the
system model group would interpret the language,
specify a hardware representation and necessary
language supplements, define speed or cost objectives I and the "use image" the machine is to present.
They would have responsibility for ensuring proper
interpretation of the model by both programmers and
logical designers.

394

10.5
The program designers would have background
experience in the construction of translators and
some knowledge of logical design. They will consider necessary reductions in the source language
and translation techniques to enable efficient object
time interpretation by hardware.
The logical designers would be oriented in current programming practice and become familiar with
the source programming language to be used. Their
task is to produce designs to handle the reduced
languages.
Concepts for the Design
Some simple ideas are now presented that arise
quite naturally from using the ALGOL 60 language as
a model. These ideas have actually entered into
the design of a new Burroughs Information Processing
System. For purposes of exposition, they will be
considered out of the context of the actual machine,
and liberties will be taken to avoid complications
which are not germane to the subject.
Polis h String Program
The Polish notation was invented by the logician
Lukasiewicz for use in the propositional calculus.
It has the advantage that rules of operator precedence and signs of grouping are not required. At
least two logical machines have been built which
use it as a source language. The first of these was
the Burroughs Truth Function Evaluator 2 • More recently, Bauer described a similar logical device 3
and hinted of remarkable results which were discussed in another paper 4. During the past few
years, numerous persons have "discovered" the extension of this notation to arithmetic expressions
and found it useful as an intermediate language in
designing algebraic translators.
Any expression, as defined by ALGOL 60, can
be simply translated into' Polish form, and this can
be the basis of an efficient machine language to be
used in place of the customary single or multipleaddress command formats. Methods for doing this
are well known and familiarity with an algorithm is
assumed. While it would be quite feasible to construct a machine to directly interpret ALGOL expressions having suitably restricted identifiers, it was
decided, in view of the simplicity of the trans forma tion, to use the Polish form. At this point, it is
important to stress that programs need never be
written in Polish form; and no lower level assemblytype language is required.
This form provides a machine language with many
desirable properties. Programs consist of a string
of elements which correspond to identifiers, literals,
or operators. All the operators defined in ALGOL 60

can be inc! uded. Using the stack construct, next
described, there is no need for store and fetch commands such as are normally associated with the
temporary storage of intermediate quantities in the
evaluation of expressions. More important, the res ulting program is in a better form for the application of proced ures to improve program efficiency than
is the case with the usual command languages.
The Stack Construct
Assume that the elements of the string are examined from left to right so that operators need not
be deferred. Normally, for unary and binary operators, the transformation to Polish form would ens ure
(excepting in the case of procedures) that at most
two operands would have to be fetched before each
execution.
To mechanize the Polish string program, a special address-length register called the stack counter
is provided to hold the most current cell addres s in
a vector of temporary storage cells referred to as the.
stack. Two word-length arithmetic registers' hold
the most recently fetched or computed operands •
As sociated with each of the latter is an occupancy
toggle for indicating whether or not that register
contains an operand.
The action of the stack is defined in Table I.
The follOWing notation applies: S, A, B, TA, TB
represent the contents of the stack counter , arithmetic registers, and occupancy toggles. X is an
operand. e and ® are unary and binary operators.
S* signifies the contents of the cell addressed by
the contents of S.
Now, while the case of stack operation, which
is presented is somewhat Simplified I it does show
how a Polish string program can be mechanized. The
occurrences of operators with TA =TB = 0 is actually
abnormal, but are included to allow continuation in
the event that the evaluation of an expres sion is
interrupted. The state TA = I, TB = 0 is unstable.
It is worth noting that upon completion of the
evaluation of a well-formed expression we have
TA = 0, TB = I, with the value of S upon completion
equal to its initial value. This offers a possibility
for checking in the hardware which does not exist in
such a simple form for the usual command language.

In the event that the reader does not happen to
be familiar with the Lukasiewicz notation, he may
find it useful to trace the operation of the stack for
the program XY + VW/U + x Z + which corresponds
to the expression (X + Y) x (U + V/W) + Z.

395

10.5
Subroutines and ALGOL Procedures
To realize additional important advantages-from
this program format I we extend these notions to
handle n-ary operators or n-place functions that are
defined by sub - programs. The important case of
call- by - name is deferred. Call-by-value only is
discussed. Declarations of functions will cause
subroutines to be generated and extensions of the
operator set to be defined. Similarly I it is assumed
that for each array one or more storage - mapping
functions have been defined and that I corresponding
to each I a subroutine has been created from the array declaration which will have access to information on bounds of indices and the base address of
the array. Such subroutines will also be called by
extensions of the operator set. The program corresponding to the array element or function call will
then consist of an ordered set of expressions representing the indices or arguments. The values of
these are entered into the stack. When the operator corresponding to that subroutine is encountered I
linkage automatically results.
Subroutine Control Using the Stack
It is now necessary to place the contents of
registers A and B (or B if A is empty) into the stack
since the subroutine to be executed will generally
require these registers. Furthermore I the contents
of the control counter I C I must be saved to enable
return from the subroutine. This return can be saved
in the stack and the pOSition of this address recorded
in another address-length register designated F. To
afford a link to the return for a possible higher level
subroutine I the former contents of F are retained in
the cell with C. Finally I the subroutine entry address specified by the extension operator is entered
into C and linkage to the subroutine is complete.

Let it now be assumed that the execution of a
scanned program element is delayed until the following syllable has been interpreted. Operators can be
defined which force the preceding syllable to be a
call-by-narne. Each word in the stack has a control
bit that distinguishes between values and names ••
The address referenced by the element preceding the
call-by-narne operator then is entered into the stack
and the control bit for that cell set to indicate a
name. If an operator is encountered I followed by
the call-by-name operator I the operator (or the location of the subroutine which effects execution of
an extended operator) goes into the stack and the
control bit is set. Within a subroutine I any reference to this stack cell that is not followed by a callby-name operator will caus e execution. If I otherwise I the reference to the cell is again followed by
a call-by-name operator the cell contents are copied into the new stack level. A similar action results whenever a parameter of one subroutine is u~ed
in a lower level subroutine.
I

Other Consequences of the Design
Some of the key aspects of the logical organization of the machine have been introduced in a gradual
f.p.shion. While not all of the consequences of the
model which have been developed can be presented
in this paper I a few concluding remarks may be of
value.

Within the subroutine I the parameters are addressed minus relative to F in reverse order. Temporary storage is allocated for the subroutine by
advancing S corresponding to the number of cells
needed. These cells can then be addressed plus
relative to F. It will arso be useful to store constants used by the subroutine ahead of its entry and
address them minus relative to C.

Change of sequence is accomplished by one of
two means: jumps relative to C of conditional or
unconditional type I or via a switching table of entries corresponding to labeled program segments.
The conditional jumps examine the truth value in the
stack produced by evaluation of a Boolean expres sion I and then cause it to be erased. Fbr ease of
segmentation and effective use of a random-access
secondary storage device I we make program invariant
with respect to its position in storage. Correspondi!lg to each declaration of array I switch I or procedure will be a locator word which is assigned in
a table. Program references are then made to the'se
words to obtain the location of the corresponding program. These words contain other information which
is itilized in multi-programming and automatiC segmentation control.

Upon exit the resulting value is left in B the
contents of S are replaced by F I and for the cell then
addressed by S I the C-part will go to C and the Fpart to F. Finally I S is reduced by n - 1. This automatic linkage construct enables the use of subroutines in depth and a subroutine may call itself.

II Locatorll words also correspond to labeled program segments and input-output control information.
The latter are grouped for scanning by a universal
input-output control program which assigns 1/0 channels. The main program and 1/0 control program
communicate via a status bit in these words.

Denoting the F and C parts of S * by S *F and
S*C I respectively I the location of the subroutine by
P I and other notation as previously defined I the action upon entry and exit is displayed in Table II.

Character and bit manipulation constructs for
the machine are also departures from familiar practice I but arise from different considerations and will
not be discussed here.

I

I

II

II

396
10.5

Conclusion

References
1. Naur, P. (Editor): Report on the Algorithmic
Language ALGOL 60, Comm. Assn. Compo Mach. 3,
No.5 (1960) I 299-314.

It is hoped that a case has been made for a way
to introduce some new ideas into a field where enormous amounts of technical talent are s pent in designing the hardware and programs for a large number
of very similar machines. Most of these are "general
purpose. .. Much of the effort in developing these
machines could better be spent in designing some
useful special purpose computers. An ALGOL machine would fit into the latter category. With automated logical design and fabrication in the immediate future, any number of these useful special
purpose machines can be envisaged.

2. Burks IA. W. , Warren, D .W., Wright, J. B.:
An Analysis of a Logical Machine Using Parentheses-

Free Notation, Math. Tables Aids Compo 9 (1954) ,
53-57.

ll

II

3. Bauer, F. L.: The Formula-Controlled Logical Computer" Stanis laus , .. Math. Comp. 14, No. 69
(1960), 64-67.
4. Samelson, K.and Bauer, F. L.: Sequential
Formula Translation, Comm. Assn. Compo Mach. 3,
No. 2 (1960), 76-83.

****
This paper is based on work sponsored by the
Burroughs Corporation.

UNARY OPERATOR

OPERAND
1.
2.
TA

X~A,

I::::

A~

= I,

TA
B, TB

= TB = 0

l. 8* -+B, TB

I

BINARY OPERATOR

=I

2. S -I---"'S

TA == 0

3.

1.

8*~B,TB=1

2.

S-I~S

3. B-+A, TB

B9~B

= 0,
=I

6. ABe-tB, TA
1. X-7A, TA = I
TA
TB

=0

TA

= TB = I

TA

I::::

I

4. S*-+B, TB
5. S-I~S

I::::

0

3. B~A, TB = 0, TA = 1
4. 8*-tB, TB = 1

I. B9--"7B

=1

5. 8-1~8
6. ABe -+ B, TA? 0

1. 8+I~S
2. B-t8*
3. A-tB

4.

6. ABEI)-7B, TA

1. A9-"::'A

X~A

TABLE I
ENTRY

TA = TB

I::::

0

6. 8 +1--"78
7. F-+S*F, C-)8*C
8. S-7F
9. P-+C

I

IITA == 0
1

TB == I

EXIT
(Normal for procedure
without value)
I. F~S
2. 8*F-tF, S*C-)C
3. 8-n-I~8

4. 8+I~8
5. B---78*, TB == 0

(Normal for procedure
with value)

Steps 6 through 9

(8ame as above)

I. 8+ I--"7S
2. B-)8*
3. A_B, TA = 0
4. 8+1-78
5. B-.,.8*, TS = 0

(Improper)

1

j
TA = TS = I

Steps 6 through 9

TABLE II

=0

397
10.6

THE JOVIAL CHECKER
AN AUTOMATIC CHECKOUT SYSTEM
FOR HIGHER LEVEL LANGUAGE PROGRAMS

Mildred Hilkerson
System Development Corporation
Paramus, N. J.
Knowledge of specific machine lar..guage is
still required for checkout of higher level programs. Consequently, several potential advantages
of higher level programming languages have not
materialized. These include shorter and less
technical programming training, and faster program checkout.
JOVIAL is a higher level programming language. The "item" is its basic unit of data. The
Checker is a program which executes translated
JOVIAL programs and selectively records test
results in JOVIAL language. Most non-essential
information is eliminated from printouts. Check
of actual results against expected results may be
done automatically.
The Checker is part of a utility system
which includes a Compiler and the Checker. The
Compiler checks legality of JOVIAL statements,
translates to binary code for a specific machine,
and produces a JOVIAL program listing.
The Checker operates the object program and
"snaps" every modified value of selected items.
Basis of selection may be items of interest to
the programmer, items whose final values deviate
from programmer-supplied, expected values, or
both. Each modified item value is printed with
the statement which modified its value at that
point and an associated label. Final values only
of any or all tables may also be recorded.
Introduction
Although higher level languages have been in
use scarcely two years, the contributions of
FORTRAN, COBOL, JOVIAL, ALTAC and other such
languages are now beginning to be realized by
their users. Despite continuing maChine obsolescence, the problem of program obsolescence due to
the differing languages of various computers is
now soluble ~th higher level language programs
and compilers.

reduced by higher level language. This is generally attributed to the sheer reduction in the
number of higher level language instructions
required to program a given problem, to the more
flexible format of instructions, and to the
greater readibility of the language.
On the other hand, one major, potential
advantage of higher level languages has not been
realized. This is the elimination of machine
language as a requisite.
The Problem and Its Consequences
Higher level languages have not yet replaced
machine languages for a single programmer. The
expectation in this respect was that machine
languages could be omitted from the training of
all programmers except those involved in programming and maintaining compilers and special
utility programs. The existing situation, however, is that programmers must still be taught
the symbolic language of at least one specific
machine, as well as the new higher level language.
Consequently programming training today is more
time-consuming and more complicated than ever
before.
Closely associated was the hope that higher
level languages would further progress toward a
common communication link between man and machines,
between non-specialists and computer specialists,
between management and programmers. One of the
deterrents toward this goal is that while such a
language may exist, conventional training is
still dominated by machine languages. This training is neither appealing nor expedient for management and non-specialists. It is time-consuming
and tedious. It requires a capacity for rote memorization and a fastidiousness for clerical detail.
The very technical nature of machine language
furthermore restricts the type of people who may
be selected for programmer training, and may, in
fact, discourage many creative people from entering this profession.

MOst programmers observe that, while overall output may not yet reflect the speed-up,
coding with a higher level language is considerably faster than coding in machine language.
More time is now available for such problematic
areas as problem analysis, program design, and
modifications resulting from~system design
changes.

It is not resistance to higher level languages
which has prevented relinquishment of machine
languages from programmer-training. It is the
fact that no method'had been devised to produce
test results from higher level language programs
that did not depend upon a thorough knowledge of
machine language by individual programmers.

The pitfalls leading to trivial, clerical
type coding errors al.so have been radically

The JOVIAL Checker is designed to solve this
technical problem. Its consequences, or any

398
10.6

similar solution, cannot be displayed as a cure
for the problems just mentioned. It can, however,
be regarded as one stepping stone toward full
realization of the advantages of higher level
languages. To our knowledge, it is the first
utility program designed to eliminate the last
remaining traces of machine language from higher
level language programming.

The combined Compiler and Checker system is
designed to translate programs written in JOVIAL
language, execute the translated instructions,
and produce test results in JOVIAL format. A
brief examination of the organization of JOVIAL
variables and one type of JOVIAL instruction, as
well as an outline of the Compiler's fUnctions,
will be helpful in understanding the Checker's
operations and output.

The Problems at SOC
Organization of Variables
The problems that gave rise to the Checker
were much more modest. Program checkout has long
been one of the most time-consuming headaches of
the programming field. While higher level languages have significantly reduced the number of
errors made in the original program, virtually
no time has been saved in debugging and checking
out programs.

,iii

Professor Wrubel l cited the problem in apt
words when discussing one of the better known
higher languages. lilt is a frustrating thing,
he writes, Ito have computed the answers correctly
only to find them printed on the output page in
an all but incomprehensible jumble. I'
I'

A survey of the practices within the field
yielded no satisfactory solution. Generally,
test results are printed in the machine language
format. Instructions are sometimes traced, but
this produces stacks of printout paper with no
clue to the origins of errors. An individual
programmer with a great deal of foresight may
leave holes in ~is program for insertion of
instructions that could produce test results in
any format, provided these were prepared by the
programmers and later deleted from his program.
Due to the efforts of Jules Schwartz and
others at System Development Corporation, we have
been programming in a higher level language called
JOVIAL for well over a year. Various utility
programs for program checkout have been developed,
but none embodied all of the needed capabilities.
Martin Blauer, therefore, initiated the efforts
that led to the design of the Checker to meet the
following requirements:
1. All test results appearing on the printout, including instructions, data or other information, should appear in the JOVIAL format.
2. Adequate information should be provided
to locate the origin of errors, but otherwise unneeded or unwanted results should be omitted from
the printed test results.
The JOVIAL Checker
The JOVIAL Checker was designed and developed by members of System Development Corporation
for use in checking out programs written in the
JOVIAL language. It will be available for use in
March 1961, and will operate upon programs translated by the JOVIAL-to-IBM 7090 Compiler. The
Checker is also suitable for use with the AN/FSQ31V military computer and is readily adaptable
for use on any other computer for which a JOVIAL
compiler is available.

All input and output data, as well as variables manipulated internally by a JOVIAL program
are organized into items, entries and tables.
The item is the basic unit of data. Its size may
range from one bit to the total number of bits in
the machine w'Ord, depending upon the size of the
data it will contain.
An entry is comprised of one or more items.
Two or more items collected in the entry are
usually related, i.e., a payroll code, an employee
number, a tax deduction rate, etc. relate to one
employee.

A table is comprised of one or more entries,
usually repeating the same group of items contained in the first entry. Each entry, however,
contains a different set of variables. There is
no practical limit to the size of either an entry
or a table.
All items and tables used by a JOVIAL program are defined according to the basic characteristics of the data they will contain and are
assigned symbolic names. Thereafter they are
conveniently called upon by name. Entries are
referenced by integer values in the form of constants, subscripts or other variables.
Assignment Statement
Dynamic JOVIAL statements may be classified
according to two basic types--those Which control
sequence of operations and those which modify the
language may be used to modify the value of any
variable--the assignment statement and the
exchange statement. The latter is a type of twoway, restricted assignment statement.
The assignment statement places the value
named by the right term into the location of the
variable named by the left term, altering the
format of the right term to fit, if necessary.
(Right Term)
(Left Term)
1 $
NUMBER
ABLE + BAKER$
ABLE
The item named NUMBER in the first example is
assigned the decimal value of 1. In example two,
item ABLE is set to its own value plus the value
of item BAKER.
example:

Since modification of the value of any JOVIAL
variable must be performed YTith this type of
statement, the assignment statement, and parti-

cularly its left term, is vital to the operation
of the Checker.

strings and items located in tables whose
entry lengths or entry structures vary from
entry to entry may also be traced.

399
10.6

The Compiler
The JOVIAL Compiler accepts a program
written in JOVIAL, analyzes its statements for
illegalities, generates an intermediate language
version of this program and then translates from
this language to the binary code of the specific
machine. Four of the Compiler's working tables
are saved for subsequent use by the Checker.
One relates to statement labels; one gives references to items and tables; another, to so-called
status items; and the last refers to intermediate
language statements.
The output from the Compiler is the object
program and test data, the above tables and a
printer destined tape, used also by the Checker,
listing each JOVIAL statement with its equivalent
symbolic instructions and octal machine code.
The listing also provides a record of input test
data in JOVIAL format and, if any illegalities
were detected, error messages in context. \ihen
the JOVIAL program is corrected of all errors,
it is ready to be run with the Checker.
The Checker Options
The general functions of the Checker are to
operate the object program with the supplied test
data and to record selective test results in
JOVIAL format on a printer-destined tape. Results
may also be printed on-line, if desired.
To use the Checker, the programmer creates
two or three control cards to specify the method
of selecting test results for recording. From
three basic options of selecting test results,
the programmer may choose any, all, or none. By
answ'ering yes or "no" to each of the following
questions, the programmer has eight combinations
of recorded results from which to choose:
I

j

Unconditional Trace: Are dynamic snaps of
selected items for which the programmer has not
supplied expected final values wanted? (LetUs
call this an "unconditional trace. I. D"JIlamic
snap, as opposed to final or static snap, is
used here to mean that every value of a selected
item is recorded each ~ime it is modified throughout the entire operation of the program.
This option also applies to items in selected
entries. If, for example, the program operates
upon every other entry contain~ng the selected
item, instead of every entry, the programmer has
no need for any dynamic snaps of item values
located in half of the entries. The programmer
then specifies the item name followed by the
entry nwnber. This selectiveness is important
because the total number of items selected for an
unconditional trace is limited to one-hundred
items, and each iteration of the same item in
different entries is counted as one item.

To initiate this type of trace, the programmer creates an unconditional trace control card,
followed by a sufficient number of cards to list
every item he wants traced in this manner. (See
figures 1 and 2).
As a result of the unconditional trace, the
values of all modifications of selected items are
recorded for printout. Each value is accompanied
by the item name and entry number, the assignment
statement which modified the item at that
point, and the closest preceding JOVIAL statement
label.
Discrepancy Trace: Are dynamic snaps
wanted only in the event that final values of
items within selected tables deviated from
expected values? (Let us call this'a "discrepancy
trace. I. )
For the discrepancy trace, the programmer
supplies a control card with the words, "Discrepancy trace, tt and defines two sets of tables
as part of his organization of variables with the
original JOVIAL program. One set of tables
defines and names all items selected for dynamic
snaps in the event their final values are in error.
These tables are named "ACT¢, ATCl," etc. After
the object program has been operated, the actual
final values of the items defined within these
tables automatically will be placed in the item's
assigned location.
The second set of tables are given the names,
"EXP¢, EXPl, I' etc., and contains the programmersupplied expected final values for all items
named in the ACT tables. The values of items
within the ACT tables must correspond exactly
with the positioning of expected values in the
EXP tables, and all recurrences of the selected
items in every entry of the tables must be
provided for.
After operation of the object program, actual
final values of all items in the ACT tables are
compared with the expected values in the EXP
tables. Only in the event that a discrepancy
occurs between any of the correspondingly positioned values, is a trace initiated.
Except that only those items in error are
traced, this trace is performed in the same way
as an unconditional trace, and recordings will
also be accompanied by the modifying assignment
statement and closest preceding statement label.
Although any number of items may be placed in the
ACT-EXP tables, only the first one-hundred discrepant final values will be traced.
vllien an item to be checked is already organized within an entry containing different items
which need not be checked, the programmer may

400

10.6
remove the selected item from its original table
and define it within an ACT table. This reorganization in no way alters the operation of the program or the results obtained.
Final Snaps: Are final values only of
selected tables wanted? This option will usually
be employed in conjunction with one or both of the
options already discussed. In effect, it is a
'static snap" of the values in preselected tables
at the end of the program's operation, therefore
no assignment statements or labels accompany
these recordings. Table names, entry numbers and
item names are provided. On the control card the
programmer may specify that final snaps be made
of all tables defined with his program, no tables,
only the tables named, or all tables excluding
those named. (Figure 3)
With any of the three options named, special
information must be supplied to the Checker if
recordings are requested from tables whose entry
lengths or entry structures vary from entry to
entry. One control card is needed, one card for
each table name, and one or more cards per item.
Control items--or those items containing information about the length or structure of each entry
--are designated by their absence or presence in
floating fields. Control information on strings
is also specified on these cards.
Highlights of Checker Operation
Three major routines called Control, Tracer,
and Record constitute the Checker program. These
routines, in turn, are modularly constructed of
multiple subroutines for both flexibility of
operation and ease of modification. The broad
flow of operations is illustrated in Figure 4 to
depict the sequence of the operational highlights
only.
One pass is required through the Checker
program unless actual values deviate from expected
values, in which case two passes are made. The
Checker may be operated in a strictly unconditional
mode, strictly discrepant mode, or a combined mode.
If both modes are desired, an unconditional trace
control card is used, but the first pass succeeds
in performing all the operations required of the
first pass of both modes.

the binary program of the machine instruction
which modifies the value of the item. This instruction is usually a "store" class instruction.
The Refer table is subsequently used to insert
traps in the object program, create a table consisting of displaced "store" instructions, and
finally, its assignment statements are recorded
for printout with corresponding item values.
statement Labels: Two compiler tables are
used to associate the closest preceding JOVIAL
statement label with each assignment statement
in the Refer Table. A search of the intermediate
language table yields only the operator "label"
and a reference to another table containing all
statement labels. JOVIAL labels are easily
recognized, however, and these are saved until
associated with an assignment statement to be
traced or until another JOVIAL statement label is
encountered. The last label saved is thus automatically associated with the next assignment
statement under trace.
Imbedding Traps: So that all modifications
of an item under trace may be saved before the
item is subjected to further modification, the
object program is imbedded with "traps." Traps
may be defined as instructions vhich effect an
unconditional transfer of control to the snap
recording routine.
Traps are imbedded in the object program to
replace each "store" class instruction whose
relative location is furnished by the Refer table.
The store instructions, in turn, are relocated in
another table and are operated upon from within
this table prior to operation of the Snap routine.
Snap Tables: Recordings of the modifications
of all items under trace are saved in a snap table
which has a capacity of 400 snaps per Checker pass.
If snaps exceed this capacity, the contents of the
filled snap table are repeatedly buffered onto a
scratch tape and brought back into memory just
before the final recordings for printout are made.

For an unconditional trace, selected item
names are read in from card reader or from script
tape. These item names and entry numbers, if
entry references are furnished, are entered into
a table called "Trace."

Operation and Recording: The Checker then
operates the object program with imbedded traps.
If control information is present regarding
variable length or variable structure entry tables,
this is tabulated. Final snaps of selected tables
are processed as requested and recorded on the
~rintout tape.
If no ACT tables are present,
snaps resulting from the unconditional trace are
grouped with appropriate statement labels and
assignment statements. These are recorded on the
output tape and the job is logged complete.

Statement References: With the aid of tables
furnished by the Compiler, the items named in
table Trace are then used to locate all JOVIAL
assignment statements which contain these items
as their left terms. As these assignment statements are located, they are placed in a table
called "Refer." In addition to all JOVIAL statements ,.,hich modify the items selected for trace,
the Refer table contains the relative location in

Discrepancies: If ACT tables are present and
one or more values deviate from the EXP values,
the Trace table is recreated. This time, however,
instead of containing items requested for an unconditional trace, the Trace table contains only
the names of items Which revealed discrepancies.
Again, traps are imbedded, and again, the program
is operated. For the second pass, all parts of
the Record routine are omitted except the final

401

10.6
Evaluation

recording of snaps and associated information
destined for printout.
Checker Printout
The printout of the Checker provides the
programmer with the following information in the
JOVIAL format: (Figure 5)
1. Final values of any tables specified for
final snaps. The word "Table" precedes the table
name, the word "Item," its name, tString," its
name, etc. Fixed entry length tables are printed
first, listing all values for each item sequentially. Variable structure or variable entry length
tables follow, however, these values are listed
entry by entry because the same items may not be
present in all entries.
2 • Unconditional traces are so labeled and
are followed by all such traces in the format previously described.
3. Discrepancy traces are listed last in the
same format.
Checker Restrictions
Restrictions imposed upon JOVIAL programs by
the Checker at this time are approximate since
the Checker, which is itself a JOVIAL program, has
not been compiled at the time of this writing.
Restrictions estimated for the Checker and the
JOVIAL program when compiled on one computer
will not be the same for another computer with
greater capabilities.
'.'!hen compiled on the IBM 7090, the Checker
is expected to occupy about 16,000 registers,
reserving 10,000 for the object program and test
data. Most of the remaining core space will
probably be occupied by the Compiler tables and
the tables created by the Checker. Actually no
core space is left unused, thanks to one whimsical
programmer who filled in remaining space with
transfer to a routine which prints the message,
"Dear Programmer. You transferred out of your
program. How' about that? l'
Practical limits to the lengths of tables
created internally by the Checker impose two
rather generous limits on the JOVIAL program. It
is assumed that the number of assignment statements which refer to any item under trace will
not exceed an average of four. JOVIAL assignment
statements are also limited in length to an
average of six registers each.

In light of the stated objectives of the
Checker, comments must be withheld until programmer use of the utility program is observed and
indications of time-savings are available.
Similarly, the overall efficiency of the Checker
is more convincingly reported after compilation
figures and timing results are tabulated.
If, however, a bad plan admits of no modification, ,nth a slight twist of logic we can
believe the Checker is a good plan in at least
one respect. Two proposals are now under investigation for possible modification of this or
future Checkers.
One proposal considers that the programmer
may wish to check out his program several times
'-1i th different sets of test values, all of which
may not necessarily be included with the original
program. Once the program has been compiled and
corrected, the present system necessitates recompiling to include such changes. The modification under consideration would enable the programmer to insert, delete, or change values of items
by means of an additional Checker sub-routine
which would operate at the beginning of the
Control routine.
The second proposal permits the progr&~~er
to designate certain items as "conditioning
items." Conditioning items are those items not
selected for a trace, but ,,,hose use in the program affects the values of items selected for a
discrepancy trace. Conditioning items \-Tould be
traced unconditionally in the event that the item
with which they were associated were found to be
in error.
The present system accommodates such items
only as items selected for an unconditional trace.
Should the main item selected for a discrepa..'1.cy
trace not be in error, unneeded results wOLud be
produced.
Reference
l,-Trubel, Marshall H., A Primer of Prograrmning
for Digital Computers, McGraw-Hill Book Company,

1.

Inc., N.Y., N.Y., 1959,

p.

122.

402.

10.6
UNCONDITIONAL TRACE
DISCREPANT TRACE

FIGURE I

TRACE CONTROL CARDS

ABLE($,$) ABLE($I'$) TEMP ALPHA *KEY* BAKER($J$)

FIGURE 2

AN ITEM CARD FOR UNCONDITIONAL TRACE

EXCLUDE T ABI INDEX.

I

PROCESS TABI CHART T AB6 INDEX.

I

PROCESS NIL

PROCESS ALL

FIGURE 3

FINAL SNAP CONTROL CARDS

YES

SNAPS,

LABEL

& STATEMENT

PUT
PANT ITEMS
IN TRACE TABLE

PUT MODIFYING STATEMENTS IN
TABLE~

FIGURE 4.

BROAD FLOW OF CHECKER OPERATIONS

.....

~

00

•~ (N

404

10.6

FIGURE 5. SAMPLE CHECKER PRINTOUT
TABLE DECOR
ITEM COLORS
(I) GREEN
(2) ORANGE
ENTRY
(0) BLUE
(5) PINK
(3) BLACK
(4) PURPLE
(6) RED
(7) YELLOW
ITEM PATTERN
(2) .2EI
(I) • 06025E2
ENTRY
(0) .135663E3
(5) .5E-2
(4) .200E3
(3) • 55E3
(7) • 325EO
(6) .601E3
ITEM NAMEP
(2) SUNSET
(I) SPRING
ENTRY
(0) BLUBEL
(5) SUNRIS
(3) EBONY
(4) DUSK
(6) BLAZE
(7) JONQUL
TABLE MILLS
ENTRY
(0)
ITEM PATTNO
.135E3
ITEM TLMILS
ONE
STRING LOCATN SAVANA
ENTRY
(I)
ITEM PATTNO
.2EI
NONE
ITEM TLMILS
ENTRY
(2)
ITEM PATTNO
.6025E2
THREE
ITEM TLMILS
STRING LOCATN
LOUSVL
SAVANA
BIRMHM

FINAL. VALUES IN
TABLE WITH FIXED
LENGTH ENTRIES.

FINAL. VALUES IN
TABLE WITH VARIABLE
LENGTH AND VARIABLE
STRUCTURE ENTRIES.

UNCONDITIONAL TRACES FOLLOW
lOA. BYTE ($B9 E$) (PRICE ($C$»
(2) (3) (PRICE (12»
99
lOA. BYTE ($B. E$) (PRICE ($C$)
(2) (3) (PRICE (13»
87
lOA. BYTE ($E9 E$) (PRICE ($C$»
(2) (3) (PRICE (14»
• 54

TEMP ($A$)
DYNAMIC SNAPS OF

0

TEMP ($A$)

DISCREPANCY TRACES FOLLOW
CF8. ABLE (~L$) COUNT $
ABLE (0)
2
CF8. ABLE ($L$) COUNT,
ABLE (I)
3
CF8. ABLE ($L$) COUNT $
ABLE (2)
4

VALUES OF PARTS OF
ITEM

0

PRICE.

TEMP ($A$)

DYNAMIC SNAPS OF
VALUES OF ITEM

ABLE

405
11.1

FACTORS AFFECTING THE CHOICE OF MEMORY
Claude F. King
Logicon Inc.
Palos Verdes Estates, Calif.
Introduction
One of the fundamental choices in the
design of a computer or data processing
system is concerned with the medium for
the storage of information. Once the system requirements impose the need for
information storage that exceeds a certain
level, typically several hundre~ bits a
"hierarchy" principle of memory sets'in.
As expressed by Von Neumann, the system
may call for an overall storage of N words
at an access time t. For economic or other
r~asons it may be more practical to pro- •
v1de.for borne small~r quantity of storage
N, w1th an access t1me t = t and obtain
the total storage N at s6me longer access
time depending on the system needs.
Extending this reasoning then to the more
general case there would be a sequence of
c~pacities N{N2< • • •-

...... N
•

tv

IoN

424

11.2

425

11.2

Figure 12. VOLTAGE OUTPUTS OF SENSE AMPLIFIER WITH 512 "ONES" AND 512 "ZEROS"
STORED

427

11.3
TUNNEL DIODE STORAGE USING CURRENT SENSING
E. R. Beck, D. A. Savitt, and A. E. Whiteside
The Bendix Corporation
Research Laboratories Division
Southfield, Michigan _

Summary
Tunnel diodes are attractive for use as the
basic elements in high- speed random-access
memories because of their fast switching speed
and good environmental tolerance. A memory
approach using tunnel diodes has been devised
which is based upon destructive sensing of the
operating current level in simple bistable elements. Each element consists of one tunnel diode
and one resistor. This approach results in a
near minimum of memory element complexity and
of drive and bias power requirements. The immediate goal of the work reported is the development of a memory of somewhat over 1000 bits,
able to operate from -SSC to +12SC at submicrosecond cycle times.
The feasibility of the overall memory concept, which is applicable to either bit or word
organization, has been demonstrated by the operation of a test model. Test results indicate that
a 200-ns cycle time is obtainable in a memory of
64 words of 24 bits each. Cycle time is roughly
proportional to the square of the word capacity;
correspondingly shorter cycle times can be obtained with memories of smaller capacity. With
further circuit refinements it should be possible
to operate a 64-word memory of this basic type
at a cycle time of 100 ns or less over the temperature range of -SSC to +12SC.
Introduction
One of the most attractive immediate
applications of tunnel diodes is in the construction
of high-speed random-access memories. l ,2 The
tunnel diode offers the possibility of operating
such memories over wide temperature ranges.
This paper discusses a memory element that has
been devised to make efficient use of the tunnel
diode. Since thi s element provide s no isolation
between drive currents and sense output, the form
of the matrix has been arranged to provide the
required isolation. The matrix form is explained,
and the techniques employed for driving and
sensing with this form of matrix are discussed.
The paper ends with some test results and
conclusions.

Basic Design Choices
Basic Memory Element Component
The tunnel diode is attractive for use as a
memory element because of its fast switching
speed and good environmental tolerance~ Potentially a low-cost device, the tunnel diode can
be made with close initial tolerances on its
parameters and on their variation with temperature. Tight packaging can also be used because
of the tunnel diode's small size and low power
con sumption.
Semiconductor Types
With the exception of the tunnel diodes, only
silicon semiconductor device s are employed in the
circuitry. Either germanium or gallium arsenide tunnel diodes may be used in the basic
memory elements; the switching speeds of the two
types are comparable. Information received
from manufacturers as well as tests of available
units indicate that the necessary tolerances on
parameters and their variation with temperature
can be obtained. Gallium arsenide tunnel diodes
are, however, capable of operation over a ,!ider
temperature range; roughly the same percentage
of parameter variations occur from -SSC to +lSOC
as occur with germanium tunnel diodes from -SSC
to + 100C. For this reason gallium arsenide units
were adopted as soon as they became available.
This choice must be qualified by the fact that the
long-term stability of currently available gallium
ar senide tunnel diodes is unsatisfactory. If this
problem cannot be solved, germanium tunnel
diodes must be used and the temperature range
reduced.
System Organization
Word rather than bit organization was
selected as being more suitable for the memory
sizes of immediate concern and for the wide
operating temperature range desired. It is
possible to make a bit-organized memory using
the concepts and circuits to be discussed. and in
fact the feasibility of doing this was verified by

428

11. 3
the operation of a test mode1.4 However, no
saving in selection and drive circuitry results
from using bit rather than word organization in
small memories, and the increased system tolerances afforded by the latter are clearly helpful
in meeting the temperature requirements. Destructive readout was chosen because it was felt
that the additional rnernory-element complexity
required for a nonde structive readout capability
was not justified by the sornewhat higher speed
obtainable.
The overall system organization used is
the same as that of any word-organized memory
with destructive readout. The system block
diagram is shown in Figure 1 for reference
purposes. It may be noted that for REWRITE
operations the sense outputs are brought directly
to the bit-line drivers to avoid the propagation
delay of the input;"'output register.
Memory Element
Destructive sensing of operating current
level was chosen for the readout mechanism;
This type of sensing allows the use of a bistable
element with only two terminals and only two
components, a tunnel diode and a resistor.
Besides having the least nurnber of components,
this element allows the simplest matrix topology.
The form of the rnemory elernent is shown
in Figure 2. The values of the bias voltage V BB
and the resistor are chosen to make the circuit
bistable. The state of the element is changed by
increasing or decreasing, as appropriate. the
voltage across the element to force operation to
the desired stable state. The elernent voltage is
changed by applying at points X and Y drive
pulses of the type indicated.
With the diode polarity shown, a negative
READ pulse applied at Y is used to switch the
element. to its high-current state, defined as the
ZERO state. The increase in element current
which occurs if a ONE is read is detected with
the use of the transforxner shown in Figure 2. A
separate transformer is not r~quired for each
element. One transformer is actually shared by
all the elements in a given bit position of all
words.
Switching of the element to its low-current
state for writing of a ONE is done on the coincidence of a positive pulse at Y and a negative
pulse at X. To obtain larger tolerances with the
WRITE operation the d-c bias is offset in the

direction of the READ switching voltage. This
bias offset also reduces the d-c power consurnption, and permits the two opposite-polarity
drive pulses at the Y terrninal to be of nearly
equal rnagnitude. The latter sirnplifies the
generation of these drive pulses.
An inherent property of the element shown
is the lack of drive- sens e isolation. Currents
produced by the drive pulses will produce noise
signals in the secondary of the sense transforrner
which may be difficult to distinguish from the
element output. Successful use of this type of
element depends upon the cancellation of drive
current obtained with the matrix configuration to
be described below.
Memory Matrix
Matrix Forrn
The basic memory elements are connected
in the matrix form shown in Figure 3 for wordorganized selection of n words of k bits each.
Each vertical matrix line is a word line and is
connected to the Y terminals (see Figure 2) of
all elements of a given word. Each horizontal
matrix line is a bit line and is connected to the
X terminals of all elements of a given bit position.
A single common sense transformer is
shown in each bit line. The drive pulses of
Figure 2 are supplied by the generators shown
SChematically in each line of the matrix. Only
the generators in the selected lines are active,
and the remainder rnay be considered to be
shorted out.
Mernory Element Equivalent Circuit
To understand how ,the driver currents are
cancelled to obtain drive-sense isolation, it is
helpful to consider an equivalent circuit for the
memory element. Figure 4 shows that the mernory
elernent (a) may be represented by the equivalent
circuit (b) consisting of a resistor r in shunt with
a current generator Is' Resistor r is the elernent
incremental resistance, and Is is the difference
in element current in the two stable states.
The justification for the equivalent circuit
is illustrated by Figure 4(c). The rnemory element I-V characteristic is shown along with
corresponding time plots of current and voltage
waveforms. Assume that the er'ernent operating
point is initially at point A. As the voltage across
the elernent is increased from VBB to VH' the

429

11.3
current through the element suddenly drops as
switching from the high-current state to the lowcurrent state occurs. This current change may
be repre sented by the clo sing of the switch in
series with the current generator in the equivalent
circuit. The current generator provides a step
of current which, if the element is allowed to
remain in its new state, results in a change in the
current drawn from the d-c bias supply. Similarly,
an equivalent circuit for the element can be
derived for the case where the initial operating
point is at B and the voltage is reduced below V L;
this circuit differs from (b) only in the polarity of
the generator Is' If the voltage across the element is between V Land V H the element is simply
represented by resistor r.
Drive-Current Cancellation
The drive-current cancellation can now be
explained by referring back to the matrix of
Figure 3. Current from the selected word-line
driver is cancelled out of the sense transformers
by simultaneous operation of the cancellation
driver feeding the dummy "cancellation word"
(resistors r) on the opposite side of the transformer primaries. Cancellation driver and wordline driver waveforms are identical. Selection
of the proper cancellation word is made from
the most significant bit of the address specified.
Current from the bit-line drivers is
cancelled out of each sense transformer by
applying this drive to the center tap of the primary winding. These transformers are placed in
the center of the bit lines, with half of the word
lines on each side of the transformer primaries.
By replacing the elements with their equivalent
circuits, it can be seen that the primaries are
symmetrically loaded and the bit-line drivers
produce no net current in the sense transformers.
Matrix Equivalent Circuits
For purposes of explaining the sense and
driver requirements, the matrix may be replaced
with the simple equivalent circuits shown in
Figure 5. The matrix presents a load to the
word-line and cancellation drivers equivalent to
a resistor of value rlk as shown in Figure S(a}.
The matrix presents a load to the bit-line driver
equivalent to a resistor of value rln as shown in
Figure S(b). Therefore the loading On the wordline driver is proportional to the number of bits
per word, and the loading on the bit-line driver is
proportional to the number of words.

The equivalent circuit of the matrix as a
signal source for each sense circuit is shown in
Figure S(c). The current generator Is of the
memory ele:ment being switched is shunted by the
incremental resistance of all n memory elements
on the bit line. The equi¥alent sense transformer
shown has a turns ratio of I :2a; the turns ratio of
the center-tapped transformer actually used is
1 :a. The ratio l:a is chosen for maximum transfer of power to the sense circuit. It can be seen
that the matrix output power is inversely proportional to n, the number of words. Note also
that the polarity of the Is generator will depend
upon the location of the selected word; that is, to
which end of the sense transformer primary the
element is connected.
One of the main determinants of cycle time
in a destructively-read memory is the delay in the
rewrite loop. It can be derived from the matrix
equivalent circuits, Figure S(b) and (c), that the
power gain required in the rewrite loop, and
hence the rewrite loop delay, is proportional to n ~
The memory cycle time then varies roughly in
proportion to the square of the memory word
capacity.
Drive Circuitry
Coupling to Matrix
The drive generators shown in the matrix
of Figure 3 were indicated for ease of explaation. The drive voltages are actually introduced
in series with the bias and memory elements by
means of transformers as shown in Figure 6. The
drive voltages required are low, on the order of
0.9 volt, and the required tolerances cannot be
obtained directly with silicon transistor and
diode logic circuitry at such levels. The drive
transformers provide an efficient voltage stepdown from convenient, easy-to-controllogic
signal levels to the matrix drive levels. Simultaneously, this approach provides the necessary
low driver a-c output impedances.
Drive Waveforms
Two cycles of the required drive-pulse
waveforms are shown in Figure 7. These
essentially repeat the waveforms' shown in
Figure 2 except that an additional bit-line drive
pulse is indicated with dotted lines. Since wordorganization is being employed, reading is
accomplished by the negative word-line pulse
alone. Writing a ONE, however, requires the
coincidence of the word-line and bit-line puls es .
Writing a ZERO is accomplished by delaying the

430

11.3
bit-line pUlse so that it does not coincide with the
word-line pulse. The delayed bit-line pulse is
shown with dotted lines in Figure 7.
A delayed bit-line pulse must be used when
a ZERO is being written because the drive
voltages are a-c coupled to the matrix lines. If
a pulse were not applied on the bit line every
cycle time, whether a ZERO or a ONE was being
written, the pulse duty cycle would be variable
and dependent upon the sequence of memory
operations. Such a variable duty cycle would
make the effective matrix bias voltage variable,
reducing the bias-supply tolerances. Note that
by employing equal-area read and write pulses
on the word line the average value of this waveform can be made equal to zero for each cycle;
therefore there is no duty cycle problem on this
line.
Drive Circuits
The pairs of alternate-polarity drive pulses
required for the word lines are obtained by
differentiating a rectangular OPERATE pulse
provided by the word-line pulse generator. (See
Figure 1). Only unipolar pulses need then be
handled by the word- selection circuitry. Each
word-line driver consists of a tuned transformer
driven by a diode AND gate as shown in Figure 8.
The transformer is critically damped and the
write pulse amplitude is clamped. The last level
of address decoding is also done in this circuit.
The bit-line driver is shown in Figure 9.
Transistor 0 1 is a pulse amplifier and transistor
02, provides low-frequency feedback for control
of the pulse area. A clamp voltage is used to
control the pulse amplitude, and resistor RT
provides temperature compensation. The input
to this circuit is supplied from a three-input OR
gate which in turn is fed from three AND gates.
The AND gates are controlled by the sense
circuit outputs, the input-output register and the
OPERATE READ command. (See Figure 1).
Sense Circuitry
Level Restoration
The sense circuit must detect the presence
of and amplify the current step Is (of the matrix
equivalent circuit) produced by a switching
mem-oryelement. Each time another element
switches, another current generator can be considered to be added in shunt with the one shown
in the equivalent circuit of Figure 5(c). The

level of the matrix output at any time then will
vary, depending upon the polarity and spacing of
the previous output step waveforms. It is therefore necessary to restore the matrix output level
to zero after each new step of current occurs.
Level restoration is achieved in a manI).er
similar to differentiation by the use of a shortcircuited delay line connected across the sense
transformer secondary. The manner in which
level restoration is accomplished with a shortcircuited delay line is illustrated in Figure 10.
In this figure, e g and Rg represent the equivalent
circuit of the matrix as seen from the sense
transformer secondary, and RL represents the
input impedance of the sense circuitry. The
value of the parallel combination of Rg and RL
is made equal to the delay-line characteristic
impedance Ro' Step waveforms from the matrix
are then converted to rectangular pulses as
shown. The pulse width is equal to twice the
delay time T d of the line. In general, a rectangular pulse is generated for each step of matrix
output, and the pulse polarity is the same as the
step polarity. The effect of the circuit, then, is
to restore the matrix output level to zero before
the next signal arrives. This permits an
amplitude level discriminator to determine if a
signal is present.
Level Discrimination
In addition to the step current waveforms
re sulting from the switching of memory elements,
the matrix also produces some noise. Noise
results from the imperfect cancellation of the
drive currents in the matrix. The noise resulting
from the word-line driver is relatively low compared to the matrix output signal level, because
the cancellation element and driver can be
matched closely to the active words and drivers.
The noise produced by the bit.line drivers,
however, depends upon the match between the
incremental resistances of all elements on one
side of the sense transformer primary and the
resistances of all elements on the other side.
The noise level then is proportional to the matrix
size in words and can be significant compared
to the matrix signal for memory sizes of interest.
However, this noise is produced only during the
WRITE operation, when sensing of the matrix
output is not performed.
Because of the presence of noise in the
matrix output, it is desirable to use a level
discriminator in the sense circuits. Since it is

431

11.3
undesirable that a response be generated during
the WRITE operation (because this could result
in a recovery time problem at short cycle" times),
the discriminator is strobed during·the READ
operation. Since the pulses produced by the level
restorer may be of either polarity, a bipolar level
discriminator is required. The circuit employed
to meet these requirements is shown in Figure
Il(a).
The bipolar discriminator circuit is
essentially two tunnel diodes in parallel, biased
by a current source IB to a point just below their
peak currents. The output voltage is initially low.
An input pulse is fed to the diode s through the
center-tapped transformer. As a result a
positive pulse appears in series with one tunnel
diode and a negative pulse is in series with the
other. If the input pulse exceeds the discrimination level, both diodes switch to their highvoltage state, and the output voltage goes high.
Strobing is accomplished by using a pulse bias
which is turned on only during the read operation.
The circuit operation is explained by
Figure ll(b) and (c). Figure ll(b) shows the
situation when the input voltage e p is zero. The
solid curve is the I-V characteristic of one diode,
and the dotted curve is the characteristic of the
two diodes in parallel. With the bias IB shown.
the initial operating point is A. Figure ll(c)
illustrates the effect of e . The characteristic of
one diode is shifted to th~ left and the other one
to the right. The characteristic of the two in
parallel. shown by the dotted line. then has a
lower peak current than when e p is zero. If e p
is large enough. the equivalent peak current is
reduced below IB and the operating point switches
to B, producing an output signal.
Alnplification
In addition to the functions of level restoration and discrimination. the sense circuits must
provide amplification. Because of the shunting
effect of all the other memory elements of a bit
line on the output of the one memory element
that switches, the amount of gain necessary to
allow the sense output to operate the input-output
register increases with the memory size.
To sense the low-power matrix output. a
sensitive discriminator is required. Discriminator sensitivity is inversely proportional to the
peak current of the discriminator diodes.
Accordingly. low-peak- current units are
employed. The discriminator is followed by two

stages of tunnel-diode amplifiers of the analog
threshold OR-gate type; successive stages use
diodes having higher peak currents. All stages
are biased by the same pulse supply, and interstage coupling is accomplished with high- speed
diodes.
The switching time of a tunnel diode is
approximately inversely proportional to the diode
peak current. As a result, the sense delay contributed by the discriminator is proportional to
the sensitivity. It was found for the 64-word
memory that the delay with a single- stage video
transistor pre-amplifier before the discriminator
was less than that with a discriminator sensitive
enough to operate directly on the matrix output.
The discriminator employs two O.S-ma tunnel
diodes, the succeeding stage uses a 4.7-ma diode,
and the output state employs a 22-ma diode.
Experimental Re suIt s
The feasibility of the concepts and circuitry
described has been proved with a trial system
which simulated a memory of 64 words of 24 bits
each. Two words of active memory elements
were used, and the remainder were simulated by
resistors. The first tests were made with two
active word-line drivers and one complete rewrite loop consisting of a bit-line driver and
sense circuit. The loading and source-impedance
effects of the remainder of the drivers were
simulated.
The memory was operated in a repetitive
READ-ONE, WRITE-ONE sequence at a·cycle
time of 280 ns. The rewrite loop was closed and
a bit was circulated in a READ- ONE. REWRITEONE sequence at the same speed. Based upon
the measured delays in the rewrite loop. it is
estimated that the present circuitry' is capable of
running at a 200-ns cycle time. The additional
delay in the 2S0-ns cycle was due to limitations
in the word-line pulse generator.
Figure 12 shows driver and sense-output
waveforms for a READ-ONE, REWRITE-ONE
sequence; the waveforms are identical for a
READ-ONE. WRITE-ONE sequence. (The
presence of a sense output pulse indicates that a
ONE has been read during this cycle by the wordline READ (negative) pulse. Since the bit-line
driver pulse is in coincidence with the word-line
WRITE (positive) pulse. a ONE is being written
at this time.) The waveforms show a 60-ns
delay in the sense circuits and a 40-ns delay in
the rewrite logic.

432.

11.3
Figure 13 shows the znatrix and sense
circuit outputs for both READ-ONE, WRITE-ONE
(a) and READ-ZERO, WRITE-ZERO (b) repetitive zneznory operations. Two zneznory cycles are
shown and the word-line driver waveforzn is
included for a tizne reference. The znatrix-output
waveforzn includes the effect of the sense-circuit
level restorer. A coznparison of the znatrix
output waveforzns of Ca) and (b) shows the
excellent ONE-to- ZERO ratio achieved. As
stated before, the output level is inversely
proportional to the nuznber of words in the znatrix.
For the 64-word zneznory being operated here, a
ONE produces a 2.0-znv pulse across the 2.50-ohzn
sense-circuit input iznpedance.
Liznited teznperature tests have been perforzned on the driver circuitrY$ and the results
. indicated that the tolerances required on the drive
pulses can be znet over the teznperature range
-55C to +12.5C. Figure 14 is a photograph of the
test systezn chassis which consists of a base
ground plane into which circuit cards are plugged.
Two cards are shown in place, one of which is a
digit plane card. A partially asseznbled digit
plane is shown in the foreground.
Conclusions
The overall zneznory concept used, which is
based on current sensing, appear s to be one
feasible approach to construction of a high- speed
randozn-access zneznory. The approach results
in low drive and bias power requireznents. In
the znodel discussed, the znaxiznuzn d-c power
consuznption per bit is 1 znw; the znaxiznuzn peak
drive power per bit (for writing a ONE) is 1.6 znw.
A cycle tizne of 2.00 ns is obtainable for a zneznory
of 64 words of 2.4 bits each with present transistor
driver circuitry. The cycle tizne varies roughly
with the square of the nuznber of words; correspondingly shorter cycle tizne s can be obtained
with sznaller zneznories. The circuitry has been
designed to operate from -55C to +12.5~, and
uses only silicon diodes and transistors and
galliuzn arsenide tunnel diodes.
The siznple form of the memory element
allows a high degree of matrix miniaturization.
To show what can be done, the 16 by 16 array
shown in Figure 15 was constructed. using tunnel
diode packages expected to be available and
sznall znetal-filzn resistors.
Replaceznent of transistor drivers with
tunnel-diode circuits is an attractive possibility.
Drive pulses of the required amplitude can be
obtained with a single tunnel diode switching

between its low- and high-voltage states. Tunnel
diode driver circuitry is siznpler and znore
coznpact than the transistor-diode circuitry
currently used. It should also be faster and, in
particular, should greatly decrease the delay in
the rewrite loop. It is believed that a 64-word
meznory will then be capable of cycle time s of
100 ns or less from -55G to +125C.
Acknowledgement
The authors are indebted to fellow workers
at Bendix Research Laboratories for contributions
to the developments discussed in this paper. In
particular, the authors would like to acknowledge
the direction and encouragement provided by
Dr. E. C. Johnson and Mr. R. C. Sizns.
References
1.

J. C. Miller. K. Li. and A. W. Lo. "The
Tunnel Diode as a Storage Eleznent,"
Digest of Technical Papers, International
Solid-State Circuits Conference,
Philadelphia, Pa .• pp. 52-53. February
10-12, 1960.

2.

M. M. Kaufznan. "A Tunnel Diode Tenth
Microsecond Memory," 1960 IRE
International Convention Record, pt. 2,
pp. 114-324.

3.

R. N. Hall. "Tunnel Diodes, II IRE Trans.
On Electron Devices, Vol. ED-7, pp. 1-9,
January, 1960.

4.

R. C. Sizns, E. R. Beck, Jr., and V. C.
Kaznzn, II A Survey of Tunnel- Diode Digital
Techniques, II Proceedings of the IRE,
Vol. 49. No.1. pp. 136-141, January, 1961.

k SENSE CIRCUITSl
• kSENSE
LINES

e.

..

"1

FROM
ADDRESS
REGISTER

ADDRESS
DECODER

1

WORD·
LINE
DRIVERS

,....

1

.....
n WORD

LINES

.l

.~~

WORD.L1NE
....,
....
PULSE
OPERATE
GENERATOR
COMMAND

n

I
WRITE

CLOCK

1

ONE

...

U

WRITE
I

ZERO

I

BIT·L1NE
P-0776

GENERATOR

Figure 1.

k BIT
LINES

BIT-LINE
DRIVERS

j~

l

MATRIX

,,

L
I'"
w
....

GATING
j~

INPUT.OUTPUT
REGISTER
.:

,
GATING

I

.

1--

I

-

""\.

OPE RATE

EAD

IN FORMATION

MEMORY SYSTEM BLOCK DIAGRAM

.....

~

..... Vol
•
Vol
W

434

11.3

P-0539

. r---ORIVE 1NPUi1

-v-f'-

V)

I

I
I
I

I
I
I

---v-

I

I

I
I

I
I

I

I

:

::
I
I
I
I
I
L _________ J

X )..-----+----'
L _________
J

Figure 2.

r -SENSEOUTPUT-1

BASIC MEMORY ELEl-1ENT

I

I~

n

~

WORDS
SENSE

•

- - - - - - i

U

-.

Io..JUlO.J

•

:

k BITS

1

w
BIT-LINE
DRIVERS

CANCELLA TION
DRIVER

WORD-LINE DRIVERS
CANCELLA TION
DRIVER

T

BIAS
VOLTAGE -

Figure 3.

CD
,...
,...

o

ri.

SCHEr-1ATIC OF MEMORY MATRIX

......

......
(.)

H:>-

e..->

\)1

436

11.3

RI

I

P-0539

I

v

1
(0)

r

T
1

(c)

V

(b)

Figure

4.

MEl,10RY ELEMENT EQUIVALENT CIRCUIT

r----------,

r--------...,

I
I
I

I
I

I
I

z

WORD-LINE

I

0

II CANCE~~ATION

i

I

I
I

DRIVER

I

:

r/k

-

L ________

I
I
BIT-LINE
DRIVER

II

I

I

:
L ________

--1

(0)

(b)

RL

r/n

(c)
Figure

5.

NATRIX EQUIVALENT CIRCUITS

SENSE
CIRCUIT
INPUT

I
I

.--1

437
11. 3

WORD LINE

BIT.LlNE~

WORD.LlNEM
DRIVER ~

DRIVER~

BIAS
TVOLTAGE

Figure

6.

SCHEt'iATIC OF BIAS AND DRIVE ARRANG:Et-lENT

WRITE

ON E

READ

~

14
i l l

ZERO

~I • •1

Jn
i I'-~

-l

I

f

I
___
WORD·LINE
t
.
DRIVE!LJ
I

BIT.LlNE
DRIVE

WRITE

I

I

I

I
I

1
I

:

pi

I
I
I

I

t

t

I I----J'r"\\'---iw

----4l

I
I
I

I

I

1

1_....._ . .'I____.......0-- •
I
,-_J

I--!
-.I

I
I

~I.------ONECYCLE----~~I

Figure

7.

IDEALIZ~D

DRIVE WAVEFORHS

P·0776

438

11.3

.v

CLAMP
VOLTAGE

FROM
WORD-LINE
PULSE
GEN ERA TOR

Figure 8.

P-0776

~10RD- LINE DRIVER

439

11.3

~TOBITLINE

CLAMP
VOLTAGE

BIAS VOLTAGE

FROM
BIT.LlNE
GATING

I
P.0776

Figure

9.

BIT- LINE DRIVER

P-OS39

Figure 10.

DELA.Y- LINE LEVEL RESTORER AND WAVEFOm·iS

440
11. 3

OUTPUT

1: 1
INPUT
(a)

I

~----~~-------+V

e

p

--W-~M-

2

(b)

(c)

Figure 11.

BIPOLAR DISCRIMINATOR

P-0776

441

11. 3

READ-ON E, REWRITE-ON E OPERATION
WORD-LINE DRIVER (1 V/CM)
BIT-LINE DRIVER (1 V/CM)
SENSE OUTPUT (0.5 V/CM)

TIME SCALE - :20 NS/CM

Figure 12.

P-0776

DRIVE AND SENSE-OUTPUT WAVEFORHS

(a)

READ-ON E, WRITE-ON E OPERATION
MATRIX OUTPUT (50 MV/CM)
WORD-LINE DRIVER (1 V/CM)
SENSE OUTPUT (O.5V/CM)

(b)

READ-ZERO, WRITE-ZERO OPERATION
MATRIX OUTPUT (SO MV/CM)
WORD-LINE DRIVER (1 V/CM)
SENSE OUTPUT (O.5V/CM)

TIME SCALE - 40 NS/CM

Figure 1.3.

I·1ATRIX Arm SENSE CIRCUIT OUTPUT TilAVEFORMS

P-0776

442
11.3

Figure 14.

Figure 15.

1,fEl,10RY TEST l-10DEL

POSSIBLE HENORY MATRIX CONFIGURATIOn

443

11.4

THE DEVELOPMENT OF A
MULTIAPERTURE RELUCTANCE SWITCH
by
A. W. Vinal
International Business Machines Corporation
Space Guidance Center
Owego, New York

This paper describes the development of a
substantially improved transfluxor type device
called the Multiple Apertured Reluctance Switch
(MARS). The MARS is similar to the classic
transfluxor in that it comprises square loop
magnetic material embodying two apertures.
However, in deducing the details of a subtle
switching phenomena, a powerful electrical control parameter was discovered, permitting
geomechanic design of MARS devices mechanically
characterized by two apertures having substantially the same inner perimeter (figure 1).
Eliminating the necessity for the large aperture,
a mechanical characteristic of the classic
transfluxor, was necessary before a practical
three -dimensional coincident current, nondestructive readout memory could be developed.

Basic Operation of MARS
Figure 2 illustrates a piece of square
loop, square knee ferrite material containing
two apertures having essentially the same inner
perimeter. The left aperture has been identified
as the read aperture and the right as the control
aperture.

Figure 2 also shows two conductors passing
through the read aperture. One is called a sense
winding, labeled S1. It will sense, during read
time, an electrical Signal proportional to the
time rate of change of the flux (0/), (do/ /dt)
irreversibly switched about the read aperture.
This flux reversal is caused by the current pulses
interrogating the second conductor, A. The
nature of the flux distributions, to be described
later, determines whether a large or small
sense signal, corresponding to a binary none"
or "zero" respectively, appears across the
terminals of the sense winding.

The address conductor (B) passes through
the control aperture. Current. pulses applied
to this conductor determine by means of flux
distributions whether the "one" or "zero" information state occurs.

To qualitatively describe the operation of
the MARS, a timing diagram is shown in figure
3. This diagram describes the time relationship between the read and control pulses applied
to conductors A and B respectively. These
pulses have been numbered to provide a simple
method of distinguishing one pulse from another
during the following discussion.

Figures 4a through 4d describe the saturated flux distribution that encircles the read
aperture resulting from driving the current
pulses (figure 4e) through it. Figure 4f
describes the output signal present at the
sense winding terminals during the application of corresponding current pulses No.1
through No.4.

In all flux distributions drawings shown,
the current pulses applied to line A switch
only the direction of flux encirCling the
read aperture, i. e., from a clockwise to a
counterclockwise direction. A voltage pulse
developed at the sense winding terminals
corresponds to the time rate -of -change of
this flux reversal. The magnitude of this
voltage is proportional to:

e = - _~o/_
~T

(Br
=

+.

-As

Bm)

(1)
~T

444
11.4

where
As = equivalent cross-sectional area of
saturated flux encircling the read
aperture
~ T=

time increment

Equation (1) describes the output voltage
for the case where the flux encircling the read
aperture is s witched from i:Br (the remnant flux
density) to -iBm (maximum flux density).
Figure 5 Shows an effective hysteresis curve
that defines the flux density and magnetizingforce
relationship while the MARS is storing a binary
"one", as viewed from the read aperture.
The amplitude of read current pulses No.1
through No. 10 (figure 3), excluding control pulses
No. 5 and No. 10, are of sufficient amplitude
to saturate flux around the read aperture to a
radius slightly less than its diameter.
Pulses No. 5 and No. 10 (figure 3), which
interrogate the control aperture, are called the
write "zero" and write "one" pulses respectively.
The polarity of pulse No.5 is not arbitrary. Its
polarity must be in a direction that will produce
reminant flux around the control aperture in the
same direction last developed around the read
aperture. For example, figure 4d shows flux in
a clockwise direction established around the read
aperture owing to the application of pulse No.4
to line A. To establish the binary "zero" saturation flux distribution, current pulse No.5 must be
in a direction that establishes saturation flux in a
clockwise direction around the control aperture.
Figure 6 shows the reminant flux distribution for the binary "zero" condition.Because of the
shape, it is called the "pulley" flux pattern. This
situation is analogous to the block condition of
the transfluxor. 1 The belt of saturation flux
encircling both read and control apertures
formerly encircled only the read aperture.
Specifically, this flux is near the control aperture, riding on the outer boundary of saturated
flux enCirCling the control aperture. In essence,
the application of the write "zero" control pulse
switched the reluctance of the MARS to a higher
value.
In figure 7 the solid base line passing through
is the higher reluctance or binary "zero"
response excitation characteristic, as viewed
!rom the read aperture. Current pulses that
-~

interrogate conductor A, formerly sufficient to
switch flux encircling the read aperture, are now
insufficient to switch the flux belt enCirCling both
read and control apertures. When the pulley
pattern exists, the sense winding signal at the
read aperture is substantially zero. Bi-polar
read pulses may be applied indefinitely to the A
conductor with no effect upon the pulley pattern,
if the pulses are below a critical amplitude
(IRD in figure 7). This critical amplitude (read
destructivity threshold) and the switching
mechanism is·discussed later. The amplitude
of the write "zero" current pulses (pulse No.5)
should be of sufficient magnitude to extend the
flux in a radial direction from the center of the
control aperture until the flux is tangent to the
inside diameter of the read aperture. This is
the situation shown in figure 6.
Unlike transfluxoroperation, the power and
energy required to operate the MARS during the
control phase is a minimum. This minimum
energy requirement, in part, is accomplished
by referencing the polarity of the write "zero"
control pulse to that of the last read pulse.
This technique eliminates the necessity for the
control pulse to resaturate leg 1 of figure 8 since
this leg was previously saturated during read
time. (Leg 1 is the outer leg adjacent to the
read aperture.) Re -saturating leg 1 during the
writing of a binary "zero" is, therefore, useless
and power consuming. However, this referencing
technique requires the polarity of the write "zero"
control pulse to be in a: direction that will develop
flux about the control aperture in the same direction as that last established around the read
aperture. For example, in figure 8 the last read
pulse No.4, applied prior to writing a zero,
establishes clockwise reminant flux about the
read aperture. Similarly, control current pulse
No. 5 (figure 3) applied to conductor B establishes
clockwise saturation flux around the control
aperture. This flux links leg No.2 (figure 8) and
opposes that portion of the flux enCircling the
read aperture. The action of the interference in
leg 2 during the control pulse produces the "pulley"
flux pattern shown in figure 6.

Figure 9 shows the flux distribution
corresponding to the low reluctance state. This
state is analogous to the unblocked condition of
the transfluxor. 1 The flux belt previously
encompassing both apertures (figure 6) is now
encircling only the read aperture. This process
by which the MARS is switc,hed to this low
reluctance state starts with the application of the
write "one" control pulse No. 10 (figure 3).

445

11.4

This unblocked or binary "one n information state
allows read current pulses to switch the direction
of the saturation flux encircling the read aperture
in the same manner described previously in
reference to figure 4. This flux, may be alternately reversed in direction indefinitely. The
only restriction is that the correct polarity
be established around the read aperture prior to
writing a "zero".
Topography Investigation
Discs of ferrite material prepared for
experimental evaluation possessed magnetic properties exhibiting the rectangular B-H characteristics shown in figure 10. Each disc, pressed
and appropriately sinter ed, was cylindrical, with
diameter and thickness of 0.25 inches and 0.025
inches respectively. Two apertures were then
ultrasonically drilled in each disc. The relative
position and diameter of these apertures are
listed in Table I. These freshly cut samples
were re-sintered for a period of 10 minutes to
relieve any abnormal strains that might have
resulted from the drilling process. Mechanical
characteristics and dimensions of these test
samples are shown in figure 11.

2} Read destructivity threshold measurements.
3) Write "one" control properties
4) Write "zero" control properties
5) Write "one lt - write "zero" switch time
measurements.
The meaning of these experimental measurements relative to device operation are discussed
later with the results of each experiment along
with graphic plots of the appropriate test data.
All experiments employed pulse techniques
primarily because the mechanisms sought were
switching phenomenon. If control of any of these
mechanisms were to be developed, the transient
nature of coherent energy (flux) interactions
during external excitation would be the prinCipal
obj ectives of the experimental work.
Figure 12 shows the current pulse program
used throughout the experiments. Each pulse
in the sequence is numbered to simplify identification of individual pulses discussed.

Table I
Test Sample Dimensions in Inches
D

Sample No.

Ds

a

L

R1

R2

1

.003

.008

.050

.0125

.0125

0.250

2

.0365

.0115

.050

.0125

.0125

0.250

3

.039

.014

.050

.0125

.0125

0.250

4

.043

.018

.050

.0125

.0125

0.250

5

.045

.020

.050

.0125

.0125

0.250

6

.049

.024

.050

00125

.0125

0.250

7

.055

.030

.050

00125

.0125

0.250

8

.060

.035

.050

.0125

.0125

0.250

Five basic experiment measurements
were performed on each sample. The sequence
of these experiments, listed below, was essential to the success of this development effort:
1) Test Sample - material homogeneity
measurements.

Each test sample was wired with No. 35
Formvar insulated wire to conform with the
arrangement shown in figure 13.
In figure 13 sense winding Sl and S2 linking
leg 1 and leg 3 were provided to measure the
change in the effective vector magnitude of the
flux density occuring within the magnetic material

446
11.4

comprising leg 1 and leg 3 respectively. (The
nature of the flux reversals occurring within these
boundaries is discussed later. )
The following paragraphs describe the
experimental procedure for obtaining the read
and control response excitation characteristics
of each test sample in order to determine the
influence of aperture topography.
Uniformity Measurements
The first experiment determined electrical
uniformity of the magnetic material comprising
the test devices, both individually and as a group.
The amplitudes of positive and negative
current pulses, such as pulses 1 and 2 in figure 12,
applied to the read aperture of each test sample
via conductor A, were gradually increased and
mainta.ined equal in magnitudes. The corresponding peak amplitude, switch, and peak time of the
electrical signal sensed by winding S1 were
recorded. Similar procedures were employed
for each sample by sensing, with winding S2,
irreversible flux switched about the control
aperture owing to the bi-polar pulses passed
through conductor B. This data was plotted for
each sample, as exemplified by figure 14. The
inner wall irreversible switching threshold for
each sample was obtained by projecting the linear
portion of the output pulse amplitude until it
intersected the abcissa or current axis. It was
important for subsequent tests to know that the
switching threshold (10) for each sample had
substantially the same value.
Write "One" Control Properties
The write one control properties for each
test sample were obtained by USing the following
experimental techniques:
In figure 12 the magnitude of pulse No. 10,

the write "one" control pulse interrogating the
control aperture, was adjusted from an initial
value of zero in increments of increaSing
amplitude. For each increment, the magnitude
of current pulse No. 10 was recorded along with
the corresponding peak amplitude, peak, and
switch time of the read signal sensed by winding
Sl during the application of read pulse No.1.
This response data could also nave been taken during the first positive read pulse (pulse No. 11)
occurring immediately after the application of the
write "one tt control pulse. However, there was
a slight difference in the shape of the first readout Signal obtained during pulse No. 11 compared
to that measured at any other appropriate read
time. But the area under each output response

was the same. The difference in the observed
response is believed to be associated with a
slight disturbance in the distribution of flux
encircling the read aperture caused by the application of the first positive read pulse following control pulse No. 10.
The measured response excitation (write
"one tt and "zero") characteristics for samples
No.2, No.6, and No. 8 have been plotted
collectively in figure 15. Two critical points
(IRDS and IRDF) in the write "one ff control
property have been indicated for each sample.
IRDS corresponds to the control current
amplitude where the reluctance of the MARS,
viewed from the read aperture, begins to
switch from the high to the low reluctance state.
IRDF corresponds to the control current amplitude which is sufficient to completely switch
the MARS into the low reluctance state. Flux
distributions within the MARS device for the
low and high reluctance states are shown in
figures 9 and 6 respectively.
Before describing the significance of the
write none" control properties through an
interpretation of the data, the procedure for
obtaining the write "zero" control properties
will be described.
Write "Zero" Control Characteristics
The order of the following control properties
is essential for proper evaluation of the MARS
device. The write "one" control properties
must be established first because there is a
third break current associated with this control
process. It is called the reflex break current,
IRB·
The write "zero" control functions of the
test samples were obtained by USing the current
pulse program shown in figure 12. For each
sample under test, the amplitude of current
pulse No. 10, (Write "one" control pulse) was
adjusted to be in excess of the break current
(IRDF), but less than the reflex break current
(IRB). Once this condition was satisfied, current
pulse No.3, (figure 12) was adjusted from an
initial value of zero in equal increasing increments.
By recording the magnitude of current pulse No.3
at each increment and the associated peak
amplitude, peak, and switch time of the signal
appearing at the terminals of winding Sl during
read current pulse No.5, figure 12, we obtained the write "zero It response excitation
characteristics plotted in figure 15.

447

11.4

Unlike the write "one" control function,
there are only two current break points indicated
in the figures, IRIS and IRIF~ IRIS corresponds
to the current magnitude where the reluctance of
the MARS starts to increase (RIS - Reluctance
Increase Start). IRIF corresponds to the current
amplitude of the write "zero" control pulse
where the reluctance of the MARS is completely
switched to the high reluctance state, (IRIFReluctance Increase Finish).
An interesting phenomenon, shown in
figure 15; is associated with the write "one"
control response excitation characteristics. For
each sample tested, break point (IRUB> was substantially identical. The results of this experiment indicated that:

IRDS is independent of the separation distance
between the read and control apertures, and it is
directly proportional to the inner perimeter of the
control aperture and the switching coercivity of the
magnetic material comprising the device.
Break points IRIS, IRIF' and rRDF, however,
are substantially effected by aperture separation.
For a MARS device to be conveniently
operated in a three dimensional, coincedent
current system, the following are critical design
criteria:
IROS

=

lROS

= 66%IROF

(2)

IRIS

66%IRIF

(3)

=

IRIS

(1)

Figure 15 shows that an increase in aperture
separation increases the current amplitude at
which ~hresholds IRIS, IRIF' a~d IRD F occur.
For wlde aperture separatIon dlstances, exemplified by samples 6 and 8, two criteria specified
by equations (1) and (3) are violated. If this
situation were tolerated, considerable power
would be required to operate the device. Fortunately, equation (1) and equation (3) are compatible
with low-power operation since they require close
aperture proximity. However, a later section
describes a phenomenon, the read destructivity
threshold, which if uncontrolled prohibits close
aperture proximity. The solution to this dilemma
is discussed later and a unique method for electrically controlling this destructivity threshold is
described.

Inner Wall Reflex Switching
Read Destructivity Threshold
The read destructivity threshold is the
amplitude at which read current pulses start to
reduce the reluctance of the MARS device,
viewed from the read aperture. This phenomon
was referred to as spurious unblocking in reference 1. High and low reluctance states define
the binary "zero" and "one" information states
respectively. (Reluctance as used in this paper
defines the relative degree of impedance encountered at the read aperture by the energizing
means to switching irreversibly the flux in leg 1
(figure 13) of the MARS).
It is essential to understand the destructivity threshold in terms of flux distributions
and particularly the switching mechanisms responsible for it. Characteristically, the existence of this phenomena determines to a great
extent the limitations of multiapertured devices
for practical applications in random access,
non-destructive memory systems. Furthermore,
if these subtle switching phenomena are understood, the optim:um geomechanics of this and
other multipath devices can be determined both
experimentally and analytically.

The physical arrangement of the conductors
passing through each experimental test sample
is shown in figure 13. Conductors A and B permit external energizing means to influence the
material energy state in the vicinity of the read
and control apertures. Conductors S1 and S2
sense the change occurring within the material
bounds comprising leg-1 and leg-3 respectively.
Regarding current polarities, the reference
direction adopted in this paper corresponds to
applying positive current pulses to conductors
A and B in the direction indicated by arrows
in figure 13 to produce counterclockwise
energization around the respective apertures.

Read destructivity thresholds for each test
sample were determined experimentally by
applying the current pulse program shown in
figure 16.
Figure 17 is a plot of the switch time of
the "zero" readout Signal versus the amplitude
of read pulse 6 and 7. This data was obtained
by recording the electrical properties of the
"zero" readout signal occurring during read
pulse No.9.

448

11.4

The readout signal occurring at pulse time
No.2, called a reference signal, was also observed, but only as a relative measure of the
degree"of destructivity being induced by pulse
No.6. This reference signal displayed the electrical output associated with each test sample
after it was fully switched by control pulse No. 10
to the binary "one" information state. Referring to figure 17, note that for the reference
polarity established, no destructivity threshold
occurred for clockwise flux inducing read current pulses.
It is important to know the amplitude at
which positive read current pulses start partial
destruction of the binary zero information state,
and how the geomechanic properties of the test
samples influence this phenomenon.

Figure 16 shows the pulse program used
to measure the positive and negative read destructivity threshold by overdriving read pulse
No. 6 and 7 respectively.
Figure 18 shows the effect the width of the
common leg, separating the read and control
apertures, on the current amplitude at which the
destructivity threshold occurred.
For all samples tested, the amplitude of
the positive read current pulse, corresponding
to the destructivity threshold, was in excess of
the magnetizing force 10 required to just irreversibly switch flux in the unblocked wall of
the read aperture, but substantially less than
twice this magnitude. This situation is shown
in figure 17 where the "zero" readout response
excitation of sample No. 2 and No. 8 are plotted
collectively. Notice that the destructivity
threshold IRD is substantially less than twice
10 , This situation apparently compromises the
effective use of multipath devices with similar
apertures in a coincident current, three dimensional matrices. The nature of this threshold'
and its severe compromises provided a motivation to deduce, by experimental techniques, the
mechanism responsible for it, and hopefully to
evolve a suitable control technique. The material
presented subsequently describes experimental
techniques which led to the discovery of the
switching phenomenon identified as "inner wall
reflex switching". These critical experiments,
conceived to verify this switching mechanism,
provided a powerful control technique and design variable.
Prior to discovering the actual nature of
this switching phenomenon, geomechanical
techniques were used for increasing the relative
current level at which the destructivity threshold

occurs. These techniques are exemplified by
the experimental characteristics plotted in
figure 18 and by the classic transfluxor where the
inner perimeter of the control aperture is substantially larger than that perimeter comprising
the read aperture. 1, 2
The switching mechanism associated with
the read destructivity threshold is shown in
figures 19a through 19d. These drawings show
the distribution of flux thought to exist in the
proximity of the read and control apertures during different stages of deformation. The degree
of deformation is proportional to the amplitude
of positive current pulses applied to the read
aperture. Referring to figure 19a, leg 2 separating the read and control apertures was
saturated in the reference direction (up) during
the generation of the pulley flux pattern by control pulse No. 3 (figure 16). The flux denSity
(Br) in this leg can be increased, depending on
material rectangularity, by externally applying
to the read aperture a magnetizing force in only
one direction. For the established reference
direction, this corresponds to the positive current pulses shown in figure 16. The intensity
of the H field propagated from the conductor can
be calculated to be proportional to the current
and inversely proportional to the radial distance
from the conductor. The action of the H field
propagated from conductor A increases the flux
density in leg 2 in diminishing proportions depending upon the radial distance of leg 2 from
the conductor and simultaneously diminishes
flux density in leg 3 by the action of the same
vortex source, particularly in the vicinity of the
wall area of the control aperture.
As the amplitude of the positive read current is increased, the coherent energy (flux)
distribution internal to the magnetic material
at the wall of the control aperture and tangent
to leg 3 reaches a critical level. When this
critical level is surpassed, the coherent energy
directed clockwise around the wall of the control
aperture reflexes back tangent to the remote side
of the control aperture. This switching phenomenon, shown in figure 19d, has been called
"inner -wall reflex switching".
The necessary conditions believed to be
required to perpetrate this reflex switching is
summarized by the following hypothesis, which
is subsequently confirmed by experimental
evidence.
A given region in bounded magnetic
material that is suitably influenced by
energy generated externally contains
adequate internal coherent energy (including that produced by the external

449
11.4

influence) to cause a reversal in the
direction of coherent energy in another
region. This second region undergoing
reversal is in the immediate proximity
of the first, and it has a lesser mean
path length.
When this inner wall reflex switching is
allowed to occur, it acts as a gate permitting flux
to irreversibly switch in a counterclockwise
direction concentrically about the read aperture.
This situation is shown in figure 19d. Furthermore, the amount of flux permitted to switch
about the read aperture is directly proportional
to that amount reflexed about the wall of the control aperture. This, therefore, is the mechanism
alluded to earlier as being responsible for the
read destructivity threshold. Although the
occurrence of this switching mechanism is undesirable in memory device applications, it undoubtedly is useful for others.
To experimentally determine that reflex
switching is the mechanism responsible for the
read destructivity threshold, and that it obeys
the hypothesis outlined, some critical experiments were performed. Specifically, these
experiments were designed to determine the
following:
1) That reflex switching occurs.
2) That it occurs in the wall of the control
aperture.
3) That by occurring first, it acts like a
proportional flux gate producing the
effect at the read aperture just described.
Does reflex switching occur? Conceptually,
the experiment designed to answer this question
is simple: - By applying the train of current
pulses shown in figure 16 to the appropriate
apertures, and then increasing the amplitude of
current pulse No. 6 in excess of the destructivity threshold, a signal should appear at the
terminals of sense winding S2 (figure 13). The
anticipated signal was observed. This would indicate that a change in the vector magnitude of
the total coherent energy in leg 3 had actually
occurred. As expected, the switching characteristics of the electrical signal detected were considerably different from those normally associated with irreversibly switching a similar amount
of flux in a path closed about an external vortex
source. The principal difference observed was
a characteristically long trailing tail on the detected electrical response signals. Figure 20
depicts the electrical Signal observed compared

to the switching that encloses an external vortex source.
It was more difficult to devise an experiment for determining whether reflex switching
instigates the destructivity threshold by occurring first and at the wall of the control aperture.

The experiments conducted show that the
results anticipated occurred by using the pulse
program shown in figure 21 and proceeding with
the destructivity threshold measurements in
exactly the same sequence already described to
obtain the plot shown in figure 17. The salient
results are shown in figure 22 which plots the
switch time for the "zero" read out signal as
a function of positive read current pulse amplitudes and the amplitude of pulse No. 13. The
magnitude of pulse No. 13 is shown as the running parameter in figure 22. We see that bias
pulse No. 13 substantially influenced the current
amplitude IRD at which positive read current
pulses initiate destruction of the binary "zero"
information state.
The experiment just described was carried
further by determining the relationship of the
destructivity threshold IRD as a function of the
amplitude of pulse No. 13, defined as the "inner
wall bias pulse". The results of this latter experiment are plotted in figure 23.
The data plotted in figure 22 indicates that
the energy propagated by the conductor carrying
bias pulse No. 13 applied under the conditions
specified by the experiment retarded reflex
switching. Evidence of this fact is shown in
figure 22 where the read destructivity current
threshold (IRD) is increased by an amount proportional to the magnitude of bias pulse 13.
Figure 23 further exemplifies this condition by
showing that the read destructivity threshold is
effectually controlled by an amount substantially
equal to the magnitude of bias pulse 13, within
the limits indicated. The shape of the curve in
figure 23 leads one to conclude that reflex
switching initially occurs in the wall of the control aperture and not elsewhere.
If reflex switching did not occur in the wall
of the control aperture, but in some other more
remote position in leg 3 (figure 19d), the control
provided by the bias pulse shown in figure 23
would not break down at the current level indicated. Specifically, figure 23 shows that the
current level (10) corresponding to loss of control by the bias pulse is the current amplitude
previously determined to be equal to the
irreversible switching threshold of the wall of
the control aperture.

450
11.4

If reflex switching had started at a position
more remote than in the wall of the control aperture, the bias control function shown in figure 23
would have broken at some correspondingly
higher amplitude.

Turning to the results of the first experiment, a signal was detected by sense winding
No. 2 during an overdriven positive read pulse.
Specifically, the electrical characteristics of
this reflex signal (figure 21) were different from
those normally encountered by irreversible
switching of coherent energy about a path enclosing an external vortex source. If reflex
switching at the wall of the control aper~acts
as a gate, permitting coherent energy to be irreversibly switched around the read apertur.e
enclosing the vortex source" the electrical signal sensed at winding Sl should have indentically
the same electrical characteristics as that
simultaneously sensed by winding S2. Comparing
these experimentally measured signals revealed
no detectable difference in their response
characteristics; also, both signals possessed the
response characteristics peculiar to reflex
switching. The fact that inner wall reflex switching occurs first (acting as a proportional flux
gate) as described permitted controlling the read
deSfructivity threshold by biasing the control
aperture.
The effectiveness of this bias technique is
illustrated in figures 24a through 24c, using
test sample No. 3 for the conditions indicated
below. These figures are exact enlarged copies
of the actual oscilloscope traces.
Figure 24a shows nondestructive read of
"zero" and "one" with read currents of 300 rna
full-select, below read destructivity threshold
and where no inner wall bias was used.
±300 rna full-select
IWR "0" = 600 rna full-select
IWR "1" = 450 rna full-select
Figure 24b shows nondestructive read of
"zero" and "one" with read currents of 400 rna
full-select. Note that the positive read current
is above read destructivity threshold, hence the
increase switching time of the "zero" readout.
In figure 24b:
±400 rna full-select
IWR "0" = 600 rna full-select

IWR "l"

=

450 rna full-select

Figure 24c shows nondestructive read of
"zero" and "one" with drive conditions as
shown in figure 24b. Inner wall bias of 100 rna
has been used to shirt read destructivity threshold. Note the reduction of peak amplitude and
switch time of "zero" signal.
The experiments outlined in this section
indicate the following:
1) Inner wall reflex switching occurs.
2) Inner wall reflex switching occurs in
the wall of the control aperture.
3) By occurring first, inner wall reflex
switching acts like a proportional flux
gate, allowing switching to occur about
the read aperture.
4) Inner wall reflex switching is the mechanism responsible for the read destructivity threshold.
Write" One" Break POint-IRDS
From experimental work outlined earlier
IRDS was found to be:
1) Independent of aperture separation
2) Directly proportional to the inner perimeter of the control aperture.
3) Directly proportional to the inner wall
switching coercivity of the magnetic
material comprising the device.
These experimental results provide useful
aperture and topography design parameters. The
nature of the IRDS properties and the mechanisms
responsible need not be answered to use the
experimental findings for engineering purposes.
The switching mechanism associated with
the write "one" control process is illustrated by
figures 25a through 25e. These drawings show
the distribution of coherent energy (flux) in the
proximity of the read and control apertures
thought to exist during different stages of switching. The degree of switching indicated is proportional to the amplitude of the write "one" control pulse. Referring to figures 25a through
25<1 we can see that the effects of the write "one"
control pulse instigates a complicated redistribution of coherent energy. This redistribution

451

11.4

involves reflex switching, but not in either aperture wall. Reflex switching is believed to exist
in the area shown in the figure because it was
observed that part of the electrical signal detected by winding S 2 (figure .13) during the
write "one" control pulse exhibited the characteristics peculiar to reflex switching described
previously.
In addition to the hypothesis cited, this
reflex switching is believed to be instigated
spontaneously by the action of the innermost band
of coherent energy formerly encircling both
apertures closing directly about the read aperture
as shown in figure 25b. This closure can only be
caused by a supplementary component of energy
propagated internal to the magnetic material
boundaries from conductor B, which carries the
write "one" control current.

data plotted in figure 27 indicates that the reflex
break threshold is increased by an amount proportional to the magnitude of bias pulse No. 11,
within the limits indicated. The bias current
loses control at the magnitude just capable of
irreversibly switching the inner perimeter of the
read -aperture. The control latitude offered by
this bias pulse permits the operation of production MARS devices in a unique three dimensional
matrix. In this matrix it is virtually impossible
to surpass the reflex break point.
Experimental evaluation showed that IRDS,
IRDF, IRIS, and IRIF are uneffected by the
presence of bias current of either polarity,
if
magnitude is below that amount required
to switch the wall of the aperture through which
it is caused to pass.
Verification of the "Pulley" Flux Pattern

RefleX Break Point IRB
Experimental work reported earlier indicated that a third break point exists in the write
"one" response excitation characteristic. This
break point, IRB in figure 15, was referred to
as the ref:e x break current. Increasing the
amplitude of the write "one" control current beyond the reflux break point affects the readout
signal subsequently sensed at the read aperture
in essentially the same fashion as when writing
a "zero". The mechanism responsible for this
phenomenon is defined as inner -wall reflex
switching. With the exception of a slight difference in the peripheral flux distribution, the
mechanics of the reflex break phenomenon are
identical to those described previously as
responsible for the read destructivity threshold.
Most of the text regarding inner -wall reflex switching would apply to the reflex break
phenomenon if the words "read" and "control"
are interchanged.
Figure 25 shows the distribution of coherent
energy in the proximity of the read and control
apertures for the conditions indicated. A second
region of reflex switching is shown in figure 25d,
specifically in the wall behind the read aperture.
To show that the reflex break phenomena involve
inner-wall reflex switching as noted in figure 25e,
a pulse bias technique was employed, similar to
that disclosed earlier. Figure 26 is a diagram
of the current pulse program used. As shown,
bias pulse No. 11 and write "one" control pulse
No. 8 were applied to their respective apertures
simultaneously. The amplitude of the bias pulse
was maintained below the inner -wall switching
threshold of the read aperture. Experimental

Reference has been made to the existence
of the "pulley" flux pattern. Verification of its
existence was deferred until now because
proof is better understood after assimilating
some of the concepts developed earlier.
The saturation flux drawn in figure 28a
illustrates the coherent energy distribution in
the proximity of the read aperture owing to
current pulse No. 2 (figure 16). Current pulse
No. 3 applied to conductor B is in a direction
that establishes clockwise flux around the control
aperture, as illustrated in figure 28b. This
flux opposes that portion linking leg 2 already
established around the read aperture. Under
these circumstances, two alternative flux
distributions might seem feasible as shown in
figures 28c and 28d. Figure 28c is the familiar
"pulley" pattern referred to frequently and
figure 28d shows the alternate case where a
form of reflex switching has occurred.
The purpose of the following experiments
is to provide conclusive experimental evidence
of the "pulley" pattern of figure 29. It appears
from figure 28c that the generation of this flux
distribution involves no net change in the flux
linking leg 1. On the other hand, figure 29d
illustrates a situation in which the total net
vector flux linking leg 1 is numerically reduced
to zero. If this later condition were the true resultant flux distribution instead of the "pulley"
pattern, a substantial Signal would be measurable
across the terminals of winding S1 (figure 13)
during control pulse No.3. Experimental evidence does not substantiate the flux distribution
of figure 29d since there was no switching sig-

452
11. 4

nal present at the terminal of winding S1 during control pulse No.3. This experiment indicates the existence of the "pulley" flux pattern
(figure 29c) since its generation does not alter
the resultant vector magnitude of flux already
established in leg 1 by read pulse No.4; therefore, no switching signal should be observed.
Additional verification of the pulley flux distribution was provided by the electrical performance of the "tear drop" MARS device shown in
figure 30. The external boundary of the "teardrop" device was designed to confrom precisely
with the outer periphery of the "pulley"
flux distribution (figure 28c). The electrical
performance of this device was identical to its
counterpart in an unbounded form of a 1/4 inch
disc with the same aperture topography.
Tangible Results
The more immediately tangible results of
this study are the following:
1. key-hole MARS
2. three dimensional matrix array
These developments are discussed in the following paragraphs:
The Key Hole MARS
The salient characteristics of the keyhole
device are:
1. Non-destructive read out.
2. Ideally suited for three dimensional,
coincident current operation.
3. Half-select currents required for the
read and control operations are the
same magnitude.
4. Speed capabilities for both read and
store modes are at least as fast as a
three dimensional toroidal core memory.
5. Device size permits matrix densities
of at least 3000 bits per cubic inch.
6. Mechanical characteristics afford relative ease of manufacture and handling.
Figure 30 is a schematic of the production
key-hole MARS. The specific design criteria
dominating the aperture topography were as follows:

IRDS

= IRIS

92% lOR < IRDS S lOR

(1)
(2)

IRIS

68%IRIF

(3)

IRDS

68%IRDF

(4)

The name "key-hole" was chosen because
of the suggestive shape, which was selected to
simplify automatic testing, handling, and array
assembly. The response excitation characteristics electrically defining the "key-hole" MARS
are shown in figures 31a through 31c. The
break points IRIS, IRDS, and lOR were controlled through design of the aperture topography.
Break points IRD and IRB are controlled by
uniquely employing inner-wall bias as an integral
part of the three dimensional MARS matrix
array.
3-D MARS Matrix Address Configuration
Figures 32a through 32c show three coincident current selection schemes. The technique shown in figure 32a is the conventional
method for instrumenting transfluxor type devices. 3 Figure 32b illustrates the effective 5wire system (sense and inhibit not shown) conceived and designed as an integral part of the
"key-hole" MARS. The vertical (X) coordinate address selection means "link both apertures
(read and control) of each and every element
comprising that selected column coordinate".
This selection technique automatically provides
the inner-wall bias necessary for proper operation of the "key-hole" MARS. Without innerwall bias, the break points IRBI and IRDI occur
at substantially lower excitation levels, as
shown in figures 31a and 31c respectively. The
ability to employ this matrix technique involves
solving the geomechanic design criteria indicated in equation (1) and equation (2). Advantages of this technique are as follows:
1. Only one driving and selection meanS is
required for column selection instead
of two.
2. Transmission characteristics resemble
those of an ideal transmission line.
The selection scheme shown in figure 33c
is called the effective 4-wire, 3-D MARS array.
As shown, the X coordinate selection technique
is indentical to that described for figure 33b.
However, the Y coordinate selection means is
connected to half-select the read apertures of all
devices comprising one row and the control apertures of all devices comprising an adjacent row.

453
11. 4

Figure 34 is a photograph showing a section of a
4-wire MARS matrix. This technique, similar
to that described for X coordinate selection,
eliminates the need for two different selection and
driving means. Similarly it provides ideal
address transmission line characteristics. The
4-wire MARS matrix array permits a three
dimensional Random Non-Destructive Advanced
Memory (RANDAM) tobe instrumentedwith no
more selection or driving means required than
those required to instrument a three dimensional,
toroidal core matrix.

RefereIices
1.

J. A. Raj chman and A. W. Lo, "The
Transfluxor - A Magnetic Gate with
Stored Variable Setting" - RCA Rev. Vol.
16 PP 303-311, June 1955.

2.

J. A. Rajchman and A. W. Lo, "The
Transfluxor" - Proc. IRE Vol. 44, PP
321-332 - March 1956.

3.

J. A. Rajchman, "Computer Memories A Survey of the State of the Art" - Proc.
IRE, Vol. 49, No.1, PP 104-127,
January 1961.

Summary
The results of this study provided useful
understanding of subtle switching mechanisms
peculiar to multi-aperture ferro magnetic devices. The experimental techniques employed
to study these switching phenomenon provided a
new and powerful control technique called inner
wall pulse bias. This bias technique provided
the suplimentary control variable which, when
combined with geomechanic techniques, permitted
the development of the Key Hole MARS and
embodyment on a unique 3-D coincident current
matrix array.

Bibliography
1.

D. A. Buck and W. I. Frank, "Nondestructive Sensing of Magnetic Cores" Commun. and Electronics, No. 10(AIEE
Trans. pt. 1, Vol. 72) PP 322-330,
January 1954.

2.

C. L. Wanlass and S. D. Wanlass, "BIAX
High Speed Magnetic Computer Element",
1959 Wescon Convention Record Pt 4, PP
40, 54.

3.

R. M. Tillman, "Fluxlock - a Non-destructive Random Access Electrically
Alterable High Speed Memory Technique
Using Standard Ferrite Memory Cores",
IRE Transactions on Electronic Computers, Vol. EC-9 PP 323-328, September
1960.

4.

R. Thorensen and W. R. Ansenauft,
"A New Non-destructive Read for
Magnetic Cores", Proc WJCC PP
111-116, March 1-3, 1955.

Acknowledgment
The author wishes to express his thanks
to R. Betts, K. Clark, and M. Sulich whose
work, in providing materials and devices to
specifications, was essential to the success of
this development. The author is indebted also
to J. Coffin and J. Owen for their work on instrumentation and the gathering of experimental
test data.

Figure 1.

MULTIPLE APERTURED RELUCTANCE SWITCH (MARS)DEVICES (PHOTO)

454
11.4

/

/

I
/

/

Figure 2.

CONTROL
APERTURE

SCHEMATIC OF BASIC MARS DEVICE

A

READ CONDUCTOR
,._ _ _ _ _ _ _ _ _~fiO\

CONTROL CONDUCTOR

B--------------~GJ
Figure 3.

~----

TIMING DIAGRAM

Co)

J:\

READ
CONDUCTOR-A

----A

(c)

( b)

8

n

lj

(e)

/\

V
(f )

Figure 4.

(d)

SATURATED FLUX DISTRIBUTION AND OUTPUT SIGNAL

OUTPUT
AT SENSE
WINDING S1

V

455

11.4

B

+Br

+Bm

Hc1

------------~r_-------+----~~~----------------~~ F

CLOCKWISE

FLUX " "
-Bm
Figure

5.

-B r
EFFECTIVE HYSTERESIS CURVE FOR MARS DEVICE STORING A BINARY

Figure

6.

MARS DEVICE STORING "ZERO II

"onE"

456
11. 4
B

-----

MINOR LOOPS USED FOR
HIGH SPEED APPLICATION

----------------~~----~------~----------------~+I
-I

RERENCE DIRECTION

FOR "1"
Figure 7.

a "0"

SUPERPOSITION OF RESPONSE EXCITATION CHARACTERISTICS
OF BINARY "ONE" AND "ZERO" INFORMATION STATES

INTERFERENCE
REGION (LEG 2)

......1 - - -

LEG I

Figure 8.

---l~

...

~--3--

CLOCKWISE SATURATION FlUX-IAST READ PULSE
APPLIED BEFORE WRITING "ZERO"

457
11.4

B

FLUX

Figure 9.
DISTRIBUTION CORRESPONDING TO LOW RELUCTANCE STATE

------------------+----4----~----------------·H

o

Figure 10.

MATERIAL BH CHARACTERISTICS

Figure 11.

TEST SAMPLE APERTURE TOPOGRAPHY

458
11.4

B----------~l:Jp----------------------------~~---------

CONTROL

Figure 12.

Figure

13.

TEST CURRENT PULSE PROGRAM

WIRING DIAGRAM OF TEST SAMPLES

459
11.4

120

"0
100
en

....
-oJ

90

0

> 80
::::;
-oJ

2

z 70
La.!
0

=> 60
t-

-oJ

Q.

2

<[

50

t-

=>

Q.

t-

40

=>

0

30
20
'0

100

200

IO

3UO

400

500

600

DRIVE. CURRENT IN MILLIAMPERES

Figure 14.

PLOT OF TYPICAL RESPONSE CHARACTERISTIC UNIFORMITY MEASUREMENTS

..... Ifo.

..... 0'
•

~

SAMPLE #2
SAMPLE
SAMPLE #8

*6

60

50

--------~'Y'"~

-

-~-

en

!:i

40

30

~o

20

10

"

"
\
/
'
V
"
\

I

...J

z

.

LEGEND

' \ \

o
>
::::i

i

-

t

'l~"

i

\

""

IRDS ~
IRDF ED

\

\

,......

------'

@J-..

"

IRIS-

@)

IRIF
IRB

(!)

•

'",'''-

.. -......~ ............ ,

.---.~

100

200

300

400

500

600

700

800

CONTROL CURRENT IN MILLIAMPERE S

Figure 15.

CONTROL RESPONSE EXCITATION CHARACTERISTICS

900

1000

0

461
11.4

A

~------------------~~~--------

B--------~l:J
Figure

16. PULSE PROGRAM USED TO MEASURE READ DESTRUCTIVITY THRESHOLD

READ

en
c

Ho1

. . . - -_ _ _ _ _---.J

z

@.I

§ 1.5

en
o 1.4
a::
(,)

L-_ _ _ _

CONTROL

~POSITIVE READ OVERDRIVE

~ 1.3
~
..J

« 1.2

z

(!)

en

1.1

c

«w 1.0

a::

t-

z .9
w
::::>

0
w .8
en
al
::::> .7
en
lL.
.6
0
w

:E

i=
:::r:

.5

~ .4

~

NEGATIVE READ OVER DRIVE NO EFFECT TO 1.6 AMP.

I =230ma
O

I
I
I
I

~Si

I
1 21 0

i

i

i

I

i

i

i

i

300
340
380
420
460
500
540
580
620
AMPLITUDE OF POSITIVE AND NEGATIVE READ PULSES 6 AND 7 IN MILLIAMPERES
Figure 17.

PLOT OF SWITCH TIME FOR "ZERO II READOUT SIGNAL
AS A FUNCTION OF READ PULSE 6 AND 7

.....
.....
•

~

0'

,j::I.N

(f)

l&J
Q:

~

::it

450

<[

:I

440

~

430

2

~ 420

....

~ 410

Q:
Q:

:::)

0400
~

<[

~

PULSE RISE TIME 0.4 fLsec
NORMAL READ AMP 300 ma

390

en
~ 380

:;;

t;

370

:::)

Q:

t; 360
l&J
Q
Q

<[

350

l&J

IX:

L~

i i i

to

12

14

i i i

16

18

20

i i i

22

24

26

i i i

28

30

32

i i i

34

36

38

WIDTH OF LEG SEPARATION DISTANCE (0) TABLE -1 (IN .001 INCHES)

Figure J.8.

PLOT OF READ DESTRUCTIVITY THRESHOLD AS A FUNCTION
OF THE LID WIDI'H SEPARATING READ AND CONTROL APERTURES
(NO INNER WALL BIASING TECHNIQUES EMPLOYED)

463
11.4

(a)

( c)

Figure 19.

(b)

(d)

SWITCHING MECHANISM ASSOCIATED WITH READ DESTRUCTIVITY THRESHOLD
(a) No Posit.ive Read Pulses Applied to Read Hole
(b) Positive Pulse Applied to Read Hole Below
Destructivity Threshold
(c) Positive Pulse Applied to Read Hole Below
Destructivity Threshold (More than 20b)
(d) Positive Pulse Applied to Read Hole in
Excess of Destructivity Threshold

464

11.4

- -- ---------

...fEFLEX SWITCHING

1

2

TIME IN MICROSECONDS

Figure 20.

OBSERVED REFLEX SIGNAL COMPARED TO NORMAL SIGNAL

~----------~~----------

ll!i

Figure 21.

PULSE PROGRAM USED IN PROVING THE REFLEX SWITCHING PHENOMENON

I

3

1.8

PULSE BIAS

OMa

SOMa

100 Ma

150Ma 200Ma

1.6

w ~ 1.4
:E

z

-0

t-o
(!)

~ 1.0

xQ:

ZO
00

t: i 0.8

~z
U>_

0.6

0.2

1

lRO
O

0.4
I

100

-

I

200

T·

---

300
POSITIVE READ

Figure 22.

-

ROSO

~-

1

1

RO IOO RO ISO

--..• -

400

- - --

I

R0200

--,-------.-,--

SOO

600

•

700

CURRENT IN MILLIAMPERES

PLOT OF READ DESTRUCTIVITY WITH A PULSE BIAS CONTROL

~

~Jo!::..

a-

.r:.U'I

466
11.4

520

~ 480

0:::
I.LI

a.

:E

<{

::J

~ 440
~

CC
<{..J

~~ 400
I.LI~

>0:::
E~
en

~~ !60
:;
i=
u

::::>

0:::

I-

en
c

I.LI

+40 +20

-20 -40 -60 -80 -100 -120 -140 -160 -180 -200 -220 -240
PULSE BIAS MAGNITUDE IN MILLIAMPERES

Figure 23.

READ DESTRUCTIVITY THRESHOLD VS INNER WALL BIAS

467

11.4

0.2 fL SEC I CM

.0 I V/CM
(0 )

0.2 fL SEC I CM

.02 V/CM
(b)

0.2 fL SEC I CM

.02 V/CM
( c)

Figure 24.
EXACT ENLARGED COPIES OF ACTUAL OSCILLOSCOPE TRACES FOR TEST SAMPLE NO. }

468

11.4

( b)

(d)
Figure 25.

( e)

SWITCHING MECHANISM ASSOCIATED WITH WRITE "ONE" CONTROL PROCESS
(a) Write "One" Control Below IRDS Break Point
(b) Slightly in Excess of' IRDS
(c) Nearly I S
(d) Full wri~en "one II
(e) Slightly in Excess of' IRE

469
11.4

~______________~~~________________

B

CON~T!'!!!!R"!"OL~--"l!.I

Figure 26.

PUI:SE PROGRAM USED TO BIAS INNER WALL OF READ APERTURE
DURING WRITE "ONE" CONTROL PROCESS

470
11.4

640

en
w
w
Q..

a:

~ 600

:J
..J

~
~

o

~
560
a:
::::>

g
o

~

«w
~ 520
xw
..J

"W

a::

::I:

~ 48
::I:
~

~
w
o

RELUCTANCE INCREASE START (IRDS)

::::>

t: 220

Q..
..J
:E

«
Iz

w
a::

§

~---------------------------------------------------------

200

o

180

-20

-40

-60

-80

-100 -120 -140

-160 -180 -200 -220 -240 -260

CONTROL BIAS IN MILLIAMPERES

Figure 21.

PLOT OF BREAK POINT WITH INNER WALL BIAS

INTERFERENCE

(0)
\

4--LEG 1

\

4--LEG 1

4-LEG 1

~

Figure 28.

2

3~

(d)

(c)

EXPERIMENTAL VERIFICATION OF "PULLEY" FLUX PATTERN

......
......

~

-.J
,1:>. ......

472

11.4

Figure

29.

"TEAR-DROP" MARS DEVICE USED TO VERIFY "PULLEY" FLUX DISTRIBUTION (PHOTO)

Figure 30.

SCHEMATIC OF PRODUCTION "KEY-HOLE" MARS DEVICE

473
11.4

c

IRDF

~
::::;

IRB' (0 ma BIAS)

ex

~

o

1.0 .....- -.....

..

..........- -

Z

IRS U80 ma BIAS)

l&J

(I)

Z

~

(I)

l&J

0::
Q

ti5

.25
.125,-+----_4

0::

100 200 300 400 500 600
CONTROL CURRENT MAGNITUDE IN MILLIAMPERES
"CONTROL" RESPONSE EXCITATION CHARACTERISTICS
(a)

25
..J

ex

40
tI)

Z(I)

~~

(/)0

...j >

:$

30
20

c~

10

E

C)

E

0

10

0

ex

Q)

C)

15

o.-J
... ..J
j~

CI)

~

Q)

~
......

I

I

5

l&J

I

I

~

0::

100 200 300 400 500 600
100 200 300 400 500 600
CURRENT AMPLITUDE (NEGATIVE READ)
IN MILLIAMPERES

POSITIVE READ CURRENT IN MILLIAMPERES

NEGATIVE READ OUT RESPONSE EXCITATION

POSITIVE READ OUT RESPONSE EXCITATION

( b)

(el

Figure 31RESPONSE EXCITATION CHARACTERISTICS OF ''KEY-HOLE'' MARS DEVICE

474
11.4

y

y

y

x

x

(a )

(b)

Figure 32.

Figure 33.

COINCIDENT CURRENT SELECTION SCHEMES
(a) Classic Three-Wire System
(b) MARS Five-Wire System (Sense and
Inhibit Windings Not Depicted)
(c) MARS Effective Four-Wire System
(Sense and Inhibit Windings not
Depicted)

SECTION OF MARS FOUR-WIRE MATRIX (PHOTO)

(el

475

11.5
HIGH SPEED OPrICAL COMPUTERS AND QUANTUM TRANSITION MEMORY DEVICES
Lewis C. Clapp
Computer Development Laboratory
Sylvania, Needham, Massachusetts
Ie The Problem
Toda.y we are pressing forward to produce a
new generation of computing devices which are to
be larger, faster, and more reliable than any
class of preceding machines. The avenues which
are being taken to follow this approach are as
numerous as topics in a mode~ physics text book
such as Parametric Oscillation, Quantum Mechanical Tunnelling, Superconductivity or Thin Film.
Magnetic Properties. Sophisticated though these
approaches may be, they do not come to grips
with one simple problem which is becoming more
serious with each development in the components
field. As we send electro-magnetic energy down
a wire there is a tendency for that wire to act
as a radiating antenna. This effect also means,
of course, that neighboring wires will act as
receptors and absorb some of the spurious energy
as undeSired "noise." While we were operating
our computers in the microsecond region one
could deSign around this noise problem USing relatively simple and straight forward pulse techniques, but now that we are entering the new
generation of ultra-fast computers much more
elaborate considerations are necessary. It is
not uncommon today to hear respectable computer
men talking of wave guides and other microwave
plumbing' Meanwhile, the paremetron computer
deSigners speak of placing each unit in multiples
of half wave length distances from its neighbors.
The steps which must be taken to get around the
interference problem are certainly in the direction of less design flexibility and while we have
been miniturizing the components the connecting
links between them seem to get bigger all the
time.
It is one purpose of this paper to ask if
there are not other, possibly more pleasant,
solutions to the problem and to investigate the
class of devices dictated by this approach.
Since space is short, we must concentrate only
on one aapect of these conSiderations, namely
memory devices; but we shall also broadly outline the features of a different class of computers which comes from these thoughts. -It should
be clear that we are speaking of devices which
are sometime in the future and for this reason
the emphasis in this paper is placed on theoretical rather than experimental topics.
II.

Optical Computers

The question before us is this: "Is there
a method of transmitting information from one.
point to another at a rate much greater than 10 6
pulses per second, without generating interference on neighboring information channels?"
A little reflection on this question:leads
us'to consider the possibility of using light

waves for the propogation of our Signals, for
it is clear that light pulses can be transmitted
free from generated noise b.Y simple isolation
techniques. Let us therefore examine a little
more closely this intriguing idea o~ transmitting our data in the form of pulses of light.
(Light in this paper is used to include all optical manifestations from the ultraviolet to the
infra-red, however, many of the following state.
ments also apply to the microwave region.)
Any digital computer which we build today
must possess at least the following three properties or ~omponentsl

a). A storage ,element to maintain 'the datain a conventional machine - this corresponds to
territe cores, flip flops and delays.
b) •. A. means of tranSmitting each bit ot
information between internal points in the computer - these correspond to wires in conventional
machines and waveguides in the newl~ proposed
devices.
c). A deCision making element-diode gates,
core logic and so on in present day machines.
Storage Device
What we might use as the storage element in
such a machine is actually the subject of this
paper and will be treated in fuller detail shortly.
Here we will simp~ point out that the type of
memory element theoreticallY discussed in the
following paragraphs makes use o£ well known
quantum mechanical properties ot radiation. The
idea essentially is to use the ground state and
certain excited energy levels ot our system,
for example an atom, as storage positions with
the switching action between logical conditions
being optically induced transitions trom one
energy level to the next. We will see that one
major advantage of such a method is the inherent
fast switching times which can be achieved. The
transition time between optical levels is frequently in the order of 10-9 seconds and even
taster switching times are· found if one uses
electron spin resonance techniques.
Transmission Lines
To transmit our light pulses from point to
point, we can probably take advantage of the fact
that certain materials will transmit light along
preferred axes for relative~ long distances with
ver,r little attenuation. Perhaps the most interesting of these media are long drawn out glass
fibres which are extremely flexible and easy to
use. Although there is some attenuation in

476
11.5

passing a light beam down such a fibre practically all this loss is due to absorption by the

The photocell i8 a slow component but certainly it is fast enough to keep up with the present

glass. This absorbtion amounts to less than a
quarter of one percent of energy per inch. Genera~ the optical fibre can be coated by a thin
glass film of lower refractive index to eliminate "cross talk" between neighboring transmission lines. The low'cost of these glass fibres
compares favorably against the expensive wave
guides n~eded far micro-wave computers.

output devices available. I~ addition, considerable work is being carried out these days toward devices which will rapidly convert optical
Signals to electrical pulses and the results of
such research could be directly applicable in
this area.

The science of fibre optics is now a rapidly
developing iub~ect and is well treated in the
literature. '

Gaseous Model

Decision Elements

B.r admitting the ,possibility of computing
with radiation as the-carrier of information,
the door is open to a whole new class of decision making techniques. There are countless
properties of light beams alone or light in interaction with crystals and other matter that could
be used to generate logic. To consider a very
Simple example, let us recall that light can be
polarized in several ways J linearly, elliptically"
and circularly_ In fact, a beam of circularly
polarized light can be either right circular polarized or left circularly polarized - that is to
s~, the electriC vector of the light beam rotates to the right or left in the plane perpendicular to the axis of propogation. Next imagine
all the ways two beams of circularly polarized
light can be combined. The results are summarized in table one and are strongly suggestive of
a logical and truth table. One will observe that
the analogy is not exact for when left and right
are combined the result is a linearly polarized
beam. This difference is not serious since linear polarized light will not effect future logical operations. Another way to look at this
difference is that with three types of polarized
light beams - linear, circular right, and circular leftJ we can generate ternary logic rather
than the bin&I)" logiC in vogue today. If we try
to combine more than two beams Simultaneously,
we find the result leads to a majority decision
logic much the same as parametron logic discussed else~ere in the literature 3.
Since all the input equipment for computers
is electro-mechanical, we should also consider
gating techniques that will permit us to convert
from an electrical impulse to an optical signal.
One such method employs the Kerr effect, which
states the polariz~tion vector of light pasSing
through certain crystals can be rotated through
tr.e application of an electric field. The magnitude ot' the,rotation depends on the crystal
material, the length of the light path and the
strength of the electric field. Ordinar,y polaroid filters could be used to monitor the outputs' of such a conversion device.
Of course, the converters needed to translate our optic-al. signals to electrical pulses
for stimulating the output equipment tied to
the computer could be just a simple photocell.

III. Quantum Level Memory Devices

It has already been pointed out that the
memor,y devices proposed in this paper center
around the basic fact that electrons can make
transitions from one quantum mechanical level
to Inother in ver,y short times on the order of
10-tl or 10-9 seconds at optical frequencies.
Now it is well known from atomic theory that the
electrons surrounding an atomic nucleus do not
have a continuous band of energies but in fact
can exist only at certain discrete energy levels.
The energy of an electron in the n th level 1.
given approximately by:

(1)

where m and e are the mass and charge of the
electron respectively andh is Plank's constant.
The state of lowest energy corresponding to the
principle quantum number n equal to one, is
commonly called the ground state. In the absence
of any external stimulation most of the atoms
will be in the ground state, however, a small
portion of the population will be at higher levels at any given moment. The number of atoms in
the EL th energy level is given by the Boltzmann
distribution law

(2)

where k is the Boltzmann constant 1 T is the absolute temperature of the sample 8'ld C is a
constant. The normal distribution is plotted in
figure one.
For each energy level there are act~ n
which are denoted by a second quantum
number.l which can assume the integer values
from 0 to n-l. Each sub level m~ not have the
exact energy given by (,t) since there is a slight
energy dependence on 1 as well. In figure two
we show an energy level diagram for Hydrogen
gas. In spectroscopic notation one generall;y
denotes the values of 1 equal to 0,11 2,3,4 etc.
by s,p,d,f respectively. Thus, the ls level
refers to the ground state Where n is 1 and 1is O. Soma transitions are indicated by the
solid diagonal lines in the figure.
.~b-levels

The hydrogen atom can absorb incident radiation which will cause the electron to go to a

477

11.5
higher energy state, and it is the frequena,y of
this incoming radiation which will determine the
resultant energy level which the electron achieves.
In general, a transition from level Em to En requires a radiation frequency

-..1
.Yh

m -- En-Em
h

(3)

Conversely, an electron in a higher state will
spontaneously radiate down to a lower enerby
level through the emission of radiation whose
frequency is again given by (.3). Referring
again to figure two, one will observe that transi tion between all sub-levels do not occur. Some
of the "forbidden" transitions are indicated
by the dotted lines in the figure and fail to
take place because or a quantum selection rule
whiOh only permits transitions in ~ich ~ changes
by t 1. From these facts one can readily conclude
that if an electron falls into the 2s state it
is "trapped" in the absence of external radiation.
The only state of lower energy is the Is state
and this cannot be reached since 1 would not
change by unity in such a transition. Such
states are frequently called metastable. For a
more detailed review of atomic spectra theory
the reader gan refer to Herzberg4 or Richtmeyer
and Kennard!>.
Consider a simple version of a device which
can be used to store data through the principles
outlined above (fig • .3.). A small vessel with
two electrodes encapsulates a gas with known
metastable states am a second gas with an ionization potential well below the metastable state
energy. The second gas merely' provides free
electrons which will impart energy to metastable
gas atoms during the reading and writing processes. When our gas atoms are in the normal energy
distribution, mainly the ground state, we shall
say that a logical zero is stored in the cell.
A voltage pulse V, will cause the free electrons
to excite the gas atoms, to higher energy levels
after which many will radiate down to the 2S
states. The remainder will return primarily to
the ground level. Those electrons in the 2S
metastable level 1411 remain trapped until a
small read pulse v is applied. This will result
in perturbation of the 2S electrons through colliSion with a consequent induced transition back
to ground and a corresponding emission of radiation. This radiation could be detected b7 an
external device. If the memory cell were in the
zero state, our interrogation pul.se would only
shift the energy level of the ground state electrons slightly' and no Significant output radiation
would result. It is clear that the, same actions
can be caused by admitting radiation into the
cell to effect the transitions from ground to the
upper levels and to induce transitions of trapped electrons to ground. The radiation method
is actually simpler and is compatible with our
concept of optical computers.

Utilizing this technique, one could construct a large scale memory in the following
manner. Two glass plates are joined in a gaseous environment so that the upper plate wbich
contains m~ spherical half cavities encloses
the gas (fig. 4). In the lower flat plate may
be three electrodes which will be used to excite
the gas in each cavity during read and write
operations. Each cavity can be activated at the
appropriate time through the mechanism described
in the preceding paragraph. The radiation output which occurs on the detection of a stored
logical "one" could be Channeled through an
optical fibre to a photocell associated with a
particular group of bits. The electrodes in each
storage cell have been included here strictly
far simplicity of description. One could also
excite the atoms in a cell through radiation
thus eliminating the need for electrodes and all
attendant mechanical construction problems.
So far we have neglected to point out a
number of deficiencies in the gaseous metastable
memory model just described. Perhaps the most
serious difficulty is that metastable states in
gases are not quite as permanent as we may have
implied. Sometimes even farbidden transitions
will occur very we~ but never-the-less,
strong enough to depopulate the trapped upper
energy levels after a long time interval. Collisions between the atoms which constitute the
gas will also induce spurious transitions to the
ground state .trom the upper levels. To reduce
the effect of atomic collisions one could work
with lower pressure gas systems at reduced temperatures but such an attempt lD uld also result
in lower amplitude output signals. Finally, the
technological and production prOblems associated
with gaseous devices are und.ersirable in their
own right even if the other difficulties can be
minimized or eliminated. For this reason we
shall next turn to the task of looking for similar fast switching storage action in solid state
materials. Before taking up this question, one
final remark is in order. It is tempting to
consider the possibility of artifically prohibiting transitions between certain energy levels
through the application of external electric and
magnetic fields. Although this idea has not been
fully' investigated as yet, certain details are
alre~ well known from basic quantum theory.
First, application of a pure electric field has
the net effect of simply displacing all the energy levels (stark effect), wheras a pure magnetiC
field will generate third order sub-levels for
each level shown in figure 2, corresponding to
the third or magnetic qumtum number associated
with the particular atomic system (Zeeman effect).
Neither an electric or magnetic field 810ne
would have the desired effect, but some combination of the two might work for a given gas.
A Solid State Model
We now turn our attention to similar phenonema in solid state materials in an attempt to

478

11.5
overcome the problems encountered in the gaseous
version of the proposed memory device. One
reason for considering solid state materials is
that molecular collisions can orten be reduced,
but this, of course, requires cooling of the material and may in itself be objectionable. In the
domain of solid state physics we are fortunate
in finding scores of~effects with potential usefulness as computer memory devices. It will be
recognized however that most of the effects in
current solid state technology make use of charge
migrationj that is the physical. motion of electrons and holes from one part of the crystal. to
another. Such motion is too slow for the speeds
which we are considering and therefore the
emphasis wil1 be placed on motion independent
phenonema.
F Center Trapping

The F center is only one and the Simplest of
several types of color centers now known to
exist in various iDnio crystals. It is generally thought to be a lattice point with a missing negative ion. The resulting vacancy then
acts as a positive charge oenter with the capability of trapping electrons. A trapped electron
is therefore held in place in a manner quite anal agous to the way electrons assume orbits around
atomio nuclei. Of importance to us is the fact
that such eleotrons are loosely bound and can be
easily freed either through the action of external. radiation such as infra-red light or b.1
simply heating the crystal. The relation of F
centers to the crystal band structure is depicted in figure S. For proper operation as a memory
device the crYstal properties must be such that
the Fermi level at the operation temperature is
below the F center energies.
The d1namics of a memory element USing F
center action is relativell' uncomplicated. In
the normal condition, the electrons wil1 be below
the Fermi level and therefore below the F center
traps as well. Such would be the "zero· state
of the memory cell. Writing a "one" into the
unit cel1 is accomplished through illumination
of the crystal. which causes the electrons to rise
into the conduction band. If the F center density
is high enough, a signifioant number of electrons
will be trapped, the remainder of electrons falling back into valence band. Detection of the
particular state of a memory cell can be handled
in several wqs. First, ve mq free the trapped.
electrons b.1 irradiating the crystal with short
wave-length radiation or possibl1' b,. appl11ng an
alectric field. in either case attempting to
observe the transition radiation 88 the electrons
return to the valence batd. If the crystal parameters are exactly favorable such a method will
be acceptable. On the other hand, a long search
may be required to find a crystal with such a
detailed structure. One may even be able to produce the proper crystal taking advantage of the
recent work in F center production in crystala
through X ray ilTadiation. Sge for example
the current paper of Mitchell and the general

statements in Kitte1 7•
An alternate method of continuously monitoring the state of the crystal would take advantage of the fact that F centers are themselves
color absorbers. A beam of weak radiation passing through the crystal in the zero state will
be absorbed but a corresponding beam of radiation passing through a crystal in the one state
will find the material transparent and go out
the other side. This mOnitoring technique is
attractive since it does not necessarily destro,y
the stored information during the reading process. A large scale memory of this type could
be made by vacuum coating the F-center abundant
material onto a substrate to form a large wafer.
Etohing is then used to isolate the individual
memory cells into a grid-like arrangement for
independent memory action (fig. 6). This wafer
might even be plaoed over an electro-luminescent
material to increase the number of lum1nation
generated electrons capable of being captured
by the F centers. This type of construction
would result in inexpensive memory wafers with
high bit densities.

We come now to the problem of selecting a
particular bit on our wafer. If we wish to use
only optical. signals the following technique
seems satisfactory. Input; lines, which are
optical fibers, are applied as X and Y drivers
to an electro-luminescent switching matrix
(fig. 6). The properties of this matrix are
such that an optical signal must exist both
at drivers Xl and Yl for an output, whioh ia
again an optical signal, to appear on selection
line S • The selection lines are themselves
optic;I1fibers Which go direotly to the chosen
memory cell to produce the desired storage or
read out action.
One sould not neglect to make a comprehensive study of other types of color centera also,
as potential mechanisms behind thia t¥pe of
.
data storage device. For while it is evident
that .~ of our conclusions about 11' center
memories are tentative and await further experimental verification, the possibilities are
extremely attractive and deserve further investigation.
Maser Models
We have not as ,.et pointed out the interesting s1m1larit,. between the memory devices
discussed so far in this paper and certain
elements of the maser theOl7. The maser. is well
known as a system for producing high signal amplification with very low noise background.
First, we shall brietly review
basic principles of two level maser theory. ,9 Recalling
figure one, the Bol.tzJIl&nn distribution tella
\US how the various enera levels of a material
will be populated under normal conditions. It .
can be seen trom the figure that the lower
level EA is IlOre substantially populated than
the upper leTels, s&iT Ea. In a maser, one

thB

479

11.5

attempts to reverse this situation with a pair
of levels 6 in such a manner that the upper state
has the greater population. Shortly after this
population inversion, another signal is used to
force these excited electrons or spin states
back down to the EA level. The resulting emission from the downward transitions becomes the
output signal. If resonance factors are properly
introduced into the system high orders of signal
amplification can be produced. Once the population has been inverted, there is a tendency for
the electrons to spontaneously drop back to the
normal distribution without external stimulation,
the time required for this process to go to completion being referred t~ as the "relaxation
time". Usually, this relaxation period is several orders of magnitude slower than the transition
times involved, but the exact values may vary
under different operating conditions.
Let us try to interpret the maser model as
an information storage device along the lines
outlined in the foregoing pages. The basic
feature of all the preceding models, transitions
between quantum mechanical levels, is equally
important for maser operation; but we may expect
some difficulty in retention of the data. Consider then a simple two level maser as a one bit
storage device. When the distribution of energy
states is closely that of the Boltzmann formula
(2), we shall say the cell is storing "a logical
"zero". Assume next that a write signal is admitted with sufficient power to invert or equalize the level distributions (figure seven), and
call this the "one" state. If the relaxation
times are long we Will have a pretty fair memo17
element. There are again at least two ways to
sense or read out of such a memory cell. In the
first method when it is desired to read, one can
stimulate emission from level B to level A by
an external signal. If the cell is in the "one"
condition a large output signal will be detected.
Conversely, if the memory lUlit is still in the
zero state there would be no downward transition
and consequently no output signal. This read out
technique has the advantage of falling in line
with usual maser practice, but the serious disadvantage of being a destructive read out technique and would require elaborate rewrite procedures to conserve the data. The secoftd possibility may be more appealing. If a weak Signal
of frequency v, where

(4)
is sent through the maser, while in the normal
distribution, absorption takes place. The material then said to be opaque for radiation of this
frequency. The physics behind this absorption
is easy to mderstand since the energy carried
by the interrogating signal is exhausted in raising a few electrons from EA to the higher En level.
If the same Signal is sent through a maser rn the
inverted population condition, the upper levels

will already be filled and no such absorption
can occur, that is the crystal will be transparent. Therefore, one can continuously monitor
the logical state of the unit without destroying the stored information.
As mentioned above, relaxation effects will
play an important part in the operation of such
a maser memory. We eannot generally expect the
data stored in our cell to persist indefinitely
for the population will eventually return back
to the BoltZmann distribution. By choosing
materials with long relaxation periods one can
hope to do a large number of computer operations
before the data deteriorates badly, however, a
reconstitution procedure must finallY take place.
There are a number of methods with which one
can face this difficulty and the most straightforward techniques will be mentioned here. Two
maser cells connected in series and operated in
two phase action, could be employed for the
storage of each single bit (figure 8). During
phase one time data can be written into or read
from the first of these memory cells while a
normalizing pulse is returning the second cell
back to the Boltzmann distribution. The duration of phase one will naturally be shorter
than the relaxation time of the crystal. A
short interval occurs after phase one which is
used to transfer the stored data from the first
to the second memory cell. The same operations
occur during and after phase two except that now
the information is shifted back to the first
unit. A crude timing scheme for these operations
is given in figure nine. Since the maser is
essentially an amplifier such a method of dynamic data preservation can be continued indefinitely. Several other methods of getting around
the relaxation problem have been proposed,
should the two maser cells per bit scheme prove
to be mdesirable. First, there is the idea mentioned briefly before, of prohibiting transitions between quantum levels by the application
of external conditions such as electro-magnetic
fields. The earlier comments apply to the maser
case as well. Secondly, in place of the second
cell one might substitute some sort of pulse delq lUlit, so that at the end of the useful operating period an output pulse would be delayed
long enough to return the maser back to the normal condition. The pulse would then be fed back
to the maser as an input.. We have investigated
one such pulse delay unit which also makes use
of the gaseous metastable action but our conclusions are still tenative and the device will
not be discussed here. The third possibility
of beating the relaxation problem is perhaps
the most interesting. One could use a three or
four level maser and store each bit of data in
a single cell. We would then work alternately'
between various pairs of levels I storing the
data in one set of levels while the remaining
states were being normalized for the next cycle.
Although the details of this multi-level maser
prinCiple have been worked out in a cursor.r
manner, the PUlIlPing scheme is rather involved
and the results are still too preliminary for

480

11.5

further treatment at this time.
In our treatment of maser memory models so

tar, we have considered only masers in which the
switching occurs between'energy levels. It is
well known however that many of the new masers
are based on eleotronio spin resonance or optioal
pumping between the hyperfine levels of a gas or
orystal. There are several advantages to be
gained by using optioal pumping between these
hyperfine levels in a_memory system. First, the
transition times involved are generally an order
of magnitude faster. Secondly, in a pumping
soheme suoh as those discussed by KastlerlO,ll,
the appropriate transitions are only produced if
the inooming radiation is proper~ polarized.
That is to say, we oould construot a memory in
whioh the storage aotion would oocur o~ if the
inoident beam had the right type and degree of
polarization. This reminds us again of the earlier remarks on optioal gating with polarized
beams and we see now that this type of deoision
teohnique oan blend nioely with an optical maser
memory.
Now that the possibility of using masers as
a storage element has been outlined, a few worda
of oaution are in order. Masers, as they exist
today, are large, bulky devioes often requiring
oonsiderable external equipment such as oooling
systems and magnets. The trend in maser research
has been for the production of higher amplifioation factors. In contrast, the computer designer wants more compact systems with fewer external
attachments. Most important, we are looking for
maser oavities with a large number of independent cells far data storage. In other words, we
are willing to sacrifice amplification for high
density storage at lower costs. It can be expeoted that maser research with these new goals in
mind Will eventuall.y lead to positive results.
Acknowledgements
Among the many people to whom lowe gratitude for this work.. I wish to explicit,.y mention
Dr. H. Yilmu, H. Cohen and W. S. Lee far interesting discussions am. valuable advice.. and
W. Congleton tor his constructive criticisms of
the paper. I belatedly thank Dr. J. Cordua and
the entire staff ot the Laboratory for Electrical.
Investigations at the University of ChUe, for
their hospitality and the privelege of 1IJOrJd.ng
with them in 19S9.

References
1. "Fiber OpticsJ Parts I, II, III and IV" by
H. S. Itapanathy, Journal. ot Optical SOCiety ot
America, Tol 47 .. 19S7
2. "Fiber Optics" by H. S. (apanathYJ Scientific
American, Tol 203, IS, Hov. 1960.

3. "High Speed Computers· by J. A. RachjmannJ
Proc.edings ot the Eastern Joint Computer
Conference, 19S9

4. "Atomic Spectra" by G. Herzberg; Dover
Publications, 1944.
,. "Introduction to Modern Physics" by
Richtmeyer and Kennard) McGraw-Hill, 1947.

6. "Formation ot F Centers in Kel by I r~s",
Mitohell at al, Physical Review 121.. 2 p. 484,
Jan lS, 1961.
7. "Introduction to Solid State Physics" by
C. Kittell) John WUey, 19S6.
8. "M8Bers", J. Weber) Reviews ot Modern Physics,
vol. 31, #3, July 19,9.
9. "Elements of Maser Theory", A. Vuylsteke)
Van Nostrand, 1960
10. A. Kastler, J. Phys. Rad. 11,2"

(19S0)

11. "Optical Pumping and Related Eftects",
J. Brossel; appearing in "Quantum Electronics"
editecl by C. Townes, ColUJJ1bia Univ. Press, 1960

481

1l.S

E

Ee t--------~

N = Ce

-~
kT

EA~--------------------------------~~

N
Figure 1.

NORMAL BOLTZMANN DISTRIBUTION

482

11.5

n=4

4F

40

3D

n=2

2P/

/

/
/
/

/
/
/
/

/

ENERGY

I /
IV
n=1

IS

1=0

2
ANGULAR

Figure 2.

MOMENTUM ---~.

ENERGY LEVEL DIAGRAM FOR HYDROGEN

3

483

11.5

f
ELECTRODES

V

/ V

(WRITING
PULSE)

OUTPUT
SIGNAL

Figure 3.

SlMPLIFIED METASTABLE MEMORY EL1!lmNT

(INTERROGATION
PULSE)

.....

.

~

..... 00
CJ'I

OUTPUT SIGNAL
PHOTOMULTIPLIER

OPTICAL
FIBERS

GAS FILLED
CAVITY

'II

II

VI}
V

2

'II

II

II

V3

":'

",",0
( INTERROGATION
PULSE)

•

•
Figure

4.

•

•

ARRANGEMENT OF SEVERAL GASEOUS 1vM·l0RY CELLS

WRITING
PULSES

~

CONDUCTION BAND
FERMI LEVEL
AT OPERATING
TEMPERATURE

E

VALENCE BAND

Figure

5.

CRYSTAL BAND STRUCTURE

......
...... ~

.00
(J1U1

..... ~
.....
.00
c.n0'

ELECTRO- LUMINESCENT
SWITCHING
MATRI X

X
DRIVERS

__

SELECTION LINES
~".,.::=

=====z

SII

=:':::=J

OPTICAL
FIBERS

-.l

SELECTED
BIT

-.
F CENTER MEMORY
WAFER

Figure

6. OPTICAL SELECTION OF A MEMORY CELL

487
11. 5

,

,

\

\

\
\
\

\
\

\
\
\

\

E

\

EB~--------~--------\
\

\
\
\

\

"EA .....- - - - -

,
" '" .......

.........

N
Figure

7.

INVERTED POPULATION

....-.

---

-- ---

......
......

.,j::..

•
00
C/l00

(TRANSFER FROM CELL 2 TO CELL I.)

1NPUT I·

..

MASER CELL
ONE

(PHASE ONE)

f
NORMALIZING
PULSE
(ACTIVE DURING
PHASE 2
TIME)

Figure 8.'

..

OUTPUT I

INPUT ...2

..
(TRANSFER
FROM CELL I
10 CELL 2)

-

MASER CELL
TWO

(PHASE TWO)

f
NORMALIZ IN G
PULSE
(ACTIVE DURING
PHASE I
TIME)

CONTINUOUS STORAGE OF A BIT WITH TWO CELL MASER

--

OUTPUT 2

..

489
11.5
READ/WRITE CELL

~

n

TRANSFER DATA CELL I TO CELL 2

--------~

~------------------

READ/WRITE CELL 2

n -------

TRANSFER DATA CELL 2 TO CE LL I

-----------'

NORMALIZE CELL

~

NORMALIZE CELL I

Figure

BEAM I

9.

BEAM 2

TThlING FOR CONTINUOUS STORAGE

RESULTANT BEAM

CL

CL

CL

CL

CR

P

CR

CL

P

CR

CR

CR

CR = RIGHT
CL = LEFT

CIRCULAR POLARIZED
CIRCULAR POLARIZED

P = PLANE

POLARIZED

TABLE ONE
COMBINATION

OF

TWO

POLARIZED BEAMS

491

12.1
OPTIMIZA TION OF A RADAR IN
ITS ENVIRONMENT BY GEESE* TECHNIQUES
Eugene L. Berger and Robert M. Taylor
Advance Systems Engineering
Defense Systems Department
General Electric Company
Syracuse, New York
SECTION I
Summary
One of the most complex problems in optimization
of ground radars is the evaluation of the ground clutter
and how it influences the operation of the radar. A
fundamental limitation of ground radars is that the radar is unable to distinguish stationary targets in the
presence of large amounts of random scatterers. The
capability of a phase comparison radar to pick the stationary targets out of random scatterers has been evaluated on GEESE under a contract for Frankford Arsenal
in Philadelphia, Pa. A model of Frankford Arsenal's
radar was built on the GEESE analog computer facility
and tested in single and multiple target situations in an
environment conSisting of random scatterers (clutter).
This was accomplished by simulating mathematical
characteristics of ground clutter and applying this as an
input to the simulated model. This report deals with the
GEESE application, an optimization of a general radar
block diagram. The optimization is carried out under an
arbitrary set of boundary conditions for the overall
system.
A Discussion of GEESE
General Electric developed analog simulation techniques for evaluating entire electronic systems to meet
the need for a faster, less expensive means for evaluating, analyzing and developing weapon systems.
In considering a system evaluation, there are four
basic facts to remember:

1.
easily.

Any signal can be generated and controlled

2. All signals and waveforms can be recorded this includes transients as well as steady state signals.
3. Feasibility studies can be undertaken on a
simulated basis prior to development of system hardware.
4. No equipment need be procured for this preliminary investigation.
Thus GEESE permits the electronic system engineer to predict and optimize system performance.
GEESE techniques were pioneered at General
Electric on the interference program associated with
the development of the radio-command guidance sys-

* General Electric Electronic Systems Evaluator

tem for the Air Force Atlas. A facility exists at
General Electric's Defense Systems Department in
Syracuse, New York, which is used to simulate all
types of radar and communications systems and to
evaluate the effects of ECM and mutual interference.
Purpose
The purpose of this study was to see if ground clutter could be simulated on the computer; and, as a check
upon the simulation, the output of the clutter generator
was to be compared to actual clutter. Also, the simulation of the radar was to be checked and compared to
experimental data taken in the field by an actual radar
which was used and designed by Frankford Arsenal,
Philadelphia, Pa. Thus, if the simulation of both the
clutter and the radar are correct, the results of this
test should be the same as those obtained with the
actual operating radar.
System Description
A radar can be analyzed by describing the transfer
functions of the receiver and signal processing circuitry; thus, on the analog computer, a simulation of
these transfer functions and signal processing circuitry
yields the capability of analyzing all of the parameters
of the radar. Second, the effects of ground clutter
upon this radar can be determined by a simulation of
the clutter environment. The ground clutter environment is the effect of trees t grass, bushes, and other
moving objects upon the radar. In addition to'these
moving scatters, there are also fixed targets which are
part of the ground clutter configuration. These are
basically rocks, buildings, tree trunks, cliffs, ridges
of earth, and other stationary man-made objects. The
whole clutter environment, therefore, is quite complex
in that it is composed of both random scatterers and
fixed targets. It is the object of any ground radar to
distinquish a particular stationary target in this complex clutter environment. The radar simulated is
basically a monopulse radar. This means that the radar has two antennas and that simultaneous lobing
occurs such that the phase of the target returns is the
measure of the location of a given stationary target in
azimuth, and the occurrence of the pulse is a measure
of the range of the particular target. In this monopulse
radar, there are two channels, a sum and a difference
channel. The sum channel is the sum of all the instantaneous voltages arriving at the two antennas while the
difference channel is the difference of the instantaneous
signals arriving at the antennas. These signals are
then bandpassed in an IF strip and video detected. The

492

12.1

video signal is integrated and processed by boxcarring.
These boxcar red signals are then subtracted and the
final presentation is this subtraction of boxcars. When
a given fixed target is on boresight and there is no clutter, then a maximum will occur in the sum channel
when a minimum occurs in the difference channel. If
these two signals are then video detected and boxcarred,
the sum channel boxcars will then be a maximum while
the difference channel boxcars will be a minimum. The
subtraction of these two boxcars will yield positive boxcars. If there is no target, only random scatters, then
the sum and difference channels will have independent
random noise signals and the boxcar outputs of each
channel will be both positive and negative. The subtraction of these two signals will give a boxcar output
which varies around zero both positive and negative;
thus, in this system, a positive boxcar output is an
indication of a target.
Area of Investigation
The radar was investigated in fundamental form in
different clutter environments. A range of target to
clutter power ratios was investigated by varying the
amount of clutter input. The clutter was characterized
by wind speed and the particular center frequency of
the radar. The radar was also investigated in a fundamental form without any clutter by supplying only
target information to the radar.. The target was then
moved in azimuth across the antenna pattern of the radar. This determined the sum and difference patterns
of the respective channels of the radar. The radar
again was investigated in its fundamental form when
two targets were the only input to the radar. By varying the two targets in azimuth across the antenna beamWidth, the interaction in either the sum or difference
channel could be noted.

SECTION

n

Simulation
GEESE Simulation
The analog computer simulation of this radar is a
scale model of the given system and not an analog. In
all systems simulated on GEESE, only a scaling of
frequencies occurs with gain relationships, voltages,
and currents remaining in the same order of magnitude.
More important, certain non-linearities are taken into
account. Waveforms observed at the terminals of
system elements are identical to those in the actual
system except for time scaling. On the Whole, the
simulation is highly idealized compared to the actual
system. The amplifiers and integrators are linear
over their entire range, and system elements such as
a linear detector are extremely linear. The input
frequencies to the analog computer can be made to
have the same order of stability as those experienced
in the actual system. Noise inputs are purely Gaussian
in amplitude and white in power density. The signal
levels put into the analog computer were held constant
without the action of an AGC loop, and the input frequencies were held constant without an AFC loop.

The Simulation of Ground Clutter
Actual clutter is characterized by power frequency
spectrum and a correlation function. The clutter generator simulates these characteristics as frequencies
scaled compatibly with the radar parameters under
consideration. The power frequency spectrum and the
correlation function simulatea were determined from a
study of actual clutter. In a situation involving random
scatterers (clutter) and a single stationary target, the
composite return has a Rayleigh distribution. As the
target level increases, the probability density function
of the signal power approaches a Gaussian distribution.
The mean of this Gaussian probability density function
is the signal power of the target. Appendix 1 presents
a mathematical ana~ysis of clutter., 1
The clutter supplied to the simulated radar comes
from a clutter generator which implements this mathematical model of ground clutter. This generator implements and incorporates some of the features particular to the radar in this study and includes the flexibility
required to simulate a wide variety of environmental
conditions. The clutter generator, in general, provides
two basic components of clutter. The first is random
scatterers, which consist of the returns from leaves
and grass in various wind speeds up to gale winds. The
second main component of clutter is stationary targets.
These targets are representative of dense woods, rocks,
boulders, and other ground environments which are
stationary. The clutter generator can position these
stationary targets in range and in return power. Also,
the clutter generator can position the stationary targets
in azimuth. Another capability of the clutter generator
is to simulate amplitude and frequency of range jitter.
This can correspond to frequency instability in the
transmitter or to the motion of a given fixed target.
The block diagram shown in Figure 1 represents
the signal processing which is carried on in the clutter
generator. Each channel receives equivalent power
returns from a collection of random scatters, twoparasitic targets varying in range, and one fixed target.
The random scatters have the characteristics of the
purely theoretical random scatters in that the probability distribution of the amplitudes is a Rayleigh distribution and correlation function is equivalent to that
observed in experimental data. Also, the amplitude
frequency spectrum of the random scatterers is based
upon the experimental findings of clutter returns at
different wind speeds for various RF center frequencies.

In the block diagram, four independent Gaussian
noise sources are used to modulate two carrier components in phase and quadrature. These components
are added and then supplied to both the sum and difference channels. The output of the sum amplifier in the
block diagram is then Rayleigh distributed in amplitude
and has a power frequency spectrum corresponding to
a particular wave length of the radar and the wind speed
of the environment. When a main target is introduced
into the summing amplifier in the block diagram, the
output is now Gaussian distributed in amplitude having
a mean about the average amplitude return from the
steady target. By varying the main target in amplitude
and phase,., so as to simulate frequency instability or
motion of the target, the output of the summing amplifier in the block diagram changes from the Rayleigh

493

12.1

NOISE GENERATOR

1

n

1

n

l

LAG
1T

/4

2

1
ng

1

n

4

~
P

I
PI

;

2

BALANCED
MODULATOR

ADDER

LEAD
1T

/4

BALANCED
MODULATOR

COS

ADDER

-----+-<> Eo

L

Figure 5A. Narrow Band Pass Filter

Figure 5B. GEESE Equivalent Circuit
Figure 5.

This computer simulation is the exact equivalent to
a single tuned circuit. Staggered pairs and triplets can
be formed by cascading several of these. In Figure 7
is the simulation of the stagger tuned triplet simulated
on the computer. The first IF amplifier in this stagger
tuned triplet has a relative gain of 2 and a bandwidth of
10 cycles. The second bandpass filter has a relative
gain of one and has a bandwidth of 20 cycles per second.
The third is identical to the first with the exception that
it is at a higher center frequency. The resultant magnitude plot of these three staggered tuned circuits is
illustrated in Figure 7.
The third detector is a linear detector and a lowpass filter. The linear detector has the relationship
eink = 4- which is the normal characteristic curve for
a perfect linear detector. The third detector is shown
in Figure 6. The low-pass filter has the transfer
function of:

H(s)

1

s + 1
RC

1

The bandwidth of this low-pass filter is set on the
computer to that bandwidth corresponding to the third
detector in the actual radar. The computer simulation
of the low-pass filter can be seen in Figure 7 which is
a complete GEESE model of the simulated radar.
The boxcar generator consists of an active integrator which allows integration only during the sampling
period or gating period. After the gating period, the
input is switched to zero so that the output of the integrator remains constant until sometime later when it
is desired to discharge the integrator and to -initiate the
integrating again. This is all shown in Figure 7 where
the circuit discharge is made through the resistances
Q 09 and Q 28 at the discharge occurring time.
Table 1 gives the IF center frequencies, the RF
frequency, the IF bandwidth, and other characteristics
of a representative radar. Corresponding to these
parameters of the radar are the computer scaled
parameters. The RF center frequency was changed to
400 cycles per second since there was no information
in the carrier frequency; therefore, it could be challged
to any convenient carrier. The other parameters were
scaled by 10-6 with the exception of the clutter frequen-

497

12.1

lOW PASS
FILTER

INPUT FROM
AMPLIFIER

~

~

COMPUTER EQUIVALENT CIRCUIT

TYPICAL RADAR THffiD DETECTOR

Figure 6. Radar Third Detector and Equivalent Circuit

cies and the pulse repetition frequency. These two
parameters were scaled to the 10- 3 so that the computing time would not be so extremely long had it been
scaled to 10- 6 • Figure 7 includes all the filter characteristics, the boxcar characteristics, etc. The output of the two boxcar generators are subtracted from
each other in an operational amplifier after the inversion of the difference channel signal. Thus, a representative radar receiver is simulated on the computer,
taking into account all of the bandwidth characteristics
and the transfer characteristics for each of the blocks
in the radar.
Data Recording Processes
Two types of data recording are used throughout
this study. The primary method is "A" scope presentations photographed for varying periods of time. The
photographic process provides an accurate simulation
of real time "A" scope integrations. The majority of
the photographs in Section ill were exposed for a period
of 5 minutes. This corresponds to approximately 1200
traces, equivalent to 1/4 of a second in the real system. Twelve hundred traces are sufficient2 to produce
a stationary presentation, i. e., a reliable signal to
clutter ratio.

TABLE I
Radar
Characteristic s

Scale
Factor

Computer
Simulation

RF Center Frequency
35, 000 mc/ sec

400 cps

Pulse Length
0.06 J..tsec

0.06 secs

IF Bandwidth
20 mc

20 cycles

IF Center Frequency
60 mc/sec

60 cps

Gating Width
0.06 J..tsec

0.06 secs

Length of a Boxcar

1

250 milli/sec

1

5 milli/sec

250 milli/ sec

Discharge Time
5 milli/sec
Pulse Repetition Frequency 4, 000 cps

4 cps

Video Bandwidth

15 cps

'i

15 mc

The second method of data recording and display is
eight channel oscillograph recordings. This type of
data recording process enables us to examine individual
traces on a continuous basis, that is, a non-integrated
situation. In addition, this method enables us to monitor several pOints in the system to insure correct
functioning of all stages of the simulated system. Data
recorded in this fashion can be statistically analyzed by
an examination of individual output traces in compilation
of an overall statistical result for the system.

Clutter Frequencies
X cps

X 10

-3 cps

.....
N~
•

...0

..... 00

O.IUFD

22

~

~
10

M5K
~
IfEG
31
+
SJ

I 30

E RECORDER

E RECORDER

t
Q28
I

~

~r5MILS

vr60MILSIV
~-250MILS

10-

O.IUFD

MOK QI9

MIXERS

~~-IOO

LINEAR
DETECTORS
6db/OCT INTERGRATORS AND
BOX - CAR GENERATORS
15cps

PI2

PI3
IF
F36
F35
F34
F33

AMPLIFIERS

St
ME
2E
I I:

-i

j

CLUTTER GENERATOR OUTPUTS
TAPE
UNIT

MI

o MAIN TARGET

el

250 MIL SEC
5MU

I-

l

o

DISCHARGING PULSE

b TWO SECONDARY TARGETS
c AND MOVING SCATTERERS

F37
F32
F31
F30

SA
2A
IA
MA

250
SEC--!

Q04

P2

+IOOV

r-- MIL

-IOOVl

~A~
L.Q2f..J
~ ~F44

MOJ-

TPI5 M5

e2

n~n
L--...J

GATING PULSE

"'W~
o

"h,

WI

o

tIOOV]
Q03

GATING AND DISCHARGE PULSE

MOVING SCATTERS

Figure 7. Complete GEESE Model of Radar Receiver

GENERATORS

L-

0

499

12.1
SECTION

m

Results
The Investigation of Different Characteristics of the
Radar
The clutter environment was applied to the simulated radar to investigate different characteristics of
the radar. In the condition investigated, there was one
stationary target and random scatterers. The ability
of the radar to detect the stationary target was investigated for different m 2, s where the m 2 is the ratio of
steady power returns to random power returns. The
results of this run are shown in Figure 8. In this figure,
the output boxcars, which are the sum minus the difference boxcars, are shown for m 2 = 1. 32, 2, and 4 when
the main target is on bore sight. The results of this
show that for m 2 =: 1. 32 that the probability of detecting
a target is 70 percent. The probability of detecting a
target for m 2 = 2 is 95 percent. The probability of detectin~ a target for m 2 = 4 is 100 percent, so that on
any m above 4, the probability of detecting a target is
always 100 percent. It can also be noted from Figure
9, which is the complete record for m 2 = 1. 32, that
the boxcars of the difference channel are that of a
Rayleigh distribution in amplitude. The output of the
boxcars in the sum channel are some Gaussian distribution with a mean about that of the amplitude of the
target signal. This in turn verifies the output of the
clutter generator to be exactly what it was designed to
be. The other records in this figure are the sum and
difference IF signals. These signals are essentially
CW signals, but the results out of the boxcar generators are the same as they would have been had the inputs been pulsed. This is due to the fact that the boxcar generator is sampling the IF signals and integrating

them only during a period of time equivalent to the time
occurrence of a pulse. The only difference that can be
said to exist if the inputs were pulsed is that the amplitudes of the sum and difference boxcars would be smaller due to the fact that some of the energy would have
been lost due to filtering in the IF strip. An actual
pulse input has been put into the computer to provide a
range gated "All scope presentation. This is shown in
Figure 10.
The second investigation was to determine the system performance in the presence of two stationary targets of different sizes. Figure 11 is a series of
recordings taken on the computer to determine if any
interactions occur between the sum and difference
channels. The strong target had twice the power of
that of the small target, and the small target was jittered in phase so that the envelope of the sum and
difference channels could be observed. The two targets
were separated by 30 electrical degrees in azimuth.
The frequency of the phase jitter on the small target
was O. 3 cycle per second with a total phase deviation of
10 1l' radians. The different recordings are for different locations of the target and the parasitic. In all of
the recordings, the main target is 30 electrical degrees
to the left of the parasitic. It can be observed from the
figure that the difference channel obtains its minimum
when the stronger target is on boresight or its electrical position in azimuth is zero.
Figure 12 is an lIAtt scope presentation of two targets appearing at slightly different ranges, with the
radar scans across the targets. The targets are being
jittered in phase so as to simulate frequency instability
or target motion.
It can be noted from these figures that the behavioc
of the simulated radar corresponds to that of actual radars. In fact, comparative results have shown the
correspondence to be quite satisfactory for various
specifiC situations.

At this point, optimization of the particular radar
may begin. Parameters such as bandwidths, center
frequencies, repetition frequency and pulse widths can
be varied easily. In addition, specifiC antenna patterns
can be investigated. This has been accomplished for a
specific radar with results satisfactory to both the system engineers and the circuit designers.
As an example the video bandwidth in this report
was found to be optimum at 12 cps instead of 15 cps.
In addition to empirical methods of parameter optimization, analytical optimization techniques can be evaluated using these analog methods.

Figure 8. Single Target on Boresight in Clutter

500

12.1

m

2

=

1.32

Figure 9. Single Target on Boresight in Clutter

Figure 10. Single Target on Boresight in Clutter

501

12.1

em
e
P

= -90

°

= _60°

°

em

= -60

e

= -30°

P

e m =- -30 o
e p = 0°

em
e

P

= 30 0
=

60°

Figure 11. Two Targets Being Scanned in Azimuth

502

12.1

Figure 12. Two Targets Being Scanned in Azimuth, No Clutter

503

12.1
APPENDIX 1
Mathematical Analysis of Random Scatters
Description of Noise Envelope
Consider some of the statistical features about the
envelope and phase of a random noise after passage
through a narrow band filter. The frequency spread of
the noise is :.;een to be small compared to the center
frequency of w c of the narrow band filter. If z(t) denotes the output noise record, then z(t) is equal to R(t)
cos wct - 1>(t ) with the envelope R(t). The phase
angles 1>(t) are allowing varying variables of time relative to oscillations of angular frequency w c. From the
previous expression z(t) can be represented as the sum
of sines and cosines z(t) equal x(t) and the cosine of
Wtt - y(t) sin 1>ct where x(t) := r(t) cos 1>(t) and
y(t) = r(t) sin 1>(t ). This equivalent representation
indicates the following relationships that R2 == x 2 (t) y2(t) and Tan 1>(t) := y(t)
X(t}

where R 2: O. This probability density function Q1 (R)
governs the distribution of the envelope and is known as
the Rayleigh distribution. The parameter R is restricted to non-negative values. It should not be confused with the normal probability density function where
the parameter may take on both positive and negative
values. Solving the above integral Q2(1)) probability
density function for the phase angles 1>(t) is given by
Q2(1)) := 1/2 7r where 0 ~ 1> ~ 271'. This shows that
the values of cj>(t) are uniformally distribu~ed over zero
to 271' and have a rectangular distribution.
The Presence of a Parasitic Target Jittering
In Phase in a Clutter Environment

Let

a = No. 1 parasitic target signal, rms
b = No. 2 parasitic target signal, rms
c = the phasor sum of a and b
Po = the mean square value of the random
scatters
2
m = C 2 jp = C 2 for Po normalized to

0

Suppose that the joint probability density function
Then the joint probability density
function p (R, 1» can be found from the following
equation:

o

p. (x, y) is known.

p(x, y) dxdy == p(R cos 1>, R sin 1» R dRd1>.

Po = 1.
Assume that the probability density function of 1> is
f( 1» =

Since the element of area dxdy in the x, y plane corresponds to the element of area of R dRd1> in the R, 1>
plane, let Q(R, 1» = R p(R cos 1>, R sin 1». Then
p(x, y) dxdy = Q(R, 1» dRd1>. Now the probability density function and the noise envelope R1> (t) alone is obtained by the sum of al[haSe angles and is
'P71'
Q1 (R) =
Q(R, 1» d1> .
o
While the probability density function of the phase angle
1>(t) alone is obtained by summing over all possible R
and is

C

1.

If y

=

f

Q(R,1»

0

dR.

If x(t) and y(t) are each normally distributed about zero
and mean squared values of x(t) and y(t ) are equal and
equal to the mean square value in c(t) , also if the cross
correlation fWlCtion between x(t) and y(t) so that x(t) and
y(t) are independent random variables, hence x(t) and
y(t) can be expressed as sums of normal variables. We
conclude that their joint probability density function will
be two dimensional normal distribution of the form

- x

2

=

a

0 ~ cP ~ 71'

2

2.

f(

or

cp = cos-1

f(cp) ,

I~ I

(-L-)
2ab

Therefore,

f(y)
3.

+ b 2 + coscp = m 2

= 2ab cos cp,

since f(y) =

OO

Q2(1))

2

1

7i'

=

2"lab [ 1- L~) 2]

1

-"2

or, the probability density function of m 2 is
m2 )

2
+Y

20"2

p(x, Y)

p(x) p(y)

1e
2
271' 0"

During the integration described previously, it follows a
Re

I
2Vab

m2
2
2
a + b + 2ab

~~----~------------~~~------~~~---------

504

12.1
This distribution indicates the probability,
f(m 2)d(m2), of obtaining a particular ~ of first probability distribution in power for a target consisting of
random scatters and two strong targets. It can be indicated as

3.

A1:!hough the probability of having an
m ::s a1 2 + b12 always remains at 1/2, the
probability with which we can predict a given
~ of WI (P/F) increases with K, independent of a 2 + b . This implies that under the
condition that K>lO, the clutter generator
provides stationary statistics.

On the basis of this analysis, it is recommended that
the clutter environment be characterized by:
1. a 2 + b 2
2.

I K: alb

K

= a:jb

in order to use the result that

IO(K):L

I

I
I
I

K+!.

K
m2
02

W (P)dP
1

= (1 + m 2) e -m

2

-

2

e-P/P (1 + m )

+b2
First probability distribution in power for a target
consisting of random scatters plus a fixed target, for
several values of m 2 = S 2/P o '
S2

steady power
random power

O(K)

Ref: Vol. 13 Radiation Laboratory Series

2

3

As a typical case consider that the levels of the
fixed targets have been established at a1 2 and b1 2 , and
that the parasitic-l to parasitic-2 target ratio is
Kl = al/bl, leaving only the relative phase between al
and bl unknown. Then, we can specify with what probability we can expect to obtain an m 2 (with corresponding type of first probability distribution)
WI (P/i», which may lie only in the range,
2
2
2
2
2
2
al + bl - Kl + l/Kl :s m :s al + b l
2

+ Kl + l/Kl
Note the following:
1.
2.

The value around which m 2 may range is
2
2
al + b l •
The percentage spread ~ m 2 is ,dependent on
K"" alb; i. e. '. D(K) = --1K+i{

0.5

1.0

1.5

2.0

2.5

pip

The nature of f(m2) is such as to suggest that, in a
large percentage of the possible WI (PIP) cases, identifying and resolving each of two strong targets in ground
clutter is definitely possible. Since the most likely,
and therefore most prominent, traces on an "A" scope
will be those caused by returns that have been reinforced and those that have been cancelled, a comparison of the difference channel "A" scope presentation
with the sum channel "A" scope presentation should
enable an operator to identify and measure the range of
each target. 4

505

12.1
APPENDIX 2

Comparison Of The Tape Recording
With The Filter Output

f

cf>xx2(T)

o
e

1

= 2T

T~O

1

WHITE NOISE
GENERATOR
(20l-A)

~------~v-------JI

( , + T,

TAPE

lW)2

4.

So,

5.

5
It has been shown that if we define

Let
1.

then

cf> xxl (T) = e

where

Gxx1 (w)

=
1 + Tl

2 2
W

where
and
then
2.

Hence
1

2Tl

f

00

-00

cf>xx1 (tl ) cf>xxl (T-tl)dtl

3. Since the autocorrelation function is an even
function, it will be necessary only to solve for the case
of T ~ 0 and use the mirror image of the result for T!!ii 0
in establishing the form of cf> xx2 (1)

T = recording time,

506

12.1
APPENDIX 3
Relationship Between The Noise Generator
OUtput And The Filter. OUtput

2
a A2 [
1 2
=-22
WI 2
1 + (w /w )
o 1

2x

4.

Wo

WI

2 2 [
- 2 ~A
x 2 =-2--

J

Ub

WJ.

1.

0

2

Ub

]

+ _1_ tan-I
WJ.wo

-2x 0

if Wo «WI

If

r«J

w(jcJ ~ ~

e -jeff 4>(T)dr

~2A2

-00

5.

2~-2--

x

1.
j

2.

tan-I

0

2x

1
2+

u.:..

WI

+ -w

Then

f

j.o

e!'WT

2

[ w:2

+ _1_
w1wo

~J

2

Xo

cI>(jw)d(jw)

-J«J

r

«J

4> xx2 (0)

if?xx2(jw)dw

-«J

or

I

W(jw)

2"
x

12 cI>xxO(jw)dw
REFERENCES

2

-2-

3.

~

a 2N A12T2
o
271"

1.

Kerr, Propagation of Short Radio Waves,
Radiation Laboratory Series, Vol. 13, pp.
550-588, McGraw-Hill, 1951.

2.

Lawson and Uhlenbeck, Threshold Signals,
Radiation Laboratory Series, Vol. 24,
MCGraw-Hill, 1947.

3.

Bendat, Principles and Applications of
Random Noise Theory, John Wiley and
Sons, 1958 •

.4.

Shapiro, S. S. Hughes, Improvement of
Angular Resolution by Means of Interpulse
Frequency Modulation, AFCRC - TN - 59 754, on Contract No. AF 19(604)-2625,
15 June 1959.

5.

Lanning and Battin, Random Processes in
Automatic Control, McGraw-Hill.

-;2
o

L....-----w-:-7-0- - W
27rx2
o
"10 =Wo
--

507

12.2

THE SPECTRAL EVALUATION OF ITERATIVE DIFFERENTIAL
ANALYZER INTEGRATION TECHNIQUES
M. C. Gilliland
Beckman/ Berkeley Division
Richmond, California

Summary
The iterative differential analyzer utilizes
both continuous and incremental or iterative
mathematical techniques for the solution of
problems. Consequently, this machine is frequently required to perform numerical integration. In thi s connection it is de sirable to
evaluate numerical integration techniques from
a control system point of view. The W -transform, a special case of the modified Z-transform, is used as a base for analysis.
The true numerical integration operator is
shown to have the form T(logw)-l where Tis
the time increment for the process. The truncation error for an integration proce s s is related to the dynamic error of the corresponding
integration operator which is determined by
comparison with T(log w)-l. Various integration operators are evaluated with respect to
dynamic error. Parasitic solutions are shown
to correspond to poles of the W -transfer function of the operator. Round-off error accumulation may be determined from the W -plane
representation of the operator. Synthesis of
open-loop operators using root-locus and Bode
techniques is discussed. Closed-loop operator
selection and stability for the integration of
linear ordinary differential equations is discussed. Techniques are indicated for the
stabilization of closed-loop operators. Techniques are presented for the determination of
truncation and round-off error estimates for
the integration of linear ordinary differential
equations.
Introduction
A new type of analog computer, the iterative differential analyzer (IDA), recently has
been developed. This computer is more
flexible than the electronic differential analyzer. The latter machine is capable of direct
integration with respect to only one independent variable corresponding to machine time.

The IDA, however, performs not only this opertion, but with the addition of memory allows
the programmer to use numerical techniques.
The IDA can be used as a three or four significant figure .digital computer performing integration numerically. The IDA can also be
used as a hybrid computer where some computations are carried out numerically and others
in a standard analog manner.
Since the IDA can use numerical integration techniques as part of its mathematical
repertoire,' it is helpful to compare these
techniques from the control system point of
view which is familiar to most analog computer users. It is desirable to be able to de-'
termine which numerical operator is the most
suitable for a certain integration process, or
further to compare the results of a numerical
integration scheme with those to be obtained
by direct integration.
W -Transform
Numerical integration can be thought of as
a sampled-data process. One can use a transform algebra to analyze a sampled-data system in a manne r similar to the use of the
Laplace transform for the analysis of a continuous-data system. The modified Z-transform l has enjoyed wide application for this
purpose. For the discussion which follows, it
is convenient to use this transform with a delay which is equal to the sampling period. The
re suiting special case of the modified Z-transform is called the W -transform. 2 The relationship of the W -transform to the Z-transform is similar to that of the Laplace transform to the operational transform used by
Heaviside. Some of the pertinent properties of
the W -transform are summarized below.

508

12.2
W -Transform and Inverse

Steady- state Re sponse of W - Transform
Operators-- ------ - - - - - - - - - - - - - -

The W -transform of a continuous function,
f(t), is defined by
00

WT

\'

{f(t)}~

L f(nT)w

-(n

The W -transform of sinwt is given by
F(w)=

-+ 1)

sinwT

= FT(w).

n=O
If we take a =wT, then

The subscript, T, is omitted where the dependence of the transform on T is understood.
Consideration of the well-known sampling
theorem 3 leads to the conclusion that if f(t) is
continuous and its Fourier spectrum vanishes
. identically for w ~..!. , then f(t) is uniquely
T
determined from a knowledge of F(w). Under
these conditions, f(t) can be found from 2
f( t1 =

t

L

Re sidue s ( w

T

F (w) ]

(l)

where the implied integration contour, encloses all the poles of F(w).

1

Region of Stability , 2
If all the poles of F(w) lie inside the unit
circle, Iwl <1, in the compleX'w-plane, then
F(w) is the W-transform of a "stable" function, f{nt). That is, ultimately, f(nt) --0 with
increasing n. If some poles of F(w) lie in
Iwl>1, orifp01es of order >1 lie on Iwl=1,
then the corresponding f(nt) is an "unstable
function". Ultimately, f{nt) -+oowith increasing n. If F(w) has no poles in wi >1 and only
first-order poles on w 1= 1 then the corresponding f{nt) is "marginally stable". f(nt) does
not tend to zero but is bounded for all n. This
behavior is illustrated by the thr.ee functions
and their W -transforms below.

I

I

f(t)

I

I

Sino
(w-cos0)

2

. 2

+ Sin

a

and the poles of F(w) are seen to be located at
w=e :l:ja. Suppose O{w) is an operator on F(w)
which results in the response H(w), so that
H(w)/ F{w)=9(w). Then, the steady-sta,te
response H(eJa) due to the excitation F(eJa) is
given by O{eja)F{eja). If 0 is expressed in the
polar form 0{ej8)= G{a)e j cp (a), then G, cp are
re specti vely the transfer gain and phase of
the operator, 0, as a function of a, which may
be called the "sample angle". Bode plots can
be constructed for W -transfer functions from
a knowledge of G,. cpo It is seen that the sample limit given by the sampling theorem 3 corresponds to the sample angle a=1T.
W - Transform of Finite Diffe rence Ope rators

It can be shown that
W{f(nT

+ mT)}=

w m W{f(nT)}=wmF{w).

This result can be applied to the finite difference operators
Ef{nT) =f{nT

+ T)

~f{nT)=f(nT

+ T)

- f{nT)

Vt{nT)= f(nT) - f(nT - T)

F{w)
to give

(0. 1)t

W {Ef(nT)}=wF(w)

w-O. 1 T
sinu.t

F(w)=

W {~f(nT)}= (w-l) F{w)

sinwT

2

2

(w-coswT) +sin wT

W {Vf(nT)}= (w;}) F{w),
which define the W-transfer functions for these
operators. W-transfer functions can be deduced for more complex finite difference operators in an obvious way.

509

12.2
The True Numerical Integration Operator
Suppose f(nT) is some sampled-data function. One miglit ask if there is a unique continuous function, g(t), such that g '(nT)=f(nT). If
so, then apparently g(nT) can be considered the
"true integral" of f(nT). This question is basic
to the problem of numerical integration. Assuming that g(nT) exists, if an operator, 0, can be
found such that
G(w)=O(w) F(w)
for some set of functions fa(nT), then 0 is a
"true nume rical integration ope rator" with respect to these fa' g(nT) does exist and an 0 can
be found whenever the fa satisfy certain properties.
.
If f(nT) is not singular for finite n, then f
has a unique continuous represeD;tation h(t) with
the restriction that the Fourier spectrum for h
vanish identically for w 9.2:.. Thus,
T
h(nT)=f(nT). h is a valid representation for f,
since for practical cases T must be restricted
so that no significant information is lost. Then,
the true integral of f(nT) is g(nT) where
g(t)=Jh(t)dt. Suppose g(t) is given by (see
equation (1) above)

r+

g(t)= _1_._ \ w
2rrJ
'C

O(e jwT) =

lJW

as one might suspect. This result can be developed on an alternative and intuitive basis.
Suppose h(t), the continuous representation of
f(nT), can be considered a linear combination
of functions

t

n=O
In this event, g(t) is given by
00

>.

An_ e j(nZi - cpn)
_

,~

Jwn

n=O
For each term

Thus in the limit, the above result obtains.
Unfortunately ..!.... has no realizable reprelogw
sentation in terms of numerical processes.
However, it can be used to provide a spectral
estimate of the integration error for a given
integration operator.

G(w)dw

where C is suitably chosen and G(w) is the
W -transform of g(t). It can be shown that
t
T
fw
gl(t)
logw
G(w)dw,
2rrj . .
T
c

=

The steady- state transfer function for 0 is

\

Dynamic Error
In order to evaluate the useful range of the
sample angle, 8, for a numerical integration
ope rator, its phase and gain as a function of 8
can be compared with that for T(log w) -1 .
Thus, a Bode analysis can be made of the
spectral integration properties of an arbitrary
integration operator.

and
W{gl(t)}= l1 w G(w).

It follows that
G(w)= _T_ F(w)
logw
and O(w), the true numerical integration operator, is given by

T
'
O(w)= - _ .
logw

Dynamic error is generally a more meaningful measure of the integration capability of
an operator. Suppose g(nT) is the true integral of some function f(nT). If an operator, 0,
acting on f produces the response g (nT), then
what is really desired as a measure of integration error is an estimate of
E=Max{lg-g
Igl}. E can be computed as a
function of 8 (or wif T is fixed). If
G (w)=O(w) F(w), then the steady-state transfer function for 0 can be represented

II

510

12.2
The steady- state form of the true integration
operator is

..!..e
w

-j

Open Integration Operators
An open integration operator has the form

rr/ l

so that

j(q,f ~)

= 1 1 - wG( e )e

-.&
g

I.

Thus
E e= 1

2.

+ (wG) +

l (wG) sin q,.

In what follows it will be seen that the dynamic
error of an integration operator is the same as
its normalized spectral truncation error.
Truncation Error

where k and the Aa are suitably chosen so that
Of(nT) approximates yn+l - Yn-p. There
are nume rous special case s 4 of equation (2.)
which are commonly used either for predictors
or for integration. Three are given below as
an example.
Yn-tl =Yn ofT fn (Euler)

T
Yn + 1=Y n + 2.4(55fn -59fn_l+37fn_2-9fn_3)(Adams)

For a discussion of truncation error, it will
be convenient to denote f(nT) by f n . Thus
fnt-m means f(nTt-mT). If

The W-plane representation of equation (2.)
is
j

k

y(t)=ff(t)dt

(w;})

LTAj
then

j=l

t=(nt- l)T

Yn+ 1 = Yn

+

5

f(t)dt.

t=nT
In order to determine yn+l precisely, f(t) must
be known for almost all,t in the interval
nT< t < (n+l)T. However, for numerical integration, f(t) is generally known only for discrete
values of t, resulting in the sampled-data function f(m T). Thus, some approximation must
be found for
t=(n+ l)T

so that the W -transfe r function of an open
operator from y
to y
1 is
n- p
n-t
A
o

(W)=( p+lw p
w

-1

in terms of f(mT), which will provide an adequately accurate estimate of Yn+ 1. H(t) is
usually approximated by the application of some
finite difference operator to f(nT). An operator,
0, is said to be closed or open depending on
whether Of(nT) involves a knowledge of f{mT)
for m~n+l or m~n.

(3)

Closed Integration Operators
A closed integration operator has the form

o f(nT)=(2: TA i

H(t)=S f(t)dt
t=nT

j[~\Ai (W~l) i]
V'i)f(nTtT),

(4)

J= 1
where k and the Aa are again suitably chosen
so that Of(nT) approximates y +l-Y
n
n-p.
Three examples of special cases are given
below.
Yn+l =Yn+ i(fn + l +f )
n
Yn + 1 =Yn-l +

f

(Trapezoidal)

(fn+-l +4 fn +fn _ l ) (Simpson)

3T
Yn+-l =Yn-2+ S{fn +l+3fn +3fn -l +fn -2)
(Newton)

511

12.2
The W -plane representation of (4) is
k

wL

TAj

(w~ 1 ) j

j= 1
so that the W -transfer function of a closed operator from Yn-p to Yn+ 1 is

Truncation and Dynamic Error
The error committed by representing
T(logw)-l with an expansion of finite order in
the operator Vis called the truncation error.
The coefficients Aa in equations (2), (4) are
usually determined so that f(nT) is approximated by a polynomial in t of degree k+2. Since
only a limited clas s of functions are well
approximated by polynomials, there is no
guarantee that the Aa are optimal for a general
purpose integration scheme. Further, the
error estimates based on this procedure in
many cases represent only very conservative
upper bounds. Since every well-behaved
sampled-data function, f(nT), has a continuous
representation, h(t), which has a finite spectrum, it is preferable to evaluate truncation
error spectrally. The total error in the computation of Yn+ 1 from a knowledge of
f(mT), m~ n+l and y n-p is equal to the inner
product of the dynamic error and the spectral
representation of f(nT).
Consequently, the
dynamic error is a spectral representation of
the truncation error. Fig. 1 shows the dynamic error of various integration ope rators as a
function of the sample angle, 8.
Note that one five-point closed operator has
better error properties than the five-point
Newton-Cotes operator. This shows that the
Aa are not chosen in an optimal way in the latter formula. The round-off error stability
(to be di scus sed later) of both ar~. comparable.
Parasitic Roots for Open-Loop Integration
In the proce s s of constructing a numerical
integration operator (see equations (3), (5) )
which approximates T(log wt 1 , parasitic roots
are introduced which corre spond to pole s of
the operator in the w-plane for an open-loop
system (see Fig. 2). These roots are excited

by round-off error, other noise, or the normal excitation of the system and generate
spurious parasitic responses. The open-loop
stability of an ope rator can be dete rmined from
the location of these poles in the w-plane. Note
that the parasitic roots for a closed-loop system do not in general correspond to the poles
of the operator. The closed-loop behavior of
integration systems will be discussed later.
The stability properties of an open-loop
operator are illustrated with examples. The
transfer function for Simpson's rule is given
by

T
3 .

w 2 + 4w T 1
wZ _ 1

The pole s of this ope rator are located at
w= +1, -1. The pole at w= + 1 is a basic
requirement for an integrator in the same
sense that a pole is needed at s-O, in the
s - plane for integration in a continuous -data
system. The pole at w= -1 results in a marginally stable parasitic response of the Simpson operator. Note that this root causes a
resonance 'of the operator to uniformly alternating round\-off error, e, whose W -transform
is (for an E inagnitude)
E(w)=

E

•

(w+l)2
The transfer function for Newton's rule
is given by

The parasitic roots of this operator correspond to poles located at W=e jlTT, ej4TT
3
3.
It is seen that the parasitic response is again

marginally stable.
Round-Off Error Sensitivity
Frequently, the round-off error distribution is known for an integration process. In
this case, a round-off error spectrum can be
found which then can be used to determine
quantitatively the response of a stable system
to round-off error. In any event, a realistic
upper bound for round-off error accumulation
can be obtained from the knowledge of the location of the roots of a stable system.
Sll:~pose a system root is located at
w=ae J , a < 1. Further, suppose the system is
excited sinusoidally by a driving function
whose pole e are located at w =e ± j
Then

512

12.2
the response of the system corresponding to the
root at w=e jtl is magnitude bounded by
[1+a 2-lacos(8- <1>)] -0.5.

Stable Differential Equations

:Maximum response is ~ren to obtain for tl=,
and is given by (1 - a)
. Thus, one can
assume (and conservatively so) that all the
round-off error as a function of nT has the form
E cos(ntl) to obtain an upper bound for this particular system response to round-off. The
above procedure can be used to obtain a bound
for each system response. Then a bound for
the overall effect of round-off error on the system is a weighted average of these bounds.
If the system has unstable roots then, ultimately' the response of the system due to
round-off error alone will be dominant. The
long-time response due to an unstable system
root is essentially independent of excitation
whose growth is no more than exponential. Consequently, a good estimate can be made of the
contribution of unstable roots to total system
response. In this connection it is frequently
important to be able to establish an upper
bound on the number of computation cycles undel1gone by a slightly unstable operator before the
error growth is deleterious. The root-locus
chart for the system provides this capability.

Closed-Loop System Stability
Integration operators are used more often
to integrate, numerically, a simultaneous set
of ordinary differential equations than to perform an open-loop integration of some function.
The parasitic roots of the se operator s generally determine the stability characteristics of
the integration system. In order to evaluate
the stability properties of a closed-loop system
it is neces sary to determine the location of the
closed-loop roots. This can be accomplished
from a knowledge of the open-loop poles and
zeros for a linear system. Root-locus techniques are advantageous for this purpose. Frequently, much can be determined about the
stability characteristics of a non-linear system
using linear analysis.

It will suffice to use two simple differential
equations to illustrate the techniques suggested.
The se are (y=y (x»

y'

=ay + f(x)

(6)

y'

=-

(7)

ay+ f(x)

whose solutions are respectively unstable and
stable.

Equation (7) is a simple case of a stable
differential equation. It can be written in the
form
y=:l - fa ydx+ ffdx.
Taking the W -transform with re spect to x we
have
Y= _

~y+_T__ F.
logw
log w

Suppose T(log w) -1 is approximated by G(w).
Then
Y=G(w) [F -ay].
A block diagram of this system is shown in
Fig. 3. The open-loop poles and zeros for the
system are those of G. The closed-loop roots
satisfy aG= -1. G contains the parameter T
and consequently G can be expre s sed

Thus, the closed-loop roots can be determined as a function of K=a T from a rootlocus plot for KG= -1. System stability depends on whether the roots are inside or outside the unit circle, wi = l, in the w -plane.

I

The stability plots as a function of K for
various operators used in the integration system of Fig. 3 are shown in Figs. 4 - 13 (solid
lines). The operators are given below.

Operator
T
w-l

Operator and
Figure Number
4 (Euler)

-2T

w+l
w-l

5 (Trapezoidal)

T
3

w2+4w+l
w 2_ 1

6 (Simpson)

3T
8

(w+ 1) 3
w 3 _l

7 (Newton)

8
45

(5 -point Newton Cote s)

513

12.2

T

3. l7

l
w 4 +5. 8w 3 +6w +5. 8w + 1
w 4 + w 3 _w_l

9

which leads to the transformed equation

Y =G{w) [F i-ay]
T
l4

IT

9w3 + 19w 1 -5w +1
w3 _ w l

10

w 2 i-2w
l
5w -4w-l

11

II

T
5

w4+l6w3+66wl+l6w+l
4
3
w t- 1 Ow -lOw - 1

13

It can be seen that operators 13, II are
seriously unstable and are of limited use for
either open- or closed-loop systems. For very
small K, one root of operator 13 represents an
unstable system re sponse of {-l O)n, where n is
the computation step number.
Operators 6,7,8,9 can be used for shorttime integration wi th small K. Of the se operator 7 has the best stability characteristics.
Operators 4, 10 can be used for long -time integration with limited open-loop gain (limited ~x).
Operators 5,11 can be, used for long-time integration with any finite non-zero open-loop gain.

where G{w) is an approximation of T{logw)-l.
The block diagram for this system is the same
as that shown in Fig. 3 except for the feedback,
which in this case is positive. The closed loop
roots satisfy KG=l,where K=aT, G= TG.
The unstable system stability plots for
operators 14-7 are shown in Figs. 14-7 with
hatched loci. Note that no integration operator whose dynamic error is zero for zero
sample angle can result in stable system response. An evaluation can be made of the usefulness of a particular operator by a comparison of the growth of the unstable system response due to the operator with the growth of
the de sired unstable solution of the diffe rential
equation.
Stability Compensation
Sometime s an integration system can be
stability compensated by the addition of suitably located poles and zeros in the w-plane.
Generally, this technique provides a less
rapid growth of the unstable response rather
than the elimination of unstable roots. Compensation has an effect on the dynamic error
of the ope rator which also must be considered.
Synthe sis of Integration Ope rator s

It must be recognized that stability is not
the only requirement which limits the openloop gain of the system. The gain is a function
of the sample angle and one must also limit the
sample angle so that the dynamic error of the
operator is not serious. The dynamic error
curves for the above operators are shown in
Fig. 1.
Root-locus stability plots may be made in a
similar manner for more complex linear integration systems.

Generally, there are four criteria for the
synthesis or selection of an integration operator. These are:
1) Its influence on the stability characteristics of the closed-loop system in which it is to be used.
l)

~hether

the integration is long- or
short-time.

Unstable Differential Equations

3) Whether it has suitable dynamic
error.

Equation (6) is simple case of an unstable
differential equation. It can be written in the
form

4) How much machine time is required
for its use.

y=faydx

+ffdx

These criteria cannot all be satisfied independently. However, the use of control system
synthesis techniques simplifies the development of an operator which has desirable

514
12.2

0 1 , 0 l are well-known. 0 , 0 are of limited
use because they are both close~ and highly
unstable (see Figs. 13, 12). These are operators 11,13 above. However, it is interesting to note that the dynamic error of the latter two is quite small.

characteristics with respect to these criteria.
Serie s Approximation of T(log w} -1
Although the following method for operator
synthesis is not particularly fruitful, it is described because of its interesting relationship
to well-known integration formulae. A common series expansion for log w is
lOgW=Z[W-l
w+l

+

.!. (w -1 )
3 w+l

3+

1. ( w -l)~
5 w+l

. .

JjJIwi>

Optimization of Coefficients

Q

If various numbers of terms are retained to
approximate log wand the result is used to gene rate an appro:x:imation for T(log w} -1, a sequence of integration operators will result.
For example, if one term of the series is retained then

T(logw}-l~.:!. w+l
l w-l
which is the trapezoidal operator.
are retained,

It two terms

which is the Newton operato·r.
Polynomial Operators
One might intuitively suspect that an operator selected for its ability to integrate polynomials should have improved dynamic error
properties. In order to evaluate such a scheme,
some of these operators are developed below
and then evaluated with respect to stability and
dynamic error. Such a sequence of polynomial
integration operators is defined by
°n(w}=

n
w{t / n! }
>
w{tn-V (n-l) !}' n o.

F our of these are

It was noted earlier that the coefficients...c
usually used in the expansions of equations-~~C
(3), (5) are not optimal. Frequently, a read-_
justment of the poles of zeros of the operatpr
will lead to smaller dynamic error without!
significantly changing its stability characteristics. For example operator 9 (not Og) is
five-point and closed. It is a linear combination of the Newton and Simpson operators. It
has approximately the same dominant stability
characteristics as the Simpson operator. Compared with any other standard fjve -point closed
operator, it exhibits less dynamic error. It
has about the same general stability properties as the five-point Newton-Cotes operator.
Obviously, the coefficients used for operator 9
are a better choice than those for other
standard five - point formulae.
As an example of how the coefficients of an
ope rator may be adjusted for stability at the
expense of dynamic error, compare operator
11 with the Simpson operator. Both are threepoint and closed. Dynamic error is smaller
for the Simpson operator but operator 11 has
better stability properties.

The IDA can be used as a tool for the study
of integration operators. It is capable of
generating root-locus plots and determining
dynamic error. One may conveniently use the
high speed iterative feature to optimize the
dynamic error properties of an operator
empirically.
Choice of Operator for a Closed-Loop System

(Euler)
( Tr~pezoidal)

3

l

w +l1w +llw+l
w 3 +3w l _3w_l
w4+ l6w 3+66wl+l6w+1
4
w +-lOw 3 -IOw-l

One technique for the choice of an operator
to be used for the numerical integration of
some particular system of ordinary differential equations is outlined below. What follows
assumes that the programmer has some gross
knowledge of the characteristics of the solution.
Linear Ordinary Diffe rential E quati ons
The discussion he re will be re stricted to
those equations of the constant coefficient type.

515

12.2
Usually, the extension of the technique to a
more general linear case is not unduly difficult.

3.

Shannon, C. E., Proceedings of the 1. R. E. ,
Vol. 37, January, 1949, pp. 10-2.1.

First, the use of an integration operator
must re sult in suitable closed-loop system
stability propertie s. Thi s include s consideration of total solution time, type of solution expected, etc. The stability restriction will eliminate from consideration many of the integration
operators in the programmer's repertoire.

4.

Hildebrand, F. B., Introduction to Numerical Analysis, McGraw-Hill, 1956.

Second, some estimate must be made of the
bandwidth of the system response, and what
dynamic error is tolerable within this bandwidth. This will fix T (the integration increment) for any particular operator. Some operator can then be chosen based on simulation
time limits. The choice should be rechecked
with respect to stability.

It should be noted that this is only one
approach. Others exist and are often more desirable, using results developed earlier. For example, it may be more practical to evaluate the
effect of truncation error on the basis of closedloop system response rather than on an openloop dynamic error basis.
Non-Linear Differential Equations

,

The techniques outlined for linear systems
frequently can be used to obtain bounds for
round-off error accumulation and truncation
error estimates. If gross information is
available about magnitude bounds for dependent
variables as well as expected bandwidth requirements, then sometime s a "linearized worst
case" approach can be taken. For example,
the solution of many non-linear equations can be
approximated by a sequence of linear solutions.
With this technique, stability requirements are
established by the prope rtie s of the most severe
linear solution. In a siinilar manner, the
necessary dynamic error level for the integration operator can be estimated.

References
1.

Jury, E.!.! Sampled-Data Control Systems,
Wiley, 1958.

l.

Gilliland, M. C. The W - Transform, Presented at the Northwest Joint Computer
Conference, October 1, 1960.

516

12.2
2.0

-~---------t"---------r---;""---r-'

1.8

-

1.6

-

1.4

1.2

1.0

0.8

06

0 .•

0.2

o

o

0.1

0.2

03

0.4

0.5

0.6

SAMPLE ANGLE,

0.7

e

(RADIANS)

DYNAMIC ERROR OF VARIOUS INTEGRATION OPERATORS
FIGURE

F(w)--I

O(W)

~R(w)

F(W)~

SIMPLE OPEN-LOOP SYSTEM

O(W)

I

, • R(w)

SIMPLE CLOSED- LOOP SYSTEM

SIMPLE OPEN- AND CLOSED-LOOP INTEGRATION SYSTEMS

FIGUR"E

2

517

12.2

G(w)

1 - -___-

.....

y( w)

INTEGRATI ON SYSTEM FOR ~'. -.( ~ + f

FIGURE

3

K=+

O(w)=

T

(EULER)

w-/

FIGURE

4

K=o.S'

D(w) = T . .Y:l±.L
z w-/

K= I

(TRAPEZOIDAL)

FIGURE

5

1(=.8/

K::./

3

O(w) = l!. (W+/)
8

FIGURE

6

(NEWTON)

WJ-I

FIGURE

7

518

12.2

W=-a,·
~

1<= 1.27

$~--------~~----~~~----~~--------

O(w): 1I..1w++32w'+uwl.+JZw"7

W+ -

-+5

I

( S - PT NEWTOW- COTES)

FIGURE

~

8

K=3.'+-

w=.f.~ ~--. .------~------~~------.w--------

O(w)=

...

a

1.

L. w +SSW-t6w+5.8W+1
4
l

3.1.7

W

+W

_

w-/

FIGURE

9

O(w)= 2T.
FIGURE

10

FIGURE

w

&

+2W
5w~+w-1

II

--~I~------~~~~----W·-.9.t

o(w)= L. w'+/lWz", I/w.,./
4 WJ+3~-3w .. /
FIGURE

12

FIGURE

13

519

12.3
AN ITERATION PROCEDURE FOR PARAMETRIC MODEL BUILDING
AND BOUNDARY VALUE PROBLEMS
Walter Brunner
Electronic Associates, Incorporated
Princeton, New Jersey
Summary

criterion.*

A steepest descent iterative approach based
on least squares error minimization is employed
to obtain the optimum set of parameters typifying a system such that the system outputs
satisfy desired performance criteria. The iteration process is shown to converge under fairly
general conditions provided the initial guess
lies in a neighborhood of the true optimum. An
automatic gain control assures stability of the
iteration procedure, which is a single step
iteration process, i.e., only one parameter is
updated at a time, which aside from theoretical
considerations minimizes equipment duplication.
The optimization procedure may also be applied
to the problem of determining the parameters
defining a physical system by comparing the outputs of the physical system to those of an approximate model simulated on an analog computer.
The errors due to the comparison are minimized
by the optimization procedure to determine the
parameters of interest. The same procedure can
be applied to home in on boundary conditions in
ordinary differential equations.
Introduction
One of the problems frequently encountered
in simulating a physical system on an analog
computer is the problem of finding the values
of the parameters typifying the system so that
the system outputs will have desired performance
characteristics. Optimizations of this type can
involve many computer runs even when a good understanding of the physical system exists. Since if
the number of independent parameters is say, N,
then one must search for that point in N - dimensional space yielding the desired output characteristics. The magnitude of this task is clear.
The object of this paper is to present an approach
which automates this optimization process so
that the system engineer can simply specify the
performance criteria and then let the computer
itself do the laborious task of finding the
optimum set of parameters. The optimization
process under consideration is pictured in Fig.
(1). The system parameters
can be considered
to be inputs to the system in the sense that
they are varied until the system outputs, y(t),
behave in the desired fashion as determined by
the performance criteria. The outputs E(t), of
the box labeled performance criteria are error
quantities and are a measure of how well the
individual performance criteria are satisfied.
Since all these errors have to be simultaneously
evaluated, a single measure of total system
performance, S, has been chosen which in this
case is a weighted integral least squares error

°

To date, the design engineer optimizing a
physical system on an analog computer will, in
general, adjust the parameters, 0, in a trial
and error fashion by observing the system outputs, y(t), until the desired response is obtained. The design engineer is acting as the
feedback element by mentally evaluating the
system characteristics and employing his judgement and experience to readjust the parameters,
0. If he is more sophisticated, he will have
instrumented on the computer the performance
criteria and a total measure of system performance, S, as in Fig. (1); if in addition a
high speed repetitive computer is at his disposa~ his task as feedback element in trying to
determine the optimum set of parameters, 0, is
considerably eased since he can observe whether
S is being minimized on the display unit as he
is adjusting the parameters. However, if the
problem is nonlinear, or the number of parameters
is large, then automatic techniques are a
necessity. Meisstnger1 by means of the parameter
influence coefficient technique has been able
to obtain the gradient of S with respect to the
parameters, 0, on an analog computer - thus
indicating the direction of the vector in which
the parameters should be adjusted. In order
to make the process completely automatiC, it is
necessary to know not only the direction of the
vector correction, but also the length of the
vector required such that the iteration process
converges. An iteration process is re~uired
because it is desired to obtain the "best single
value" of the parameters for the entire time
history under consideration.
Margolis and Leondes 2 employ a similar
approach, however, the problem which they are
solving is not the one described here. They
are interested in determining the values of the
parameters, 0, which minimize S at each instant
of time, t. The answers which they obtain are
then functions of time - that is each
is a
time function. Therefore, their solution does
not require an iteration process.

°

*One

could have chosen other measures of total
system performance such as

However, the technique presented applies only to
least squares error criteria.

520

12.3
Shapiro3 developed a digital technique which
is mathematically equivalent to the one presented
in this paper, where the differential equations
of the system are expanded into a Taylor time
series, and the errors are minimized by a least
squares criterion. The feasibility of automatic
solution by analog methods has been made possible
by the advent of the parameter influence coefficient technique 1 which obviates the necessity of
the Taylor time series expansion. The chief
advantage of the analog computer being in obtaining solutions quickly, and hence the feasibility
of solving Hon line" problems.

also been the satisfaction of the boundary
condi tions.

A real time computer is perfectly adequate
for most applications because relatively few
iterations are needed to converge and small
solution times are not essential. Sample-hold
operations, and automatic cycling of the computer
modes (RESET, HOLD, OPERATE) for the iteration
process is accomplished by externally controlling
the hold and reset relays. Of course, if a high
speed repetitive computer is available the solution time will be negligible. In any case the
real time mode is more convenient for check out
purposes and final recording of results.

where the an n=l, ••• ,N are the parameters to be
adjusted such that the system outputs, Yr
r=l, ••• ,R, will have the desired perform4nce
characteristics. Let the performance criteria
be described by error functions

Practical examples where the optimization
procedure was applied are the following: 1)
Fitting the response of a complex nonlinear
three degree of freedom system (exhibiting
second order characteristics) with the response
of a simple second order mass-spring system to
obtain automatically the equivalent natural
frequency and damping of the system. 2) Determining the best estimate of the present position
and velocity of a ballistic missile by fitting
the equations of motion to sampled positional
radar data corrupted by gaussian noise. Here
six parameters, pOSition, and velocity, have to
be continuously detennined. Since it is an "on
line" problem a high speed repetitive computer
is necessary to allow the iteration process to
converge before the arrival of the next data
sample. 3) Boundary value problems in ordinary
differential equations. Here the performance
criteria are the errors in the satisfaction of
the various boundary conditions applied. Since
the number of boundary conditions is equal to
the number of parameters (initial conditions)
updated, the errors must all go to zero.
Other possible areas of application are:
1) Problems in the calculus of variations where
to date only trial and error methods for the
initial conditions of the Euler-Lagrange equations have been used to satisfy the boundary
conditions. A typical such example is the
mission profile prob1em4 where it is desired to
find the flight path which minimizes the amount
of fuel or time it takes to attain a given point
in space with a given velocity. 2) Solution of
partial differential equations by serial techniques (i.e. where time is incremented and the
space variable is taken to be the continuous
variable) appears to be feasible since one of the
main stumbling blocks in such an approach has

Formulation of Problem
Let the physical system to be optimized on
the computer be described by the following set
of R first order differential equations

r= 1, ••• ,R

(1)

*

t:i(t): €i [t'Yl(t)'Y2(t)' ••• 'YR(til i:l, ••• ,I (2)
(which vanish when the performance criteria are
satisfied. )
Consider first the case where the system
\ outputs are sampled at times tj j:O,l, ••• ,J.**
This represents J+I points in time over which
Eqs. (1) are fitted. Define

1:1, ... ,1

j=O,l, ••• ,J

(3)

Let
G = constant:> 0
ij

i=l, ••• ,1

(4)

j=O,l, ••• ,J

denote a weighted sum of the errors squared which
is to be minimized with respect to the N parameters
To minimize S, Eq. (4) is differentiated with respect to
and the derivatives are
set to zero.

an.

an

Eq. (5) represents N functional equations for the
N parameters,
which are to be determined. To
solve Eq. (5) on the analog computer the following iterative process is employed.

on,

**In

"on line" problems this represents a comb
of J+1 points. The comb moves up in time as
new data samples arrive, i.e., the system is
fitted over the latest J+I sample points.

521

12.3

(6) *

n=l, ••• ,N
where the superscript k denotes the iteration
step. The iteration process described above is
a "single step process," i.e., only one an is
updated at a time. Thus, during the next cycle
of computer operation the parameter on+1 is updated, and so on.· This has the advantage computer wise that Eq. (6) and the circuit--fOr:--the
partial derivatives

have to be instrumented once and not N times with rotary stepping switches switching the
circuit cyclically with respect to the index n
during the reset period of each repetitive
cycle, (see Fig. 2). Note that the iteration
must stop, i.e. ~k+1) = ~k), when Eq. (5)
is satisfied. The denominator is a normalization factor and may be considered to be an "automatic gain controll! to insure stability of the
iteration process.
In the case of continuously sampled systems
the iteration procedure becomes
()
T

I

SL

Gi (t) E~k(t)

~[ k

a~

dt

(k+l)
(k) toi:l
n
an
= an - . . . . ; : . - - - - - - - - - - - -

rI

{bGi(t)

(7)

h=1,2, ••• ,H

The H errors €1,f 2 , ••• ,f H depend on the N parameters,~.
(For model fitting NL H, while for
boundary value problems N=H and the errors
should become zero.) Since it is desired to
obtain a single step iteration procedure which
will minimize Eq. (8), it is only necessary to
consider one parameter, say a. Let
(9)

defiote the error vector before a is updated, and
E(a + Sa) the error vector after a has been
updated. To obtain a "steepest descent" path
one tries to make the updated vector, t(a + Sa),
orthogonal u~der the metric, gh' to the derivative of the E(a) vector, with respect to the
parameter a. This orthogonality condition can
be expressed in equation form as the following
scalar product:
H

'3£h

(I

~ ghoa fh(a + oa)

o

(10)

h=l
The object is nOw to solve Eq. (10) for ~a,
which is the increment by which the ~arameter a
is to be updated. Expanding fh(a + ~a) into
a Taylor series including only first order terms,
Eq. (10) becomes
*Shapir0 3 considers a more general formulation.
The error measure, S, is defined as the positive
definite quadratic

[~t

I

-lOT'

9OV~BRESET
2'
BUS

FLIP
FLOP I •

I

~

90 V---' B HOLD

I

6.T'2

"""'"iO

90V~
~ 3

90V
+100
-100

LOGI C CIRCUIT FOR
AUTOMATIC COMPUTER CYCLING
FIGURE (5b)

(+)

1

BUS

iARESETi
BUS

A

HOLOl
BUS

•

IJ.)

W

N

533

12.3

1---1------4

+ 8x~)

b }---_ _ _ _----t

..

TWO POINT BOUNDARY

VALUE PROBlE M

I

X + aX

+ bX= 0

X (0) = Xo
I

X (l)+CX(l)=d
FIGURE (6)

535

12.4

ANAJj)G SThIDLATION OF UNDERGROUND WATER FIDW
IN THE IDS ANGEIES COASTAL PLAIN
by
Donald A. Darms
EAr Computation Center

and
Howell N. Tyson*
IBM Corporation

Summary
A general purpose electronic differential
analyzer approach to the study of the underground
water flow in the Los Angeles Coastal Plain is
described. The structure of the Los Angeles
Coastal Plain aquifer system is discussed. A
suitably simplified model of the aquifer system
is proposed leading to the diffusion equation in
two dimensions. The diffusion equation model is
approximated by a system of difference-differential equations defined at a set of asymmetrically
spaced node points. An electronic differential
analyzer circuit is Chosen which retains a one-toone correspondence, in certain of its features, to
the asymmetric network approximation. Some
special checking teChniques are discussed.
Introduction
The decline of California's ground-water
tables in recent years represents a serious
threat to the continued growth, of the state. Of
particular concern is the area of the Los Angeles
Coastal Plain (basin), where three major problems
present themselves. These are, 1) an already
serious depression of the ground-water level, 2)
an increasing demand for water, following predicted population trends, and 3) the encroachment of
sea water into the basin. To combat this, and
other threatened water shortages, the people of
California voted funds of $1.75 billion last
November to provide for, among other things, the
transportation of water from the Feather River to
Los Angeles. With this imported water injected
into the basin, it is hoped that solutions will be
provided for all three of the aforementioned
problems.
Unfortunately, however, the water cannot be
injected at random. Injection at one point might
result in flooding in other areas. Thus it becomes
necessary to know something of the dynaJllics of the
basin, which in turn implies the need for some
sort of computer simulation. As some of the flow
*Formerly with EAI Computation Center

parameters are not accurately known, the resulting
cut-and-try aspec,ts of the problem make the a.na.log
approach particularly attractive.
Geology and HydrolOgy
The Los Angeles basin is bounded on the north
by the Santa Monica Mountains, Elysian Hills, and
Merced Hills; on the east by the Puente Hills and
the Los Angeles-Orange County line; and on the
south and west by the Pacific Ocean and the Palos
Verdes Hills. Running diagonally across the basin
from Southeast to Northwest is the NewportInglewood Uplift. The mountains and hills present
complete barriers to the flow of water, while the
Uplift presents a resistance to flow that varies
along its length. The flow across the Orange
County line is relatively small and is considered
constant. The boundary condition along the seacoast is met by holding the ground-water level at
sea level. Finally, a constant head is maintained
at the Whittier Narrows. This completes the set
of boundary conditions.
The water-bearing sands and gravels, called
aquifers take roughly the form of a series of
overlying sheets of varying thickness. The
aquifers are separated from each other in the
main by clay lenses but make contact at numerous
locations. The areas of contact are relatively
small, but they provide the principal means for
the transfer of water between aquifers.
For the initial computer analYSis, the
subject of the present paper, the ground-water
complex described above is replaced by a very
much simplified model. This model consists of a
single unconfined aquifer, whose local properties
are composites of the corresponding properties of
the several aquifers that make up the actual
structure. The thickness of the single aquifer
is considered to be small compared to its lateral
dimenSions, and its bounding edges are irregular
in shape. Finally, time varying flows are extracted from or injected into the aquifer in a
distributed fashion. A plan view of the aquifer
is shown in Fig. 1.

536

12.4
Mathematical Model
The equation of continuity of an unconfined
aquifer, in which there is no vertical variation
of properties, is given by

Vof V-

+ S

dh + Q =
dt

(1)

0

where h = 6 + z, 6 = 6~ + ~. 80 and 8 are
the mean and perturbation of b. Neglecting
gravitational forces, Darcy's Law gives the
following equation of motion

The symbols incorporated in Eqso (1) and~ (2) are
defined as follows
z
h

5

= reference
= head

elevation

L
L

= local thickness of saturated

portion of the aquifer

v = velocity

S = storage coefficient
Q = volumetric flow rate per
unit area

f = denSity
g = acc$leration of gravity
k = permeability
~ = absolute viscosity
t = time

dimenSionless
LT- l
ML- 3
LT-2

rJ2

ML-~-l

T

Equations (1) and (2) are combined and
linearized to yield a single equation Which,
subject to appropriate boundary conditions,
describes the dynamics of flow in the aquifer.
The thickness of the aquifer is assumed small
compared to its lateral dimensions. This equation is

V·T'\lh -

S

dh

dr-

Q= 0

where T = J: egk/f-t. The quantities T and S are,
respectively, the local transmissibility and
storage coefficient of the aquifer. The source
flow rate, Q"is in most cases time dependent~
This flow rate is the algebraic sum of several
component extraction and replenishment flows.
The replenishment flows are precipitation, imported water, stream percolation, artificial ~e
charge, and subsurface inflow across boundaries.
The extraction flows consist mainly of the water
pumped from the aquifer for consumptive use, and
subsurface outflow across boundaries.
In this work Eq. (3) is replaced by an
equivalent system of difference-differential
equations, the simultaneous solution of which
gives the wanted function h at a finite number
of node points lying within the boundaries of the
aquifer. The system of equations is solved by
means of a general purpose electronic differential analyzer.
If the node points at 'Which the differencedifferential equations are defined are regularly
spaced, the equations have a particularly simple

symmetrical form that is identical for all the
node points. However, there are two troublesome
problems which often exist if such spacing is to
be used. The first of these comes about if the
boundaries are irregularly shaped. The node
points near such a boundary must be connected to
the boundary by grid elements of irregular lengths.
This necessitates the use of special equations at
these points. The second problem concerns the
change of mesh size within the boundaries. In
the neighborhood of expected large rates of change
of the wanted function, the mesh size must be reduced if an accuracy is to be obtained that is
comparable with the accuracy in the rest of the
aquifer. Since it would be uneconomical from an
eqUipment utilization point of view to preserve a
minimum spacing of node points throughout the
aqUifer, a rational approach to the problem of
mesh size change is required.
Both of these difficulties are encountered 1n
the case of the thin aquifer, shown in Fig. 1.
The boundaries are indeed irregular and the expected spatial rates of change of the head, h,
and the properties of the aquifer (i. e., local
transmissibility, etc.) are lar~e. ExtensiVe
work has been done by MacNeal(l) in the development of an asymmetric network of node points for
two dimensional second order b. v. :problems which
overcomes the difficulties outlined above. Such
a network pertaining to the thin aquifer is shown
in Fig. 1. The attendant system of differencedifferential equations is
~

L..J (hi-hB)Yi B
i

Y
=
i,B

~B

=

~SB dt + A:s~

'

J

(4)

T
i,B i,B
Li , B

where

= area

associated with node B

acres

conductance of path between
acre ft
nodes i and B
year ft
= storage coefficient of polygonal region
associated with node B
dimensionless
=

Qi , B = volumetric flow rate per unit
area at node B
=
transmissibility at midpoint
Ti ,· B
between nodes i and B

acre ft
year acre
acre ft
year ft

ft
Li, B = distance between nodes i and B
=
length
of
perpendicular
bisector
J i ,B
ft
associated with nodes i and B
A typical node point, its neighbors, and the
polygonal region associated with it is shown in
Fig. 3.
The terms in the first of Eqs. (4) are
interpreted physically with the aid of Fig. 3 as
follows. The left hand side represents the sum
of the flows within the aqUifer to the polygonal
region, ~.1 (across the dashed lines, whose lengths
are JiB). The first term on the right hand side
represents the rate of water storage within AB.

537

12.4
The remaining term represents the extraction or
replenishment flow from ~.
Computer Circuits
The system of equations (4) can be represented
by a network of res~stors, capacitors, current
generators, and arbitrary function generators.
The portion of such a network that corresponds to
the node point array of Fig. 3 is shown in Fig. 4.
Special purpose computers containing multidecade resistor and capacitor elements, current
generators and arbitrary function generators, have
been constructed in the past to solve diffusion
problems. These computers are sometimes referred
to as network analyzers. The current generators
in such cOIIll'uters are often formed from two
operational amplifiers.
With the exception of the multi-decade
resistors and capacitors, all of the above equipment is normally found in a general purpose electronic differential analyzer. In place of the
multi-decade passive elements, the differential
analyzer contains a large quantity of Single high
precision passive elements. By utilizing this
eqUipment appropriately, the differential analyzer
can be made to simulate the network analyzer with
no substantial overall increase in equipment
utilized per node. Thus, in the present case, the
general purpose analog computer has the capability
of economically simulating the network analyzer,
as well as the ability to solve other classes of
problems.
The system of equations (4) can be mechanized
in the usual way as shown in Fig. 5. However,this
approach has two unattractive features. First, the
conductance Yi B is represented by two different
pot settings f6r every node. A slight error in
setting one of the two has the effect of introducing
an additional flux term. For example, if the feedback pot is set slightly high, one can think of the
resultant error as a leakage flux. This can sometimes be disproportionately large compared to the
error in pot setting, espec~ally in cases Where the
flux should be zero in steady-state.
A second undesirable feature of the circuit is
the computational labor involved in a change in
conductance value, and that at least two pots must
be reset. This difficulty is made particularly
unattractive by the change in conductance values
required by the model tailoring.
To avoid these difficulties, and at the Sallle
time retain an economy of equipment, two nonstandard circuits are used; a voltage divider
circuit and a variable capacitor circuit. These
are shown in the final diagram of Fig. 7.
Voltage Divider Circuit
Each of the terms (hi-hB)Yi B of Eq. (4) is
mechanized by a simple voltage diVider potentiometer
networkpetween the two nodes. Pairs of 60,000 ohm
resistors, matched to within a percent, are chosen
for compatibility with the 30,000 ohm pots. By

setting the conductance pots with one node at
reference and the other at ground, loading of the
dividers is compensated for. The outputs of these
pots now read directly the flow between two adjacent nodes.
Variable Capacitance Circuit
To obtain a variable capacitance, a pot can
be inserted in series with the integrating capacitor. The computer modification is minor, and
allows one switch to restore the integrator to its
normal mode. The circuit of Fig. 6 represents a
typical application of this technique, and has an
exact transfer function of:

! +~[l-CsCl- S)R!f-l]+
1+ ~(l- $JRCs

tA-

If

r

'710\ R,

~,

_1_ + _1_ + _1_
f'lRlCS
RfCs f'-RfCs

6
Rf /' 10\ C< 101
~Cs

Or, if 1:
{.

:=.

$

$ RfC

R
R

(1-

$")

f

IV

=-

Rf

Rl

1 + t t's
l+(l+~)t's

The transfer function of the integrator alone
CRf = 00) is

4

-

6

-6

With R = 3 x 10 , R = R = 10 , and C = 10 ,the
step response of thts cirCUit can have a maximum
initial error of 3% of steady-state, dropping to
.02~ in one time constant. With capacitors of
10- , as might be used in the repetitive mode of
operation, these errors become entirely negligib1e.
$

Computer Model Check Procedures
The
circuits
checking
order in

number and similarity of the analog
in this simulation suggest some special
procedures. These are outlined in the
which they are best carried out.

Uniform Head Distribution
If all boundary values and source flow

538

12.4
rate functions are removed ~rom the aquifer, the
head distribution corresponding to a reference
voltage applied at anyone node should be uniform
at steady-state. In order to produce this result
it is necessary that there be no regenerative
loops (i.e., the inverting amplifiers be properly
placed).
Single Degree of Freedom Dynamic Check
The d.yna.m.ic behavior of the portion of the
aquifer associated with one node, under the influence o~ the source flow rate function, is
checked against a precalculs~ed function. To implement this check, all nodes are held at ground
potentional except the node of interest, and the
computer allowed to integrate. A typical source
flow rate function and the dynamic response are
shown in Figs. 8 and 9 respectively. This test is
made at each interior node, and completely checks
the individual nodes.

constructed from these influence fUnctions by
superposition. However, this procedure normally
results in a very large number of curves. Hence,
it is usual to make investigations of the dynamic
behavior of the aquifer under certain specific
conditions of replenishment or extraction that
are expected to take place in the future. For
example, in the present study, sets of responses
were obtained for large constant injection flow
rates in the Whittier Narrows area, the Ios
Angeles River area, and the Manha.ttan Beach area,
etc. Similar data was taken for large extractions
in the Lakewood area that would be expected from
future industry.
Acknowledgements are in order to Lloyd
Fowler, Robert Thomas, and Robert Chun of the
California State Department of Water Resources,
Southern California Office; and to Harry Turner
of Electronic Associates for suggestions leading
to the electronic differential analyzer circuits
used in this work.

Model Flux Balance
As a final analog circuit check, an overall
nodal flux balance is made. Wi th all boundary
conditions set in, and constant flow rates inserted, the aquifer is allowed to reach equilibrium.
At each node, the algebraic sum of the flows should
equal the source flow rate. Moreover, a balance
can be made of the total water entering the
aquifer against the total water leaving. In
addition to its usefulness as a check, the nodal
balance can provide the hydrologist with some insight as to the hydraulic behavior of the aquifer.
This can be of considerable aid in the adjustment
of the flow parameters in the model tailoring
phase of the problem.
Model Tail.oring
In many areas of the aquifer, the transmissibili ties are difficult to calculate from available
data.. Consequently, a tlbest value" is used initially. A complete set of plots is then generated
by the computer and compared with historical data.
Where discrepancies occur, conductance pots are
adjusted and new plots made. As this portion o~
the program can be quite time conSuming, it points
toward the high desirability of the repetitive
mode of operation. While we did not have this
feature when P7rfonning this study, we do plan to
use it next time.
Data Gathering
After the model is perfected to the hydrologists I satisfaction, a study is usually made of the
d:ynamic behavior of the basin. The present model
is linear. Hence, an exhaustive study of the
model could be made by obtaining influence functions. That is to say, one could plot the response
(head vs time) of the entire network due to the
sudden application of a unit flow rate at any node
(all other source flow rates are set to zero).
Similar responses would be obtained for successive
applications of the unit flow rate at all other
interior nodes. Any desired solutions could be

1. MacNeal, R.H., "An 'As:smmetrical Finite
Difference Network", Quarterly of Applied
Mathematics, VoL XI, No.3, 1953.

_'t-d.~

-

--~.~

"'-,
'----

)

.ALHAMBRA

/

)

)

"~~,
"'-'-

~

/

~ ~::~~,~:~:~~. (

:---

)

·VAN NUYS

Pile

I

f:

I

C

LEGEND
___

BOUNDARY OF INVESTIGATIONAL

___

BOUNDARY OF

W.T£R~BF4RtNG

AREA

MATERIAL.

~

(r-"\
,_/
0"

ISO

TRANSMISSIBILITY BETWEEN POLYGONS

Fig. 1

SCIIII.EO'M'LES

.....
NUl
•

Vo)

,j:l.-.D

..... \1l
N~

.a:. o

- ~*.~~.

~

~

)

( 1

~

~, ---l ••""..,::.;:> ~

.ALHAMBRA

/

.

,"1.00:,.5'.

·YAN NUVS

\
/-

~

V'"

'

"

~~/' 440

I

"

~~-,A'f

Y,

/

I

o·

(

'\

'&

-

:
•
"..............

910

--/

r

/

I'"

'460

-----

/

,~

C

---

BOUNDARY OF WATER-.8EARING

o r-~_

40

0

AT THE

l.

',--

r

,

J

I

,

/

/

", 230
'\ ____ _

/

r

,

"

I

0"

I

'"

) ____ \

"

_____ ,

1

,

I

"'"

0",'

~

'0

/ ""

____ .......

.J
... _ - - - \

'

'130'

34,? '''y''':

74

...v',
'190

' ..(

440

0,

/ "
/',

,

/

0

\

/

r'

/

'i

~

0

_"\'...
_-\
,

//

0..

I T

,/'

,/

\

210

+

•

~

r0":

'~

\

0

//"

",. /\

~
(\~
~
~

0"

\ , " . __

\ _. . ~_ 0,

//

6

\

0

"')..............

,"
//
\\ ",,/ 280
V
I

21

I

0

r

' - - ' "

~1\

NODE

Fig. 2

--.I.,

I

~

AREA

I

/I--. __ \

0

I

l~E~29 .... )........
. . ~" '" _--..l,I
~~" YO/"
"

MATERIAL

_

, ,

J

_

----- ",
80

\ .. _

I

J..

\

r

I

;--______ { 0 "
,/ 0
~ I ~ -----,c""'L --. _

I

......\

\

J

/

/

'"(

/ 0·'

/ --'-_

........ "....

0"

FACTOR

/

...

"

OUTLINE OF POLYGON

STORAGE

\
'160

......~/

LEGEND
eOUNDARY OF INVESTIGATIONAL

'--

L"
/

/"I ........ ,
............\ \ 0'"
/'
~ ~HATTAN0 \\

___

30

350
,~

/'

/

>

~,~

.. lr-"'\

i

,/

\
0",

"

/

/

I~~ (" 'y'/' •::;:on:,,-\j ___ -<, Z40 /
'\
:
: 190 / .~ "
I 40
~-_ _-{\
/
,,"
'r- __
~ 1..' 50 "~'

\

' /

y

()44

I

420

......... (
...

~-', _-~

oe

450

/

<,

/' -

-----<,.,

0"
'{
J..
290 /"', " 120

I

I

I

"

P

,

/

004

\_

~1
.50
,
/
0"
0,,-- ,
~,~~- -~, .
i '~~.y' 0 ~
., •••• ~---,,-t-----_ / ~-------1:----..
---',-----4--'>-__ ~''''_ ' . . . _---".,-,....
• '", ~... ,
" ----1'\
•\ 0
: 0
\.,'_
" ;'<- ............150"'-~I
: ITO : 150
60 f '\\
~:-'-__
_
/ 0
'
.
"","",
•
_ _ ___ __ - ' -__"\
_,
____
_

I

- -'O. . ,....-• - ,"'
----_!

f.~~""~ / /

". is the recognition
of the ~ object -.;.Tith size and. rotation invariance. The second. important design principle is
that there be a recognition of a set of objects
which are the same in the sense that a single
specific output is required for all members of
the set even though ~he members all vary some-.;.nlat
image-'ir.l.se (e.g., all A's are members of one set
even though not dra'im in the same itray). Third,
the cod.ed. differences betveen d.ifferent sets of
images (A's, Bl s , CIS, etc.) must be maximized.
in ord.er to also maximize the permissible variation ,·Ti thin a set. These tbree guidelines for
design are to be incorporated. in the system logic
so as to permit the recognition proeedure to
interpolate internally by a type of self-learning
process.
No learning can take place itTithout a feed.back or correcting signal. When humans learn to
read., they are given a simultaneous set of inputs,
i . e ., they are shown. letters and. asked for a.
specific response to those letters. After a time,
the human learns to internally program his learn-·
ing. That is, by trying out various responses •
to stimuli the desired response is reached using
a nebulous error correcting signal. We can
d.esign in a much simplified. vro:y tOITard.s e.n. anal-·
ogous electronic system. Initially, the system
learns patterns by storing the images in the
memory \Then the machine is in a learning mod.e.
Later, vrhen the machine is in a reading (or
recognition) mod.e, the input images are compared.
I·Ti th the memory patterns and. a I1easure of the
"fit ff is obtained.. By internal prograrmning, the
machine accepts ano. recognizes the 'iTorst fit for
a set of im2.ges Ilhich does not overlap 'ilith a
d.ifferent set of illk'l.ges. Later on, by internal

progre;raming, the in.ternal system re-exarnines the
recognition d.ecisions to "learnl! to make better
ones.
MATHENATICAL DESCRIPTION OF AN IMAGE DILATION
A fun.o.amental requirement for electronJ.c
recognition of objects independent of the image
d.imensions is a transformation to a standard.
image representation. The image need. not be
optical; sound. falls into the same category.
The absolute ampli tud.e of Signal is not of paramount importance in general, but the relative
ampli tud.e or size of each Signal element to the
rest of the signal is an, essential component for
size-invariant recognition. We •.nsh to discuss
first the mathematical analysis of dilations and.
then Sheil that such transformations may be c-arried
out using electronic teclm.iques.
The purpose of this exposition is to shOi';
that a practicable recognition system can incorporate the importa.nt property of recognition
regard.less of size by means of a relatively Simple
electronic system. Such a system has been outlined. previously by the e.uthor in a less d.etailed.
manner l ,2 -.;.Tithout a rigorous formulation. This
pape~ 'nll formalize the approach and. result in
a more mathematically satisfying exposition.
We consid.el·, first, the dilation transformation as a homothetic tranSfOrlllation having a
unique ordinary point at its center 4 . For rectangular coord.inates the transformation is simply
I

X

(1)
ay + c

y

2

where a is the dilation scale factor and. is not
zero or one. The transformation has a center
given by
x

c

y

c

l

2

(1

a)-l

(2)
(1 - a)-l

The set of dilation trsl1sformations a.oes not form
a group since it is not closed. und.er multiplica-·
tion, nor is there an identity transformation.
If 'VTe consid.er an image of a figure on a
plane 'VTith bord.er coord.inates ~ Yi i-Tith the
centroid. of the im8.ge initially centered., the
constants cl and. c2 moy be set to zero in the
transformation. The unique point of the d.ilation
is then (0,0) vrhich is the center of the image.
The dilation of an image occurs externally
to the photoreceptor system, and. may be consid.ered

546

13.1
as a projection of an image as the projector is
brought closer to the receptors. Similar~, of
course, an image size red.uction m~ be interpreted
as a projector 'WithOral·Tal from the receptors.
The physical problem consists of arranging the
image reception system so as to provide the same
digi tized image cod.e for the image regardless of
dilation (except that the image must be 'Within
the receptive area of the photoreceptors). A
simpler dilation transformation than 'I) m~ be
utilized. if 'tV"e work in polar coordinates. The
corresponding polar dilation is
r

ar

Therefore the sllnplest arrangement of photocells
is a polar array. The constant of dilation (a)
is not constant for all projected images, but
takes all possible values up to the maximum value

=
l·mere ro
centered
and. I'max
resolved

r max

(4)

is the minimum rad.ius of a resolved. image
on a polar arrangement of photocells,
is the maximum possible radius for a
image contained in the photocell array.
PHYSICAL DESIGN

A physical embod.iment of the general principles is sho1-m in Figure 1. The photocells are
arranged in a polar arr~ to create an electronic
l1retina" which provides for a set of pulses 'ifhich
can be logical~ connected. so as to provid.e a
size-invariant image code. Each segment in 0.
sector of the polar arrangement of photoreceptors
contains four or more. independ.ent photocells which
are connected to provid.e an output on~ if an
image bord.er is projected. upon a segment. The
method~mbining photocells four at a tiIne is
shm.m in Figure 2. The logical (Boolean) an~sis
of reCJ.U,iring one, two, or three photod.etectbrs to
be in d.arkness (or illuminated.) but not all four
to be equal~ stimulated. to obtain an output, is

elements with esse.ntial~ no loss of resolution.
Away from the center of the polar array of photod.etectors, the segments are large. On~ one pulse
output signal must come from each segment. In
oro.er to "see" fine points in the outer segments,
ino.i vidual photosensiti 'Ie elements must be smaller
than the point. By using thin glass fibers leading
more or less random~ (vTithin a segment) to four
(or eight) bridge-connected electronic photoreceptors, the resolution of a small signal (narrow
line) is good, \lhile the number of electronic
components is relatively small. The effect of
this arrangement is to pennit even one stimulation
of a single light fiber to cause a signal to emerge
from one polar segment. The more or less rano.om
connections of the optic fibers to the photocell
brioge lead. to a possibility of cancelling a set
of light spots which stimulate the four photoo.etectors of the brio.ge equal~. Due to the lack
of \{ell-organized. connections, the probability
of s~ch a pattern occurring is almost nil.
It may be interesting to note that the human
retina may be organized. in an analogous vray. Near
the center of the retina on~ a few photosensitive
retinal elements (roo.s) are interconnecteo. neurally
before being connecteo. to a fiber of the optic
nerve. The roels (or cones) towaro.s the periphery
of the retina ere interconnecteo. sometimes hunoracls at a time before being connected. ,·Tith one
optic nerve fiber5.
He have o.iscusseo. the arrangement of the
photocells into brioges of fours iTith each set
of four photoreceptors filling a segment of the
polar plot of Figure 1. The next topic to be
consio.ereo. is the logical arrangement of output
signals from the polar array. The output is most
read.i~ and convenient~ manipulated. by a general
purpose digital computer. Even though many of
the computer operations vTill not be utilizeo., the
availability of such computers is an economical
gain.
TEE PHarOMATRIX

(a+b+c+d.) • (a. b· c· d.)
where a, b, c, 0. represent the binar,r state of
each cell, a bar represents "not", the dot is
for logical II and." , and. the -I- is for logical "or".
Using this moo.el of a brioge circuit, each
photoreceptive bord.er receiver area must become
larger vi th increased. oistance from the center of
the polar array of Figure 1. There are several
methods of accomplishing the increased reception
area. If solid state photoreceptors are utilized,
the required. area is easi~ arrangeo.. H01-TeVer,
another approach utilizes glass (also quartz, etc.)
fibers to guio.e the light paths to the photocells.
This permits a very small area to be resolved.
It is particular~ useful to have very small
photoreceptor areas in. towaro.s the center of the
polar array to o.ecrease the area of the central
blind. spot, and fiber light guides may be invaluable for application here. There is, hm-rever,
a more important reason for utilizing fiber optical
'i·rave guides to collect light ano. direct it to the
photoreceptors. The reason is. economy of eleetronie

The photodetectors are energized. by a pulsed
voltage source for a short time -- the ord.er of
a microsecond .~- just long enough to exceed the
response time of the light detectors. The repetition rate for this activating pulse is relati ve~ slmr (about 30,000 pulses per second.).
The repetition rate of l/rt of the response time,
vrhere rt is the total number of circular divisions
of photoreceptor areas, is utilized to code the
image for recognition or memorization prior to
the appearance of a second image.
The outputs from the polar segments are
connected as shmm in Figure 3. Each output goes
through a unilateral conduction element (a dioo.e)
ano. a d.elay (of about Ii microsecono. which is about
the receptor response time) before being attached.
to rad.ially neighboring photoreceptors. One has
3. choice of directing the output pulse path either
rao.ial~ outvTaro. or imTard., and. 1-Te have elected,
"for the present system, to arbitrarily use radially
outward. propagating signal pulses-

547

13.1
Let us nmv observe the effect of our logical
afl.c1. geometrical circuit combination. Suppose an
irJ1age appears on the photoreceptor array shoi'm in
Figure 1. The borc1.ers of the image ,·Thich fall
upon segrJ1en:ts of the array activate the photoreceptor bridges (Fi~~e 2) lying in those segments ('\orhen a pOITer pulse occurs) an.a a set of
pulses representing the bord.ers travels outlrard.s
along the rao.ial connecting lines shmm. in Figure
3. The activatin~ pulse also starts a IIclocl';:"
'I;hich o.eterrnines the position of all pulses on a
time scale. l.fe have, then, thirty·-six raoial
lines either carrying pulses or not 'I·Ti th the
pulse position (in time) indicating the boro.ers
of an image. He may consid.er the digitized. in1age
representation to be a matrix of binE'xy elements
"i·lith the rOvTS corresponc1ing to the radial output
lines an.d. the columns corresponding to the time
scale (each column is separated by one d.elay
perioa. or clocl" time unit).
THE NATRIX REPRESENTATION OF AN IMAGE

The image is converted. to a set o{.,.radislly
out,'Ivard traveling pulses by the reception system
Cl.escribed above. He now 'I··Tish ,to d.escribe JGhe
combination af pulses to represent the iIlage far
both le2xning and. recagni tion purposes.
It, is convenient to' number ·the rad.ial pulse
propagating lines in a clacl;:i'Tise I:18nl1.er. The
Dast vertically uplf8rd. sector then af Figure 1
is calleo. radial number 36 an.a. the sectars 'to' the
right are radials l,.~) 3, etc. If the rad.ials
are given rOlTs in a matrix af binary numbers then
the time sequence af pulses in each raoial o.eterynines the posi tian af the units in the matrix.
T'ne matrix size is given. by: the nUlilber af
rac1.ial lines = n:urnber af rai-TS af the matrix, and.
the nurnber of d.elay units = the nUlnber af columns
of the matrix. '·le assume here that each d.elay
unit is equal bet"leen segtilents of the photodetector array an.d. that a unit d.elay follmrs the
outermast segment.

o0
o0
o0
o0
o0
o0

1 0

o 100

0

10

0 100

0

10

0 100

0

1 0

o 100

0

1 0

0 100

0

1 0

0 100

0

(1.:1)

It rosy be noteo. that the zeros bath to' the
left and right of the columns of ones provide
information ~beut the size of the iJn8ge, but
nothing regaroj,ng shcope. Therefore vre use a
floating d.ecimal point or similar type aperatian
to eliminate all columns of zeros to the left and.
the right of the first columns from the left ano.
from the right 'Thich contain units. '·le may retain
information regard.ing the number of columns dropped. as an inclication of image size if we chose,
but for the moment) that is not important. The
reduceo. matrix is then
1 0

o

] 0

0 1

1 0

0 1

1 0

0 1

1 0

o

1 0

0 1

1

1

(N2)

NovT, the columns containing entirely zeros d.o not

Having, described. the image coding ~cheme,
I·re 'Irill nm·; illustrate hov images vThich are centereo. upon. the polar photoarray proviae a binary
matrix cod.e. Consider images of alpha-·nUlneric
(letters and nurabers) form first. Suppose that
the 'Iud.th of the lines of these letters are
larger than the c1iwneter of a photoreceptor
element (e. g., a glas s fiber c onouctor )) an.Cl.
also that the image is of a size suitable for
total inclus10n in the photosensitive arroyo

prmriCl.e much information. These only indicate
the "thiclmess II of the imag,c. Nonetheless) the
thiclmess is an inv8Xiant of an image; Clue to the
photoreceptor geometry, a dil~cion of the image
urovicles the same thiclmess in the matrix repre~entation. Therefore, ,·re choose not to operate
upon the "intern8~" columns containing all zeros.
The rea.uced m~trix N2 is calleo. the "canonical
matrixl1 for 110 II •

For heuris'Gic sil:1plici ty consid.er OJ.1 110 II
first. The fi&'ure is imaged., the photocells
energizeo. by a pulse, and. the pulses representing
the "0" travel c1.mffi the rail.ial d.elay lines in
syncbronism 'ITith the c1igital clock. In conformity
vi th c on-nnon practice, absence of a pulse d.enotes
a zero, an,d. a pulse d.enotes l.mi ty • The m20trix
d.enoting the 110" coo.e is then obtained. by taldng
the number one radial '\101"'0." (set of pulses) as
the first rm" of the matrix follm,red. by the
number t,·TO rad.ial 1T0rd., etc. The form of JGhe
matrix is ind.icated. in abbreviated. form uS

He next consid.er the reo.un.d.811.cy of information contained. in matrix :M2. It is apparent
from a cursory examination of the matrix that
the outer shape (outer bord.er) of a figure is
represented primarily by the change of position
of the occurrence of units (i.e., nonzero elements)
in the right hand. columns. Therefore, it is
reasonable to utilize the difference betw'een rOVT
positions of the right hand. units in the ce..n.onical
matrix. For the :M2 matrix the d.ifference is
given by a string of zeros since all of the units
accupy the sar:Je rO'l-T. The ccnonical outer bord.er
is then~

548

13.1
0, 0, 0, 0, 0, • • • , 0

(Cl)

(36 zeros)
This word. is stored. in the memory as a representation of an 0 or zero when the system is in the
learning mode.
I t will be useful to next give an example
of a less syrmnetrical pattern such as a line
drawing of the letter D_ This letter is shown
"centered" in the photomatrix of Figure 4. The
canonical matrix of the letter is shown in (M3).
Sector No.
• • • 0 0 0 0 1 1 0 0 0 0 0 0

(1)

• 000 110 0 0 000 0

The canonical word. representation of the letter
D found. by taldng the difference betii'reen rmT
positions of outer units is:
-1, -1, -0, -1,0-.1.,0,0,0,0,0, +1,0-1·1,0+1, ;-1,0,+1
0,-1,-2,-1,0,-1,0,0,+1,0,+1,+2,0,+1,0,-1
This is the can.onical vord. vlhich is stored. as a
memory representation of the letter D shovm in
Fig. 4. Of course, the d.igital manipulation to
obtain this word is simply programmed. into the
machine logic and. one is never aware of this
representation. Once a "D" has been exposeo to
the photomatrix of the machine i{hile in the
learning mod.e, the canonical vrord is automatically
calculated. and. stored. in the meraory •

•• • 000100000000

READING THE INPUT IMAGES

.00 110 0 000 0 0 0
• • .00 1 0 0 0 000 0 0 0
• 0 110 0 0 0 0 0 0 0 0
• 010 0 0 0 0 0 0 000
• • • 010 000 000 0 0 0
.010000000000
• •• 0 1 0 0 0 0 0 0 0 0 0 0

(10)

• • • 010 000 000 0 0 0
010 000 000 0 0 0
• • • 0 110 0 0 000 0 0 0

o

0 100 0 000 000

• • .00 110 0 0 0 0 0 0 0
• • • 000 1 0 0 0 0 0 0 0 0
• • • 000 110 000 000
000 0 110 0 0 0 0 0
• • • 0 0 0 0 0 100 0 0 0 0
• 0 0 0 0 0 0 1 0 0 0 0 0

(20)

• • • 0 0 000 110 0 0 0 0
• 0 0 0 0 110 0 0 000
• 000 1 0 0 0 0 000 0
· • .00 1 0 0 0 000 000
• • • 011000000000
• • • 010000000000
010000000000
• 010 0 0 0 0 0 0 0 0 0
• • • 0 110 0 0 000 0 0 0
• •• 0 0 1 0 0 0 0 0 0 0 0 0

(30)

. • • 000100000000
• • • 000 0 110 0 0 000

o0

0 0 0 1 0 000 0 0

• • • 000 0 0 110 0 0 0 0
.000000100000
• • • 000001000000
The canonical matrix representation of
the letter D as imaged. in Figure l.~.

Suppose that after a number of such words
are memoriZed., the machine is in the reail.ing
(.or recognition) mod.e. Images of letters (or
,wrd.s) presented. to the photomatrix array are
converted. to canonical Ivords '\{hich aJ."e then compared. to the memorized. i{ord.s. The comparison
may be performed. in a number of ifayS, for example
by subtraction, by sieving, or by same other logical operation. For simplicity of the heuristic
d.esign, ire 113.11 assume here that the comparison
is by subtraction between the memorized. word.s
and. the vmrd. to be recognized.. The system 1001>:s
for the absolute value of the cUfference between
all of the stored. words in a certain subclass of
the entire collection of the memory. (This subclass is d.etermined. by examination of canonical
word.s for similar syrmnetry properties.) The
initial search is for a d.ifference of perhaps
ten units or so. This Ilould. be a poor match.
Sup~ose that three such matches at three memory
ad.dresses are found.. The question nm{ arises
as to whether these three matches are members of
the same class of symbols. That is, are they
all memorized. D's ,dth various d.istortions?
This question is ansvTered. by a logical procedure
of comparing output signals from the three ad.dresses.
Suppose they are not identical. Next a comparison
is run among the four vmrd.s again, allmdng for
perhaps an absolute a.ifference of five units in
this comparison. Suppose that there are nmf two
matches. A third run allmring for an absolute
difference of three units or so will usually
ne.rrow d.mm the match to one. The machine may
nm{ print out the ad.dress of the memory '\{ord.
vrhich vTaS the best match. Usually the ad.dress
has been cod.ed. in the learning process to correspond. to the letter D.
Novr the system has been progreJl1med. to reexamine the three memory 'wrd.s vThich matchea.
wi thin ten un:!. ts of the input vTord.. Suppose that
of these three ifOrd.s, tim correspond. to the S81ne
output (let's say
The system retains information about this red.und.ancy. Such redundancies
are sometir.1es useful, but too many vrill be uneconomical of memory storage space. Therefore
redundancies are programmed. for removal either
vThen (1) the memory is overloaded, or (2) a
eanonical "Tord has eonsiderable overlap "lith a

"D"Y.

549

13.1
canonical word. belonging to another set. The
examination of the memory for nonusefUl redundancies takes place during the reading (matching)
process.

Ea.eh canonical matrix representation ce.n be tested
f'or left hand. rotation by removing matrix rOvlS
from the top and putting tHese on the bottom.
Right hanel rotation reverses this matrix manipulation.

If no suitable matching (recognition) can
be accomplished. for a symbol, the machine changes
its state from a reading mod.e to a learning mod e
and. stores the canonical form of the input image.
Unless an outsid.e oper~tor now gives this symbol
a specific "name" so that the system may print
out the desired. name of this image, the machine
'Hill, when reading this symbol, simply print out
a Ullll1ber corresponding to the address of the
stored. can.onical vlOrd.•

The matching procedure between the memory
and. the image representation allovTS for a certain
degree of "smoothing". First of all, "similar"
patterns which are d.istorted. relative to each
other 1-Till occupy d.ifferent memory positions.
Thus there is ability to recognize misshapen
patterns through utilizing ad.di tional memory space.
Second.1y, the matching proceed.s through an elimination of mis-matches and. narrowing of possibiJi ties.. The difference between the image representation and. the stored. canonical forms is permitted.
to be large (a large number of differences) at
first. Then, as the possibilities are lessened.,
a better match is attempted.. The mathematical
proced:ure of matching is a logical procedure
'which can be programmed. on any electronic d.igital
machine (although some machines are easier to
employ than others). It should. be noted. that the
conventional decision procedure of electronic
computers (i.e., "is A >- B?") is more pOi"erfUl
than it first appears. For example, by ad.cling
a predetermined. number x to A or B one can memipulate the logic to permit a d.ecision as to whether
A is x units larger or smaller than B. This then,
is a smoothed. match betireen A and. B.

ROTATION INVARIANCE

In this system, an image can also be examined for matching with memorized. canonical word.s
when there is relative rotation of the images.
The effect of a rotation is to change the rOvT
ord.er of the canonical matrix. That is, a rotation of an image by ten d.egrees clocl,.,wise is
id.entical to placing the top row of the canonical
matrix on the bottom of the matrix. In a similar
vray, any amount of image rotation is equivalent
to a corresponding amount of canonical matrix
rOvT rotation. Thus in the recognition process,
it is quite simple to look for a "match" with a
certain amount of rotation ad.ded. to the canonical
vlOrd. form.
SUMMARY

The system is composed of an electronic
shutter, a photoreceptor matrix (the retina),
a c entering system, a pulse timer, and. a digital
computer vTith a specific program. When the
machine is in the learning mode, images appearing
on the retina will be centered. and. transformed.
(-with size invariance) to a canonical matrix
vThich in turn may be reduced. to a canonical i-Tora.
stored. in the computer memory.
The images need. not be simple alphabetical
patterns, they may be human faces, navigational
guid.eposts, or ro:ry other sort of image. For
example, one may utilize an extension of our
id.eas to id.entify human. faces or navigate by
terrain recognition.
After a sizable memory is obtained., the
machine may be put into the read. mod.e. It is to
be noted. that the machine is usually in the read.
mode. Each incoming image is tested. for a certain d.egree of matching vTi th previously stored
patterns. If the apprOximate matching result
incUcates that the image is not "id.entified." the
machine will automatically go into the learning
mode and. store the canonical iforel representing
the image.
The canonical word. (or worcls for complex
images) is size invariant in the sense that a
large or small image will result in the same
representational cod.e. Rotation invariance occurs
through testing the representation for rotation ~

The machine is also progra~mmed. to correct
for overlapping sets of figures. For example,
the letters U and. V have rather similar canonical
representations. Consequently the maximum amount
of smoothing permitted. for these letters will be
automatiCally found by machine logic idth allowance for the fact that very little overlap in
the recognition should. be tolerated.. The significance of this in the recognition procedure is
that the smoothing of U and. V in the reading
procedure iTill be found. to be less than -for other
patterns ifhich are not as similar. Such problems
are entirely solvea. by the machine program. The
aetermination of smoothing values are founa. by
loo~dng for the ma..mJUllJ which separates out memory
uni ts having d.ifferent output Si&,11a..1s. In other
i·lOrd.s, the machine i·Till continue the matching
procedure as long as any of the matching memory
vlorels have different outputs, but vTill stop i·Then
the matching memory ad.dresses (and. they may ,fell
be multiple) correspond. to the some set (s~ne
output) •
Since each pattern can have considerable
thel~e may be a number of memory
locations corresponding to a particular output.
During the course of reading, the maclLi.ne is to
be programmed to recognize these redund.ancies
an.d. eliminate them whenever they are not required.
to prevent overlap I-Ti th other patterns. It may
be vrell to explain this more fully. If a large
smoothing is possible for some particular pattern
vTi thout confusing d.ifferent sets of patterns
(patterns id th different outputs) then there is
no need. for memorizing many forms of t,hat particular (since the forms vTi th malformations are
recognized. through smoothing). Consequently, the
macro ne progr'alB eliminates stored. pa;tterns which
m2~formations,

550
13.1
d.o not help to isolate a recognition set. This
program is need.ed. if we are to utilize a machine
with a small mer.lory capacity.
Finally, there are a number of ramifications
planned.. Recogni tion of vTord.s, people, assembly
line parts, etc., reQuires more memory capacity
than letters, but the proced.ure is the same. It
is perhaps simplest to store the canonical matrix
rather than canonical vTord.s as more information
is then handled. in a uniform manner. Hith the
ad.di tional information, redundancy increases.
ConseQuently, some of the recognition problems
are lessened.. For example, distinguishing betvTeen
U and. V is much simpler vThen these are' contained.
in VTord.s than I'Then they stand. alone.
ACKNO\f.LEOOEMENT
It is a pleasure to tha.n1~ Peter :lvI. Kelly,
Lloyd. Lambert, and. C. L. Vlanlass of the Aeronutrorlic
Physical Electronics and. Bionics Div~sion. for their
interest and. helpful criticism.

1.

J. R. Singer, "Electronic analog of the
human recognition system", Journal Optical
Society of America, Vol. 51, pp. 61-70,
J anu.ary 1961.

2.

J. R. 8inger and. P. M. Kelly, paper given at

the First Bionics Symposium,
September 13-16, 1960.

3.

J. R. Singer, "Rad.ar Pattern
paper presented. to the Rad.ar
A.F.C.R.C., Lexington, Mass.
Proceedings), October 24-28,

Dayton, Ohio,
Recognition ll ,
SympOSium,
(Vol. III of the

1960.

11·.

B. E. Meserve, Fund81nental Concepts of
Geometry, pp. 172-1""(3, Ad.d.ison-vlesley (1957).

5.

S. L. Polyal>.:, The Retina, pp. 237-2~.2,
Uni versi ty of Chicago Press, 191.~8.

551

13.1

552

13.1

PULSE
OUTPUT
FOR BORDER

STIMULI

+
PULSE ACnVAnON

SOURCE
FIGURE 2

553
13.1

\

\
\

,,
\

\

\ \
\ \

.. '.'

,

... ···f. . -',' \ \

\
\

.....
w\J'l
•

.....

9

28

10

.11

19
FIGURE 4

\J'I
~

555
13.2

A PATTERN RECOGNITION PROGRAM THAT GENERATES,
EVALUATES, AND ADJUSTS ITS OWN OPERATORS
by
Leonard Uhr
Mental Health Research Institute
University of Michigan
Ann Arbor, Michigan

Charles Vossler
System Development Corporation
Santa Monica, California

Summary
This paper describes an attempt to make use
of machine learning or self-organizing processes
in the design of a pattern-recognition program.
The program starts not only without any knOWledge
of specific patterns to be input, but also without
any operators for processing inputs. Operators
are generated and refined by the program itself
as a function of the problem space and of its own
successes and failures in dealing with the problem space. Not only does the program learn information about different patterns, it also learns
or constructs, in part at least, a secondary code
appropriate for the analysis of the particular
set of patterns input to it.
Background Review
The typical pattern recognition program is
either elaborately preprogrammed to process specific arrays of input patterns, or else it has
been designed as a tabula rasa, with certain
abili ties to adjust its values, or nlearn ." The
first type often cannot identify large-classes of
patterns that appear only trivially different to
the human eye, but that would completely escape
the machine's logic. 2 ,7 The best examples of
this type are probably capable of being extended
to process new classes of patterns. 8 ,19 But each
such extension would seem to be an ad hoc complication where it should be a Simplification, and
to represent an additional burden of time and
energy on both programmer and computer.
The latter type of self-adjusting program
does not, at least as yet, appear to possess methods for accumulating experience that are sufficiently powerful to succeed in interesting cases.
The random machines s~ow relatively poor identification ability.15,1 (One exception to this
statement appears to be Roberts t modification of
Rosenblatt's Perceptron. 14 But tpis modification
appears to make the Perceptron an essentially nonrandom computer.) The most successful of thistype of computer, to date, simply accumulates information or probabilities about discrete cells
in the input matrix. 3 ,IO But this is an unusually
weak type of learning (if it should be characterized by that vague epithet at all), and this
type of program is bound to fail as soon as, and
to the extent that, patterns are allowed to vary.
~everal programs compromise by making use of
some of the self-adapting and separate operator
processing features of the latter type of program,
but with powerful built-in operations of the sort

used by the first tYPe. 6 , 25 They appear to have
gained in flexibility in writing and modifying
programs; but they have not, as yet, given
(published) results that indicate that they are
any more powerful than the weaker sort of program
(e.g. Baran and Estrin) that uses individual cells
in the matrix in ways equivalent to their use of
II demons 11 and Itoperators."
A final example of
this mixed type of program is the randomly COUPI~d
"n-tuple" operator used 'by Bledsoe and Browning. ,5
In this program, random choice of pairs, quintuples and other tuples of cells in the input
matrix is used to compose operators, in an attempt
to get around the problems of pre-analyzing and
pre-programming. This method appears to be guaranteed to have at least as great power as the
single cell probability method. 23 But it has not
as yet demonstrated this power. And it would,
like most of the other programs discussed (or
known to the authors ~ fall down when asked to
process patterns which differed very greatly from
those with which it had originally "gained experience" by extracting information. 22
Summary of Program Operation
In summary, the presently running pattern
recognition program works as follows: Unknown
patterns are presented to the computer in discrete form, as a 20x20 matrix of zeros and ones.
The program generates and composes operators by
one of several random methods, and uses this set
of operators to transform the unknown input matrix
into a list of characteristics. Or, alternately,
the programmer can specify a set of pre-generated
operators in which he is interested.
These characteristics are then compared with
lists of characteristics in memory, one for each
type of pattern previously processed. As a result of similarity tests, the name of the list
most similar to the list of characteristics just
computed is chosen as the name of the input pattern. The characteristics are then examined by
the program and, depending on whether they individually contributed to success or failure in
identifying the input, amplifiers for each of
these characteristics are then turned up or down.
This adjustment of amplifiers leads eventUally to
discarding operators which produce poor characteristics, as indicated by low amplifier s~ttings,
and to their replacement by newly generated
operators.

556

13.2
One mode of operation of the present program
is to begin with no operators at all. In this
case operators are initially generated by the
program at a fixed rate until some maximum number
of operators is reached. The continual replacement of poor operators by new ones then tends to
produce an optimum set of operators for processing the given array of inputs.
Details of Program Operation
The program can be run in a number of ways,
and we will present results for some of these.
The details of the operation of the program
follow.
1. An unknown pattern to be identified is
digitized into a 20x20 0-1 input matrix.

2. A rectangular mask is drawn around the
input (its sides defined by the leftmost, rightmost, bottommost, and topmost filled cells).
3. The input pattern is transformed into
four 3-bit characteristics by each of a set of
5x5 matrix operators, each cell of which may be
visualized as containing either a 0, 1, or blank.
These small matrices which measure local characteristics of the pattern are translated, one at a
time, across and then down that part of the matriX
which lies within the mask. The operator is considered to match the input matrix whenever the O's
and l's in the operator correspond to identical
values in the pattern, and for each match the
location of the center cell of the 5x5 matrix
operator is temporarily recorded. This information is then summarized and scaled from 0 to 7
to form four 3-bit characteristics for the
operator. These represent 1) the number of
matches, 2) the average horizontal position of the
matches within the rectangular mask, 3) the
average vertical position of the matches, and 4)
the average value of the square of the radial
distance from the center of the mask.
A variable number of operators can be used in
any machine run. This can mean either a number
pre-set for that specific run, or a number that
begins at zero and expands, under one of the rules
described below, up to a maximum of 40. The
string of 25 numbers which defines a 5x5 matrix
operator can be generated in any of the following
ways:
a.
b.

c.

A pre-programmed string can be fed in by
the experimenter.
A random string can be . generated; this
string can be restricted as to the number
of "ones" it will contain, and as to
whether these "ones" must be connected in
the 5x5 matrix. (We have not actually
tested this method as yet.)
A random string can be "extractedll from
the present input matriX and modified by
the following procedure (which in effect
is imitating a certain part of the matriX).
The process of inserting blanks in the
extracted operator allows for minor dis-

tortions in the local characteristics
which the operator matches.

(1) A 5x5 matriX is extracted from a
(2)
(3)

(4)

random position in the input matriX.
All "zero" cells connected to "one"
cells are then replaced by blanks.
Each of the remaining cells, both
"zeros" and "ones," are then replaced
by a blank with a probability of ~.
Tests are made to insure that the
operator does not have "ones" in the
same cells as any other currently
used operator or any operator in a
list of those recently rejected by
the program. If the operator is
similar to one of these in this respect a new operator is generated by
starting over at step 1.

4. A second type of operator is also used.
This is a combinatorial operator which specifies
one of 16 possible logical or arithmetic operations
and two previously calculated characteristics
which are to be combined to produce a third characteristic. These operators are generated by the
program by randomly choosing one of the possible
operations and the two characteristics which are
to be combined. This random generation process
is improved by generating a set of ten operators,
and then pre-testing these using the last two
examples of each pattern which have been saved in
memory for this purpose. This pre-testing is
designed to choose an operator from the set which
produces characteristics that tend to be invariant
over examples of the same pattern yet vary between
different patterns.
Since these operators may act upon characteristics produced by previous operators of the same
type, functions of considerable complexity may be
built up.
5. The two types of operators just described
produce a list of characteristics by which the
program attempts to recognize the unknown input
pattern. At any time the pJ;"ogram has stored in
memory a similar list of characteristics for each
type of pattern which the program has previously
encountered. Corresponding to each list of characteristics in memory is a list of 3-bit amplifiers, which give the current weighting for each
characteristic as a number from 0 to 7.
The recognition process proceeds by taking
the difference between each of the characteristics
for the input pattern and those in the recognition
list of the first pattern. These differences are
then weighted by the corresponding pattern amplifiers, and then by general amplifiers which represent the average of the pattern amplifiers across
all patterns, producing a weighted average difference between the input list and the list in memory.
This average difference is multiplied by a final
"average difference" amplifier to obtain a "difference score" for the list in memory. When a difference score has been computed for each list in
memory, the name of the list with the smallest
score is printed as the name of the input pattern.

557
13.2

6. After each pattern is recognized the program modifies pattern amplifiers in those patterns
which have difference scores less than or only
slightly above the difference score for the correct
pattern. This means that the program will tend to
concentrate on the difficult discrimination problems, since amplifiers are adjusted only in those
patterns which appear similar to the correct pat~
tern in terms of the difference scores and therefore make identification of the input difficult.
The correct pattern is compared with each of the
similar patterns in turn. Each characteristic in
the memory lists for a pair of patterns is examined
indi vi dually , and a determination is made as to
whether the correct pattern would have been chosen
if the choice had been made on the basis of this
characteristic alone. If this one characteristic
would have identified the correct pattern, then the
corresponding amplifier is turned up by one. If it
would have identified the wrong pattern then the
amplifier is turned down by one. If no information
is given by the characteristic, for example, if it
is the same for both patterns, then the amplifier
is turned down with a probability of 1/8. If the
pattern compared with the correct pattern had the
higher difference score then the amplifiers are
adjusted only in that pattern. Otherwise, amplifiers are adjusted in both patterns. This means
that if several patterns obtained lower scores than
the correct pattern then the amplifiers in the correct pattern will be drastically changed, since
they will change when compared with each of these
patterns.
The list of characteristics in memory for the
pattern just processed is then modified. The first
time a pattern is encountered its list of computed
characteristics is simply stored in memory along
with its name. On. the second encounter of a pattern each of the characteristics in memory is replaced by the new characteristic with a probability
of 1/2. For the third and following encounters
each characteristic is replaced by the new value
with a probability of 1/4. Since about 1/4 of the
characteristics will be changing each time, after
several examples of a pattern have been processed,
the list of characteristics in memory will tend to
be more similar to the characteristics of the last
patterns processed than to those processed earlier.
However, to the extent that the learning process is
able to produce operators giving invariant characteristics for a single pattern, the list of characteristics will be representative of all the examples
processed. The reason for not simply using the
average value for each characteristic is that this
would require saving in memory more than the 3 bits
otherwise needed for each characteristic, as well
as saving an indication of the number of times each
characteristic had been calculated for each pattern.

An alternate scheme which we tried involved
saving the highest and lowest values obtained by
each characteristic, and averaging these to obtain
a mean value with which to compare the input. This
worked quite well in all our test runs, which used
a few samples of each pattern. But there is the
possibility that with large numbers of examples of
a pattern, all the characteristics will eventually

have very large ranges; that is, the lower bounds
will tend to be 0 and the upper bounds will tend
to be 7.

7. The average difference amplifiers which
are used in the final step of the recognition process provide only coarse adjustments. These amplifiers are initially set to some fixed value, e.g.
60, and are then adjusted for the same pairs of patterns as the pattern amplifiers. The amplifier for
the correct pattern is turned down by N if there
are N incorrect patterns, and the amplifier for each
of the similar patterns is turned up by one.
8. The general characteristic amplifiers are
now computed by averaging the pattern amplifiers
across all patterns. These indicate the general
value of each characteristic in the recognition
process and form the basis for the construction of
success counts which control the replacement of
operators. Since the combinatorial operators combine characteristics to produce other characteristics,
the success count should reflect both the value of
a characteristic in the recognition process and the
importance of this characteristic in aiding the
creation of other, possibly important characteristics.
9.- This success count is formed by first
storing the value of the general characteristic
amplifier corresponding to each characteristic in
a table for success counts. Then starting with the
last combinatorial operator and working back through
the list of these operators, 1/2 the value of the
success count for the characteristic corresponding
to the operator is added to the success counts of
the two characteristics which the operator combines.
Finally, two times the general characteristic amplifier setting is added to each success count.
10. Whenever a new operator is generated, the
characteristics produced by the operator are computed for each of the possible patterns using the
last example of each pattern, which has been saved
in the computer memory. These newly calculated
characteristics are then inserted into the list of
characteristics for their respective patterns. At
the same time the pattern amplifier settings for
each of these new characteristics are set to 1 so
that the characteristic will have very little
weight in computing a difference score until it
has been turned up as a function of proved ability
at differentiation. Since the general amplifier
for a characteristic is simply the average of the
pattern amplifiers, it will also be 1 for the new
characteristic. The success count of a new characteristic which is not combined to produce other
characteristics is then 3 and this value will tend
to increase if the operator proves to be valuable.
On the other hand if a success count drops below
3 (or in the case of a matrix operator, if the
average value of the success counts of its four
characteristics drops below 3) the operator is
rejected and a new operator is generated to take
its place.
The pattern amplifiers play a crucial part
both by aiding directly in the recognition process
and by providing the information which ultimately

558

13.2
determines the generation o~ new operators to replace poor ones. Since the adjustment o~ these
ampli~iers is made selectively, based on their
individual success or ~ailure in distinguishing
pairs o~ patterns where con~usion is likely, the
operators rejected by the program will tend to be
those which are not use~ul in making the more di~
~icult discrimination.
Also, because ampli~iers
are usually changed more drastically when the computer makes an incorrect guess, the 5x5 matrix
operators will have a higher probability o~ being
extracted ~rom unrecognized patterns. Although the
rules governing the learning process seem rather
arbi trary in many cases, and it is di~~icul t to
describe their e~~ects quantitatively, qualitative
e~~ects, such as this ability to concentrate on
di~~icul t problems, are ~airly easy to show.
The
description o~ the program's operation shows that
the emphasis is not so much on the design o~ a
speci~ic problem solving code as it is on the design o~ a program which, at least in part, will
construct such a problem solving code as a result
o~ experience.
It is interesting to note that the memory o~
the program exists in at least three di~~erent
places: 1) in the lists o~ characteristics in
memory, 2) in the settings o~ the various ampli~iers, and 3) in the set o~ operators in use by
the pro~ram. While the lists o~ characteristics
bear some direct relationship to the individual
patterns processed by the program, the values o~
the ampli~iers and the set o~ operators in use by
the program depend in a more complex way on the
whole set o~ patterns processed by the program, and
on the program's success or failure in recognizing
these patterns. The learning in the ~irst case,
which involves simply storing characteristics in
memory, is merely "memorization" or "learning by
rote." In the second case, the learning is more
subtle ~or it involves the.program's own analysis
o~ its ability to deal with its environment, and
its attempts to improve this ability.
Test Results
The program was written ~or the IBM 709 and
required about 2000 machine instructions. The
time required to process a Single character was
about 25 seconds when 5 di~ferent patterns were
used and 40 seconds when each character had to be
compared with ten possible patterns in memory.
While such times are not exceSSive, they are large
enough to make it impractical to run extremely
large test cases.
In several early runs which we made, 48 preprogrammed matrix operators were used. These were
designed to measure such things as straight and
curved lines, the ends o~ vertical and horizontal
strokes, and various other ~eatures. The program
was tested using seven di~ferent sets o~ the ~ive
hand-printed characters A, B, C, D, and E. These
involved a ~air amount o~ distortion, and variation
in size, but were not rotated to any great extent.
~our

The program's performance on the last three or
sets in a run varied from about 70% to 80%

depending on various changes which were made to the
rules governing the learning process. Although the
e~~ects o~ the various rules were not extensively
tested, the program's ability to recognize characters seemed quite dependent on the manner in which
pattern ampli~iers were adjusted. Originally, the
amplifier for a characteristic had been turned up
by 1 with a probability of 1/2 i~ the characteristic
individually identi~ied the correct pattern, and
the amplifier had been turned down by 1 i~ it gave
no i~ormation. Per~ormance seemed to improve when
this rule was changed so that the ampli~ier was
always turned up ~or a correct response o~ the
.characteristic, and turned down only with a probability o~ 1/8 when no i~ormation was given by the
characteristic. The ~irst o~ these runs showed
that ~ew new matrix operators were being generated
by the program, so changes were also made in the
way the success counts were ~ormed in order to
increase the number o~ new operators generated.
One run was made which did not use the matrix
operators. Instead, the individual cells o~ the
20x20 matrix were used as the ~irst 400 characteristics and 500 combinatorial operators were used,
to produce a total o~ 900 characteristics which the
program used to recognize characters. With these
changes the program recognized only a little more
than 30% of the characters. The ampli~ier settings
appeared to be generally somewhat higher ~or the
characteristics produced by the combinatorial operators than ~or the input cells themselves. This
indicated that the program might have done slightly,
though probably not substantially, better i~ the
recognition had been based only on these latter
characteristiCS.
The last runs were made without pre-programmed
operators, but with the program generating all operators from the start. A maximum o~ 40 matrix operators were used at anyone time, and 160 o~ the
combinatorial operators were used in addition to
these. On a run with the same seven sets o~ five
characters used in previous tests, the program
recognized 86% o~ the characters correctly in sets
2 through 7. (The ~irst set can never be correctly
identi~ied by the program, o~ course, since the
program must always predict the name o~ some pattern whose characteristics it already has in memory.)
In this run, the same sets o~ characters were then
processed by the computer a second time, and in
this case the program recognized 94% o~ them, missing
only two o~ the 35 characters.

In another run three passes were made through
three sets o~ the ~irst ten alphabetic characters.
In this case the program recognized 29 out of the
30 characters, or 97%, on the third encounter. With
this rather limited training the program was then
able to recognize 70% o~ the hand printed characters in a ~ourth set di~~erent ~rom the three sets
with which it was trained. In this case, the program's ability to recognize unknown characters was
considerably less than its ability to recognize
previously processed characters. This can be explained by the ~act that three examples o~ each
pattern contain only a ~ew of the possible variations o~ a character. It can be expected that as

559
13.2

the number of examples with which the program is
trained increases, its ability to recognize unknown
characters will also increase.
We have not made any test runs of this program
on the entire 26 letter alphabet, because of the
computer time involved. We have, however, made
some preliminary runs in debugging a modified program that was designed to increase the speed of
processing by a factor of 10 or greater, along with
a number of other changes. These runs gave preliminary processing abilities around 80%, using
three sets of the alphabet, after only two passes.
We anticipate that the completely debugged program,
with improvements in such things as amplifier adjustments plus longer runs should raise this figure.
The program was also tested using line drawings representing a chair, a table, two different
faces, and two types of particle decay simil~ to
those shown in bubble chamber pictures. Two sets
of these 6 drawings were used, with the second set
drawn somewhat differently from the first set.
After the first set was processed, 50% of .the drawings in the second set were recognized correctly by
the program. When the same two sets were then processed by the program a second time, all of the
drawings were correctly recognized.
Discussion
When this program is given a neurophysiological interpretation, or a neural net analog, it can
be seen to embody relatively weak, plausible, and
"natural-looking" assumptions. The 5x5 matrix
operator is equivalent to a 5x5 net of input retinal cones or photocells converging on a single output, with "ones" denoting excitatory and "zeros"
denoting inhibitory connections, and the threshold
for firing the output unit set at the sum of the
"ones. " Each translation step of the operator
matrix over the larger matrix gives a sequential
simulation of the parallel placement of many of
these simple neural net operators throughout the
matrix. Each different operator, then, is the
equivalent of an additional connection pattern
between input cones, firing onto a new output unit
that computes the output for that operation. This
is all quite plausible for the retina as known
anatomically, with a single matrix of cones in
parallel that feed into ~everal layers of neurons.
Evidence for ex~itatory and inhibitory connections
is also strong. And there is even beginning to be
evidence of several types of simple net operators
that exist in parallel iterated form throughout the
retinal matrix (four of these as determined by
Lettvin, lv1aturana, McCulloch and Pitts in the frog;
and probably even more as determined by Rubel and
Wiesel in the cat).ll, 12
It would seem, however, that the known physiological constraints and the plausible geometric
constraints on operators would suggest fewer than
the 40-odd operators that we have used (or than the
30-odd used by Doyle or the 75 used by Bledsoe and
Browning - ignoring the fact that they c~o~ be so
easily interpreted neurophysiologically).'
For
example, straight line and sharp curve operators

would seem to be more plausible in terms of the
ease of connection and the importance of the information to which they respond. A possible operator
that might overcome this problem, with which we are
now working, is a simple differencing operator that
will, by means of several additional layers of
operations, first delineate contour and then compute successively higher order differences, and
hence straightness, slope and curvature, for the
unknown pattern. This operator appears to be
equivalent to a simple net of excitatory and inhibitory elements. 24
This, then, suggests that the mapping part of
the program would be effected by two layers of
parallel basic units in a neuron net-like arrangement. The matching part might similarly be performed by storing the previously mapped lists in a
parallel memory and sweeping the input list, now
mapped into the same s~andard format, through these
lists. Finally, the amplifiers can be interpreted
as threshold values as to when the differences thus
computed lead to an output. The specific pattern
characteristic amplifier would be an additional
single unit layer lying right behind the memory
list; the interpretation of the general amplifiers
might be made in terms of chemical gradients, but
is more obscure.
Thus a sui table parallel computer would perform
811 of the operations of this program in from three
to five serial steps. This is a somewhat greater
depth than those programs, such as Selfridge's and
Rosenblatt's, that attempt to remain true to this
aspect of the visual nervous system. 15 , 17 But it
is well within the limits, and actually closer to
the speCifications, of that system. It also takes
into conSideration the very precise (and amazing)
point-to-point and nearness relations that are seen
in the visual system, both between several spots on
the retina or any particular neural layer, and from
retina to cortex. 20 It also is using operators that
seem more plausible in terms of neural interconnections - again, in the living system, heavily biased
toward nearness.
The size of the overall input matrix has also
been chosen with the requirements of pattern perception in mind. Good psychophysical data show
clearly that when patterns of the complexity of
alphanumeric letters are presented to the human
eye, recognition is just as sure and quick no
matter hOrT small the retinal cone mosaic, until the
pattern subtends a mosaic of about the 20x20 size,
at which time recognition begins to falloff, in
both speed and accuracy, until a 10xlO mosaic is
reached, at which point the pattern cannot be resolved at all. This further suggests something
about the size of the basic operator, when we consider that most letters are composed of loops and
strokes that are on the order of 1/2 or 1/4 of the
whole. For our present purposes, the advantage of
the 5x5 operator was not only its plausibility but
also the fact that it cuts down to a workable size
the space within which to generate random operators
of the sort we are using when we permute through all
possible combinations of the matrix. Again, with
the constraint that these random operators be

560

13.2
connected, it becomes a more powerful geometry and topology-sensitive operator, and also a simulation of a more plausible neural net.
Finally, psychophysical evidence also strongly
suggests that the resolving power of the human perceptual mechanism is on the order of only two or
three bits worth of differentiation as to dimensions of pattern characteristics - things such as
length, slope, and curvature. l , 13, 21 This, again,
suggests a 5x5 matrix as a minimum matrix that is
capable of making these resolutions.
The specifications for and methods used by
living systems, and especially the human visual
system, suggest certain design possibilities for a
pattern recognition computer; but they certainly do
not suggest the only possibilities. Nor should
they be slavishly imitated. They should, however,
be examined seriously, for the living pattern recognizers are the only successful systems that we
know of today. Nor does it seem that the sort of
use we have been making of these human specifications will impose any fundamental limitation on a
program such as this, one that generates and adjusts its own operators. We have, in fact, already
found the program making a different, and, apparently, more powerful, choice of operators than the
choice suggested to us by the psychophysiological
data and conjectures we have just described. The
program's "learningtl methods can now depend both
on built-in connections (maturation) and on the
inputs that need to be learned. The program will
develop differently as a function ofidifferent
input sets. It appears to be capable of extracting and successfully using information from these
sets. This would seem to be as completely adaptive - being adaptive to inputs - as a computer or
organism can be expected to be.
One of the most encouraging things about this
program is that it still has a lot to learn. Its
present level of success was achieved without any
great sophistication in choice of operators or any
great ability on the part of the program to generate and improve operators. More important, the
program itself is in a position to improve on whatever choices it, or its programmers, make for it,
and to make and to evaluate its own choices. This
is so because it learns, and continues to learn, as
it performs. There is good reason to feel that the
present results reflect only the beginning to this
process, since the program is still adjusting its
set of operators and their valu~s. The program, as
it continues to run and be tested, should continue
to improve. The program will be dOing the improving - "learning," if you will - and not the programmers. It will be something of an experimenter
on its own; and it should, in fact, present- us with
results, as it evolves toward "best" sets of operators, that will be the equivalent of parallel
experiments between, and throw light on alternate,
pattern recognition methods. For most pattern
recognition schemes are close to being equivalents
to one or another of the sets of operators that the
present program can handle, except for those programs which make use of more complex analytic
methods.

This sort of design would seem to have some
applicability to a variety of more "intelligent"
machines. The program replaces the programmeranalyst by a programmed operator that first generates operators that make effective enough use of
the unknown input space, and then makes use of
feedback as to the success of these new operators
in mapping unknown inputs in order to increase
their effectiveness. Thus neither programmer nor
program needs to know anything specific about the
problem ahead of time. The program performs, as
part of its natural routine, the data collection,
analysis, and inference that is typically left to
the programmer. This would be a foolish waste of
time for a problem that had already been analyzed.
But pattern recognition, and many other problems
of machine intelligence, have not been suffiCiently
analyzed. The different pattern recognition programs are , themselves, attempts to make this analysis. As long as pattern recognition remains in the
experimental stage (as it must do until it is
effectively solved), a program of this sort would
seem to be the most convenient and flexible format
for running what is, in effect, a continuing series
of experiments upon whose results continuing modifications of theories are made. This becomes an
extremely interesting process for the biologist or
psychologist, especially to the extent that the
program can be interpreted either physiologically
or functionally, or at the least does not violate
any known data. For the experimentation and concomitant theory building and modification being
undertaken today is rapidly building what appears
to us to be the first relatively firm and meaningful theoretical structure - for pattern, or form,
perception - for the science of "higher mental
processes."
Self-generation of operators, by the various
methods employed in this program, may also suggest
approaches toward solving a wide variety of pattern
recognition and pattern extraction problems. Thus
there is some hope that relatively powerful operators are being extracted and generated as a result
of experience with and feedback from the program's
quasi-experimental analysis on the body of data
that is available to it - its inputs and the consequences of its actions. Further, the level of
power of these operators, and the serial ordering
of operators can also be placed under similar
control. Thus operators need not be overly simple
or random to be machine-chosen; nor pre-programmed
to be powerful. Rather, they can ari se from the
problem, and thus be sensitive to the problem, and
to changes in the problem.
References
1.

Alluisi, E. A. "Conditions Affecting the Amount
of Information in Absolute Judgements, tt Psychol.
Rev., 1957, 64, 97-103.

2.

Bailey, G. E. C. and Norrie, G. O. IIAutomatic
Reading of Typed or Printed Characters," Brit.
Inst. Radio Eng. Conv. on Electronics in -----AUt"OmatIOii'; 1957.-- ---

561

13.2
3.

Baran, P. and Estrin, G. "An Adaptive Character
Reader, It Paper presented at IRE WESCON, Los
Angeles, August, 1960.

18.

Selfridge, O. G. and Neisser, U. "Pattern
Recogni tion by Machines and Men," Sci. AIDer.,
1960, 203, 60-68.
--

4.

Bledsoe, W. W. and Browning, 1. "Pattern Recogni tion and Reading by Machine, n Proc. Eastern
Joint Compo Conf., 1959, 225-232-.-

19.

Sherman, H. "A QuaSi-topological Method for
the Recognition of Line Patterns," In: Information Processing. Paris: UNESCO, 1960-,-232-238.

5.

Bledsoe, W. W. "Further Results on the N-tuple
Pattern Recognition Method,11 IRE Trans. Electronic Computers, in press. --- ----- ----

20.

Sperry, R. VI. tlMechanisms of Neural Maturation,"
In: (S. S. Stevens, Ed.) Handbook of Experimental Psychology. New York: JohnWiley and
Sons, Inc., 1951, 236-280.

21.

Uhr, L. "Machine Perception of Printed and
Hand-written Forms by Means of Procedures for
Assessing and Recognizing Gestalts," Paper
presented at ACM Meeting, Boston, 1959.

22.

Uhr, L. "'Pattern Recognition' Computers as
Models for Form Perception;' (draft), Ditto,
1960.

23.

Uhr, L. "A Possibly Misleading Conclusion as
to the Inferiority of One Method for Pattern
Recognition to a Second Method to Which it is
Guaranteed to be Superior," IRE Trans. Electronic Computers, 1961, in press-.---- ----

24.

Uhr, L. and Vossler, C. "Suggestions for Selfadapting Computer Models of Brain Functions,"
Behav. Sci., 1961, in press.

25.

Unger, S. H. "Pattern Recognition and Detection, I'
Proc. IRE, 1959, 47, 1737-1752.

6.

7.

8.

9.

10.

Doyle, W. "Recognition of Sloppy, Hand-printed
Characters," Proc. West. Joint Compo Conf.,
1960, 133-142-.-- --- -- -Greanias, B. C., Hoppell, C. J., Kloomok, M.,
and Osborne, J. S. liThe Design of the Logic
for the Recognition of Printed Characters by
Simulation, II IBM J. Res. Development, 1957, 1,
8-18.
-- Grimsdale, R. L., Sumner,
and Kilburn, T. "A System
Recognition of Patterns,"
1959, 106, 210-221.

F. H., TuniS, C. J.,
for the Automatic
Proc. rEE, Part B,
-- -- ---

Hartline, H. K. liThe Response of Single Optic
Nerve Fibers of the Vertebrate Eye to Illumination of the Retina," Amer. ~. Physiol., 1938,
121, 400-415.
Highleyman, W. H. and Kamentsky, L. A. t'Comments
on a Character Recognition Method of Bledsoe
and Browning," IRE Trans. Electronic Computers,
1960, Vol. EC-9-:2~

11.

Hubel, D. H. and Wiesel, T. N. "Receptive Fields
of Single Neurons in the Cat's Striate Cortex,"
~. Physiol., 1959, 148, 574-591.

12.

Lettvin, J. Y., Maturana, H. R., McCulloch,
W. S., and Pitts, W. H. "What the Frog's Eye
Tells the Frog's Brain," Proc. IRE, 1959, 47,
1940-1951.
- - --

13.

Miller, G. A. "The Magical Number Seven, Plus
or Minus Two: Some Limits on our Capacity for
Processing Information," Psychol. Rev., 1956,
63, 81-96.

14.

Roberts, L. G. "Pattern Recognition with Adaptive Network," IRE Conv. Rec. 1960, Vol. 8,
Part 2, 1, 66-70. - - - -

15.

Rosenblatt, F. The Perceptron. ~ Theory of
Statistical Separability in Cognitive Systems.
Buffalo: Cornell Aeronautical Laboratory, Inc.,
Report No. VG-1196-G-l, 1958.

16.

Rosenblatt, F. "Perceptron Simulation Experiments," Proc. IRE, 1960, 48, 301-309.

17.

Selfridge, O. G. "Pandemonium: A Paradigm for
Learning," In: Mechanization of Thought Processes. London: HMSO, 1959, 511-535.
-

562
13.2
UNKNOWN PATTERN

INTERNAL REPRESENTATION

- ,
,,
"""'IIIIiI

1 1

~

"Ill'll..

1
1

~

~

~

I

-

~

1 1

1
1

1
1 1
1
1

1 1
1 1 1
1 1 1 1
1 1

1
1 1
1
1

J

~",

1

1 1 1 1 1 1
1

""II~

~,..

1 1 1 1 1
1 1 1
1 1
1

1 1
1 1
1 1

----r--.. . ,
iIII'"

1 1 1
1

"

1
1
1 1
1 1

1
1
1 1
1 1 1 1 1 1 1 1

"jill""

1 1
1 1

Figure 1 An unkno"\om pattern is input as a 20 x 20 matrix ,.,ith the cells covered by the
pattern represented by • 11 s' and the other cells by r 0 IS •• '

OPERATOR

C~

1 1

~-

11

11

.. ..

. ..

..

~- -!. .!.. r!- ~.:::.. ~- I-!.. ..!.. r--

1
1

1 1

1
1

I11III

~

,:s
'~

j

R
~

~
~

1
1

..

- -~ 1

"l

1 1
1
1 1
1

1 1
1
1
1
1 1
1
1 1 1 1
1 1 1 1 1 1 1 1 1
1
1 1
1
1
1 1
1
1
1 1
1
1 1
1
1 1
1
1
1
1

1
1 1 1 1
1 1 1 1 1 1 1 1

Figure 2 A rectangular mask is drawn around the unknown pattern.
Each of' the 5 x 5 matrix "operators II is then translated over the
pattern.

563
13.2

OPERATOR
1 1 1 1 1 1 1 1 1 1
1
1 1 1
1
1 1
1
1
HI r J
1
1 1
1
1
1 1
1
1
1
1
1

1
1 1
1
1 1 1 1
1 1 1 1 1 1 1 1 1
1
1 1
1
1
1 1
1

IFfj
I~

~

J
>-:

(5

:a
~

~
~

1
1 OJ EI t,A1 01

1
1
1
1
1 1 1 1 1

~I .., E

1 1
1 1
1 1
1 1 1 1
1 1 1

OPERATOR

HIT

x

y

1 1 1
1 0
1
0

A

1

o

4

B

2

4

o

2

2

2

2

N

=

Figure 3 The operator at the lower left in the figure is shown in
the two positions where it matches the input matrix. An operator
gives a positive output each time its t"l' s tl cover "1' s I' and its 1 0' S t'
cover trO's" in the unknown pattern.

564
13.2

OPERATOR GENERATION
a) BY EXTRACTION

1 1

1
1
1
1
1
1
1
1
1
1
1

1
1
1
1
1
1 1 1 1

1 1 1 1 1 1 1
1 1 1
1 1
1
1 1
1
1 1
1
1
1 1
1
1 1 1 1
1
1 1 1 1 1 1 1 1 1
1
1 1
1
1
1
1
1 1
1 1
1 1
1 1 1 1
I
1 1 1 1
J
/

0 0 0 1 0
0 0 0 1 0

o0
o0

0 0 1 1 0
0 1 1 0 0
1 1 0 0 0

1
1 1

~

1
1
1 1
1
0

1

0 0

1
1

0
0

1 1

0
0

b) BY RANDOM CHOICE OF CELLS

1)

1
1

2)

0 1
0

1

0

1

0

1
0
1
0
1 1 0 0
1

0

Figure 4 Operators are generated within the 5 x 5 matrix by either: a)
extraction from the input pattern (random placement of a 5 x 5 matrix,
elimination ,of "0 r S ,. connected to "1 r s," and elimination of each of the
remaining cells with a probability of
or b) by ra~dom designation of
cells as either "0" or "1" (choose a 111," then place a "0" two cells to
its right). In 1) from 3 to 7 "1 r s" are chosen completely at random,
while in 2) the choice is limited to connected cells.

t)

565

13.2

OPERATORS USED
A.

PRE-PROGRAMMED SET

1 0
1
0

0
0

0

1
1

0

1
1
1 1 1
1
1

1
1
1
1

1
1

1
1
1

SET GENERATED BY PROGRAM
0
0
1

1

0
0
0

1

0

1

0
0
0 0 0 0 0

1

B.

1
1

0
0

1 1 1

1

0
0

1

0

1
1

0
0

0

1 1 1
1
1
0
1
0

1
1

0 0

0

1 1 1

1

0
0

1 1

0 0
0

1
0

0

0 0

Figure 5 A) Some typical examples of pre-programmed operators
are shown. B) Six of the operators generated by the program,
during a run that reached 94% success on 7 sets of 5 patterns,
are shown.

566
13.2

~
PATTERN

rOMBINAT01

MATmX
OPERATORS
OPERATOR 1

~ 4 OPERATORS

OPERATOR 2

CHARACTERISTIC

N

X

Y

R2

N

X

Y

R2

m-2 m-l

m

?

2

2

2

2

2

3

1

6

6

7

4

A

3

3

4

1

4

0

1

1

1

2

3

B

2

2

3

2

3

3

2

5

4

7

5

C

4

5

6

5

1

0

0

4

2

1

7

NAME

Figure 6 Operator outputs are listed for the unknown pattern in the same format
as in lists stored in memory.

567
13.2

PATTERN A

CH ARACT E IDS TIC S (A)
INPUT
(?)
DIFFERENCE
IA - ?I

3

3

4

1

4

2

2

2

1

1

2
2

2
2

3
4
1

2

1

PATTERN AMPlJFIERS
GENERAL AMPlJFIERS

3
3

1
1

2

3

1

0
0

3
3

DIFF. X AMPLIFIERS

6

9

2

2

0

9

WEIGHTED
AVERAGE DIFF

~ = 1. 04

AVE. DIFF.
AMPLIFIER

DIFFERENCE
SCORE

61

63

PATTERN B

CHARACTEIDSTICS (B)
INPUT
(?)
DIFFERENCE
IB - ?I

2
2
0

2
2
0

PATTERN AMPLIFIERS
GENERAL AMPLIFIERS

,4

3
3

2

3
0

0

DIFF. X AMPLIFIERS

WEIGHTED
AVERAGE DIFF.

~ =

.25

3

3

2
2
0

2

5
4

1

1

1

3
1

2
0

3

2

0

0

6

2

1

AVE. DIFF.
AMPLIFIER

DIFFERENCE
SCQRE

60

15

2

Figure 7 Differences are obtained between the characteristics for the input pattern
and each list of characteristics in memory. These differences are then weighted by
the product of the "general amplifiers" and "pattern amplifiers," giving a weighted
average difference for each list in memory. When multiplied by corresponding "average
difference amplifiers," the weighted average differences give "difference scores" for
each pattern in memory. The name of the pattern with the smallest "difference score"
is chosen as the name of input.
'

568
13.~

IDGHT LIST
DIFFERENCE :

1

AMP LJFIE RS
ADJUSTED

4 ..

4

2

3 • •. 4

2
+1
-1
2

3 · .. 1

+1
+1
6

3
0
-1
2

-1
-1

2

4

5

2

2
+1

3

1

3

0
3

+1
2

4 • •• 3
-1 · ..-1
3 • •• 2

DIFFERENCE :

3

1

1

2 • •• 5

AMPLJFIERS

1

2

2

1 · .. 1

ADJUSTED
NEW TOTAL

+1
2

-1

-1

1

1

NEW TOTAL

1 · .. 1

1ST WRONG LIST
DIFFERENCE:
AMPLIFIERS
ADJUSTED
NEW TOTAL

· ..-1
· ..+1

... 21

2ND WRONG LIST

-1 · . ;to1
0 • .. 2
Figure 8 The pattern amplifiers for certain lists are adjusted
to increase weightings of individual characteristics that gave
differences in the right direction, and to decrease weightings
that gave differences in the wrong direction

569
13.2

1 1
1 1

1 1
1

1 1
1 1
1 1

1

1 1
1 1

1

1
1 1

1 1

1

1 1

1

1 1

1

1 1

111111

11111111111

1
1 1
1

1 1

1 1

1 1

1
1 1

1

1
1 1

1

1 1 1

1 1 1 1
1

1 1 1
1 1

1
1 1
1

1 1

1 1 1

1 1
1 1

1 1
111111

11111111

111

1111111

1 1 1

1 1

1 1

1 1 1 1 1
1
1 1
1 1
1 1 1

1 1
1
1 1

1 1
1

1 1
1 1

1 1
1 1

1 1
1 1
1 1
1 1
1

Figure 9 Two examples of an "A" and a. "c" and the two line drawings of
a chair are typical of unknown patterns processed.

571
13.3

AN EXPERIMENTAL PROGRAM FOR THE SELECTION
OF "DISJUNCTIVE HYPOTHESES"
Manfred Kochen
IBM Research Center
Yorktown Heights, New York

An algorithm. for finding the characteriza-:
tion of a class of objects on the basis of a randomly ordered sequence of labeled individual
objects - some members of the class, some
not - is described. The class is characterized
as a.disjunction of terms, each term being a
conjunction of attributes. frAIl red, round
objects or a1l square, small objects" is an
example. Mechanisms based on this algorithm
are described in terms of such properties as
the amount of storage available for recording
instances and the number of instances which
had to be examined until the class was first
guessed.

Introduction
Suppose that you were shown a red,
round and small object and told, by an instructor, that this was an instance of some clas s of
objects he had in mind; that you were required
to determine this class of objects or "concept. II
Suppose that the instructor then selects a
second object at random, which happens to be
red, square and sman, and he te1ls you that it,
also, is an instance of what he has in mind.
The next object he shows might be blue, round
and sma11 and not an instance of what he has in
mind. The next could be red, round and large
and a negative instance again. You might,
perhaps, conjecture that what the instructor
had in mind was the "class of all red and sma1l
objectsJwith shape an irrelevant attribute. ,,*
This experimental paradigm has been
used to study "hypothesis-formation"Z,4 in
human thinking; a phI' as e such as the above in
quotes marked by (*), has been called a "conjunctive concept. tI The experimental paradigm
described above has also been the basis for
simulations by computer l, 4, 5 for various
purposes. In a previous paper 5, to which this
is a sequel, our purpose was to describe a
series of mechanisms, in terms of the amount
of storage they had available to record past
instances and in terms of the number of instance s which had to be examined before a good

"guess"was obtained; w~ r~ated these p'roperties
of the mechanism to the "magnltud~J,i and"complexity" of the "concept" to ,Qe"{Qllild. ,tn'this ,
paper, this ~nvestigation is ext~nded to a mechanism which can handle "disjunCtl"l-e cQncept$-. II
as exemplified by:"all red objects or' a.U rotind
and large objects. "
.,--"
i

The study of such mechanisms is intere'st-'
ing, because the number of a priori possible
"concepts" grows superexponentially with the
number of attributes (as 2 zn ), which would require an immense
amount of storage and
number of instances, if no short-cut to straightforward exhaustive procedures can be found.
The mechanism to be described here is based on
such a short-cut algorithm.

**

Formulation of the Object-Problem
For the purpose of describing the properties of our algorithm, we do not have to formulate the object-problems as a speculative model
of how people would perceive them; instead, the
most simply formulated object-problem· would
suffice. To this end. suppose that each object
is completely characterized by n attributes,
Xl' . . , x n . For example, Xl might denote
color, x2' shape, x3' size. etc. It is
further assumed that each attribute has only two
values. Thus, color could only be red or blue,
size large or small, etc. These two alternatives
will always be denoted by 0 and l , although
it should be understood that Xl = 0 does not
mean the same as Xz = O. Thus, the information which is furnished to our mechanism during
a single trial consists of an n-bit word and a
Yes or No, indicating whether or not this object

** The term

"immense" is due to Elsasser, 3
and very appropriately denotes quantities which'
are orders of magnitude greater than the quantities encountered in astronomy. They generally
arise in combinatorial enumerations, such as
is presently the case.

572

13.3

is judged (by the trainer) to be a member of a
specified class of objects.
The classes of objects (the concept) to be
identified by inference from the accumulated list
of instances are denoted as follows: a set of
objects for which some attribute, say xi' is
irrelevant, is denoted by the values of the attributes they have in common in the appropriate
positions and an X in the ith position; thus,
OlXX denotes the class of four objects, 0100,
0101, OlIO, 0111. The most general "concept"
is denoted by a sum of words of n characters,
each 0, 1 or X. This sum represents the
union of the sets denoted by the words in the
sum. Thus. OlX + lXO + 000 denotes the
class of 3-bit words which are in the set OlX
or in lXO or 000; this is the set consisting of:
010, 011, 100, 110,000.
It is well-known that ZZn possible
subsets of the Zn objects may be formed, although not all of them are different; for example,
oIX + QOX = OXX. Since knowledge of the
possible attributes xl, . . , xn is built into
the mechanism, the collection of possible
solutions to the problem* is well defined. In
other words, our mechanisms have a priori
built-in the capacity for expressing a "concept"
or "hypothesis. II In this sense, we are dealing
with mechanisms for hypothesis-selection rather
than hypothesis-formation.

4

1

Figure 1
Running Exa;mple Illustrating the Algorithm for
Sequential Selection of a Disjunctive Hypothesis
Confirming Instances
The initial hypothesis will always consist
of all XIS as shown in the line above trial 1 in
the example in Table 1. A confirming instance,
like (110, Yes) in line 1 leaves the hypothesis
unchanged. As discussed,·in the author's earlier
paperS, there are two kinds of confirmation,
and two kinds of refutation. depending on whether
both the hypothesis set and the set to be found do
or don't contain the current instance, and whether
the instance is contained in one but not the other
, set.

Description of Algorithm
Revising Refuted Conjunctions
Running Example
For clarity of presentation, the following
example will accompany and illustrate the algorithm description: the number of attributes is
3, and the 'concept' to be found consists of
0,00. 110, Ill, 101, 001. This is conveniently
repre sented on the cube of Figure I, in which
the vertices corresponding to positive instances
are circled; the numbers attached to the vertices
indicate the trial number or order in which the
instances were received. On being informed
about an instance. the mechanism delivers, as
output, an hypothesis and a weight attached to
this.

*By a

A refuting instance requires a revision of
the hypothesis in force. There are two types of
refuting instances. Thes-e were called consistent
and inconsistent (secondalTY) in the previous
paper. S A refuting instance is said to be a secondary inconsistency if it leads to a logical contradiction according to the rules of inference
which were valid for purely "conjunctive concepts." To illustrate this, observe that these
rules of inference would lead us to deduce, from
some two instances like (000, Yes) and (001,
No), that the third bit must be 0; from some
two instances like (000, Yes; 001, Yes), that
the third attribute must be irrelevant, or X.

solution to the problem is meant the specification of one of the ZZn possible sets expressible
as one of the above -sums, which fits some or all of the accumulated list of instances.

573
13.3

OUTPUT

INPUT

Trial No.

Hypothesis No.

Yes or No

Instance

XXX
XXX

~

1

110

Yes

2

010

No

~~

3

100

No

@~

4

101

Yes

(j)

¢45

III

Yes

@~

6

011

No

@®~(1)

-

a Eriori.
h yp. no .(Y.

IXX

- @.We
deduce that
Xl = 1. denoted by 1.

IIX

=~. Deduce
further that
x2 = 1.

XXI

ii

~

IIX t XXI
;I ~
IIX + XOI + IXI

Call XOI ;; (3)
IXI ;; fJ)

(split into fragments)

a negative inconsistency
7

001

Yes

®

8

000

Yes

(J)

a positive inconsistency

+ @ ..

Comments

add a conjunction

5

(j)

(j)

IIX .... XXI

a positive inconsistency

The result is g) +on the cube.

Hypothesis

IIX + XOI + IXI
IIX + XOI ... IXI + 000
Call 000 =(fJ
add the last conjunction

@. represented by the 3 shaded edges pIus the origin

Table 1
Running Example Illustrating the Algorithm for
Sequential Selection of a Disjunctive Hypothesi s

Suppose that (000, Yes) and (001, No) were
observed, and now (011, Yes) is observed; the
first and third instance leads to the deduction
that x2 = x3 = X, while the first and second
imply that x3 = 0 which contradicts x3 = X.
Hence, the third instance is said to be inconsistent with the first two.

instance. This procedure is not illustrated in
the running example. In the example, the procedures followed in response to the second and
third instances illustrate what is done in the
case of a consistent, negative instance when the
particular bit to be revised can be deduced from
the current and previous instances.

If an hypothesis is refuted by a consistent, negative instance and we can not logically
fix a particular bit to be revised, the hypothesis in force is revised by selecting one of
the X's at random and replacing it by the complement of the corresponding bit in the infirming

If an hypothesis is refuted by a consistent,
positive instance (this case does not occur in the
example of Table 1), it is revised by finding that
non-X bit in the current hypothesis, which, if
replaced by an X, will include the infirming
instance; if a single bit will not suffice, pairs of

574

13.3

XIS. triples. etc. are tried. To illustrate. if
the hypothesis in force were OXO. and the infirming instance were (001. Yes). the third 0
would be replaced by X.
Each current hypothesis is checked for
consistency with all the instances which have
been recorded. To facilitate keeping track ot
which instances are consistent with which
hypotheses, each conjunctive hypothesis is
numbered according to the order in which it was
first introduced. Thus XXX is assigned the
number 1 (circled) in our running example.
This number is placed next to every prior and
future instance with which the corresponding
hypothesis is consistent. (This is done only for
positive instances in the computer program). If
a conjunctive hypothesis becomes obsolete because it is replaced by a revision. the numbers
corresponding to the original hypothesis next to
each instance are crossed out. (They are
actually replaced in the computer memory).
The number of the revised hypothesis is placed to
its right, linked by an arrow. Thus. on trial
no. 3, the hypothesis llX is in force; since it
is the second revision of XXX , or the third
hypothesis to be selected. it is labeled @. Since
it is consistent with all the previous three
instances, its number appears uncrossed inlines
1. 2 and 3 .

Adding Conjunctions
If an hypothesis is refuted by a positive
and inconsistent instance, a new conjunction
(term) is "disjoined" (added) to the hypothesis
in force. The new conjunction is obtained by
the same procedure described above, using
the current (refuting) instance and all those
previous instances which. taken together. will
not lead to a contradiction. Obviously, not all
previous instances can be used, or the current
instance would not have been. inconsistent. Thus.
instance no. 4 (101. Yes) in our running example
can belong together in a consistency class with
prior instances 3 and 2 but not 1. This shows
up in that the new conjunction. hypothesis no. @.
is listed in rows 4. 3 and 2 • but not 1.
Similarly, hypothesis no. Q) is listed in rows 1.
2 and 3 , but not 4. Note also that more than
one uncrossed circled number can be listed next
to an instance. as in line 3.

If an hypothesis is refuted by a
negative and inconsistent instance. and the

hypothesis cannot be revised by modifying
one of the conjunctions as described previously,
then some of the conjunctions of the hypothesis in
force are each split into two fragments. Each
fragment contains one les s X than its parent.
The conjunctions to be split are all those which
contain the infir:ming (negative) instance. In our
example, the sixth instance is a case in point.
and there is only one conjunction. namely XXI.
which can be split. In fact. it need not be split.
since it could be revised to IX! and still remain
consistent, but it is split for illustration. A
conjunction is split by selecting some two of its
XIS at random; the first fragment is formed by
replacing one of the X' s by the complement of
the corresponding bit in the infirming instance;
the other fragment is formed by replacing the
other X by the complement of the bit corresponding to this X in the infirming instance. 'Dus,
in line 6. XXI is split by (011. No) into IX! +
XO I. If a conjunction contains no X IS, and it
happens to be just the infirming instance (an
unlikely event). it is simply deleted; if it contains just one X, this X is replaced by the
complement of the corresponding bit of the
infirming instance. When a conjunction splits
into two fragments which replace it. the number
of each fragment is listed next to all its confirming instances. and the number of the parent
hypothesis is crossed out wherever it has occurrEd.
The two feature s of our algorithm
mentioned in the above two paragraphs go beyond
previous work in that they deal with the unique
problems introduced by "disjunctive concepts. "
At any stage, an hypothesis is satisfactory
if at least one of the numbers representing the
various conjunctions in the hypothesis is listed
without being crossed out next to each of the
instances encountered.

To compute the weight w(t) to be assigned
to the hypothesis selected at the tth trial,
denote by ~ l( t) the number of instances which are
contained in the hypothesis set and in the set
being sought and which have occurred prior to and
including the tth trial; and denote by C2(t) the
number of instances which are excluded from the
hypothesis set and from the set being sought and
which have occurred prior to and including the
tth trial. In our running example, cl(7) = 4
3. Note that as defined here,
and c2(7)
Cl(t) = t - C2(t).
Further, let r(t) be the fraction of
positive instances encountered so far;

575

13.3

+

r(t) =
cl (t). This should not be too
n
different. for large t, from s(.t) 2- • where
s(t) is the cardinality (or the total number of
distinct n-bit words) of the set denoted by the
hypothesis in force at the tth trial.
Now let
if r(t)
if r(t)

r(t)

~(t) = [ s(t)·2- n

o
o

This is a statistical estimate of the cardinality
of the set being sought; the second line serves
to give a reasonable and positive estimate even
from a long initial run of negative instances.
Beyond the rules stated in the previous section.
the revision of hypotheses is further constrained
as to the number of X's a revised hypothesis
may contain. by the inequality,
, r(t)

s(t) . 2-

n

I <:

-*

Now we define:
w(t) = t : I

[~(t)

c 2 (t)

t

(1 - p(t»

cl

(t~

This expresses the intuitive requirement
that if our estimate of the cardinality of the set
being looked for is small. a positively confirming instapce should be given much weight; if
the estimate is large, a negative confirming
instance should be given the greater weight. The
factor t/(t + 1) says that more weight should be
given to equally confirming instances if they
occur later rather than earlier. It is easy to see
that if
cl(t) t
0,
w(t)

=

2

T+T'

c i (t)'c2(t) .

Note that, in contrast with the weighting
function in the programs for purely conjunctive
concepts. there is no added term corresponding
to the number of bits which have been fixed by
logical deduction*, because there are, in general,

many ways to express a satisfactory disjunctive
h ypothe s is. Thus, in our running example, the
final hypothesis lXI + OOX +- llX would also
have fit the data; if it is important that the
number of terms be small, this would even be
a better solution. Had we followed the algorithm
exactly as described (without splitting XXI from
line 5 to line 6), we would have obtained this
result.
Discussion
Simulation Program**
To evaluate the performance of this
algorithm, it is being tested, in the form of a
709 program, on a sample of "problems. II By
a "problem," as presented to the 709. we mean
a randomly ordered sequence of instances of
some fixed "disjunctive concept, II like 0 IX + XOl
for example. A number of sequences, differing
only in order, corresponding to the same "concept" are given, and the following quantities are
measured from the output:
( 1) the average number of trials to the
first time a "good" hypothesis is guessed. An
hypothesis is good if it is consistent with all the
recorded instances; or, if the sum of terms
representing the hypothesis is either identical
with the sum of terms representing the concept,
or equivalent to it (e. g., llX + XOI t IXI .. 000
is equivalent to IXI t OOX .. llX). Call this
sample average No(B) , where B represents
the "conceptI' being searched for.
( 2) The average number of trials to the
first jump in w(t) which is "unusually large. "
This sample mean is denoted by N 1 (B). We
found, from our empirical results on simulation
with conjunctive concepts, S that the first large
jump in weight was correlated with No(B), and
could be used as an indicator that a good
hypothesis was on hand. In those experirRents,
"unusually large" was defined as:

* The notation for

indicating that a bit in a single conjunctive term of an hypothesis has been fixed is the
same as in the authorls previous paper. S That is, when a bit is deduced to be 0, lor X, it is
represented as 0, I and Y respectively. This is useful in detecting contradictory instances to est,ab-

~ish

** This

inconsistency.

709 program is still under development. It was expected to have the empirical results on the
performance of this algorithm available in time for inclusion in this paper, but various factors
beyond the author's control caused delay in the completion of the simulation. It is planned to present
these results orally at the WJCC meeting and perhaps to publish them separatelyo

576
13.3

w(t .. 1) - w{t)

2

f

3ift:Z}
Z if t = 3
lift>-4
.

( 3) the extent to which B and the
hypothesis z in force at t do not coincide.
This can be measured by the sample mean
dB(t) , defined as the ratio of: (a) the number
of instances (up to the tth inclusive) which
either belong to B but not to z or to z but
not to B and (b) the number of instances in
either B or z. When dB (t) :: 1. then z
and B are disjoint, and z is as poor an
hypothesis as can be. If dB(t) = 0, then z = B.
It is conjectured that, as in our previous
work, N 1 (B) and No{B) will again be correlated,
and that aB(t) will decrease very rapidly to
zero (possibly as a negative exponential) with t .

The quantity No(B) is then computed for
variety of B IS in order to determine how
No{B) varies with:

~

n,
m,

the number of variables describing B
the minimal numbe r of conjunctive
terms in the sum describing B

variety of B' s. The nature of this relationship
cannot now be anticipated. It might be worth
recalling that in the case of purely conjunctive
concepts, N
did not increase Significantly for
T larger tha~ a certain constant, and that this
constant did not depend strongly on n.
Theoretical Considerations
In principle. the above-mentioned relations
could be derived mathematically, but the simplest
examples will convince anyone trying this that
the number of terms that have to be enumerated
grows at such a rate as to make it practically
impossible for a human analyst.
For this
reason, we have simulated this algorithm in a
Monte-Carlo fashion. There is much to be
desired in this, as in simulations generally,
about the development of a firmer statistical
basis for the various estimat~n and sampling
procedures. For example, No is a sample
mean to estimate the true mean of a random
variable No' Can anything be said about the
distribution of the random variable No? What
should the size and composition of a sample of
sequences of instances be so as to permit valid
inferences about the desired relations?

*

and

-

1

m

'5:

mw

k .,
1

where k. is the number of
.
the l.th ld'lSJunct.

XIS

in

Concepts with small m and many XIS are
expected to be simple, and No(B) should be
small in such cases. For B fS which are about
"equally simple, 11 No(B) may be expected to
grow exponentially with n .
Next, the above relationships are recomputed with the following impo rtant modification
in the algorithm. At any trial t. the mechanism can inspect only the last T instances, and
as the (t + l)st instance is recorded, the
(t - T t l)th instance is lost. All features of
the algorithm which involved. checking previous
instances for consistency, confirmation, etc.
now refer only to this constantly changing record;
all other features are unchanged. The relationship between No{B) and T is estimated for a

*Even if he had the patience,

There are, of course, theoretical lower
bounds on No for an ensemble of B IS,
depending on what is a priori known about the
various B's. For example, suppose that N
disjunctive concepts are considered eligible and
that all are a priori equiprobable. The average
number of instances which have to be examined
before the concept is found must be at least
logZ N. This minimum will not be obtained
with a randomly ordered sequence of instances,
but one which must be carefully selected. For
randomly ordered, given sequences, an expression
for the minimum No is very diffi~ult to derive.
Even then, the design of a strategy for revising
hypotheses to achieve this optimal performance
is as difficult a problem as finding constructive
proofs to coding theorems.
Related Problems and Possible Applications
The object-problem of our mechanism is

time and enough writing paper to go through such a derivation, the
chances of his getting the same answer twice are small.

577
13.3

related to the problem of determining a Boolean
expression from a partial truth table. Our
method differs from such techniques in that
with each new line of t~e truth table we obtain
a better approximation to a satisfactory Boolean
expression. It should be of interest to compare
the performance of our algorithm with some of
the existing programs which are used in the
design of certain switching circuits. In most
applications of this algorithm, we will not be
presented with a randomly ordered sequence
of instances, but will have control over the
order. It is therefore, an important open
problem to determine the most informative (least
redundant) sequence of instances. For purely
conjunctive concepts, an optimal input sequence
has been developed.
The algorithm described here can also be
interpreted as a dynamic classification procedure using IIproperty matrices. II The element
a . of such a matrix is 1 if the ith instance
i
' d "
( e.J
g. cancer-pahent
as l.stlnct f rom a noncancer patient, a two-dimensional dot pattern
from an aerial photograph labeled lIairfield ll as
distinct from all differently labeled patterns,
etc. ) possesses property j (e. g. high fever,
containing some rectangle consisting of 90% or
more dots, etc.), and 0 if the ith instance
does not have the /h property. The IIconceptll
is then a characterization of the label (e. g.
cancer, airfield) which is assigned to some of
the instances by a,human judge. Note that in
our scheme, both positive and negative instances
provide information, while most of the available
procedures use only positive instances.

analysis. Fourth, each attribute has its own
range of values, certainly not binary in extent;
each attribute may well be regarded as a concept by itself, as may each of its values (e. g.
color, red). We should probably think of a
hierarchy of concepts, in that an attribute or
property, (e. g. IIbeing an animal ll ) has values
(e. g. "being a dog, 'f IIbeing a cat, " etc.) which
are also attributes that have values (e. g. "being
a poodle, " etc. ), and so on.
General Conclusions
The above paragraph indicates how
limited the simple paradigm we have described
in this paper seems to be as a model of real
perception- conception mechanisms. Therefore,
the terms Ifhypothesis-formationll and "conceptformation" should be used very guardedly in
view of the connections with human thinking and
genuine cognitive mechanisms that such terms
suggest.
Nevertheless, the procedure described
in this paper may serve to show another way \
of using computers as experimental tools in
the analysis of very simple and preliminary
models for cognitive mechanisms. It is also
possible that the algorithm described here can
be useful in certain tasks involving dynamic
das sifications. Furthermore, such results as
the relation between No and T could be of
sufficient generality to provide some real
insight into some of the properties of memory
and possibly some guidance for machine design.
References

The algorithm also embodies some elemerts
of the abstraction process, in that it could, for
example, inspect the sequence of binary-coded
integers (4 Yes, 7 No, 14 Yes, 19 No, 3 No,
5 No, 20 Yes, . . . ) and determine that what
the positive instances had in common was
divisibility by two.
If our formulation had to be revised to
serve as an approximate model for human
hypothesis-formation, we would, first of all,
have to drop the assumption that the number and
kinds of attributes can be a priori specified.
Second, each attribute would be assigned a
different weight, corresponding to its importance
to the individual as a characterizing feature of
objects. Third, the attributes could not be
treated as strictly independent, although it may
be possible to determine more or less independent attributes by some such technique as factor

1. Banerji, R. B. "An Information-Processing
Program for Object Recognition, " General
Systems, Vol. V., Society for General
Systems Research, Ann Arbor, 1960, p. 117.
2. Bruner, J. S. ,
Goodnow, JJ.,
Austin, H. H.

A Study of Thinking, Wiley,
New York, 1956.

3. Elsasser, W. M. The Physical Foundation of
Biology, Pergamon Press, New York, 1958,
pp. 152-153. (See also Review of this book
by M. Kochen" Information and Control,
April, 1961.)
4. Hovland, C. 1.
IlComputer Simulation of
and Hunt.• E. B. Concept Attainment, " paper
presented at AAAS meeting in New York,
January, 1960.

578

13.3

5.

Kochen t M.
"Experimental Study of
'Hypothesis-Formation' by Computer, "
1960 London Symposium on Information
Theory. Butterworths t in press. Also,
IBM Research Report RC-Z94t MaYt 1960.

579

13.4

TIME-ANALYSIS OF LOGICAL PROCESSES IN MAN
Ulric Neisser
Brandeis Univer sity
Waltham, Massachusetts

Summary
Pattern recognition in man is assumed to
be mediated by a hierarchica~. organization of specialized systems, each of which abstracts a certain
property from its input. Some characteristics of
the overall organization can be inferred from the
times that are needed to carry out various information processes. These times can be determined independently of reaction factors by a scanning method.
Preliminary results with the method demonstrate
the flexibility of the processing hierarchy in man.
They also suggest that parallel processing is used
under some conditions, while sequentialprocedures
ar e dominant in other s .

from behavior, as is most of modern psychology.
In particular, it takes advantage of certain timerelations in human cognitive activity. However
fast the information processes may be, they must
occur in real time. In situations which permit
reasonable inferences about the patterns of processing, we may be able to find confirmation by looking
at the times involved.
A particularly interesting question sterns
from the distinction, in programming, between 1
parallel and sequential processing. As Selfridge
has pointed out (see also Selfridge and Neisser 2),
a device for pattern recognition may use either of ,
two fundamental modes, which have been called
sequential and parallel. They may be used singly
or in combination. In the sequential mode, each
It is a commonplace now that perception
partial analysis of the input results in a decision
and judgement involve the processing of information.
which governs the type of analysis to be made next.
The world presents itself to our sense-organs in a
Only one process is carried out at a time, and the
superficially chaotic flow of data, from which perparticular sequence of processes determines the
ceived objects as well as concepts and ideas are abfinal result. In the parallel mode, many analyses
are made simultaneously, with the outcome dependstractions. Human behavior can be regarded as consisting of a series of decisions each of which is based ing on some (perhaps linear) combination of their
on very extensive stimulus analysis carried out over outputs. How do human being soper ate? It is evident from introspection and gross observation that
time. From this point of view, the psychologist who
the sequential mode is common. We often think,
studies cognition is examining the char acteristics of
and act, step-by-step. On the other hand, the anata processing system.
It seems clear that the adult human percepomy of the nervous system rather suggests parallel
tual and cognitive apparatus must be hierarchically
operation. It is likely that people are capable of
organized. A system so flexibly able to respond to
working in either mode, depending on circumstances.
many properties of the input can not be radically re- The sort of time-analysis to be described here is
designed for each task. For the most part, new
particularly well adapted for discovering these
stimulus analyses must occur by reorganization of
circumstances.
existing parts rather than by starting from scratch.
When you learn to recognize a new word, for examMethod
ple, you certainly use pre-existing and established
sub-systems that identify the letters of the English
Fundamentally, our experiments involve
language. You simply use a novel combination of
timed visual pattern recognition. That is, the subtheir outputs. The letter-systems, inturn, areprobject must decide as quickly as possible whether the
ably fed by the output of simpler organizations (let
stimulus input has a certain property or not. Typus call them ftrecognizers ") which select various
ically, the input might be a letter, and the·question
kinds of curves and shapes from the visual input.
might be whether it is the letter 11 Z If. We as sume
There are three fundamental methods for
that at least two levels of processing are involved in
exploring the organization of the processing hiersuch a task. Certain visual characteristics are abarchy in man. We can look inside with the techstracted from the input -- roundness, angularity,
niques of physiology; we can take another kind of
the slopes of lines, and so on -- by sub-systems
look by introspecting on the processes as they occur
we may call shape-recognizers. The recognizer
in ourselves; or we can make inferences from human for a letter, say flZtt, is some weighted combinabehavior. Each method has both advantages and
tion of their outputs. For example, a certain visual
drawbacks, and this is not the place to discuss them. pattern might produce a positive response in a
The procedure reported here is based on inference
recognizer for horizontal lines, and in another which
detected slanted lines. These two in turn would
activate the recognition system for fI Z", but not that

580
13.4
Of course, the number at·the bottom of the list,
for 110". (Note that the effectiveness of these particular shape-recognizers depends on context.
which identifies the position of the Z, is concealed
They would not serve to distinguish between t1 Z 11
from the subject's view. To insure that he has
and It A If, although it is easy to imagine other s that
actually found the critical item, he is instructed to
would.) To be sure, we do not assume that these
turn the switch to the right if the item has a dot
particular shape-recognizers exist in any person.
beside it, and to the left otherwise. Both directions
They are meant only as illustrative examples.
stop the clock, but the experimenter can check
Suppose now that several letters are prewhether the decision was correct.
sented at once, and the subject is asked wh~ther
When this method is used, the scanning time
any of them is "Z". If he can process them simneces sarily depends on the position of the critical
ultaneously, in parallel, his speed will be indeitem in the list. The time will necessarily be greatpendent of the number of letters. If he must exam- er for lists with the" ZII nearer the bottom. If the
lne them one by one, his time will incr ease linear- subject is given a number of lists, each with the
critical item in a different position, it becomes
ly with the number of letters. To be sure, there
possible to plot the time as a function of the listis no doubt that the processing must be sequential
if the number of letters is large. We cannot exam- position of the item, S·.lch a plot is shown in Figure
ine an entire page in a flash, if only because of the
2. Each point in the figure represents the search
limited area on which the eye can focus. But is
time on a single list. All of them were produced
there a more intrinsic limitation?
by one subject in a ten-minute session, working
Another interesting case arises if the subon one problem. He simply scanned down each list
ject is not looking for II Z" alone. but for either of
until he carne to a II Z". Actually, our subjects always
two letters; say "ZIT or flQ". It is evident that these scan 20 lists in each problem, but the first six are
two letters require different shape-recognizer s.
considered practice. From the subject's point of
That is, two different analyses of the same visual
view, the different possible positions of the critical
input must be mad,~. We could easily build a comitem occur at random, so he cannot predict in adputer to make them simultaneously. But can human vance where his scan will end. Actually, the six
information-proces sing go on in parallel under
practice lists always have critical items in positions
these conditions? If so, is there a limitation on the 5, 6, 25, 30,45, and 46, and the 14 lists to be used
amount of parallel activity that can be carried on?
in determining T II have their critical items in posiA program of research is under way to answer
tions 9,11,14,16,19, ••• ,39 and 41. The order of
these questions, and some preliminary results can presentation is randomized separately for the pracbe presented here.
tice lists and the experimental lists • Thus, while
Unfortunately. the actual experiments can
the subject is kept alert to the possibility of finding
not be as simple as the prototype described above.
the item near the very top or bottom of the list, these
The time which a human subject needs in order to
extreme positions are not used in the analysis of the
indicate whether a given letter is 11 Z II includes
data. There is good reason to suppose that departures
much more than stimulus-analysis. The total meas- from linearity will occur at the extremes, especially
urable reaction time includes the response itself
at the beginning.
as well. Nor can we safely consider the total time
The straight line in Figure 2 has been visually
as the sum of two parts. There are many compon- fitted to the points. It is a representative example
ents: the subject must fixate, must begin actual
of the extent to which our data approach linearity.
search, must decide to react, must actually resThe fit is generally good enough so we feel justified
pond (by speaking, or pressing a button) etc. In
in treating the average time per item scanned (Til)
addition, -the interrelation between these times may as a meaningful quantity, v.h ich can be directly dechange if the problem is altered. For example,
termined from the slope of the line. (We do not imply
the final decision to press the button may be longer that each item is separately fixated or processed.
delayed in problems where the subject feels relaEven if the subject treats them in groups, Til retively less confident. Analysis of simple reaction
mains a valid measure for the comparison of scantimes is unlikely to give adequate answers to our
ning time across different types of lists. )
questions. Indeed, reaction-time analysis was
The types of logical processing that can be
common in nineteenth-century psychology, and was explored with this method include any abstractions
ultimately abandoned for these reasons.
whatever that can serve to distinguish one item in
It seems possible, however, to obtain a
such a list from all the others. The presence of a
measure of processing time that is relatively inparticular letter is merely an example. The experidependent of reaction factors.- We have tried to
menter may require the presence of either of two
achieve this by using a scanning technique instead
letters, or of both, or of a particular sequence. The
of measuring reaction times directly. The subject
critical feature can also be the absence of a letter
is not presented with a single string of letters, but or of some logical combination of letter s . In every
with a list of fifty strings, arranged in a column.
case, the Til being measured is for the opposite of
In the entire list, only a single string has the crit- the function being sought by the subject. If he is lookical property. A typical1ist is shown in Figure 1.
ing for a "Z tt, then all the items he s cans contain no
In this case, the subject is to look for a "ZIf. As
11 Z", and T II reflects the time necessary to make
soon as the list is shown, he begins to scan down
certain that n ztr is absent. If he is looking for the
from the top. When he comes upon the item with a
absence of "Zn, each of the items scanned necessarily
"Z If (in this case the 15th one down), he turns a
contains one, and Til measures the time necessary
switch. The switch stops a clock, which had been
to process it.
started at the instant the list was presented. Thus,
the time needed to find the critical item is recorded.

581
13.4
Procedure and Results
The present paper is a report of an exploratory experim.ent with this m.ethod of tim.e-analysis.
Three subjects were system.atically given a variety
of functions, using item.s of two different lengths.
Although the sam.p1e is sm.all, the results seem.
consistent enough, and inform.ative enough~ to justify a prelim.inary report. Further work is in progress to check on the findings of this study.
The design of the experim.ent included seven
different functions, or prob1em.s. Three were positive, in the sense that the subject was looking ~
the first occurrence of som.ething. These functions
were lIZ ", "QtI, and 11ZvQ". In the last of these,
the cr itica1 item. was defined by the fir st appeaJance
of either "z" or "Q" or both together. (The trortl
lists were so constructed that each type of critical
item. actually occurred in approxim.ately one-third
of the twenty lists.) In addition to these positive
functions, we studied four others that were negative
or m.ixed. These were "-Z", "_Q", "-ZvQ", and
11 Zv-O". In the first two the subject scanned down
item.s containing the letter in question until he found
one without it. In the last two, the critical item.
m.ight be distinguished either by the absence of som.e
letter (that all the others had), or by the presence
of another (that no other item. had), or by both.
These seven functions were realized in two
sets of lists. In one set, each item. was six letter s
long; in the, other , each had only two letters. The
seven functions and two lengths yielded 14 experim.ental conditions. The subjects went through two
conditions (i. e., scanned forty lists) at a session,
which took about half an hour. The first two sessions were devoted to practice with varying types
of lists. Thereafter, each subject worked in each
condition twice. To control for the effects of order
and practice, the 14 conditions were first given in
a certain order (different for each subject) and then
repeated in reverse order. The results have been
plotted, and slopes calculated for the best-fit lines.
The lists were prepared by an IBM 7090
com.puter, and printed on an ANELEX. In preparing the lists, the com.puter program. form.ed random. perm.utations of the letter s J, P, Q, S, T, V, X, Z,
and exam.ined the last six (or two) places of the
perm.utation to see if it was an exam.ple of the desired function (e.g., if it contained 11Z11). The program. prepared random.ly ordered lists of nonexam.p1es, inserted proper exam.p1es into the chosen
list-positions, and prepared an appropriately spaced printout. The printout was cut into separate lists
and pasted on to 2 x 11 cards for experim.ental use.
An apparatus was constructed in which such a card
fit under a spring-loaded door. When the door was
opened by the experim.enter, a tim.er was initiated
which continued to run until the turn of the subject1s
switch indicated that he had found the critical item..
Occasionally he overlooked it, and scanned to the
bottom. of the list. These trials were discarded,
and the se.m.e list was re-used later in the sam.e
•
>r:
seSSlon.

*

The author gratefully acknowledges the assistance
of Paul Weene, who designed and built the apparatus,
and of Em.ily Carota and Arthur Warm.oth, who assisted in the experim.ent.

The results of the experim.ent are displayed
in Table 1. The internal consistency of the data is
good, for the m.ost part. The two replications of
each condition usually yield very sim.i1ar values
of Til, although two of the subjects (LS and MA)
show distinct im.provem.ent with practice, with the
second run generally faster than the first. The
im.portant trends can be sum.m.arized as follows:
1) "_Q" is slower than "Q lI ; IT_Z" is slower
than "ZlI.
2 ) In the po s iti ve functions, II Z 11 is slower
than "0".
3) 2-letter item.s can be scanned m.ore quickly
than 6-letter item.s.
4) "QvZtl takes no longer than does "Ztl alone.
5) With two letter s, the negative and m.ixed
functions are all equally fast.
6) With six letters, the negative and m.ixed
functions vary in difficulty.
Conclusions
1) In scanning for "-Z", the subject m.ust
m.ake sure that each item. he passes does contain
a "Z". The recognition sub-system., or recognizer,
for 1f Z n m.ust be fully activated each tim.e, and the
high values of Til reflect this requirem.ent. In the
positive function, by contrast, the Z-recognizer is
not fully activated until the critical item. is reached.
However, the subject m.ust scan slowly enough so
that this recognizer could react with the proper
input. In other words, the shape-recognizers which
distinguish between "z" and other letters m.ust
have tim.e to act. The observed tim.e-difference
between positive and negative functions m.ay be assum.ed to correspond to the different levels in the
processing hierarchy which they require. The positive functions need only enough tim.e for such recognizers as (for exam.p1e) lIpaired horizonta1lines fl
or "angle near top". The negative functions m.ust
have tim.e enough for, say, "Z IT itself to receive its
input from. such features and then to react. Perhaps
we are justified in saying that any artificial pattern
recognizing system., if organized in parallel, would
display these tim.e differences.
2) The two-level hierarchy becom.es m.ore
articulate when we consider the difference between
"z" and "Q". Why should som.e letters be harder
to find than others? Evidently, identification of 1IZ"
involves different features of the visual stim.ulus
than identification of 1IQ1I, and processing the latter
is easier than the form.er. Without direct knowledge
of the critical properties, we can only speculate
about the reason. Perhaps the shape-recognizers
involved with 1I Ztl are them.selves hierarchically
deeper than those for "Q". Perhaps it is only that
m.ore attributes of shape m.ust be exam.ined for
~but we shall see later (No.4, below) that this
m.ight not require an increase in tim.e.
It would be hazardous to suppose that the letter
"Q" is intrinsically easier to see than IT Z n. The context of alternative letters m.ust play an im.portant part.
The shape-recognizers that suffice to distinguish
"Q" from. J, P, S, T, V, X, and Z m.ight not be adequate in a context that included C, D, G, and O. The
processing system. m.ust be expected to change with
the context, and Til will also change. We are now
conducting an experim.ent to explore this point.

582

13.4
3) At first thought, it seems entirely reasonable that six letters should take longer than two.
After all, each item contains substantially more
information in the 6-letter case. Yet it would be
easy to build a 6-channel device that would handle
both cases with equal speed. It follows that the
subjects are not acting like such a device; they do
not process all the letter s in parallel. There is no
immediately clear reason why they should not do
so. The entire six letters subtend a visual angle
of less than 50, and can easily be read in a single
fixation. Thus, the visual informati,on can all get
to the cortex simultaneously. (Experiments now
in progress confirm that the actual number of
letters, and not their spatial separation, is the
important variable.) We conclude that, at least
under some circumstances, the shape recognizers
can not be applied simultaneously in different regions of the visual field. On the other hand, their
application is probably not simply successive; Til
does not triple as we go from two letters to six.
Pending further research, we can only state that
fully par allel oper ation, at least, is not the rule
for spatially separate inputs.
4) On the other hand, the clear-cut results
with "QvZ If show that fully parallel operation can
be achieved among various shape-recognizers acting on the same input. There can be no doubt that
the system examines different features of the stimulus pattern in looking for "Q If than for "Z", but
the examination uses no extra time. It follows that
different elementary figural properties can be processed simultaneously.
In a projected experiment, we will examine
"or" functions of more than two variables, such as
"QvPvXvZ rr. One wonders whether there is an effective upper limit to the nurnber of parallel searches
which can be carried on. It seems at least possible
that there is no such limit. The question can be
referred to a common experience: anyone can scan
a crowd to see if it contains a familiar face. Does
scanning time increase with the number of recognizable acquaintances we have made in the past?
5) The equivalence of the Til values for a11
the negative and rnixed functions in the two-letter
case seerns paradoxical at first. We would have at
least expected If_Q" to differ from IT_Z". A look at
the list themselves explains the paradox, however,
and emphasizes the great flexibility of organization
which characterizes hurnan pattern recognition. In
"_zrr, as in If_ZvQ", each itern except the critical
one contains a If Z 11.* Since the items are but two
letters long, the "ZI1 in any item has a 50-50 chance
of lying exactly underneath the nZ" in the preceding
item. The visual effect is the formation of short
and long colurnns of 1IZ s ", shifting haphazardly between the left and the right sides of the list. The
subjects commented spontaneously that they handled
these lists differently frorn the others, fo11owing
these columns with their eyes without heeding the
horizontal structure of the items at all. Essentially,
they were using different shape-recognizers. The
featur e s that distinguish If Z" from other letter s in
a row are not the same as those which mark the
continuity of a column of "z s Tl. The colurnn- ccntinuity features of "Z" seem to be no different, or at
least no more complex, than those for t1Q".

This result, more than any other, emphasizes the extraordinary adaptiveness of human information processing. It is unlikely that any
artificial device we wi11 know how to make in the
near futur e wi11 be as efficient in taking advantage
of the vagaries of its environm.ent.
6) The remaining results are not easy to
interpret. The Til for "-QvZI! is much longer
than the model we have been using would predict.
We would have expected it to be near the value
for "_QIT.
In summary, we have presented a rough
model of human information processes, amethod
for studying these processes in detail, and some
preliminary data obtained by the method. The
model is simply a hierarchical organization of
specialized subsystems, ca11ed "recognizers" *f
which effectively abstract certain features from
their input. The method consists of measuring
the average time needed to process successive
items of a list that is being scanned for some
particular characteristic. The data suggest that
different elementary recognizers have different
operating times; that different recognizers can
operate simultaneously on the same input; that
spatially distinct parts of the input cannot be
handled entirely simultaneously; and that the
processing hierarchy can be flexibly adjusted to
meet the demands and opportunities of the task.

*In the case of the function "-ZvQ" the critical
item itself may also contain a "z" (if it contains
a "Q"). However, this does not substantially
change the present argument, though it suggests
that the subjects can check for the presence of a
"Q" simultaneously with tracking the columns of
"Zsll.

** The model is a version of Selfridge's
"Pandemonium", in which the subsystems were
called demons.

583
13.4

Z
)(V~JTQ

TX"VQP.
XPTQJV
PTQSXV.
PJ"XOT
OJVXSP.
PVOTSJ
SVOPXJ.
TXDSJV
XSTPQV.
JSXQPT
QVXPTS.
PSOVTX
JrxSOV.
ZVJP)(Q
XQVJSP.
OJVXTP
OJTS)(P.
XOPJST
VJc;PTO.
XJVQSP
PXJTOS.
PXJSTQ
aSTJVP.
PJVSTQ
TJQSPV.
VQxJST
XQVSPT.
JQVSTX
VSJPXQ.
lCPJVQS
PTJSXV.
PSQVTJ
VPTJOX.
PTSVJX
"QXTJP.
~JVPTX

OTVPJ~.

PVl(STQ
PVQJSX.
JVQPSX
PJVXTS.
TQPXJV
XPVJSQ.
SXVTOP
TS~XVJ.

VXPSJQ
PXJTSV.
JXTVPQ
l(VPQTJ.

15

C54-44
4-

fiG-. .1

foj

U1

WH

)-,-Lz .. ~

~ .. /32//

40

I

L.S. DAY 58
Z (6)

30 I-

T/I=.62

-

( /)

-0
C

0

U

~ 20
w

~

I-

10

o

iO

20

30

40

50

POSITION OF CRITICAL ITEM

FIG. 2

C54- 45

~

585
13.4

Length:
Subjects:

two letters

six letters

SS

LS

MA

mean

SS

LS
MA
---

15
14

13
06

07
06

(10)

20
14

16
13

11
08

(14)

23
26

31
22

19
17

(23 )

38
70

62
49

60
51

(55 )

24
25

22
20

20
16

(21 )

52
47

86
44

59
51

(57 )

34
32

35
28

33
28

(32)

36
30

68
60

46
36

(46)

34
30

36
34

40
26

(33)

56
60

96
88

73
52

(71 )

30
32

39
37

28
33

(33)

60
68

96
98

90
58

(78)

34
36

32
47

28
28

(34)

132
95

104
82

75
66

(92)

mean

Function
Q

Z

QvZ

-Q

-Z
-ZvQ
-QvZ

Table 1.

T /1 as a function of experimental conditions
and subjects. Replications appear with the
second beneath the first. Times are in
hundredths of a second. Each mean is based
on the six figures to the left of it.

587

14.1
COMPUTER-BASED MANAGEMENT CONTROL
Alan J. Rowe, Manager
Industrial Dynamics Research
Hughes Aircraft Company
Culver City, California
Sunnnary
The use of computers for management controls poses an entirely new set of requirements
on the system designers. Tied into automating
information processing is the question of an
adequate understanding of the control problem
itself. For example, measurement or management
reporting is often confused with the control
process. Although information processing is an
integral aspect of a management control system,
nonetheless there are other considerations
which should be taken into account such as,~
interdependencies of system components, response
characteristics, adaptive capability, decision
rules, feedback mechanisms, etc. Furthermore,
current methods of measurement are based on
historical averages of past performance, whereas
what is needed is timely information which reflects actual performance. Finally, the computer, through the use of simulation models,
provides the capability of pretesting system
designs and the basis for eventual real-time
control.
Introduction
With the ever-increasing use of computers
in business, there is a growing demand for
computer-based management controls which have
the capability of operating in real time. Although there are scattered examples of real time
control systems such as SAGE, SABRE, and PERT,
there is by no means wide-scale use of these
systems nor a basic understanding of the control
process itself. Rather, the majority of computer
based systems can best be described as "mechanization" of existing manual systems. To make
effective use of computer technology, therefore,
a body of knowledge is needed in management
control, feedback theory, information theory,
organization theory and the techniques of operations research, management science, computer
programming, etc. It is no wonder, therefore,
that progress has been slow in achieving truly
"designed" computer-based management control
systems.
Why Computer-Based Systems
One can hardly question the complexity and
dynamics of modern organizations. In view of
shorter lead time requirements, increased number and complexity of products, wider geographic
distribution of customers, and potentially
larger risks in deciSion making, management can
no longer afford the luxury of operating on

hunch, intuition, or guesswork. To have effective control, management requires timely
information which shows the impact of decisions
on the total bUSiness, decision criteria which
permit rapid response to changing conditions,
and organizational design based on information
requirements.
When considering the management control
problem, one cannot ignore the question of
design of the business itself as an economic
system. Business systems are run by managers
who are responsible for organizing and utilizing the available resources subject to the goals
and objectives. of owners, and subject to constraints such as capital structure, physical
facilities and market status, in order to assure
effective operation and survival of the enterprise. Profit maximization can hardly be stated
as the single corporate objective; rather, a
match of the willingness of ownership to take
risks with the inherent capabilities of the
business system produces a set of feasible alternatives. Until recentiy a methodology has
not been available to evaluate the impact of
alternative strategies in a dynamic environment;
however, there are now a number of instances
where computer simulation has been used as a
research and design tool for this purpose.
Role of the Computer
It is no accident that the computer has
assumed a dominant role in business applications of information processing. Likewise it
appears highly probable that the computer will
play a vital role in the development of real
time management control systems; for herein
the computer is no longer used merely as a highspeed data processor. Rather, the computer,
through simulation models whose behavior corresponds to actual systems, provides a policy
laboratory to pre-test new designs of management control systems, as well as the capability
of permitting real time decision making. Computer simulation has been used for a long time
to solve difficult problems such as evaluation
of complex integrals. Simulation has also been
used successfully to design manufacturing plants,
to develop scheduling decision rules, to plan
flight maintenance operations, etc. However,
the real power of simulation as a research
vehicle to conduct laboratory experimentation
has hardly been tapped. Not only can insights
be gained by stressing systems beyond normal
conditions, but valuable information on the
sensitivity and interaction of variables is

588
14.1

possible. In testing new decision rules or
control theories, simulation provides a systematic means of problem formulation and application of statistically designed experiments. An
important attribute of simulation is the capability of demonstrating new system designs by
comparing alternatives for management, as well
as using the simulation program for training
when implementing the new system. Probably themost significant aspect of simulation is the
role it can play in projecting expected system
behavior which is necessary for real time
management control system operation.
Computer Programming Considerations
In the application of computer-based
systems, it is evident that there is a need for
precise formulation of the computer program.
Descriptive statements are not sufficient~
rather, quantitative or analytic formulation of
problems is required. Furthermore, factors such
as kind and size of memory, speed of computation,
manner of indexing, and rounding must all be
taken into account in computer programming.
Methods of filing information, data accumulation,
and reporting requirements are also significant
aspects of program design. It is often necessary to reformulate a problem to conform to the
computer requirements; although this consideration is becoming less important with the availability of large-scale digital computers.
At the outset of a given computer program,
a decision must be made whether to have a flexible, general purpose program or one designed
for the immediate specific use. Modular programming which treats each section of a program
separately provides considerable flexibility at
only a small cost in computation time and
storage. Probably the most difficult aspect of
the problem is the actual coding or programming
language. Not only is the coding a timeconsuming and difficult process, but the system
designer must convey the intent of his work to
programmers, thus compounding the problem. Considerable effort is currently being expended in
the development of computer languages which can
be used more readily by system designers. This,
in part, recognizes the fact that as much time
is often spent in coding a problem as in the
formulation. In this regard,-then, considerable
work remains to be done to develop computer
languages which will simplify the operational
instructions for the computer programming task.
Defining Management Controls
In current usage of management controls,
measurement is often mistaken for control. To
carry out its control function, management
utilizes a communication and reporting system~
however, control also includes specification of

objectives, decision criteria for evaluation of
performance, and decision rules for corrective
action. Thus, a management control system can
be described as a combination of a data processing system, a management information system
and a control system. These can be represented
schematically as shown in Figure 1.
Data processing is typically carried out
level 2, whereas a management information
system includes levels 1, 2 and 3. To achieve
control, however, a fourth level is needed which
includes decision rules, corrective action, and
a feedback mechanism which can change the plans
as a function of actual performance. If the
corrective action responds in time to change
performance during the activity, this can be
called a real time control system. If a computer is used for data processing, in addition
to this response capability, then we have a
computer-based management control system operating in real time.

~t

Achievement of a real time control system,
however, is dependent upon both an understanding
of the behavioral characteristics of business
systems and design principles which help meet
desired specifications. Unfortunately, most
business behavior is confounded by the decision
rules used, so that empirical observations are
not sufficient. That is, observed behavior is
a result of an underlying phenomena, such as
product demand, and the decision rules applied,
e.g., pricing. Thus) if the pricing decision
rule is changed, a modification in demand might
be expected, however, the manner in which demand
would change may not be predictable if based on
the observed behavior using anyone pricing rule.
An example of underlying phenomena in a business
system is queuing. Where queuing theory is
applicable, expected waiting time can be predicted for alternative queue disciplines. In
essence, this is a problem of joint variation,
where the prediction depends on a"knowledge of
the behavior of individual variables. Although
these problems often cannot be solved analytically, simulation techniques appear to offer a
means of establishing these relationships. In
addition to the difficulty encountered in describing bUSiness behavior, there are few design
principles. It is thus obvious that considerable research is still required before optimal
system designs can be achieved.
Requirements of Management Control Systems
Although the design of these systems is
still) at present, a difficult task, an
analysis of requirements can provide useful
insights into the problem. In most organizations, control is really a throttling technique which prevents major upheavals rather
than providing the means for achieving

589
14.1
specified objectives. Thus, for example, budgets are used to prevent excessive expenditures,
rather than attempting to find suitable measures
of performance. Without such measurement, there
cannot be true control in the sense of meeting
objectives. Not only are the measurements
currently used inadequate, such as historic
averages, but they often lead to conflict among
divisions of a cOIT~any. For example, by limiting overtime in one department, a sale could be
lost due to late delivery in another department.
However, individual managers are measured on
their own performance without regard to the
enterprise as a whole. Hopefully, computerbased systems with their ability to provide more
timely and accurate information can help remedy
this situat ion.
A related problem to that of measurement is
an understanding of the functional interdependencies among divisions of a company. By establishing suitable decision criteria, it would be possible to minimize conflicts and avoid suboptimizat ion.. Furthermore, dec is ion rules which
provide a systematic response to dynamic conditions help to assure desired performance.
Where these rules are correctly designed, they
relate the basis for plans to the imple~entation
and control requirements. In this respect, the
co~puter may provide the means for pre-testing
new programs and their related decision rules to
provide assurance of expected performance.
Design of Centrols
For purposes of this paper, control is considered directly related to the decisions made
in a given system. If we examine the typical
decisions made in-a business, we find that there
are those which affect the entire system; for
example, resource allocation, and other decisions that do not have an appreciable effect
on the entire system. The former decisions are
generally long range in nature, Whereas the
latter tend to be operational. However, the
majority of decisions directly affect a number
of other system activities. Because of these
inter-dependencies, the decisions are difficult
to formulate explicitly. Thus, one of the major
advantages of a computer-based system is the
capability of continuous updating of interacting
variables in the system.
Turning to the question of designing controls, they should be capable of assuring the
following response characteristics in a
business:
1.

Adapting the system to changing conditions.

2.

Stabilizing system response under
variable demand.

3.

Allowing maximum effectiveness of the
system rather than imposing restrictive
limits.

4. Providing a minimum time lag in
correcting deviation in performance
commensurate with desired objectives.

5.

Implementing real-time control.

Although the possibility of achieving
real-time control appears very likely in view
of the capability of computers to rapidly
process data, the data review frequency should
be matched with the response capability of the
system. For example, if corrective action
requires a minimum of one day, a review frequency of once a minute would be unreasonable.
Since business systems often have long delays
between measurement and corrective action, the
significance of real-time control may differ
radically from that used in military control
systems.
In business, there are many interacting
subsystems, and response depends on factors such
as the status of subsystems, speed and amount of
their changES and time dependencies. The time
dependencies are ifuportant in the design of
error correcting controls. The response period
is dependent both on the magnitude and the rate
of change required. Where the response period
has to be made,extremely short, as in military
control systems, computer-based controls are
often the only solution. Short response periods,
however, may be prohibitive for business systems
due to cost. In the design process, cost of
control is often overlooked, however, it is a
significant factor and should be balanced with
operating efficiency.
Another consideration in the design of
control systems is the accuracy of error sensing.
Where there is loose coupling between system
response and the forecast, extreme accuracy is
unnecessary. However, where accuracy is required, the forecast can be improved by considering the current state of the system as the
base point for predicting future states. Thus,
the best estimate of the state of the system
at some future time can be obtained by a form of
exponential smoothing using both past history
and anticipated events.
A related question is the accuracy required in the response mechanism. Since business
is stochastic in nature, error correction should
hold performance within a specified variance
band rather than directed toward an exact objective. Thus, correction is made in two
stages. First, a large correction is made, and
then only minor adjustments within the variance
bana, which is similar to quality control
techniques.

590
14.1

Since decision rules are used as control
mechanisms, they can affect performance by
inducing fluctuations in an otherwise stable
system. For example, consider a classical
inventory problem, where the usage rate is
constant and the replenishment rate has a lag
imposed by a reorder point rule. The deci-sion
rule which leads to periodic replenishment will
cause fluctuations in employment if the plant is
producing a single product. Thus, labor productivity would fluctuate despite constant
demand. This condition might arise due to the
physical processing where production could not
match consumption. Thus, a better understanding
of the interaction of decision rules and physical
processes is a prerequisite to the design of
effective management control systems.
Control Related to System Characteristics
In the design of management control systems,
both variability of system performance and
response characteristics should be taken into
account. We can anticipate that the control
methodology and information requirements for
control would change radically for differer:t
types of business, if the variability of performance is assumed to increase as a function
of the complexity of the business. Variability
in the system is a measure of the error in predicting performance. Furthermore, it is assumed
that the quantity of information required for
control varies inversely with the stability of
the system. Thus, a system which exhibits considerable variability would require a larger
amount of information to maintain a given level
of control. Of course, it is assumed that
variability is caused by underlying phenomena
rather than induced fluctuations resulting from
the control system employed.
Another problem is that of local vs. total
system control. Stated another way, the question is, what is the optimum combination of
subsystem controls where there is interdependence? One approach to this problem is the use
of quadratic programming to find the optimum
combination defined in terms of expected values,
variance and co-variance estimates. Thus, the
control system can be designed to achieve a
given expected value with a specified associated
risk. Since survival of the enterprise is an
important management objective, this manner of
formulating the problem of joint optimization
should be useful in control system design.
Organizational Considerations
Since top management is primarily concerned
with long-range d~sions, there is a loose
coupling with the actual business processes.
Thus, the amount of aggregation and frequency
of information transmission should be appropriately designed into the measurement system.

Rather than continuous reporting of system
status to top management, periodic reports
are generally sufficient. However, a computerbased system would provide the capability of
random access to updated information on detail
system status as the need arose. In this
sense, then, top management would have realtime controls.
A related problem to reporting of system
status is the communication network in an
organization. In current management control
systems, measurement of performance is reported
successively upward with each level summarizing
information to the next higher echelon, until
it reaches the top. At each level, interpretation, summarization and biasing occurs. Although the levels act as filters in one sense,
they can also introduce distortion in another
sense. Furthermore, at no time does anyone but
the top executive level have access to all of
the information in the system.
In a like manner, the design of the decision network poses a number of formidable
problems. Current organization structures
lead to a cascading effect of decisions as
a result of interpretation at each level and
generally differing measures of performance.
In addition to the cascade effect among levels,
there are the conflicting objectives at any
given level. Thus, there is need for a design
methodology which treats the problem as a whole
rather than the fragmented approach prevalent in
industry today.
Contrast this with the concept of a systems
staff reporting to the top executive which acts
as the summarizing mechanism using the computer
for rapid, random access to detail operating
information. In this kind of application the
computer would have direct access to all system
information and have a large percent reported
automatically. Not only is the information on
system performance more timely and accurate, but
the system staff, as a team, has an overview of
the entire operation. ~hus, we can anticipate
that the organization design in computer-based
control systems will be more closely matched
with the information requirements of the system.
Conclusion
Although there are many formidable problems still to be solved, the advent of computerbased management control systems for business
systems is becoming a reality. However, to
avoid the IImechanization" approach, design
principles should be applied Which consider the
problem from the total system viewpoint. Although there may be organizational changes accompanying a computer-based system, the net
effect should ue better decision making and
control due to more timely and accurate information, as well as the use of suitable

591
14.1
decision criteria and decision rules. The
manager, in a real time computer-based system,
can be expected to perform more effectively
since the system will have been "designed"
rather than growing like Topsy.

592

14.1

Level 1

Plans and
Objectives

-

Business
System

-"-

System
Performance

,
~

Transmittal
of Reports
and Data

Level 2

Measurement
of Performance

~

If

--

Decision
Criteria

Level 3

Evaluation
of Performance

Ir

Level

4

Feedback
Mechanism

Corrective
Action

Figure 1

--

Decision
Rules

593
14.2

AMERICAN AIRLINES t "SABRE" ELECTRONIC RESERVATIONS SYSTEM

w. R. Plugge, American Airlines, New York, New York
M. N. Perry, American Airlines, New York, New York
Summary
The American Airlines Sabre
System, a joint development of American Airlines and IBM, is a major step
into the field of total data procesSing. This system is designed to
solve the problems confronting the
airlines in passenger sales, seat
inventory control, and maintenance
and retrieval of passenger records.
After six years of jOint effort,
a complex of programs and hardware
has been developed which will be the
largest commercial data proceSSing
system in existence. At the heart
of this duplexed system are two IBM
7090's. At the extremeties are agent
consoles built to American Airlines
specifications. This system will
automate all of the daily reservations processes with the exception
of the vital agent-customer contact.
The Sabre System will give,
in addition to the obvious customer
advantages, the availability of
current, detailed and summarized
data for the use of American Airlines' management in their constant
endeavor to improve passenger service.
Purpose Of The Sabre System
Progress in the air since 1930
from the DC-3 to the DC-7 was
matched with similar, but less dramatic advances in our Reservations
offices. The dynamic and revolutionary burst into the Jet Age with
the Boeing 707, DC-8 and now the
Convair 880 and 990 has presented
the airline industry with new problems in the Reservations function
because of the jet aircraft size
and speed. These airplanes which
can carry up to 150 passengers can
depart an airport and in some instances arrive at the next downline city before our present day
reservations systems has adjusted
the passenger inventory.

must be multiplied by a factor of 3
or a total of 26,000,000. The Sabre
System being installed for American
Airlines in 1962 will assist us in
proceSSing 85,000 daily telephone
calls -- 30,000 daily requests for
fare quotations -- 40,000 daily passenger reservations -- 30,000 daily
queries to and from other airlines
and 20,000 daily ticket sales. All
of this processing for most individual interrogations will be handled
in less than three seconds.
Sabre's main purpose is to
carry out on a nation-wide baSis, the
functions associated with the sale
and control of air transportation
from the customer's first call for
information to his arrival at his
final destination. To achieve this
purpose, Sabre will perform a large
number of different functions which
can be grouped into three main areas;
Passenger Sales, Reservations Record
Service and Management Reporting.
Passenger Sales
The primary role of reservations
sales agents and ticket sales agents
is to sell American Airlines' space
and provide the quality of customer
service which will encourage passengers always to turn to American for
their air travel needs. The Sabre
System was designed to help the agent
provide this kind of customer service
with increased speed and accuracy.
Reservations Record Service
The electronic processing center will perform a number of record
service functions, which also support the sales efforts of agents in
the Field and promote efficient flight
loading. These functions can be
grouped into three main areas; information maintenance, distribution of
schedules and operating changes, and
teletype message handling.
Management Reporting

In the year 1960, American
Airlines carried 8,615,000 passengers. In terms of reservations'
phone transactions, this figure

In addition to Sabre's Passenger Sales and Reservations Record
Service, the system will provide as

594

14.2

an important by-product, management
control information. The electronic
processing center automatically can
and will prepare several management
reports at given intervals.
Over six years of joint American
Airlines - IBM development effort
has been devoted to this system,
which has now emerged as a comprehensive reservations and control device,
designed both to serve the air traveler more swiftly and effectively and
to offer American Airlines increased
sales effectiveness and better utilization of passenger space. The Sabre
System will contain two duplexed IBM
7090 computers, disk and drum storage
devices of advanced design and much
greater capacity than any now in use,
and a specially designed on-line data
communications network with reaoted
input - output devices -- all on a
scale never before seen in a comaercial application.
The system will maintain a complete.and up-to-date inventory of
passengers booked and seats available. It will enable reservations
agents to confirm, cancel or alter
reservations and determine seat
availability in a matter of seconds.
And it will enable reservations
agents to obtain the passengers name,
record and all pertinent data from
the storage devices in a matter of
seconds. One of the most important
and unique aspects of the Sabre Systea is that it will operate. on current information and will be involved
in the control and execution of current transactions. The accumUlation
of historical data will be a secondary
function. The Sabre System is not a
record keeping device, but it is an
operating real time system in operation 24 hours a day, 365 days a year.
Why We Need Electronic Computers
For Reservations Control
The airline industry, including
American Airlines, cann~t afford to
operate 5i million dollar airplanes
without reservations. The balancing
of equipment, government regulations
to adhere to scheduled operations,
food services, and last but not least,
our competition dictates that to run
a profitable airline, you have to
have reservations.
The Sabre System represents a
major step in what has been an evolutionary process in our reservations

function. Twenty years ago, we installed availability boards in our
larger offices. As voluae increased,
we soon found the availability boards
crOWded, more and more flights were
added and our employees had to sit
further and further away from the
board. It became obvious that soa.
systea of providing information directly to the agent position was
necessary. We began exploring means
of replacing the aanual effort for
determining seat availability with
faster, aore accurate mechanical
methods. In 1944 we developed the
idea or concept of a mechanical systea which would keep our sales agents
informed of seat availability on
various flights. We found an equipment manufacturer - Teleregister to develop the system. By 1946,
such a system called the Reservisor
was installed in our Boston Reservations Office. The original Reservisor
was a combination of input and output
units which interrogated a central
source and reflected the availability
status on a given flight - "OPEN" for
sale, or "CLOSED." It did not allow
for the automatic register of "SELL"
and "CANCEL" transaction, but it was
a milestone in reservations' history,
because it was the first time any
airline had adapted current electronic
discoveries to reservations handling.
Further research and development led to a larger and more complex
Reservisor called the Magnetronic Reservisor which was installed at LaGuardia in 1952 to handle our mounting
tide of passengers. The significant
advancement found in the Electronic
Reservisor was the introduction of an
arithmetic ability and a memory drum
to the system, which allowed sales
agents in the New York area not only
to determine seat availability, but
also to automatically "SELL" or
"CANCEL" seats recorded on the drum.
While the Reservisor's devices
brought Significant improvements to
the reservations handling function,
we saw the need for further improvements in the overall reservations
area. Several problems were arising
because with a growing volume of passengers, we faced increaSing difficulty in keeping seat inventories in
conformity with the reservations
records. There was no direct or automatic link between the Reservisor seat
inventory system and the passenger
and reservations record, which were
handled by aanual methods. Serious

595
14:2

error problems were arising in the
handling of reconfirmations and wait
lists, which were further aggravated
by delays in finding the errors. As
a result of these problems, customer
service and efficient aircraft loading were suffering. For example, we
often found the inventory count of
passengers booked would not agree
with the reservations records and
this resulted in either oversales or
undersales, i.e., either more seats
were sold than were available under
the original flight schedule or
seats were not sold to requesting
customers even though in actuality
they were available. Undersales
also arose because cancellations
did not result in a timely reopening of the affected flight. In addition, in order to allow for sales
already in the co_unications "pipeline" but not reflected on the records,
a certain number of seats were held
open as a "cushion." This "cushion"
often was not fully absorbed by sales
in the "pipeline" and was not reopened
in time to make the seats available
to customers on the waiting list.
We also were reaching a point
of diminishing returns in terms of
reservations manpower. As the passenger volume grew, more and more
reservations manpower per passenger
boarded was required for-communications and record keeping. This resulted in a cost ratio trend that
was beginning to take alarming unfavorable proportions.
Foreseeing the affect of further increases in passenger volume
upon the efficiency and accuracy of
the present reservations handling
system, we began exploring with IBM
the possibility of further improving
the reservations operations. In 1953
American Airlines and IBM formed a
joint engineering -- Product Planning Project. Staffed by both american Airlines and IBM working in close
cooperation, this joint team performed a thorough detailed overall
system analysis in the sales and reservations area, to determine the
characteristics and volumes of the
existing operation and to develop
a program to solve the overall problem. The approach was not one of
what equipment was available on the
shelf to do the job, but rather what
equipment was needed with the thought
being that whatever was needed would
be developed from the ground up.

Benefits Of The Sabre System
In all three of the major groups
of functions performed -- Passenger
Sales, Reservations Record Service
and Management Reporting -- substantial improvements will be realized.
The customer hiaself will be a
major beneficiary of the system. His
request will be processed more speedily
and accurately with timely information
that reflects the actual status of
seats available. This superior service, we hope will attract more customers and therefore more revenue to
American Airlines. This system will
also increase agent productivity,
which should result in improved sales.
Our aircraft will be aore efficiently and fully loaded, since cancellations, "nO-Shows," and waiting
lists will be processed immediately
and accurately. Also, with more
timely and accurate information it
will not be necessary to maintain a
"cushion" of unsold seats to handle
sales in the cOJllJl.unications ttpipeline,"
or to cover clerical errors. American Airlines' reservation agents
throughout the United States will have
an agent set which is connected to
the computer via the Sabre communications network. With this, they will
be able to request and obtain a reply on seat availability instantaneously on any flight on the American
Airlines System. Management direction, both of an operating and planning nature, will be based on more
accurate and timely data. This should
allow for a more effective and better
informed control and evaluation of
our airline operations.
Sabre's benefits assume a great
importance when viewed against the
background of the airline industry
during the present decade. The costs
of both flight crews and equipment
in the Jet Age haveoincreased and,
therefore, placed a high premium on
the efficient utilization or loading
of eqUipment. An empty seat is much
more expensive than it used to be
froa many standpOints. Also, the
growth in passenger volumes and the
increased speed of aircraft have produced a need for processing a greater
volume of data, faster and more accurately. The growth of competitive
conditions in the industry accentuate
these needs for aore timely and accurate information and centrel.

596
14.2

Operating Characteristics
Of The Sabre System
The Sabre System will consist of
three main elements: the Electronic
Reservations Processing Center in
the New York area, the Agents Sets
in the field offices, and the Communications Network. The processing
center is completely duplexed. The
reliability requirements are such
that, if necessary, the entire system might be duplexed.
The Electronic Reservations
Processing Center will contain all
of the system's electronic storage
facilities, and will perform logical,
computational, and decision-making
functions for the system. An important aspect of the center is that
all of the memory is randomly accessible. This is one of the features
which allow the simultaneous processing of many requests.
At the heart of the electronic
processing center are two IBM 7090's,
each having a 32,000 word memory.
The memory of the computer actually
performing the reservations processing will contain the ttcontrol program,1t the messages actually being
processed, and operational programs
to perform those functions which have
been requested. The "control program" has the capabilities of a conventional execution program; but,
in addition, will perform all allocation of core temporaries on request, and will allocate space for
programs when needed. Due to the
magnitude of programs required (well
over 100,000 words), only those
programs actually in use can be retained in core. Using a hardware
relocation device, programs will be
shuttled into core each time they
are used. The'''control program l1
also facilitates multiprogramming
which will be carried on to a degree
never before attempted. As many as
30 programs will be in progress at
any given time during the peak load
period of the day. Each program will
proceed until it is required to Itwait"
for an external reference, at which
time, another program will be initiated or allowed to proceed. By this
device, each request will be proc~ssed
in a minimum of elapsed time, mak1ng
this system truly !tReal Time."
In addition to the computer's
internal core memory, the electronic
processing center will contain magnetic

drums and large-capacity magnetic
disk files which will act as the system's principal storage media.
The magnetic drums will have
capacity for 7.2 million characters
and will contain:
1. The inventory of the number
of seats sold and remaining for sale
on each American Airlines flight.
2. Current flight and schedule
information.
3. An area for each of the 1100
agent sets using the Sabre System
where requests and messages will be
assembled.
4. The more than 100,000 words
of programs.
The magnetic disk files will have
capacity for over half a billion characters and will contain:
1. Current passenger reservation
records.
2. The indices to facilitate retrieval of passenger records.
3. Duplicate copies of all information stored on the magnetic drums.
There will be several buffer
units which will enable the computer
and the many input/output devices to
communicate with each other. These
buffer units have the logical ability
to schedule, control, and assemble
input and output data between the computer and the magnetic drums, the magnetic disk files, the communications
lines to the Agent Sets, and other
input/output equipment located in the
processing center.
In addition to the random access
storage devices mentioned above, there
will be a large library of magnetic
tapes which contain historical information.
Agent Sets will be located at
more than 50 cities throughout the
United States and are used by American Airlines' Reservations agents to
communicate with the processing center in the New York area. To facilitate the use of these sets, each agent
will have a file of Air Information
Cards, which are pre-coded, machinesensitive cards showing information
relating to all American Airlines
flights and certain flights of other
airlines.
Each agent set will consist of
three principal elements: an Air

597

14.2

Information Device, a Director Console of rapid action pushbuttons, and
an input/output typewriter. Air Information Cards are inserted in the
Air Information Device, which automatically senses the code punched
in the card for transmission to the
computer. Rowand column pushbuttons
on the Air Information Device are depressed to designate a service preprinted on the Air Information Card,
and buttons on the Director Console
are depressed to indicate the date
on which the service is requested,
the number of passengers involved,
and the type of action required.
The computer's response to a request
using an Air Information Card will
be displayed to the agent by lights
on the Air Information Device or on
the input/output typewriter. The
input/output typewriter is also
used by the agent to enter information of a variable nature, such as
passenger name, phone number, etc.
The Communications Network,
which is similar in operation to an
AT&T automatic switchin~ teletype
circuit, will consist of 0ver 10,000
miles of telephone lines Jnd switching
devices which are required at each
of the more than 110 physical locations at which Agent Sets are installed. These switching devices called
Mulcoms (Multiplexor-Communications)
accomplish the sharing of each of
the 9 or 10 separate long telephone
lines radiating from the processing
center in the New York area. All
outgoing messages transmitted from
the processing center are monitored
by each Mulcom on the line and sent
only to the appropriate Agent Set.
In the other direction, when an
agent using an Agent Set initiates
transmisSion, it is buffered in a
logic gate until it is time for its
associated Mulcom to use the telephone line. Under this structure,
rate of input can be controlled by
the computer in the processing center.
Periodically (at least every half
second), the computer instructs the
farthest Mulcom to begin sending.
After this Mulcom has transmitted all
waiting messages, it instructs the
next farthest Mulcom to "go ahead."
This process continues until all Mulcoms on the line have transmitted
all waiting messages.
Following is a typical example
of the use of the equipment which
has been described. Let us suppose

that a customer telephones the American Airlines' reservations desk in
Washington and wishes to reserve a
"seat on a flight to Chicago, leaving
Washington sQaetime tomorrow." The
agent will select an Air Information
Card marked "Chicago" from the file
before him. For agents in Washington,
this card lists all relevant informa·tion for American Airlines' flights
between Washington and Chicago. The
agent inserts this card into the Air
Information Device on his Agent Set,
where it will remain in sight throughout the remainder of the transaction.
By pushing buttons on the Director
Console, the agent records the number
of seats required and the date for
the trip. After discussing the desired flight with the customer, the
agent depresses the row and column
pushbuttons on the Air Information
Device to designate the flight most
convenient for the customer. He
then presses the "Sell" button which
sends all of the information which
he has entered through a Mulcom and
thence to the computer in the processing center. If, in scanning the
inventory records, the computer determines that the desired seats are
available, a message is sent to the
agent (printed on his typewriter)
giving him information concerning
the flight and advising him that the
seat has been reserved. If, on the
other hand, the seat had not been
available for sale, a display of
lights on the Air Information Device
would inform the agent on which
flights he could reserve seats. kssuming that the seats were available,
the agent would ask for the customer's
name, home and business phone numbers,
the name of the person making the
reservation if not the customer, and
any special requirements the customer
might have, such as car rental, wheelchair, special diet, etc. The agent
enters all of this information, properly identified, via the input/output
typewriter, into the computer. At
the end of the transaction, the agent
depresses the Hend transaction" button. If all of the necessary information has been entered, the computer
will send a confirming message to the
agent and will construct, file, and
index a passenger record for future
reference. If, at some future time,
whether I second or 1 year later, the
customer wishes to revise his reservation, this record can be retrieved
and used as the point of departure
for his revised reservation.

598

14.2

The foregoing example showed
the most normal use of the Sabre System. Some of its other vital, if
less-used, capabilities are briefly
outlined below.
Teletype Co. .unications
Requests for continuing space
on other airlines, car rentals, tours
and hotels, and taxi service, may
be entered directly into the agent's
set. If a request is to be executed
by American Airlines, it will be
transmitted by Sabre to an office
in the appropriate city. If it must
be executed by another airline, a
teletype message will be transmitted
to the proper point on that airline.
Airlines which are permitted to sell
space on American Airlines will notify American of each sale by sending a teletype message. When a
designated inventory level is
reached, Sabre must generate and
transmit stop sales messages via
teletypes. Naturally, Sabre must
be prepared to accept and interpret
each type of message that it generates. An interesting and unusual example of worldwide industry
cooperation is provided in the development by the Air Transport Association of America and the International Air Transport Association
of a machineable interline message
fermat which specifies rigidly the
fora and control of each such message.
Wait lists
Waitlists are created and processed for all flights. If a cancellation occurs, the first passenger on
the waitlist is "confirmed" and a
confirmation message sent to him via
an American Airlines agent.
Flight Forecasts
Flight forecasts are maintained
fer all flights which have not departed. If the forecast is "not
normal," this ferecast information
is furnished automatically each time
a reservation is made on this flight.
Flight progress is maintained on each
flight after it departs. This information is furnished to the agent
on request to facilitate answering
inquiries concerning details of the
flight. Flight forecast and progress
information is entered into Sabre via
agent sets.

Ticketing Arrangements
Ticketing arrangements are entered as part of each passenger record.
Ticket pick-up time limits are tabulated and periodically, the list is
scanned to determine if any of the
limits have been expired. If a time
limit has expired, the reservation
is automatically cancelled in most
instances. In a few cases of especially complicated itineraries, the
passenger must be contacted. Information is furnished by Sabre to facilitate this contact.
Flight Manifests and Passenger
Name Lists
Flight manifests and passenger
name lists are furnished fer each
flight and upon request. These name
lists are used to check against boarding passenge'rs. It greatly facilitates dete"cting "no shows."
Processing of Schedule Changes and
Extra Sections
ProceSSing of Schedule changes
and extra sections will result in
automatic generation of lists of
affected passengers and how they may
be contacted.
Historical Records
Historical records are retained
(on magnetic tape) to allow investigation of transactions for some months
past as prescribed by law.
Various Operating Statistics
Various operating statistics
will be gathered to allow improvement
of the System, to predict saturation
of the present eqUipment, and provide
data for management control.
Other On-Line Applications
Being Considered
At the outset, the Sabre System
will perform the Reservations functions as described in this paper. In
the final analYSis, however, many
other on-line applications will be
put on the Sabre System.
There are several ways that a
company can approach the task of developing an integrated real time

599
14.2

inventory control and information retrieval system.
One approach is to do a total
system study and establish broad company objectives which then are translated into system requirements. From
these requirements a system is designed and implemented in one step
on a company-wide basis.
Another approach is to select
electronic equipment which has expansion capability as the basic components of the system, to implement
an important company function and
to concurrently study other applications for later application. This
is the approach being followed by
American Airlines in Sabre. The
Sabre System uses standard IBM 7090's
as the heart of the system. The
reservations function is large and
important enough to justify the installation of the minimum data
processing and nation-wide high
speed communications network.
Our immediate objectives are to
get the system paying for itself as
soon as possible, to develop our own
experienced computer applications
staff and at the same time learn
more about the problems and capabilities of real time data processing.
Concurrently, we have embarked
upon a study and planning effort to
meet the following objectives:
1. Determine the expansion
capabilities of the Sabre System.
2. Study potential applications.
3. Rank an application in order
of its operational need or return on
investment.
The results of this study will
be a three to five year plan for
orderly Sabre growth.
Some of the areas which may
benefit in the future from real time
data processing are Customer Services, Flight Operations and Maintenance. I shall discuss a few of these
briefly.
Customer Services
Ticketing. Such functions as
seat confirmation and automatic fare
computation and printing could be
accomplished.

Air Cargo. Freight inventory
control and informational retrieval,
optimum space allocation, tariff calculation and billing and freight forwarding messages are feasible.
Lost and Found Baggage. An upto-date inventory of misguided baggage coupled with a means of rapidly
locating it and forwarding it to its
owner could be accomplished.
Flight Operations
Flight Dispatch. By providing
current aircraft status, it would
be possible for Sabre to assist in
aircraft rescheduling and re-routing,
fuel load calculation, flight plans
and clearance requests. In an area
such as this, the planning would be
done off-line by operational personnel. The Sabre System would provide "how goes it" information and
it is conceivable that sometime in
the future it would be possible to
simulate the effect of a given operational decision so operational
personnel could select optimum solution to a multi-factor problem.
An example of this is the problem
of getting an airline back on a
normal schedule after operations
have been curtailed over a large
portion of the country by a three
day snow storm.
At the present time, American
Airlines is conducting Ticketing
and Air Cargo Feasibility Studies.
In the very near future, we will be
investigating crew scheduling and
communica t ions.
Management Reports And
Control By Exception
The Sabre System while basically
devoted to the proce~sing of reservations will be extremely useful in
supplying Management with an abundant
amount of information on day-to-day
operations. In addition, Sabre
will be able to pin-point critical
values from the vast amount of data
and "flag" such information to Management. This latter concept known
as "control by exception," is especially effective with Sabre because of the tremendous processing
capabilities inherent. Management
Reports are divided into 3 basic
areas:
1.

Information that must be

600
14.2

supplied to Sabre Management.
2. Information which is required
within the framework of American Airlines, i.e., to divisions of the
company outside of Sabre.
3. Information which must be
given to agencies outside of the
structure of American Airlines.
Information for Sabre Management
In order to evaluate whether
the Sabre System is meeting the response time of interrogations entered
by the sales agents, several measurements of critical variables will be
obtained from Sabre.
One critical variable that
Sabre Management is concerned with
is saturation of the system. For
increased passenger boardings per
period of time_, additional passenger
nalue records will be composed and
stored in the disk files. Consequently, a point in time could occur
where the disk files are full. Such
a situation will be made known to
Sabre Management via an unsolicited
computer response. If such a contingency continually arises, Sabre
Management may decide to add additional disk files to the system
to alleviate the saturation condition.
A second critical variable is
the "line loading." An analysis of
communication loading in the Sabre
System may reveal that the elapsed
time from sales agent to computer
and return is continually increasing.
The computer will be able to measure
the response time and print the values per period of time. If the response times are too high, Management may add facilities or reroute interrogations to the computer. If too low, economies can be
achieved through consolidating
facilities.
Several other critical variables which can be obtained from
Sabre fot Management decision include:
1. Measure of extent to which
Sabre uses teletype equipment, i.e.,
volume of teletype messages entering
and leaving the computer.
2. Measure of the extent of
usage of programming routines.
3. Measure of the extent of
reference to disk and drum files.

It is intended, therefore, to

utilize Sabre to its fullest capabilities in rendering information which
heretofore was either unobtainable
or laboriously calculated.
Information for other Departments
of American Airlines
Sabre will be able to retrieve
on command, past-date records from
the historical files to satisfy the
needs of Management outside of Sabre
The basic purpose of such reports
•
is to illustrate past performances
and to predict future goals.
These reports may be regureports, with specific
1nformat10n occurring at stated intervals, or
2. Reports which may be useful
over designated lengths of time or
O'
3 • f tne-t1me"
analyses of'
special interest.
1.

~arly re~urring

For (I), Sabre will be able to
compute daily load factors (ratio of
passengers boarded to total seats
authorized) per flight for each airport. Under present procedures
such information is not immediately
availab~e in total for all airports.
Sabre w1ll be able to pin-point those
flight legs whose load factors are
outside of a certain range. For a
low load factor, a sales program
would be instituted. A high load
factor may be an indication that extra sections should be brought into
service for frequently travelled
flight legs.
For (2), Management will be interested in such information as:
a. A report of production for
the past month which comprises a
breakdown of sales by station, activity, sales account and sales agent.
b. A report on reservationmaking habits of customers for a particular period of time, as an aid to
the sales program.
c. A report on source of business; fo: example, percentage of business der1ved from other airlines
commuters, conventions, etc.
'
d. A report on passengers preference for certain types' of meals.
For (3), Management will be interested in such information as:
a. Effect on bookings due to
specific advertising campaigns.

601
14.2

b. Measure of extent to which
American Airlines is requested to
make reservations for passengers on
other airlines.
It will be Sabre's purpose to
supply a variety of reports to
Management instantaneously. (See
attachment for examples of Management Reports.)
Information for Agencies Outside
of American Airlines
In general, information that
is required by organizations outside of the company will not differ
in any major respect from that required by the company itself. Such
information will be limited to the
following outside agencies.
1. The Air Transport Association (ATA)
2. The International Air
Transport Association (lATA)
3. The Air Traffic Conference (ATC)
4. The Civil Aeronautics
Board (CAB)
5. The Federal Aviation
Agency (FAA)
Civil Air regulation requires
that airlines retain certain information for a period of 90 days.
a. All Passenger Name Records
made for a day including a cancelled
reservation or wait listed passenger.
b. The corrected Inventory
Record (record of seats sold for
each flight) for the day's flights.
c. Flight information (data
governing whether a flight will take
off or not) and flight progress information for all flights.
This information will be made
available for examination on request. It will be conveniently and
efficiently supplied by Sabre from
the historical records which are
purged from the master files of
passenger name records at the end
of each day.

602

14.2

STRONG AND WEAK LEG ANALYSIS
flIGHT
NUMBER

()IIG.OEST
.It _ W

'1

.It _

sro

.,

I'It_Cl!J

4l

ClIl-SPO

I1C _ .r.r
461

m
)IS

lilt -

lOS

I1C _ DC'

LOAD
fACTOR

I1C-W

LEG

WEAK

STRONG

•••

.,

l1C...tCr

,.

n:..aI1L

20

8DL-IOS

1'7

n:-IAL

II

a.u..a:A

12

OCCURRENCES IN lAST • WEEKS
M T W Q f J S
TOTAl.
6

•

2

•

2

6

)

,

4

5

)

2'7

6

?

?

5 6

?

,

4)

?

•

?

•

6

'2

6

6

•

?

6

"

••
?

Ilt-a:,

750

SPECIAL STUDY REPORT
SUBJECT: Sooara.. or 100&1

.... 1M•• _

PElIOD COVEllO:

196)

"-F.

ao.-

Examples of Management Reports

4l

603

14.3
REAL-TIME MANAGEMENT CONTROL
AT THE
HUGHES AIRCRAFT COMPANY
Donald R. Pardee
Industrial Dynamics, G.O.
Hughes Aircraft Company
Culver City, California
Summary
We have reached an impass as data processing
professionals unless we use a new approach in
meeting management information requirements. Our
old approach - such as batch processing, maintenance of voluminous card and tape files, and
rigidly structured reports - can no longer do the
job. We must continually recognize the implication and impact of the dynamic nature of the
industrial environment. New products, more acute
competition, shifting skills, changing customers
and reduced lead times place staggering demands
on operating managers for faster decisions and
more accurate answers. In addition, these rapidly changing conditions make it impossible for
management to predict, with any preciseness, what
future information needs will develop. What they
do know - and demand - is that our information
systems must provide more visibility and responsiveness. These systems inherently must be more
flexible and contain more economy of operation
than traditional te~hniques afford. At the
Hughes Aircraft Company, we are meeting these
requirements through the growing utilization of
source data recording, large rando~ access file
processing, and high character-rate tape units.
In addition, we make the greatest possible use of
programming aids such as automatic coding routines and report generators. Our Finished Goods
and Shipping information system exemplifies the
use of these tools in our current approach to
real-time control.
Management Information Requirements
As noted recently in the Wall Street Journal
management disenchantment with data processing
has set in - and will continue to spread - unless
we recognize our present shortco~ingsl. We must
adopt totally new methods - now - to satisfy
today's management information requirements.
These requirements have been postulated and expounded upon ad nauseam. Yet, by their very
nature these requirements will continue to serve
as a criteria and a measure of our successes and
failures. It does not seem inappropriate, therefore, that we should review them again.
~lity

Imorovement of Data

One authority defines information as " ••••
knowledge derived from the organization and
analysis of data" and data processing as " ••••
the conversion of data into information".2

Obviously, the value of any information so produced will be a function of the accuracy and
timeliness of the data from which it was derived.
All too frequently, our old techniques of converting raw data to machine language formats transmittal preparation, key punching and verifying - have defeated our objective of obtaining
current, valid data.
Flow Time Reduction
In the past - and I suspect for sometime into
the future - we have been and will be criticized
for using our computing power for the sole purpose of producing documented hindsight. Far too
great a portion of our data processing reports
show where we have been, rather than where we are
and where we are going.
Much of this is attributable to batch processing transactions and changes - and our voluminous
card'and tape files seem to preclude any different approach if economy of operation is to be
considered. I think it is fairly obvious that
when we batch process for file maintenance and
reporting, we are updating files with events that
occurred, on the average, one-half the batchperiod-interval ago. In short, many of our
monthly financial reports reflect important
financial transactions that took place 2-1/2
weeks before~ This kind of performance will no
longer do. If electronic data processing is considered a management science - and I believe it
is - our main goal, according to Peter Drucker,
" •••• must be to enable business to take the
right risk". "Indeed", he goes on to say, "it
must be to enable business to take greater risksby providing knowledge and understanding •••• and
by measuring results agains~ expectations thereby
providing means for ~a3lY correction of wrong or
inadequate decisions".
The flow time between
data inception and information reporting can be
and must be reduced. As one of our managers has
put it, he wants visibility, not perspective;
information, not history~
Responsiveness and Flexibility
As data processing people, it seems to me that
all too frequently we attempt to ignore the dynamic nature of the industrial environm·ent in which
we work. Hardly a day goes by when our companies
and management do not experience a change in product or a change in competition, a change in customers or a change in regulatory specifications.

604
14.3
The American Management Association puts it this
way, "We live in an ever-changing world. Everything about us - and we ourselves - change to a
greater or lesser degree. So far as the business
situation is concerned, this state of constant
flux introduces two very real complications into
our calculations. First, we find as we look into
the future that the number of possibilities to be
considered increases at an astronomical rate.
Second, the further ahead we attempt to plan, the
greater the uncertainty factor, which also operates to increase the number of alternative decisions we must consider. On the whole, flexibility arid adaptability are urgently needed to
4
adjust quickly to shifting conditions ...• ".
In the face of this situation, data processing must become increasingly more responsive to
implementing requests for new applications. It
is imperative, I think, that we reduce system's
design and programming lead time. In this sense,
it will always remain incongruous for us to talk,
in the same sentence, of machine processing in
terms of micro-seconds and programming in terms
of months~
As for flexibility, the difficulty and expense of changing report content, format and
sequence has made it mandatory for us to establish contr,ols on change reques ts. Then, as
changes increase in volume and frequency, subtly
but inexorably, our controls become impediments
designed to discourage change. Carried to extremes, our relative inflexibility leads us to
creating reports, we find to our utter amazement,
that are neither used nor read.
Hughes Aircraft Company Approach
Automatic Data Collection
At Hughes Aircraft Company, the need to improve the accuracy and timeliness of input data
has been recognized as a chronic problem. The
mUltiple operations we required in the preparation of such data by conventional methods permitted the introduction of human error - as well
as time delays - at every step. This input gap between the recording of information at its
source and its later processing - has been
bridged by the introduction of "Automatic Data
Collection" equipment. We are using this equipment currently for:
1.

Labor Attendance Recording

2.

Labor Distribution Recording

3.

Order Location Recording

4.

Quality Control Data Recording

5.

Inventory Control

In terms of pure economics, the introduction
of this equipment reduced clerical costs through
the elimination of many handwritten documents; it
obviously reduced key punching and verification
costs as well as reducing control and audit
requirements.
Large Random Access Files
The storage of data on numerous decks of
punched cards necessitates multiple machine runs,
a great deal of lost time in having to pass the
same decks through the system many times for such
operations as sorting, merging, calculation and
listing. On the other hand, magnetic tape creates difficulties in interrogation of information
contained in the tape files and would, in cases
in which these interrogations had to be answered
quickly, require voluminous print-outs or frequent computer interruptions for tape file
searches. Our problem, then, was to develop a
system that could produce timely working paper
and reports - at a price we could afford. The
solution we chose was mass random access storage.
Subsequently, we installed a dual-processor, 40
million character storage RAMAC system to compliment our 705 and 1401 computer systems. For the
first time, we have the ability to maintain a
number of massive files as frequently as four
times per day~ Having divorced ourselves from
batch-process frequencies, we n~w generate working documents such as work o~ders, priority
reports and delinquency reports on a meaningful
basis.
Report Generators
In our efforts, at Hughes Aircraft Company,
to increase data processing flexibility and responsiveness, we have and will continue to rely
heavily on automatic programming and report
generators. In this regard, we have programmed
our own report generator, for a tape system 1401,
which utilizes control cards specifying format
and control total fields, but does not require
computer assembly time. Only from extensive use
of these "soft-ware" packages can we get the mos t
for our programming dollars - more programs, in
less ti~e, with less effort.
Finished Goods and Shipping Control
A brief review of an inventory control
system in operation at one division of Hughes
Aircraft Company can serve to illustrate an integrated application of real-time source data
recording, large random access file processing
and report generators.
The Finished Goods and Shipping inventories
may be said to represent the culmination of the
production and supporting departments' efforts.
The products in these inventories - such as end

605
14.3

unit "black boxes", shippable spare subassemblies
and modification kits - represent a considerable
financial investment. Their anticipated shipment
is a major consideration in determining realizable income. Of equal importance, these products
must be shipped according to stipulated schedules
defined by contractual obligations. It was
imperative, therefore, to establish accurate and
timely controls. The system in general, then, is
one of maintaining a record of all inventory
status changes, by part number and serial number,
from receipt of the product (from assembly workin-process) to actual shipment. In addition, the
control system has the inherent ability to provide immediate replies to such questions as:
Was the product closed into finished goods
in compliance with the manufacturing schedule?
What is the location and condition - test
and inspection, rework, shipping inventory,
etc. - of the completed products?
Was the product shipped in compliance to
contractual schedules?
Input Transactions. The system, as illustrated in Fig. 1, utilizes three Stromberg Time
Transacters to record the movement and status of
products. One is located in the Finished Goods
Stores, the second in the Product Compliance
area, while the third is located in the Shipping
Office. Three classes of information are entered
and recorded simultaneously - a plastic stub card
containing the transaction code, a pre-punched
part and order identification card, and variable
information such as quantities or DD250 numbers
which are entered through the transacter dials.
By means of this source data recording, faster
input and more accurate recording of transactions
to the inventory records is thus accomplished.
File Records. The inventory records are
maintained on Ramac disk files in two forms, a
part number record and a serial number record for
end units. The part number inventory records are
fairly typical in that they contain summarized
information such as the. total quantity in rework
or in test and inspection. In addition, cumulative totals are maintained for manufacturing and
delivery schedules. The part number record contains, as well, part identification codes (end
unit, subassembly, or modification kit) and procurement codes.
The serial number record is unique, however,
since it contains the latest information on the
location and status of each end unit. In addition to providing supporting details for the
quantities in the part number record, its primary
function is the provision of replies to the types
of inventory questions raised earlier. However,
these records would be of little value if we

could not maintain or interrogate them on a real
time basis as provided by random access storage.
Output Reports. In addition to supplying
interrogation replies, the control system provides management reports on weekly production and
shipping performance - what was manufactured and
shipped according to the specific, predetermined
plan and what variances occurred. Equally important, a delinquency report by making department is supplied. Other reports are and will be
provided, as management requires them, without
undue delays. Through the use of report generators, we have the ability of dumping the file
contents on tape and process them through the
1401 computer for any conceivable type report
utilizing the file data. This can be done more
quickly and economically than any technique we
have used in the past.
Other Applications
Obviously, Hughes Aircraft Company is not
the only company to recognize the advantages
offered by new hardware and programming aids. A
number of companies, large and small, have implemented or are planning similar systems. Faced
with a problem of maintaining and controlling
160,000 tab cards utilized in a spare parts
inventory management control application,
Convair-Astronautics in San Diego turned to
large random-access storage for solution. In
addition to local input, the system is geared up
to accepting off-site inventory transactions via
a five-channel punched paper tape communication
system. The resu1t~ Cost per transaction posting was reduced by 25 per cent in addition to
obtaining a tremendous increase in control and
visibili ty. 5
Another company, Federal Telephone and Radio
Company, having converted to source data recording, reports, " ••.. over-all (input) costs are
much less, reflecting fewer errors, new information, and fast'er action to clear up shop trouble.
Four keypunch clerks have been freed for other
assignments. Manual reporting errors and wasted
time have been cut to an insignificant level in
the plant".6
The American Bosch Division of the American
Bosch Arma Corporation utilizes large random
access file processing for controlling the manufacture and procurement of some 15,000 parts used
on approximately 1000 end-products. Processing
scope extends from a level-by-level net part requirements computation to the issuance of purchase requisitions and shop fabrication orders.
Some of the benefits claimed to have been derived
from the system are:
1. Reduction of inspection, ordering and
set up costs by cutting the number of manufacturing lots - and related paper - processed through
the shop.

606

14.3
2. Increased accessibility to accurate,
up-to-date production control records.
3. Indications of inventory shortages in
time to take corrective action.
American Bosch states that these and other
advantages have already resulted in net savings
which are in excess of $120,000 per year. 7
Conclusion
As indicated by our experience at Hughes as
well as others, management information requirements can be met. The hardware and the soft-ware
is available. We are remiss in our responsibilities if we do not utilize them to their fullest
potential.
References
1. Wall Street Journal, December 27, 1960,
"Bugs in Automation"
2. Milton M. Stone, "Data Processing and the
Management Information System" (AMA Management
Report Number 46)
3. Peter Drucker, "Thinking Ahead" (Howard
Business Review~ Jan.-Feb. 1959)
4. Elizabeth Marting, Editor, "Top Management Decision Simulation" (American Management
Association 1957)
5. J. A. Dufresne, F. J. Knight, "Spare
Parts Control for the Atlas Missile" (Aircraft &
Missile Production Management Proceeds, IBM,
Feb. 8-10, 1960)
6. R. W. Christian, "Controlled Plant
Information" (Factory Magazine, Aug. 1960)
7. "Manufacturing Control at American Bosch
Division" (IBM) General Information Manual E202053

FINISHED

MASTER

PRODUCT

SCHEDULING

DISTRIBUTION

ORDER CONTROL

GOODS

INVENTORY

GENERAL

FLOW

FINISHED GOOD INVENTORY CONTROL DEPT.

ASSEMBLY
DEPARTMENTS

-"'AINTENANCE

STORES

SYSTEMS BUILD- UP 8. TEST

TABULATING SERVICES
SHIPPING

MASTER
UNIT
DELIVERY
SCHEDULE

SUMMARY
DELIVERY
SCHEDULE

DATA

ORIGINATES
(A) MASTER UNIT
DELIVERY
SCHEDULE.
(B) SUMMARY
DELIVERY
SCHEDULE
(C) MANUFACTURING
SCHEDULE.

ORIGINATES
SHIPPING ORDER

RELEASES
ASSEMBLY ORDER
WITH LINE FLOW
HISTORY CARD.

RECORDS RECEIPT OF UNIT OR PARTS BY
ASSEMBLES END
USING LINE FLOW HISTORY CARD IN DATA
UNITS OR
COLLECTION RECORDER WITH 01
SHIPPABLE SPARES
TRANSACTION CARD
SUB-ASSEMBLIES
RECORDS DISBURSEMENT OF UNIT OR PART
EACH UNIT OR
BY USING LINE FLOW HISTORY CARD OR
SHIPPABLE SPARE
SSO CARD IN DATA COLLECTION
SUB-ASSEMBL Y
RECORDER WITH 13 TRANSACTION CARD.
WILL HAVE ONE
LINE FLOW HISTORY
WHEN UNIT OR PART HAS BEEN SUSPENDED
CARD FOR EACH
FOR REWORK DETERMINE WHERE THE
UNIT OR PART.
REWORK WILL BE PERFORMED 8. USE
LINE FLOW HISTORY CARD TO RECORD
MOD KITS WILL
HAVE ONE LINE
TRANSFER OF PARTS TO REWORK
STATUS
FLOW HISTORY FOR
EACH ASSEMBLY
TRANSACTION
TO RW
FROM
CODE
ORDER RELEASE
- - - 3 4 - - TEST
FIN. GOODS
QUANTITY.
TEST
35
WIP
36
TEST
VENDOR
14
STORES FIN GOODS
15
STORE5 WIP
16
STORES VENDOR
AFTER REWORK HAS BEEN COMPLETED
RECORD COMPL.ETION OF REWORK BY
MAKING REVERSAL. OF TRANSACTION
WITH L.INE FL.OW HISTORY CARDS

-----

RECORDS ACCEPTANCE OF
UNIT OF PARTS AND/OR
MOVEMENT OF UNIT OR
PARTS TO SHIPPING BY
USING LINE FLOW HISTORY
CARD OR SSO CARD IN THE
DATA COLLECTION
RECORDER WITH 37
TRANSACTION CARD.
IF UNIT OR PARTS ARE
SUSPENDED FOR REWORK
MOVE PARTS TO FINISHED
GOOD STORES FOR
REPL.ACEMENT.

RECORDS SHIPMENT OF UNITS
OR PARTS BY USING LINE
FLOW HISTORY CARD OR SSO
CARD IN THE DATA
COLLECTION RECORDER
WITH 78 TRANSACTION
CARD
FOR SSO ALSO DIALS IN THE

"00250" NUMBER IN THE
DATA COL.LECTION
RECORDER.

1-'01

.t:>.O

.-....J
W

608 - 611 Missing from proceedings

613
15.1

THE COMPUTER SIMULATION OF A COLONIAL SOCIO-ECONOMIC SYSTEM
Warren Dee Howard
Defense Systems Division
General Motors Corporation
Warren, Michigan

The spectrum of weapons employed by the
Communists is not confined to force, but
brackets all possible relationships between states and social groups---ideological, political, economic, psychological,
cultural, technological and military.
,
Strausz-Hupe

Summary
One might view the international political world as a system of nations, each
described by a unique transfer function.
Existing system engineering methods and
computer techniques might then be applied to this multi-variable system with
the hope that a better understanding
might be achieved for the rise of international problems and their subsequent
solution.
This paper describes the design, and
actual demonstration by analog computer
techniques, of a colonial socioeconomic system which included national
growth and national behavior models.
The national growth model included
such variables as resource, opportunity
and incentive with the hopes of evaluating the asymptotic behavior of the
total population. The national behavior
model described which one of the two
political alternatives would be advocateq by elements of the organized
native class based on the environment
defined by the growth model.
Introduction
In considering the requirements for
a "total defense", the modern military
strategist must critically evaluate
the roles and potentials of factors introduced from the areas of economics,
sociology, psychology, etc. This is
particularly true today where we, as a
nation, are involved in a large number
of potentially dangerous international
activities known as the Cold War.

If we were to look into our arsenal
of these types of weapons, however, we
would immediately find that we are
quite poorly equipped. There are many
reasons for this; but the dominating
reason, no doubt, is the fact that the
social sciences have not developed to
the point where even somewhat valid
and reliable predictions can be made.
The most often cited difficulty is the
reportedly large numbers of variables
which must be evaluated and processed.
With the advent of modern computation
techniques and machines, however, the
processing of large numbers of equations
and variables is no longer a problem.
Moreover, a new technology generally
termed "systems engineering" has developed which has made possible the simulation and solution of large scale systems
problems.
One might then speculate on the question as to whether or not the nations
of the world might be conceived of as
a system of elements which, when quantified, might lead to predictions of
future instabilities, growths, etc.
An attempt was made, therefore, to
consider the growth and subsequent behavior of a primitive nation in light
of the foregoing to determine the feasibility of such a simulation.
Although the social scientists in
general have failed to make use of
mathematics and hence computers, we
cannot ignore the very important work
which they have accomplished.
The
documentation of this work includes
a wealth of verbal propositions which

614
15.1

attempt to explain some phenomenon in
their area of interest.
Important then
in the construction of a nation simulation, is the possible quantification of
these verbal propositions.
By considering small portions of the
model on an analog computer, one might
intuitively quantify certain of these
propositions by comparing the computer
output with known data. Each proposition
so quantified may then serve as a build-'
ing block for more complex computer
models.
The Model
Development of the Model
The socio-economic model was constructed in several stages on the analog
computer; beginning with the primitive
group and ending with national behavior.
An .exhaustive literature search in the
social sciences produced the general
ideas of how the model might be construct~d.
From biology,S,ll came the
familiar first order equation which
describes the unconstrained development
of a species.
The thought of a Law of
Diminishing Returns was borrowed from
early economists 4 ,8 and the consumption
function from more contemporary authors. 6 The national behavior is an
application of research done by Rashevsky.9,lO
For proper selection of initial conditions and constants, many statistical
publications were consulted,2,12,13 and
from the documentaries,l,3 came a wealth
of verbalizations which led to the inclusion of certain constraints.
The Model, a Verbalization
The nation is described by two classes
of models: (1) national growth, and (2)
national behavior. The national growth
models define the environment in which
a three-class society, consisting of
the colonizers, organized and primitive
natives, evolve. The national behavior
model describes which one of two political alternatives the organized native
class advocates, under the influence of
the environment as defined by such parameters as resource, opportunity, incentive and technology.
These two classes of models define a
nation, the majority of whose inhabitants are considered primitive. These
people are seen to exist in a relatively

closed society which evolves around a
resource, R.
The personal needs of the
natives are represented by a consumption
function which includes a marginal propensity to consume itself depending on
the favorability of the times.
To add
more realism, the primitive work force
has its production affected by a Law of
Diminishing Returns.
The nation also possesses an incentive
which is the motivation for an external
power to emigrate personnel to the nation for the purpose of exploitation.
The colonizers organize a portion of the
primitive population to serve as the
labor force for this exploitation. The
organization of the natives is seen as
mutually beneficial, and an opportunity
factor is thereby created for the primitives; hence, a new organized class is
formed.
The organized natives are visualized
as those who will form the urban areas,
become more highly educated, and in
general engage in those activities which
b¥ our standards would tend to improve
their standard of living from what it
was in the primitive society.
The work force represented by a portion of the organized natives develops
the incentive for the colonizer. The
effectiveness of their work is controlled by a worker efficiency function
which takes into account an optimum
management labor ratio.
Figure (1) shows the organizational
structure of the three classes.
The
primitives are seen to be organized
along the traditional tribal lines.
The colonizers are depicted as being of
the leader type; because of the nature
of their work, administrative. The organized natives develop into a large
follower class whose allegiance is sought
by two opposing leader groups; one which
advocates status quo, the other a change.
By constraining the technological
level of the nation, the incentive is,
with time, depleted. with the disappearance of incentive, the opportunity
for new individuals entering the organized society is diminished. With the
loss of opportunity, the size of the
leader group which advocates a change
will grow; hence, the total number of
people in the organized class advocating
a change will grow. The rate and ease
with which the behavior of the organized
group changes is developed as a function
of the relative number of leaders, fol-

615
15.1

lowers and their respective "influence
coefficients".
It is suggested that the
latter themselves are functions of communication, transportation; hence the
technological level of the nation.
Thus, we have described a model which
permits the study of a colonial socioeconomic system, not with the desire of
fitting the present model to a specific
nation, but to demonstrate that you can
take apparently pertinent variables and
construct a dynamic model whose characteristics are somewhat realistic.
The Primitive Society
Let us now consider how a simple model was cpnstructed. First we have a
group of primitives, and we should
like to describe how they evolve.
From
various levels of organisms we find that
their development may be described by
the following system of equations:

~~. ;; r;. (Yl./ YlJ lJ,,)
(I)
where the /?,' are elements of the society
j

'"

itself.
It is sometimes possible, however, to consider a system of a single
variable)( .
The above system of equations then reduce to
(2 )

The previous assumption holds when
for any reason one species or group
grows actively while conditions otherwise remain substantially constant.
This seems to be essentially what has
occurred in most human populations.
It is true that in their growth, they
have carried along with them a complicated industrial system or group comprising both living and non-living
elements. We shall, however, look upon
the number of the human population itself as a sort of single index or measure of the group as a whole.
To be more correct, equation (2) may
take the following form:

in.
=- (h -J ) Yf
dt

where

b

is per capita birth rate

J

is per capita death rate

(3 )

This form, however, was no more
amenable to modeling than equation (2)
so it was decided that equation (2)
slightly modified would be the core of

the primitive society.

1f-: )?[F(n>]

(4 )

As the primitives were viewed as a
group with neglible external influence and a relative low and constrained
technological capability,'
in equation.(4) was seen as primarily a function of what the natives considered as
necessary resources. We shall close
the loop in this class as shown in
figure (2). The primitives will develop
and use a fixed resource which exists
within the sphere of their influence
according to laws which we shall now
describe.
Consumption Function
As the ultimate desire is to construct
a model which might describe an African
nation, the habits of the more ~rimitive
groups here were noted. Cloete points
out that the typical African will generally produce only to his needs, and
that he does not tend to stockpile.
With this in mind, the consumption function of the primitives was set as a per
capita constant plus a marginal amount
which depends upon a favorable difference between production and consumption.
The constant was viewed as the minimal
requirements for the group under unfavorable conditions; namely when d » was
negative.
J~
To simplify the fitting of the model
on existing analog computing equipment,
the marginal propensity to consume was
set as follows:
foy

p~c..

p)C::

(5)

Production Function
This portion of the model presents
a little more complication.
First there
must be a "production force" whose numbers may be different from that of the
total population. Secondly, we must
consider the Malthusian 4 concept of the
Law of Diminishing Returns as applied
to resource.
Again, to facilitate the fitting of
the present model on existing computing
equipment a simple approximation was
used:

616
15.1

!?t",::

r

~t- T)

Y/U'J ,. f7t

(6)

where

Thus
was chosen as an arbitrary
function of the general shape of a
Poisson distribution and was equipped
with the capability of varying

n,n/K .

Resource-Saving

~P} is production force

I?-c

is, present population

/c?t-~ is the population 7 years

previous
It should be pointed out that a more
sophis,ticated approach to the above approximation would be to consider the age
distribution of the primitives. Sharpe 7
has shown that there is a certain stable
type of age distribution about which the
actual age distribution varies, and
towards which it returns if a perturbation (nature or man) causes any deviations. As the degree of sophistication
of our model heightens, the necessary
increase in computing components will
be made to include this important work.
Now let us consider the production
which is accomplished by the "production
force".
This output may be noted as
follows:

p~

k )"

(7 )

f1(P)

where

As can be seen from figure (2) the
consumption and production rates are
summed, and their effects are applied
to the unused resource.
By the very nature of the way in which we conduct our
lives (periodically sleeping, working,
etc.), the actual process of summation
of rates is viewed as a sampled process~
however, for our purposes we chose to
allow the time integral to approximate
this process.
~

~

(pco-c:,)

~

[ ( p-c) de

(10)

l.,o

k.

The constant
which appears in figure
is itself a function of the technological level of the group.
(2)

Constraining

F(n)

It seemed reasonable to assume that
the initial magnitude of the resource
R should affect F(n> only within certain
limits for two reasons.

1. The magnitude of R subtly assumes

P

Y7t,.J
k

is production of resource
per unit time
is production force
is the amount of labor per
unit time that is willing
to give

'( Law of Diminishing Returns
In general, all the terms of equation
are amenable to measurement except
r
It can be seen that '( in general
will be a function of both YI{P) and resource.
(7)

(8)

As an approximation, we shall ass~e
that there exists a oritical ratio ~,}
which determines the maximum efficiency
per unit resource subject to the following:

If?

~') < Yk~

)?(,)) Yl(~)
Yl(,) = 0
I1(P)

=~

Y is increasing

r

is decreasing

t.;-

0

r=O

(9)

nature's effect on R over the period
of evolution, and hence R's magnitude may not be known to the primitives at any given time.
They
probably see either favorable or
unfavorable conditions.

2. The constraint on their technological level and their assumed group
characteristics leave them without
the desire or the tools to measure
R even if they so wished.

Fffl), more properly, sh0uld be constrained in the favorable condition~
however, it should be left unconstrained
for the unfavorable condition. To see
this, we must first justify allowing R
to go negative. Previously we referred
to R as being considered necessary resources.
It is felt that when the
m~gnitude of R knowingly to the primit1ves approaches zero (possibly below
the value of positive constraint), that
the primitives will modify their total
consumption of resource to include items
which were not previously considered as
resource.
This would imply a certain
ingenuity on the part of the primitives
and would also require a learning period.
As the availability of this ~ resource
would be constrained by nature, the un-

617
15.1

favorable condition imposed by R going
negative would be conducive to emigration
and famine; hence Fill) may take on large
negative values for short periods.

in value systems, etc.
The second term of (11) .. l!,qui te
straight forward with f(I'lt"~eing arbitrarily chosen of the same form as
in equation (8).

r

Development of the Organized Society
The model which describes the organized society is similar to the one which
describes the primitive society with two·
exceptions. First the Law of Diminishing
Returns is replaced with a production
efficiency function. Second, the resource element is replaced by an opportunity element.
The modification of the (' function
is quite straight forward and essentially requires the substitution of
for Ylr,'/r< wi th the conditions already
enumerated in equation (9) still holding
and where~e is the number of colonials.

nr",lnt.

The second change is more one of definition than anything else. There is,
however, no attempt to constrain the
amount cf opportunity remaining for the
group.
Development of Colonizing Society
The presence of an undetermined quantity of incentive is the motivating
factor for the immigration of a group
of colonizers who are capable of organizing a portion of the native popu~ation.
An equation of the form (3)
is again used with Ftn) being a constrained
function of the remaining incentive.
Whether or not the incentive is allowed
to go negative might well be considered
as government policy towards the nation.
The rate of change of incentive is
considered to be of two factors: (1)
a constant demand, and (2) a productive
output.

Thus far we have described a model
of essentially a three-class society.
It is reasonable to assume that there
will be negligible interclass mobility
between the colonizer and the other
two groups; and as a close approximation, it may be neglected between the
organized and primitive groups if,
again, the end result is to correspond
to an African society. This may be
seen by noting that mobility is a function of ·transportation among other
items, and it is noted that public
conveyances in the areas of consideration that would be used by the individuals passing from one class to
another tend to have the same load
factors whether they are leaving or
returning to the urban area.
If there
was a differential mobility between
these two classes, it would show up
in the load factors; and if the mobilities are equal, then they can be
safely neglected.
National Behavior
As we previously indicated, our
center of attention is the organized
group as to which one of two mutually
exclusive behaviors they exhibit;
status quo or change. The fundamental
concept used here is a generalization
on Rashevsky's theory of imitative
behavior of a social group.
Interested
readers are referred to the original
pub1ications 9 ,lO for the development
of this theory.
In general the approximation takes
the following form:

ddt
Yls ~~ a ( n( oral. Ylz. ~aJ 'rl3 -Gtt Yl4f

(11)

(12)

where

~J

is constant demand

Ylt" is
f (~:}) is

the number of organized
working natives
an organizational efficiency function

where

Yls

-n,

total of those advocating
status quo
active (leader) individuals
advocating status quo

C is a work constant

Yl"

passive (follower) individuals
advocating status quo

The constant demand is viewed as that
drain on the incentive which would be
caused by governmental policy, both
within and outside the colony, changes

'Yl3

passive (follower) individuals
advocating change

n.,

active (leader) individuals
advocating change

618
15.1

t:ti influence coefficients
The influence coefficients are primarily functions of communication (number of
lines of newsprint, hours of radio broadcasting, etc.), and it may be noted that
in general the influence coefficients
associated with the active individuals
will be of greater magnitude than those
associated with the passive types: for
the actives will, in most cases, be in
control of the press and radio.
.
. t S f or dY11C
A similar express10n
eX1S
~

JYI, ::a$Yl:3~.n,~q,n-a,YJ2.

~

(l3)

J:t

where

)11(

is the total of those
advocating a change

It now becomes necessary to modify
slightly the nomenclature used in identifying the individuals in each class, for
we have to consider an interclass effect.

Yle
'Ylz

number of colonials
number of primitives

718

number of organized natives
number of organized natives
advocating actively status quo

nJr
I?~

number of organized natives
advocating actively change

Equations (l2) and (l3) can be reduced
to a single equation in ~s :

d '9.:- (t:l. ~a!) }?z dt"

where

raJ Yl '-t:l.l'),f'Q#-11

4}(l4}

=-

/?~

Figure (3) shows a sample run of the
socio-economic system. This experiment
was run to determine the feasibility
of applying a systems engineering approach to the simulation of a nation.
The general. dynamic characteristics of
the model leave one highly optimistic
concerning this hypothesis.
It is interesting to note the development of
the native society is comparable with
many present day colonial systems.
The constraints placed on the technological development appear to be substantiated. as far as the remote primitives are concerned: but, by the very
definition of the organized natives,
it would appear that the constraint is
not realistic.
Many of the constants which appear
in the model are inherently statistical
and are not amenable to quantification,
such as the influence coefficients in
the national behavior model which are,
by definition, mean-values of probability distributions.
To strengthen the present model, the
following modifications will be incorporated:

2. Removal of the constraint on the
technological level, especially
in the colonizer and organized
native groups.

The intraclass group structure is
defined as follows:

it; = r/rl?r
/7':' =- /?g[i- ~(IJ.l

Discussion

1. The application of Monte-Carlo
techniques to those portions of
the model which are inherently
statistical, such as equation (14).

/?':: Ylzr - {nJI" YlJ.j
Y/, ~ t/k: <1-)11-]'
/1.,

As Rashevsky has extended his theory
to include a mutually exclusive behaviors, one might speculate o~ the
possibility of construction of a behavior model of the Congo with three
mutually exclusive behaviors, say M,
K and L, such that the effects and potential stability of a coalition between M and K might be studied.

(lS)

The constant r~ determines the initial
percentage of organized natives who are
actively advocating status quo. The
constant is amenable to measure and is
of the order of 3%. Ca)is a constrained
function of opportunity which allows the
number of individuals actively advocating a change to increase as opportunity
approaches zero.

3. The replacement of specific group
structures with appropriate distribution functions.
Conclusion
Through the use of building block
techniques on an analog computer, a
model of a colonial socio-economic
society was developed.
In general,
the colony is described by two classes

619
15.1

of models: (1) national growth and (2)
national behavior. The growth models
lead to a three-class society, each of
which develops as a function of their
numbers and respective value system.

10. Rashevsky, N., Mathematical Biology
of Social Behavior, (rev ed) Chicago:
University of Chicago Press, 1959.

The national behavior model describes
which one of two political alternatives
is advocated by the organized native
class. The rate of change of group size
advocating a particular behavior is
shown as a function of the relative number of leaders, followers and their respective "influence coefficients". It
is suggested that the latter are them-selves functions of communication, transportation; hence the technological level
of the nation.

12. Statistical Office of the United
Nations, Handbook of Population
Census Methods, 3 vols, New York:
United Nations, 1958.

Two important points have been demonstrated:
1.

The potential of the analog computer in the possible solution of
the problem.

2.

The possibility of modeling the
evolution of society through
building block techniques.

It would
on whether
be the key
subsequent
sciences.

be interesting to speculate
or not this approach might
to the quantification and
unification of the social
References

1. Cloete, S.,The African Giant, Boston:
Houghton Mifflin, 1955.
2. Department of Economic & Social Affairs, The Future Growth of World
Population, New York: United Nations,
1958.
3. Gunther, J., Inside Africa, New
York: Harper, 1955.
4. Heilbroner, R., The Worldly Philosophers, New York: Simon & Schuster,
1953.
5. Lotka, A., Elements of Mathematical
Biology, New York: Dover, 1956.
6. Murad, A., Economics, Ames, Iowa;
Littlefield, Adams, 1958.
7. Sharpe, F., "Normal Age Distribution II ,
Philosophical Magazine, April, 1911
p 435.
8. Smith, A., The Wealth of Nations,
New York: Random House, 1937.
9. Rashevsky, N., Mathematical Theory
of Human Relations, Bloomington,
Indiana: Principia Press, 1947.

11. Rashevsky, N., Mathematical Biophysics, (3rd ed) New York: Dover, 1960.

13. Statistical Office of the United
Nations, Statistical Yearbook 1959,
New York: 'United Nations, 1959.

620
15.1

SOCIO - ECONOMIC MODEL
INTRA-CLASS GROUP STRUCTURE
COLONIZER

ORGANIZED

PRIMITIVE
LEADER

LEADER

FOLLOWER

FOLLOWER

MOBILITY

FOLLOWER

LEADER
STA TUS
QUO

LEADER
CHANGE

Figure 1

621
15.1

622
15.1

623
15.2

X-15 ANALOG FLIGHT SIMULATION PROGRAM
- SYSTEMS DEVELOPMENT AND PILOT TRAINING N. R. Cooper
North American Aviation, Inc
Los Angeles 45, California
The extensive requirements for simulation
in the design of the X-15 Research Vehicle became apparent very early in the development
program. In early 1956, only weeks after NAA
had been awarded the X-15 contract, a simple
analog mechanization was being utilized, together
with a crude three-axis controller to define the
requirements for a manual reaction control system, a problem area which at that time was as
new as space rendezvous is today. However, our
most exaggerated estimates at that time regarding the application of simulation techniques in
the developn;tent and flight testing of the X-15 are
seen now in retrospect to have been most conservative. It may be said that the X-15 simulation requirements literally "grew" with the
program. The flight simulation has been used in
all design and testing phases; first to support the
configuration and subsystem design; next to test
and evaluate subsystems, displays, and control
hardware, both in prototype and in final produc, tion form; and finally to provide flight test support and pilot training. This paper will review
briefly the overall simulation program conducted
during the 5-year development of the X-15 ;1nd
will describe in detail the unrestricted sixdegree-of-freedom mechanization which has been
utilized during the past 3 years of the program.
It will describe the most important of the many
applications in which the simulator has been,
and in fact is still being, utilized in the deSign,
development, and flight testing of the X-15.
Finally, it will be interesting'to compare the
simulation requirements of the X-15 with those
of more advanced space vehicles and attempt to
interpret problems or deficiencies encountered
in the X-15 simulation as they may affect more
advancea space mission simulation.
A review of the X-15 vehicle and its intended mission give an inSight into the simulation requirements associated with the program.
Figure 1 shows the X-15 vehicle and the control
system configuration. The X-15 has an upper
and lower movable vertical stabilizer for yaw
control, and differentially operated horizontal
surfaces for pitch and roll control while in the
atmosphere. The reaction system for attitude
control outside the atmosphere consists of small
hydrogen peroxide rocket motors in the nose and
wingtips. Electronic stability augmentation in
the X-15 consists of rae damping about all three
axes.

Figure 2 reviews the design mission and
performance capability of the X-15. It is capable of attaining speeds in excess of 6000 feet per
second, or Mach 6.0, and altitudes to 500,000
feet. Very early in the X-15 development it became apparent that a six-degree-of-freedom
simulation, covering as much of the complete
flight regime as possible, was necessary in
order to explore the problems of manually controlled boost, re-entry, and space flight. In
past airplane designs, stability and control prolems could be adequately analyzed and solved by
utilization of a five-degree-of-freedom (constantvelocity) simulation at a large number of given
flight conditions. The addition otthe sixth degree of freedom, namely.the variation in velocity and altitude, entailed much increased cOlllplexity in the mechanization required and was
not warranted, since the effects of these variations were not Significant in the overall stability
analysis. However, the X-15 can transverse its
flight profile at a rate much greater than any
encountered before. The rates of change of
velocity and altitude, and correspondingly of dynamic pressure, during the boost and re-entry
phase of the X-15 mission become large enough
to greatly influence stability and control characteristics. The performance range of the X-15
makes the number of flight conditions which
would be required to be investigated on a fivedegree-of-freedom simulation prohibitively
large. Also, it was necessary to allow the pilot
to evaluate all the various phases of the mission
during continuous Simulated missions to establish proper compatibility with the human of the
control systems, instruments, etc, during design. Thus, the requirements for at least a
limited six-degree-of-freedom simulation was
obvious purely from a stability, control, and
human engineering standpoint, without regard
for the many added advantages it would provide
in pilot training, mission evaluation, and flight
plalUling.
Because of the need for adequate aerodynamic data and early limitations of analog
equipment, it was not expedient to initiate a
six-degree-of-freedom simulation immediately.
Consequently, during the first year of the program, efforts were ~ected toward specialized
proglems which could be solved on simpler
simulations. Figure 3 presents a time schedule
of the various simulation activities from the

624
15.2

initiation of the X-15 program to the present
time. It is of interest to note the increasing
sophistication of these studies. It is also interesting to note that the problems of early interest
were those about which we knew the least,
namely the exit and re -entry problem and the
problems associated with reaction control. As
mentioned previously, the simple reaction control mechanization was initiated in early 1956
and provided preliminary criteria for the reaction control system design. This was followed
by a three-degree-of-freedom longitudinal mode
mechanization of the exit and re-entry flight
characteristics, utilizing very early aerodynamic
data for preliminary evaluation of control characteristics during these phases of the mission.
This program showed that, in the longitudinal
mode at least, manual control without augmentation was possible, but that augmentation was
desirable. Next a "programmed q" analog mechanization was utilized to evaluate the compatibility of reaction and aerodynamic controls during the transition phases of exit and re-entry.
Reaction control duty cycle requirements were
also finalized. This simulation programmed the
dynamic pressure variations of a design altitude
mission and utilized constant aerodynamic data
estimated at Mach 6. During the early simulation studies, the ability to include the pilot in
the loop was provided by means of a simple cockpit simulator which included a reaction control
stick.
j

The first six-degree-of-freedom mechanization which was utilized in the X-15 program
was limited to Mach numbers greater than 2 and
altitudes greater than 50,000 feet in order to
simplify the mechanization. In this manner we
were able to eliminate the complex variations in
aerodynamic characteristics in the transonic
regions and to minimize the range of variations
of air density which had to be simUlated. This
simulation was utilized for approximately 8
months during 1957. During this period, the
major design and development of the primary
control system and the stability augmentation
system on the X-15 was accomplished. Also,
additional stability requirements were determined, and several design changes were incorporated in the basic configuration. The fivedegree-of-freedom analog program shown on
the schedule was run Simultaneously with the
early six-degree studies in order to evaluate
stability and control characteristics, including
roll coupling, at subsonic and transonic conditions.
One other simulation program which was
accomplished during this period is worthy of
note before we discuss the complete six-degree-

of ..freedon simulation. Since there was Some

concern as to the pilot's ability to control the
X-15 under the dynamic loading conditions characteristic of exit and re-entry, a simulation
program was conducted in the Human Centrifuge
at the U. S. Naval Air Development Center,
Johnsville, Pennsylvania, during the summer of
1958. In this program, the centrifuge was
driven by computed signals from a limited sixdegree-of-freedom analog simulation of the X-15
which was very similar to the one just described.
Figure 4 shows, schematically, the method in
which the analog computer was used to drive the
centrifuge with a pilot operating as a closedloop controlling function. The centrifuge was
driven to follow the computed values of accelerations resulting from pilot inputs and airplane
stability and response characteristics, so that
the pilot was subjected to the actual G-loads of
a specific mission or maneuver. During the
centrifuge program, a total of 287 dynamic
flights, conSisting of the boost and/or the reentry phase, were accomplished by the seven
participating pilots. The results of this program
showed that the pilot could control the X-15 during its most severe re -entry conditions, and
further showed that, for the X-15 miSSions,
static simulator results were not Significantly
different from those obtained under dynamic conditions. Consequently, no additional dynamic
simulation effort has been necessary in the X-15
program.
Although the limited six-degree-of-fredom
simulation provided invaluable information in
early design of the airplane, it was soon apparent
that it was inadequate. Because of limitations
imposed on Mach number and altitude, probl~ms
associated with launch and roundout, space
pOSitioning, subsonic and transonic stability,
and landing could not be investigated. By this
time the potential value of the simulator in
flight test mission planning and pilot training
was apparent, and in these applications, complete Simulation capability over the X-15 flight
regime would be required. Therefore, the
unlimited six-degree-of-freedom simulation was
developed and has been in operation continuously
since March of 1958. This simulation is capable of piloted or nonpiloted analysis throughout
the X-15 flight regime, including launch maneuvers, boost and exit phase, reaction controlled
ballistic flight, re -entry, and glide to landing
flareont. The performance capability of the
simulation covers the Mach range from O. 2 to
10.0 altitude to 500,000 feet, and angles of
attack to 35 degrees. Simulated flights may be
accomplished with speed brakes open or closed,
with flap and landing gear extension effects,
ventral jettisoning characteristics, and other

625
15.2

nonlinear effects. Missions may be flown using
either the XLR-ll or the XLR-99 engine, including the effects of engine throttling and of thrust
misalignment. Missions may be accomplished
from the design B-52 carrier aircraft or from
other advanced carriers such as the B-70. The
dynamic effects of nonlinear changes in mass
and moments of inertia during engine burning are
simulated. Premature burnout of either engine
may be simulated at any time, and propellent
jettisoning may be accomplished at the required
rates.
This simulation c~pability permits evaluation
of all important contributions to the complete
miSSion, with the exception of temperature. A
Simplified computation of sltiIl temperature at
critical points on the vehicle was mechanized on
the analog computer and incorporated in the
real-time problem solution early in the program.
Temperature at one of several points on the
vehicle was displayed to the pilot as an aid in
remaining within limits during re -entry. However, it was found that this information merely
told the pilot that he had exceeded limits and did
not provide sufficient lead time to allow corrections once it was established that an overtemperature condition was imminent. Due to the
limited application for which the temperature
mechanization was suitable, and the equipment
required, this part of the simulation was discontinued.
The capability of permitting piloted flights
of the X-15 with this simulation is obtained by
integration of the mechanization into the X-15
flight control simulator. This simulator is
shown in figure 5. It consists of an exact duplication of the airplane cockpit, instruments, and
control system. The complete operational flight
control system provides exact system characteristics under operating conditions. Actual production components, including cables, push rods,
bell cranks, the hydraulic system, artificial feel,
etc, are utilized in exactly the same manner as
the actual airplane. The electronics of the X-15
stability augmentation system are also included.
The control system is duplicated in this detail in
order to include all nonlinearities in closed~loop
systems evaluation, and to provide the pilot with
the exact feel characteristics of the airplane.
All control displacements are available to the
analog computer by means of electrical pickofis
on each aerodynamic control surface. The cockpit area, shown in figure 6, is a x:ealistic simulation of the airplane configuration. The simulator has the same provisions for aerodynamic
control as in the airplane ,_ utilizing either the
center stick or the right-hand console stick and
rudder pedals. Reaction control is provided with

the left-hand three-axis controller. The pilot
has normal control over the stability augmentation system by means of the same controller
panel as is installed in the air vehicle. The fUght
instruments, which are simulated in complete
operational form, are driven by voltages from
the analog computer. They include the inertial
attitude indicator which provides pitch, roll, and
heading; the inertial velocity, altitude, and rateof-climb instruments; angles of attack and sideSlip; roll rate; normal load factor; indicated airspeed; and for simulator test purposes only, an
indication of dynamic pressure.
The equations utilized to simulate the X-15
in the unlimited six degrees of freedom are presented in figure 7. These are basically the
classical equations of motion of the aircraft with
respect to an Eulerian frame of reference. The
equations and all aerodynamic parameters are
mechanized in a body-axis system of coordinates.
Orientation of the gravity vector and the geographical distance and pOSition are obtained by
means of conventional Eulerian angles. Due to
the limited ground distance traveled by the X-15
in its design miSSion, a fiat earth may be
assumed as a reference. However, the centrifugal acceleration effect is included as a function
of the X-15 horizontal velocity component and
added directly to the normal gravity vector which
is mechanized as a function of altitude. The
terms and the equations which are considered
variable include thrust, mass, velocity, graVity,
inertia, and all aerodynamic coefficients which
may not be assumed constant within the accuracy
desired.
The analog computer complex used in theX-lf>
simulation is shown in figure 8. The linear equipment consists of five Model 16-31 Electronic Associates analog computers, incorporating 330 operational amplifiers. Additional nonlinear equipment
required in the mechanization includes approximately 80 diode function generators, 25 computing
servos, and three electronic multipliers; part of
this equipment is shown in figure 9. The nonlinear
variations of stability derivatives with Mach numbers and angles of attack are accomplished on
four special interpolating servos which provide 17
interpplating points selected, as necessary, over
the Mach range for each of the derivatives. The
nonlinear variation of the derivatives with the
angle of attack at the required Mach number points
are obtained from a rack of 60 fixed-base diode
function generators.
The greatest utilization of the simulator is
obtained when the pilot is included as part of the
control loop. Before discussing specific applications of the simulator in the X-15 program it
might be well to review a complete simulated

626

15.2

flight of the X-15. Typical analog traces of a
piloted design altitude mission as flown on the
simulator are shown in figure 10. The flight
begins at drop conditions, at Mach • 8, at
approximately 45, 000 feet. The throttle is
opened immediately after drop, and the pilot
makes a pullup to (X = 8 degrees, which is
maintained until the proper climb angle is established. (For a 250,000-foot-altitude miSSion,
this angle is 30 degrees.) Pitch angle is then
held constant until burnout, at which time an
I (X = 0 ballistic trajectory is established. It is
well to note that this represents only one of
several acceptable techniques for performing
the exit phase. During the period of engine
burning, control is required to correct for thrust
misalignment. Burnout occurs approximately
90 seconds from drop, at a velocity of 6200 feet
per second, at 160,000 feet. The effects of
thrust misalignment are seen in the oscillations
in angle of attack and sideslip. At burnout, the
pilot begins use of the reaction control and continues this means of control throughout the
ballistic phase of the trajectory. The recovery
used in this particular flight was an angle of
attack of 15 degrees, established at approximately 200,000 feet during re-entry. The
required aerodynamic trim for this attitude can
be set at any time, and the reaction control system then used to establish the required angle
of attack. As the dynamic pressure builds up,
the load factor is seen to increase, and for this
miSSion, the pilot performed the recovery with
the maximum of 5 G. Following the successful
recovery, which occurs in this case at approximately 85,000 feet at apprOXimately Mach 5,
the altitude is held constant as the airplane
decelerates until the desired descent speed is
reached.
Consider now what this complete flight simulation allows in the way of system development
and testing. Since the pUot can essentially fly
the complete mission, a complete evaluation of
controls, displays, and augmentation devices is
possible long before flight. In the case of the
X-15, the complete flight control system, including the stability augmentation system, was
operating in the simulator a year before the first
flight. The stability augmentation system, which
was originally designed and specified on the
limited 6-degree simulation was later included
in a closed-loop simulation in its exact prototype form on the complete mechanization. These
simulation tests on the complete system
revealed several inadequacies due to the nonlinear characteristic of the control system;
these were corrected before flight. Other
changes which were made in the X-15 as a result
of simulator testing included control system feel

modifications, display changes, and a redesign
of the right-hand console grip.
The versatility of the combined simulator
and analog mechanization has been more recently demonstrated by the design evaluation and
hardware testing of two separate autopilot systems on the simulator. These were both complex adaptive-type autopilots which included
attitude-hold modes, reaction and aerodynamic
control integration, and other refinements to the
basic X-15 control system. These autopilots
were included as a part of the closed-loop control system, and their performance was completely evaluated under conditions of the X-15
mission profile. Considerable design changes
and improvements·were made as a result of
these tests, and at the present time, one of these
advanced systems is being readied for X-15
flight evaluation. During flight testing of this
system, a simulator support program will provide the pilots with preflight training and familiarization with its operational characteristics.
In addition to systems development and
stability analysis applications, the simulator
has proved invaluable in studies involving the
pilot, his control requirements and training,
and in flight test planning and support. An example of these is the early development of
reaction control techniques. Since the basic
X-15 originally had no provisions for stability
augmentation through the reaction control system, the control characteristics were those of
a perfectly neutral, undamped system unlike
any vehicle with which pilots were familiar.
Prior to establishment of final reaction control
fuel requirements, several pilots were trained
on the simulator to perform this type control,
and a realistic duty cycle was then obtained. It
is significant that gross reductions in fuel requirements were obtained after a pilot had
experience on the simulator. ~lso, it was found
that only a minimum amount of experience was
required (10 to 20 "flights") for the pilot to
reach his near-optimum degree of efficiency.

More recently, the simulator has been
utilized by the Air Force, NASA, and North
American almost exclusively for pilot training
and flight planning. In order to fully appreCiate
the value of a simulator in this capacity, it is
necessary to consider in some detail the X-15
flight test program. First, of course, the cost
of an X-15 flight is quite high, requiring that
all possible benefit be derived from each mission
in terms of research data. The cost factor is
also reflected in the necessity for taking
reasonably large steps in performance during
the fUght buildups" which requires that each

627
15.2

flight be thoroughly evaluated prior to its acceptance. The safety aspect of research flight planning provides the greatest single requirement for
adequate simulation. The simulator allows the
research pilot to thoroughly evaluate all aspects
of a proposed flight, including all conceivable
malfunctions or emergency situations.
The ground positioning, or navigational
problem, is a critical consideration in flight
planning due to the limited glide range of the
X-15. This problem is analyzed on the simulator, with the aid of a detailed map of the flight
area (figure 11) on an X-Y plotter upon which
the simulated flight path is plotted. This permits the pilot to actually fly simulated emergenCies, such as premature engine shutdown, at all
points along the proposed flight path, and determine the optimum approaches to the selected
emergency landing sites. It is interesting to
note that the plotting board used in the simulation is identical to the ones used at the ground
station during an actual flight, except of course
that, at the actual control station, the flight
path is plotted from radar tracking Signals
rather than from analog Signals. This permits
the flight test engineer to review, or rehearse,
the proposed flight with the pilot on the simulator.
As a result of 3 years of continuous operation of the X-15 simulator in many various
applications, certain general conclusions may
be made regarding operation, accuracy, and
efficiency of complex analog simulations. The
six-degree-of-freedom analog simulation has
been operated in excess of 10,000 hours in support of the X-15. Over 5000 hours have been
logged in the flight control simulator. The
simulator is generally on a two-shifts-per-day
schedule, and on this basis, an operational
utilization of approximately 85 percent is
realized. Down time may be attributed equally
to equipment problems and to checkout. It has
been found that, for a computer complex as large
as this, daily problem checks are a requirement.
So-called "dynamic" checks, or stability checks,
are made by comparing the simulator with
IDM-computed transients obtained during preprogrammed roll maneuvers at several constantvelocity flight conditions. These five-degree-offreedom checks have been found to be most adequate in ensuring correct stability characteristics thropghout the flight regime. Very good
accuracy is obtainable with the analog computer
in stability computations, since they generally
involve short time transients for which the
analog is well suited.

Performance checks are also made at regular intervals, and involve comparison of
mM-computed trajectories with analog results.
It is in this area that the checks become most
critical. Very minor discrepancies in the computer operation can cause excessive errors in
performance checks, primarily in altitude and
range. This is because of the long time computations involved (from 5 to 20illinutes, depending on the condition) and the resulting susceptibility of the solution to computer drifts and
noise, and to nonlinear servo characteristics.
It has been pOSSible, however, to recognize the
source of these errors and, by proper computer
checks, to obtain the accuracy necessary for the
X-15 perfox:mance problem. The accuracy
usually obtained in computing such parameters
as range, altitude, and velocity over a 20-minute
simulated flight is within 2 percent.
Final proof of the validity of the Simulation
is, of course, demonstrated in how well it compares with actual flight results. The pilots who
have flown the X-15 consider the Simulator to be
a very close simulation of the actual airplane
flight characteristics. In fact, the simulator
has become an integral part of the pilot's flight
preparation program as a result of the pilot's
own acceptance of the accura~y and value of the
simulator. Figure 12 shows a performance
comparison of XLR-l1 maximum-speed flight
with the analog - predicte d flight which was run
prior to the flight. The discrepancies noted are,
to a great extent, the result of atmospheric
variations, i. e., nonstandard days and wind
shears. Figure 13 shows the same comparison
for the maximum -altitude flight with the XLR-ll
engine. During all speed and altitude buildup
flights, simulator-predicted performance has
been matched to within • 1 Mach and 3000 feet
altitude. The stability characteristics of the
X-15 have also been adequately predicted on the
simulator, although it is in this area that minor
discrepancies in wind tunnel data reflect the
largest differences in simulator matches. The
simulation is used, in this regard, to determine
the variation in stability derivatives necessary
to obtain exact flight data matches, and in this
manner the validity of wind tunnel data is established.
Figure 14 shows a simulator comparison of
the data obtained during the landing of the first
X-15 glide flight. This demonstrates the validity of the Simulation in duplicating the low-speed
dynamics and performance of the airplane, but
at the same time, it may be used to indicate an
area in which the Simulation was proven to be

628
15.2

deficient. The pitching oscillation experienced
during the landing flareout, although duplicated
almost perfectly on the simulator after the flight,
was not predicted from simulator experience
prior to flight. The problem resulted from a
higher level of control sensitivity associated with
the side-arm controller than was expected. This
characteristic was not predicted on the simulator
because of lack of critical cues, such as the
visual horizon and the actual motion of the airplane, during the landing simulation.
It is of interest to compare the X-15 simulation requirements with those of currently projected manual space vehicles, and to consider
some of the expected problem areas in view of
X-15 experience. In the X-15 program, the
major simulation requirements have been described as being related to control system
development and testing, stability evaluation,
pilot task studies, miSSion planning, and pilot
training. These requirements result primarily
from the fact that the vehicle utilizes the man
for command and control, and that it is designed
to fly at conditions far advanced over any
encountered at the time of the vehicle design.
The extension of this reasoning to manned space
systems is obvious. The system development
and testing requirements for these advanced
vehicles will far exceed those of the X-15 because of the greater number of systems requiring integration and pilot evaluation and because
of their increased sophistication and complexity.
The importance of early simulator evaluation of
subsystems critical to the space mission is
demonstrated by the advanced system programs
which have been accomplished, or are being
planned, on the X-15 simulation. The stability
and control characteristics of aerospace vehicles,
particularly during re-entry, will require simulator analysis and pilot evaluation over a much
greater range of flight conditions than for the
X-15. The effects of the expanded flight envelope
will also be reflected in increased simulation
requirements in areas of human engineering,
mission planning, and pilot training.

Even these very general considerations
leave no doubt that simulation will play an even
more important part in space vehicle development and testing than was required on the X-15.
Consider next the problems which may be expected in providing this required simulation
capability. The critical problem areas which
have been encountered in the X-15 simulation
have already been described as being related
primarily to performance. These resulted
from long time computation requirements in
which computer drift and accuracy limitations
become critical. It is apparent that these

problems will become more serious in space
missions simulation because of the larger mission time. The ranges of important variables
which must be mechanized for a typical aerospace configuration will be, in many cases,
greater than those required in the X-15 simulation and will create additional scale-factoring
problems. Typical of these are velocity, Mach
number, and altitude. In /lome instances, however, the X-15 requirements may be the more
severe. For example, the range of dynamic
pressure for which the X-15 is designed (0 to
2500 psf) is much greater than will be required
Ln winged re-entry vehicles. The rates of change
of dynamic pressure in the X-15 mission are
even more Significant. Figure 15 shows dynamic
pressure rate of change during re-entry as a
function of re-entry angle. It is seen that the
steep re-entry angles required for the X-15
result in much more severe dynamic pressure
changes than are experienced by orbital re-entry
vehicles.
Space simulation requirements will allow
particular phases of the complete miSSion to be
mechanized individually in order to limit the
range of variables and to minimize the computation time required. For instance, Simulation of
the boost, orbit, and re-entry phases of a complete orbital mission in a single mechanization
becomes impractical by current analog techniques.
However, if each phase is mechanized individually, with appropriate scale factoring, satisfactory results may be obtained for most applications. For example, the major portion of the
orbital re-entry problem may be simulated at
altitudes above 100,000 feet. Since the X-15
simulation adequately duplicates air density from
sea level to 250,000 feet, the requirements for
simulation of air denSity for an orbital re -entry
problem are no more severe than for the X-15.
Some preliminary studies have been conducted utilizing conventional analog techniques
to mechanize various aerospace missions. The
vehicles simulated were an orbital re-entry system and a lunar return vehicle. The X-15 flight
simulator was used to allow pilot participation
in analysis of such problems as high-angle-ofattack stability during re-entry and flight path
control during aerodynamic deceleration from
lunar return velocities. Analog data showing a
piloted lunar return maneuver are shown in
figure 16. As the vehicle approaches the earth
at approximately 36,000 feet per second along an
optimum flight path which has been established
many hours earlier, the pilot rolls 180 degrees
and, by utilizing aerodynamic lift, maintains a
constant altitude as the velocity slows to suborbital. In this application, the analog simulation

629
15.2

was found to be quite adequate in establishing
pilot capability and control techniques, since the
critical portion of the problem (namely, the
establishment of the proper approach trajectory)
was assumed to have already been accomplished.
Also, the maneuver considered required relatively high aerodynamic forces at velocities
appreciably greater than orbital, so that the
differences between gravity and aerodynamic
effects were not critical. Figure 17 shows the
effects of a O. 1 percent error in either of the
components of acceleration during a simulated
re-entry from a hear-circular orbit in which
aerodynamic drag is used to decelerate the
vehicle at perigee. Here the effects on performance during re-entry are seen to be quite severe
due to the near-equal values of gravity and centrifugal acceleration. It was apparent from these
studies that the analog mechanization provided
adequate simulation of both dynamic characteristics and performance, provided the computation
time was minimized and good performance was
not required at very-near-orbital velocities.
To summarize, it would be well to reiterate
that the X-15 flight simulator, as it exists today,
is the result of design requirements as projected
early in the program. The requirement to include
the pilot in the loop as an integral part of the systems development and testing provided the major
justification for the detailed duplication of displays and flight controls, and for the extensive
six-degree-of-freedom analog simulation of the
airplane motion. Since the application for which
the simulator has been utilized in the latter phases
of the program (namely, mission planning and
pilot training) were not clear cut in the beginning,
we were fortunate that early requirements dictated the simulation approach which was pursued.
It has been indicated that the X-15 simulation requirements were similar in many respects to
those which may be expected in simulation of
more advanced vehicles, and that the problems
which were encountered are definite indication of
the difficulties which may be expected. Conseque ntly , it appears that the X-15 simulation requirements, techniques, and problems should
prove helpful in future programs, both in early
definition of the overall simulation program to be
followed and in determining the best approach to
accomplish the program.

Pitching moment (nondimensional

Cn

Yawing moment (nondimensional coefficient)

Cn

Normal force (nondimensional coefficient)

Cy

Side force (nondimensional coefficient)

G

Acceleration of gravity (ft/sec 2)

h

Altitude (ft)

Ixx

Moment of inertia about X-body axis (slug-ft 2)

Iyy

Moment of inertia about Y-body axis (slug-ft 2)

I zz Moment of inertia about Z-body axis (slug-ft 2)
IT

Tail length (ft)

m

Mass (slugs)

M

Mach number

p

Roll rate

q

Pitch rate

~

Dynamic pressure (lb/ft 2)

r

Yaw rate

S

Wing area

Vo

Velocity of the center of gravity (ft/sec)

Vx

Velocity component along X-body axis (ft/sec)

Vy

Velocity component alongY-body axis (ft/sec)

Vz

Velocity component along Z-body axis (ft/sec)

ex

Angle of attack

/3

Angle of sideslip

Sf

Differential horizontal stabilizer deflection,
roll control

SH

Horizontal stabilizer deflection, pitch
control

Sv

Vertical stabilizer deflection, yaw control



Euler angle of axial rotation, roll

a

Euler angle of elevation, pitch

'I'

Euler angle of azimuth, yaw

Symbols
b

Wingspan (ft)

C Mean aerodynamic chord (ft)
Cc Chord force (nondimensional coefficient)
CI

Rolling moment (nondimensional coefficient)

coefficien~

Cm

630

15.2

REACTION CONTROL ACCELERATIONS
-PITCH eYAW - 2 1/2°/SEC2
-ROLL - 5°/SEC2

AERO CONTROLS
ALL MOVEABLE SURFACE
-HORIZONTALS - PITCH tROLL
-VERTICALS - YAW
Figure 1. Flight Controls

TYPICAL MISSION
BALLISTIC TRAJECTORY---r~ 250,000 FT
BURNOUT
T=88 SEC
ALT=158,OOO FT

--

~ :=SEC ~

GLIDE BACK TO BASE

!§~ :->

.

45,000 FT

M=.80

~ .J

J .J

BFArr:Ji

~J

J

" ';,/:l';.}\~'." ,;t,;{;\i~:)-J.l/. ELY,~~~~",_. EDWAR~~
'"

400 N MI
Figure 2. X-15 Research System

-/

631

15.2

REACTION CONTROL MECHANIZATION
30 OF FREEDOM. LONGITUDINAL MODE

t1
IiiiJ

m

PROGRAMMED DYNAMIC PRESSURE
50 OF FREEDOM. CONSTANT VELOCITY
••• ROLL COUPLING

«

1ST FLIGHT

LIMITED 60 OF FREEDOM

1/ :'~

'I

«

COMPLETE 60 OF FREEDOM

nt',

~-!"''':

0

1

to:' ' • • ,

~.;

y:"

.;;. :":77

[J

DYNAMIC CENTRIFUGE
••• 60 OF FREEDOM

JFMAMJJASONO J FMAMJJASONO ~FMAMJJASONO JFMAMJJASONO JFMAMJJASONO

1956

1957

1958

1959

1960

Figure 3. Flight Simulation Summary

ACCELERATION OUTPUTS COORDINATE CENTRIFUGE DRIVE SIGNALS
CONVERTER
ACL COMPUTER
FACILITIES
AMAL CENTRIFUGE

PILOT CONTROL
MOTIONS

CENTRIFUGE POWER
Figure 4. Dynamic Simulation

632

15.2

Figure 5. Flight Control Simulator

Figure 6. Simulator Cockpit

633

15.2

e : 'f COS - rSIN

V~: V~+ V~+ V;

 : p + ~ SIN e

~ : 51 N VO COS /3

~:

'f SIN  + rCOS

cose

-I

~:

SIN-I

VX

~:

X : [Vxcose +(VySIN + V:tCOS -VyCOS]SIN'I'

if :

[vxcose+(VySIN +V~COS - VyC OS] COS'I'

i : Vx SINe 9-: 32.17 -

VJf coseSIN - V1cosecos

ii' -/'2 x2

Figure 7. Six-degree-of-freedom Equations of Motion

634

15.2

Figure 8. Analog Simulation

Figure 9. Nonlinear Computing Equipment

635

15.2

Yo FT/SEC

1FT

S DEG
« DEG

e DEG
(6

DEG

o

50

100

ISO
200
TIlE SEC

2SO

Figure 10. Design Altitude Mission

Figure 11. Variplotter for Simulated High-range Plotting

300

636

15.2

XLR-II ENGINES
ALTITUDE
-IOOOfT
80

BURNOUT

40

--FLIGHT
•• •••••••• ANALOG PREDICTED

20
0.1I.!-8-~-~-~-~-~-~-~ MACH
1.2
1.6
2.0
2.4
2.8
3.2
3.6 NUMBER
Figure 12. X-15 Maximum-speed Flight

XLR-II ENGINES
140
ALTITUDE
-1000 FT

--FLIGHT
.............. ANALOG PREDICTED

100
80

60
40

2~8.............-

.....---.......---2......8~ MACH NUMBER

Figure 13. X-15 Maximum-altitude Flight

637
15.2

300
200
100

EQUIVALENT VELOCITY
y~ - KNOTS

4- 0
2

'LTITUDE
Hp-IOOO fT

o

PITCH RATE
-DEG/SEC

10
0
-10

ANGLE OF 'TTACK
« -- DEGREES

10

'f

5
0

NORlW. ACCELERATION
Nz...... "G'S·
HORIZONTAL SUBlUZER
POSITION
5H-DEG
328 330 332 334 336 338 340 342 344 346

TIME - SECONDS
Figure 14. First Glide Flight Landing Flareout

RATE OF CHANGE
OF DYNAMIC PRESSURE
LB/FT2/SEC

100.0

X-IS
MAX ALTITUDE
MISSION

10.0

X-IS DESIGN ALTITUDE MISSION

1.0
TYPICAL ORBITAL RE-ENTRY

o .....~~__......__'---_~__~ RE-ENTRY FLIGHT
.1

10

20

30

40

50 PATH ANGLE-DEGS

Figure 15. Re-entry Dynamic Pressure Variation

638

15.2

ROLL ANGLE [¢]
(DEGREES)

90
-900

ALTITUDE
[.J] (1000 FT)

400
300
200
100

o

VELOCITY ~ ~ 30.000
(FT/SEC)
20,000
10,000

o

DYNAMIC
PRESSURE
[t~ (LBS/FT2J
NORMAL
ACCELERATION

[nz][1's]

300
200
100

o
4
2
O~~

o

50

100

150
TIME - SECONDS

200

250

Figure 16. Lunar Vehicle Re-entry

18
16

;'

;~

- CORRECT TRAJECTORY
/
2
\
-+0.1% ERROR IN V /R
/
14
\~
OR -0.1% ERROR IN g /
APOGEE ALTITUDE \~ .... -0.1% ERROR IN y2/R "
100,000 FT
\ \ OR +0.1% ERROR IN 9 /
\~\\
10
\.. \ \

7

\

:
4

\

\~ERIGEE

. . ••... '~....--•...•. ......
-~~.

2
o

o

RANGE -

2 4 6 8 10 12 14 16 18 20 22 24~ 1000 NMI

Figure 17. Effects on Analog Errors in Near-circular Orbit Computations

639

15.3
ANALOG - DIGITAL HYBRID COMPUTERS IN SIMULATION
WITH HUMANS AND HARDWARE
Owen F. Thomas
U. S. Naval Ordnance Test Station
pasadena, California
The U. S. Naval Ordnance Test Station, Pasadena, operates an analog simulation center for
testing antisubmarine torpedoes and the associated
fire-control equipment. This simulation incorporates a real torpedo on a flight table, a real
fire-control computer, and real fire-control personnel. The simulation must be run in real time
in order to place the proper requirements on the
components being tested, for example, the human
operators must not be allowed any extra decisionmaking time. Now, it is intended to incorporate a
digital computer into the simulation. This leads
to the general system to be considered here -- an
analog - digital hybrid computer required to work
in real time with humans and hardware. Certain
problem areas are defined and some techniques of
solution are presented.
The Problem
The problem is to test a physical device.
Mathematical Simulation
In this type of testing, a mathematical model
is formed for each physical device involved and
these models are programmed on a computer. It is
then possible to consider that the computer is
acting as if it were the physical devices that are
modeled. While executing the program, the computer produces graphs as a function of time, or
lists of times along with quantities like velocity,
position, and fuel expended to describe the conditions of the physical devices at various times.
In such a solution, clock time is of importance
only in determining the cost of computation. It
is of secondary importance whether l/lO-second
time intervals on the printed page were actually
printed at l/lO-second intervals by the operator's
wrist watch. Mathematical models of this sort are
interesting, but if a human, or hardware, is involved in the solution without being modeled, then
the model must move in clock time. Furthermore,
this type of model fails to convince the hardboiled engineer who knows that there is many a
slip between the equations and a working device.

sort or produce information for human operators.
Things that move and talk to humans have been
called Robots. Therefore, this facility is defined as a Simulation Robot. The computing equipment is called a Robot Brain. The Brain must
receive stimuli, think about them, and cause an
action, while recording certain quantities of
interest. The Brain is adequate only if its response is quick enough and accurate enough. If a
digital computer is part of the Brain, it is often
necessary to sacrifice accuracy in order to gain
speed.
The device which an engineer has built and
wants to test is also a Robot. It is intended to
observe the real world in some way and perform
some sort of action in response. In the real
world, these Robots go dashing about with much
noise and violence, often destroying themselves,
and generally they are very hard to observe. The
Simulation Robot is meant to cradle them safely in
its arms and make them think they are in the real
world while the engineer watches them, as shown in
Fig. 1. The Simulation Robot regards the engineer's device as merely a Test Robot, which is
what it will be called from here on. The Test
Robot might even be a man, for example a firecontrol officer, and there may be multiple Test
Robots.

Simulation Robot
Characteristics. The Simulation center which
NOTS operates uses mathematical models when they
are unavoidable, but allows a real device to perform before the eyes and instruments of an engineer wherever possible. It is not sufficient here
for the computer to print that the torpedo is
turning at a rate of 20 degrees per second during
a search scan. The torpedo on the flight table
must actually rotate at 20 degrees per second.
The essential feature of the NOTS simulation is
that mathematical answers cause movement of some

Figure 1
must
Test
like
must

Now it is clear what the Simulation Robot
do. It must duplicate the environment of the
Robots. In greatest generality, it would be
the whole wide world. In special cases, it
at least be enough like a small part of the

640
15.3

world so that the Test Robot will not know the
difference. The engineer might also like to check
out a small part of a Test Robot before it is all
completed, in which case, the Simulation Robot is
required to behave like the missing parts. The
engineer might also like some assistance from the
Simulation Robot in the area of assigning test
jobs to the Test Robot and in evaluating performance. He might even like the Simulation Robot to
decide upon new test jobs as a result of the previous performance. The Brain of the Simulation
Robot should be composed of whatever computing
elements can be used for these jobs. Remember
that the most important job of the Simulation
Robot is that it be like at least part of the environment. More specifically, it must be like the
parts of the environment that are important to the
Test Robots.
Inertial Force Simulation~ Simulation at
NOTS has included inertial forces on the torpedo
from the environment for many years. When the
Test Robot actuates a control surface, the information goes to an analog part of the Simulation
Robot Brain that solves the hydrodynamic motion
equations and controls the flight table that holds
the torpedo. This results in some inertial forces
being applied just as they would be in the ocean.
The forces due to lateral acceleration are not
simulated because the flight table cannot move
laterally. This has caused no difficulty because
the Test Robots have not been sensitive to lateral
accelerations. It is not planned to use digital
equipment in this part of the Brain, as it would
require analog-digital conversion or a digitally
controlled flight table or both.
Environmental Response Simulation. When a
Test Robot interrogates the environment, the Simulation Robot must fake a response. For many types
of interrogation, the response is mathematically
determined by a solution of the wave equation.of
mathematical physics. The boundary conditions for
this solution are time-varying and the medium has
spatial and time variability. What is needed is a
computer to solve these equations with the same
speed as a response from the real environment. It
is well-known that present computers cannot do
this. The present effort is to simplify the mathematical description of acoustic echoes to a point
where they can be generated by the Brain.
Human Factors. Humans do not ~nterrogate the
environment in this Simulation; instead they operate certain controls on the Test Robots or on consoles concerned with Test Robot deployment.
Therefore, it is of no concern at present that
there be direct communication between humans and
the Simulation Robot Brain during a Simulation run.
Of course, human operators are involved in starting
and stopping the Simulation Robot, and in programming it, or in giving it the basic education which
it must have.

Solution Techniques
Acoustic Echo Simulation, First Method
The Test Robot occasionally transmits a pulse
of sinusoidal pressure waves and observes the return. This particular case of environmental solution will be the first to be incorporated in the
digital part of the Simulation Robot Brain. The
return can be obtained by solving the wave equation, but the initial effort will be based on a
simplification. It is assumed that the echo is a
sum of pulses just like the one transmitted but
delayed by varying amounts of time. For the first
model, it is also assumed that there is no doppler
shift, so the solution will be in the form of a
sum of pulsed Sine waves, all at the same frequency.
TEST ROBOT
TRANSMIT PULSE
POSITION
INFORMATION

ANALOG
COMPUTER

I
I
I
I

DIGITAL
COMPUTER r------.

I
I

I
I
I

I

I
I
I

I

~

oo

THIS IS REPEATED
FOR AS MANY
ELEMENTS AS
DESIRED
REPRESENTS A DIGITALLY
CONTROLLED PHASE
SHIFTER
REPRESENTS A DIGITALLY
CONTROLLED ATTENUATOR
-SWITCH

Figure 2
Figure 2 is a block diagram of the components
involved in the solution. A summation of sine
waves is not a simple arithmetic operation like a
summation of numbers. In this solution, the summation is shown being performed by an analog

641
15.3

summing amplifier. The various pulsed sine waves
are generated by analog phase shifters and attenuator-switches, controlled by the digital computer.
The digital computer has the job of computing the
phase, amplitude, and time of each component of
the echo and controlling the analog elements. It
computes this information on the basis of position
data obtained from the analog computer via analog
to digital converters. Note that the Test Robot
does not actually transmit into the environment;
it sends a pulse to the digital computer. Note
also that the Test Robot does not observe the environment; it receives the analog solution fed
through an analog filter with the same transfer
characteristics as its hydrophone.
Some may wonder why it is necessary to go to
such extreme measures and it must be stated that
the problem has been simplified for this presentation. Actually, the Signal ls more complicated
and the Test Robot is doing a meticulous job of
signal processing on the sine wave sum which it
receives. This simple example still serves for
discussion of techniques. One thing of interest
is the filter. It is an analog device; the digital computer would take too long to do the filtering job. FUrther, it is significant that the
digital computer is not outputting the sine wave
of interest. It is too slow. It is used instead
to control an analog oscillator. Note also that
the digital computer is not being asked to solve
the real-time problem of the wave equation in the
real world. Instead, it is working with a simplified model that allows computing to be done during
a time which is considered as transport delay
while the transmitted signal travels to the target
and returns. This is possible for sonar where the
velocity of propagation is relatively slow. There
is some difficulty in doing this for radar unless
the range is quite large.
This system is capable of Simple extension to
the case where each component in the echo has its
own doppler shift. It would merely be necessary
to use a number of oscillators with frequency controlled by the digital computer. However, the
cost of this system is high because there are many
digitally controlled analog devices; a cheaper,
slower, and less versatile system will therefore
be considered.
Acoustic Echo Simulation, Second Method
Figure 3 is a block diagram of a simplified
system. There is now only one oscillator with
phase and amplitude controls. The rest of the
system is unchanged, so that the output of the
single controlled oscillator must be the same as
the output from the analog summation amplifier in
Fig. 2. This summation is mathematically equivalent to a vector summation which must now be performed in the digital computer. The digital output
quantities are the phase angle and amplitude of the
sum vector. Each time a new Sine wave echo component enters the sum, the digital computer must go
through at least Sine, COSine, square root, and
arctangent calculations. This means that it will
be slower in response.

~
~

TEST ROBOT

)

TRANSMIT PULSE
POSITION
INFORMATION

i

I
I

DIGITAL
PHASE
CONTROL

I

I

DIGITAL
COMPUTER

OSCILLATOR

i

!
i
!
I

ANALOG
COMPUTER

DIGITAL
AMPLITUDE
CONTROL

I ANALOG I

I

FILTER

I

Figure 3
Because there is a limited amount of time
available, this system cannot consider as many
components as can the system of Fi~. 2. It is,
therefore, not as accurate in the available time.
Communications Between Computers
Description by Sine Wave Components. The
systems of echo Simulation considered communicate
a simplified description of the echo, namely, the
amplitudes and phases of sine waves. These are
similar to Fourier coeffiCients, except that the
coefficients are functions of time. This type of
thinking is also applicable to input problems. If
it is known that an input signal is a sine wave,
then an input unit should be built to extract its
amplitude, phase, and frequency, and input these
three numbers. This has an obvious extension for
Signals conSisting of more than one frequency.
Description by Derivatives. Some signals are
easily described by their derivatives. In analog
computations, this is almost always the case.
Figure 4 is an example. It results in a parabolic
extrapolation of the conditions at time to. At to'
the digital computer establishes initial conditions
and input to an analog extrapolator composed of two
integrators. This is done by an output of x(to ),
x(to )' and x(to ) as shown. The value x(t) will
then be a parabolic extrapolation for all times t.
If the second derivative does not change, this will
be exact; if it does change, the digital computer
can reset the output extrapolator periodically, resulting in a parabolic segment approximation. A
segmented straight-line approximation could be done
by eliminating one integrator and setting only
x(to ) and i(to ). This can be increased in complexity to give any desired degree of approximation. This type of trick could also be done on
input because a similar prediction can be made in ,
the digital computer on the basis of analog input
Signals of x(to ) and its derivatives.

642
15 .3
ANALOG
SIGNALS

DIGITALLY
CONTROLLED
SWITCHING
MATRIX
DIGITAL
COMPUTER

Figure 4
Communication Codes. At the present time,
general purpose digital computers will accept only
digital stimuli and will respond only in digital
form. ~ne problem of connection to the real world
is left up to the ingenuity of special equipment
designers who usually surround the digital computer with-analog-digital and digital-analog converters and then use familiar analog transducers
or analog-controlled manipulators for input and
output. It should be kept in mind that there are
many forms of encoders for analog-digital conversion, and there are digitally controlled manipulators; a system is not limited to the one type of
conversion.
It should also be noted that the communication line between Test Robot and Simulation Robot
is not invariant. For example, the Test Robot may
not actually radiate energy; the load may be artificial and the Simulation Robot stimulus may be
picked up from the controls of the Test Robot. In
a similar manner, the Simulation Robot may introduce a signal into the Test Robot somewhere after
a sensing element. If the Test Robot happens to
have a part digital brain, this could be very convenient. In cases of this sort, part of the Test
Robot is not functioning and the engineer must decide if there is an essential test being bypassed.
The Future of Communication. In some bright
future, it would be advantageous to have the digital equivalent of analog quantities automatically
available in some memory locations and to have
some memory locations control manipulators directly. All present computers require main program
steps to cause data transfer. The block diagram
of a desirable system is shown in Fig. 5. The
switching matrix on input is under program control
and selects the analog lines that are to feed memory locations. The selected lines are converted in
rotation and transmitted to consecutive memory locations. A similar switching matrix on the output
connects memory locations to manipulators. It
must be mentioned that there is one computer that
is partly set up for this type of operation, since
it has circulating memory tracks that can be written or read by a second computer of the same type.
The second computer could be used for the switching
matrices and transfers. The main difficulty is
that the particular computer is not really fast
enough.

MANIPULATORS
CONVERTERS

Figure 5
Tricks With Time
In reference to the problem of giving the
digital computer more time for solution, it has
been suggested that the analog computer be put
into hold until the digital computer is finished.
This might work in some cases for a pure analogdigital hybrid, but in the present application
there is no' way to re-start the Test Robot with
the proper initial conditions. Furthermore, this
would provide the humans with too much decisionmaking time. Another suggestion is to rerun the
problem many times, each time using recorded digital solutions and doing one more digital solution.
This will not work when there is enough noise in
the Test Robot to keep it from repeating its
actions on reruns. It also fails when there is a
Test Robot, such as a human, involved which may
learn too much from the reruns.
The only way to get things done faster is to
have multiple digital computers working at the
same time on separate parts of the problem.
Real Time Computers
What Are They? The hare and the tortoise are
both real time animals but no serious writer would
claim they are the same speed. Computer salesmen
seem to overlook this fact when they claim to have
a real time computer. There appears to be no
sales literature that approaches this problem seriously. This problem is defined below in terms familiar to analog people who do not juggle time.
The transfer characteristic and gain accuracy
of an analog element describes its speed and accuracy. What is needed is a similar characteristic
for digital elements. It is not immediately apparent what this characteristic is for digital
elements. Wh~t is it for systems? Analog experts
can predict whether a deSired system characteristic

643
15.3
is possible on the basis of the element characteristics. In the digital equipment, this is equivalent to deciding whether a program can be written
to input a time function, perform a required operation on it, and output the resultant time function fast enough and with enough accuracy. A
salesman who claims to have a general-purpose,
real time computer should allow an arbitrary system transfer characteristic to be stated and have
his computer obey it. So far, there have been
none found who can do that. For now, certain
limits are implied by the operations that follow.
Pulse Delay. The fastest useful job that can
be conceived for a digital computer is pulse transmission with specified delay. Such a function can
be done on a computer that has been considered at
NOTS. It requires 65 microseconds to set the delay. The minimum delay is 10 microseconds and
Ie-microsecond multiples are easily available up
to a total of about 1/3 second. The output pulse
will be synchronized with the digital computer.
If the input is not synchronized, the delay will
vary because the computer will not recognize the
input immediately. This indicates that, for some
jobs, the entire Robot might have to be synchronized with the digital computer.
Square Wave Generation. Another job the
digital computer can do rapidly is generate a
square wave with controlled period and level on
each half-cycle. This can be done on a computer
NOTS has considered with a minimum period of 120
microseconds. The 120 microseconds are all consumed in analog-digital conversion and input. The
converters require 52 microseconds and it is not
possible to complete the input until 60 microseconds have elapsed. This time is required on
each half-cycle for complete control. During the
conversion time, all the program steps required
for period control and output can be executed.
The period is controllable on each half-period in
steps of 10 microseconds up to a total of about
1/3 second.
Digitized Signal Delay. A slight modification of the previous program can give time variable
delay to a time function. The minimum delay of 60
microseconds can be increased by multiples of 10
microseconds to about 1/3 second.
Summary. A real time computer has not been
defined, but an attitude has been suggested and
some bounds on speed stated.

Test Robot Performance Evaluation
Type of Brain to Use. In this area, the
digital part of the Simulation Robot Brain seems
clearly useful. Performance results are almost
always reduced to digital form, and as such are
ideal for a digital computer. These results can
be tabulated for the Test Robot engineer, or they
can be processed according to the engineer's rules
to automatically decide on new job aSSignments for
the Test Robot. The digital computer is ideal for

making logical analysis of performance results and
varying job assignments by systematic, probabilistic, or combined rules. Some analog parts of
the Simulation Robot Brain are already digitally
controlled and, therefore, are available for job
assignment tasks originated in the digital part.
Random Factors in Evaluation. In all systems, there are certain random factors. Analog
simulations ordinarily use noise generators to
introduce these. Since the noise generators will
not repeat, it becomes difficult to determine
whether the analog can give repeated solutions.
Repeat solutions are often done to check on accuracy of the answers. The engineer may also want
to repeat a particular job which resulted in failure. Sometimes it has been necessary to use magnetic tape records of noise to overcome this
difficulty. In a digital computer, random factors
are generated by a digital program called a pseudorandom number generator. This can be easily repeated. In these cases, it seems that the digital
computer is the proper place to initiate random
occurrences.
Philosophy of Equations
Analog specialists often consider continuous
solutions to differential equations to be correct,
and digital solutions to difference equations to
be approximate. This is the result of the fact
that, historically, calculus came before fast
digital computers and phYSiCists described the
physical world by continuous models. Even so,
they have been forced to use quantum, or discrete
models in some areas. The test of a mathematical
model of the physical world is to compare calculations with measurements of the phYSical world. It
is quite possible that difference equations will
give as good agreement as differential equations.
Then the continuous solution to differential
equations would justly be called an approximation.
Translators
An engineer who has been thoroughly bitten by
the computer bug will want the Simulation Robot to
understand plain English. In computer jargon,
this means that the Simulation Robot must include
a language translator to go from user-oriented
language to machine-oriented language. The engineer would like to strap his creation to the
flight table and tell the Simulation Robot, "This
is supposed to work in X environment and accomplish Y. If it does not work, tell me why not.1I
In order to translate thiS, the Simulation Robot
must have self-consciousness, that is, it must
know the types of things it can do. Here arises
the first problem of translation; the Simulation
Robot must be told by someone what it is and how
to use the information. The Simulation Robot at
NOTS is one of a kind and the same is true for the
others presently in operation. No general-purpose
ones seem to be enVisioned for the near future.
This means that a translator will have to be
special for each Simulation Robot. An excessive

644

15.3
amount of labor would be required for one to be
written single-handed; it is to be hoped that a
user's group will eventually be formed. In the
meantime, existing translators can be used for the
digital equipment and the use of digital programs
to help in the setup of analog computers can be
wat~hed with interest.
Then, ingenuity will be
needed in using these aids to put together a specified environment for the Test Robot and in interpreting what happens, with perhaps help from
the digital computer in tabulating results. In
the meantime, it may be possible to plan a translator with general capability.

645
15.4
THE AUTOMATIC DEI'ERMINATlON OF HUMAN AND OTHER SYSTEM PARAJ1grERS
qy T. F. Potts, G. N. Ornstein, and A. B. Clymer
North American Aviation, Inc., Columbus, Ohio
Summary

Automatic computer methods for the aetermination
of system parameters from dynamic test data e.g.,
those of Heissinger, Leondes, Hargolis, Graupe,
Turner, and the authors, are reviewed and
compared at length.
Two applications of a method of steepest descent
(on the absolute magnitude of the system-equationsatisfaction error) are reported. In the first,
simultaneous automatic analog determination of
four coeffcients in a human response equation
(transfer function) was accomp1ished. The second
app1ication of the technique was in an analog
dete~nat10n of aerodynamic coefficients from
simulated flight test data.
The paper is concluaea by a briet" aiscussion of a
broaa range of potential applications of methods
for determination of the parameter values
required for computer simulation of human
systems, other biological systems, socio-economic
systems, and physical systems of concern in
science and eng1neering.
Introduction
The class of problem addressed in this paper is
that of determining the parameters of any given
dynamic system by automatic methods on an analog
or digital computer. In Section I, this class of
problem is defined and discussed in relation to
certain other important problems. In Section II
many methods of solution are described and
compared. Section III presents the method of
steepest descent on ~he absolu~e magnitude of the
error in satisfying the system equation. Two
specific applications are presented in Sections
IV and V. The paper is concluded in Section VI
with a brief look at the benefits to science and
technology which should result from further
applications •
I.

The Problem Of Parameter Determination

A simple instance of the parameter determination
problem is that of finding some of the G£
coefficients 1n a linear equation

l:

c.i. x,i.

=0

(I)

The foregoing problem is readily generalized.
For example, the system equation neea no~ be
linear; the system might require several
equations for its description; the equations
might contain unknown exponents and other
constants as well as coefficients; and, moreover,
the unknown parame~ers might not be constants.
As will be discussed, however, nearly all of the
methods which can solve ~he problem of equat10n
(1) are theoretically capable of solving also
these more general cases.
The problem of system parameter dete~mination may
be contrasted with the solution of a given differential equation, in which the coefficients and
forcing function are known and in which the
desired answer is the dependent variable as a
function of the independent variable. In the
parameter aetermination problem, on the other
hand, one is usually given the system inputs and
outputs and it is the set of system parameters
which is desired.
Thus the problem of parameter de~ermination is
closely related to the electrical problem of
synthesis, given the inputs and outputs of a
black box. Indeed, the problem of parameter
determination is a special case of the problem
of system equation synthesis, (mathematical model
optimization) in which not even the form of the
system equation is originally given but in whlch
it shoulu be posslble a~ least to ascertain some
of the parameters (particularly exponenLs) by
dimensional analYSis, etc. Herein, however, ~t
will be supposed that ~here is only a single
sys~em equation, tha~ a reasonably valia form for
the system equation is known, ana that time
h~stor1es of the var1ables are given, it being
required to determine only the system equation
parameters. This situation arises often in
science, engineering, operations research, etc.
Parameter determination is allied also to a
number of other impor~ant general problems 1n
information processing, suc~ as signal filtering,
de~ec~10n, recognition, and prediction.
Indeed,
the parameter-deterMination-problem concept may
be a key to solution of these practical problems
in many specific 1nstances.

Co

, given at least a sufficient amount of experimental
data concerning the variables Xt and given the
other coefficients c~. One of the variables
might be a forcing function. Other variables
might be successive derivatives of a dependent
variable, or values of a dependent variable at
successively delayed times. However, the
symmetry in equation (1) admits a dual interpretation, so that the forcing function might be
among the unknowns.

Another class of problems related to parameter
determination is that of the calculus of variations. In the calculus of variations the unknowns
are functions and/or end conditions. If the
unknown functions are regarded as being characterized by a set of parameters, and if the unknown
end conditions can be expressed in terms of
parameters, then a problem in the calculus of
variations can be recast as a problem in
parameter determination. This possibility is

646
15.4

especially attractive if the unknown function is
required to be nonanalytic (eg, comprised of
straight segments, in some ~olution space, represen~~ng aelays, suaaen changes, l1mita~ions,
ana other irregularities not easily treated by
the classical calculus of variations).
It will be readily seen that the common problem
of "function separa~ion" falls under the class of
parameter determination problems. There arises
often a need for analysis of a given function
into a sum of terms of known type but with unknown
coefficients, exponents, etc. Examples of this
function separation problem are radiochemical
analysis of a mixture by identification of the
half-lives of the constituents, determination of
chemical kinetic func~~ons within measurea time
h~s~or1es of concentrations, separation of the
normal modes in a dynamic structural response,
etc.
The classical approach to the determination of
system parameters was to devise a set of special
experimental conditions which would isolate each
parameter for direct measurement, all others
being in terms contrived to be zero. Since this
is not always feasible, the approach has been
extended in many instances to the simultaneous
determination of two parameters by indirect
measurement, such as the measurement of natural
frequency and damping ratio of a second order
system in order to determine two of the three
coefficients in the system equatlon.
Methods wh1ch determine at most one or two
parameter values per experiment are excessively
expensive and time consuming if the cost per test
is high. Moreover, these methods are subject to
error due to imperfect nulling of unwanted terms.
A further difficulty is tha~ closed-form
solutions are not always available for use in the
indirect methods of determining coefficients in a
differential equation.
It is eVident, then, that there has been need for
methoas which would permit simultaneous determination of many or all parameters of a system
from a single dynamic experiment. Many such
methods have been developed in the past few years
and will be aescribed below. These methods offer
an optimization of testing and data reaUC~10n for
minimum total cost.
II.

Methoas For Parameter Determination

Explicit Methods
The most readily unuerstanaable method of parameter determination is that of wr~~1ng the system
equation with variables evaluated at as many
times as there are unknown parameters and then
solving ~he resulting equations as a simultaneous
set for the parame~ers. In the case of equation
(1), for example, the simultaneous equations to
be solved are:

(2.)

Where the tj are times at which data for the Xi
are given, and where some of the Ci are the unknown parameters.
If the unknown Ci, are functions of time, l,he
process can be repeated with staggered sets of
equations. This process lends itself well to a
digital computer, ana it is especially appropriate if the given data are sampled rather than
continuous functions of time. The sampling could
be either periOdiC or in accoraance w1~h some
criterion relating to the derivatives of the
variables or to the highest significant frequency
in the specvrum of the forcing function. If the
sampling interval is too short, the matrix is
ill-conditioned, thus render1ng the answers
subject to large error due to amplified roundoff.
On the other hand, if the sampllng interval is
too long, the higher frequency variations in the
desired parameters will not be resolved. An
unpublished preliminary investigation of this
method on an operational analog computer was
conducted in July 1960, by 'W. C. Franke, North
American Aviation, Inc., Columbus Division,
using circuits controlled by a clock circuit to
yield solutions of 3 x 3 matrices staggered in
time.
One of the shortcomings of the foregoing sampled
matrix method is that it utilizes only sampled
data. This is wasteful of information if the
variables are known as continuous functions of
time, as is the case in most dynamic experiments.
Moreover, the sampled data used will yield
erroneous results if an appreciable amount of
noise is present in the measurements, since every
datum is in an essential role. Accordingly, it
would be desirable to use a method which employs
a redundant amount of data, such as would be in
short-term running averages instead of instantaneous data.
Redundant data can be made to yield exactly as
many equations as there are unknowns even in
methods of running regression analysis, such as
"dynamic least squares". 'Ihis mIthod was investigated by one of the authors in 1957-58 and
has been applied widely2. In the application of
the method of dynamic least squares it is
necessary to choose the sampling interval and the
number of samples in such a way that errors of
measurement are smoothed out without causing loss
of the higher frequencies in the desired parameters. One of the difficulties in the method of
dynamic least squares is that the normal equations,
being often nonlinear, may be troublesome to
solve quickly and hence econOmically.
Another method for obtaining as many equations as
there are unknown parameters is to differentiate
analytically the system equation successively as
many times as necessary. 'Ihis possibility is not
known to have been investigated, possibly because
it would often require differentiation of noisy
experimental data.

647
15.4
1he use of as many sets of narrow-band filters
as there are unknown parameters is another conceivable method using simultaneous equations.
~his method would fail in the case of a highly
nonlinear system, since each discrete input frequency could excite a wide spectrum of discrete
output frequencies.
It is often possible to use filters alone for
parameter determination. An example is the case
of the separation of dynamic structural response
at a point into da~ped sinusoidal functions of
time whose frequejcies and damping ratios can be
measured directly •
Another explicit method is cross-correlation,
which is especially usefu1 4 when the system is
represented in terms of a set of weights on
successive~y delayed values of the forcing
function 5 , instead of in terms of an equation.
The classical statistical methods of multivariate
regression analysis, factor analysis, multiple
correlation analysis, etc., can be used for parameter determination in the case of systems which
can be described by algebraic equations
statist~c9l methods developed by Albert1,
Wilkie ' , and others, have application in parameter determination and related problems.
Linear systems lend themselves to a variety of
special methods. One which is in very common use
in the process and control industries is the calculation of system equation coefficients from the
break points and slopes of a Bode plot (log of
amplitude response vs log of frequency of
excitation). This method is suited to automation
on a digital computer. Such automation is often
facilitated by the fact that a suitable system
model can be of a much lower order than the
10
system itself (at least under certain conditions.
Implicit Hethods
'Ihe foregoing methods yield a direct solution for
the unknown parameters. In contrast, the methods
to be discussed next involve an iterative or
continuous approach to the desired parameters.

As long as there have been analog computers they
have been used in a trial-and-error process to
match experimental results and thereby to determine system parameters. The published
applications which have been made of this manual
tentation method are far too numerous to list.
It is laborious, and it is subject to errors of
human judgment. Therefore an automatic and
objective method would be much to be preferred.
A simple and more objective method is to let the
computer determine the mean-squared error of fit
of the computer-generated dependent variable and
the given time history of the dependent variable.
By judicious adjustment of the unknown parameters
it is possible to minimize the mean-squared error.
This process can be automated by closing the loop

on the computer, a method which has been widely
used 11. As automated, however, it can determine
only one parameter per equation (i.e., per
dependent variable) per run.
1
A related method is "implicit synthesis", in
which a desired parameter is adjusted automatically to minimize the instantaneous error in the
dependent variable. Among the interesting
applications of implicit synthesis which have been
made are the determination of lift coefficient
during transient stall, aircraft-tire friction
during landing, and nuclear reactor parameters.
Implicit synthesis yields, in general, only one
parameter per equation. However, W. C. Franke
(in an unpublished study) has shown that under
certain conditions it is possible to obtain a
second parameter simultaneously by letting it be
adjusted automatically to minimize the instantaneous error in the rate of change of the
dependent variable.
The first published implicit nonrandom iterative
method known to the authors which is capable of
yielding several parameters at once per equation
is one due to r1eissingerl~. It is basically
steepest descent on an integral of the square of
the error in the dependent variable. It has the
disadvantage that it requires the use of
auxiliary differential equations. It has been
applied, nevertheless, to~ ~ber of problems
including adaptive servos '
One of the characteristics of all of the foregoing
implicit methods of parameter determination is
that they contain a mathematical model of the
system, which has been called a "'learning model" 13
since it "learns" the correct parameter values.
The learning model generates continually a tentative value of the dependent variable for comparison with experimental data.
Automatic determination of parameters can be
accomplished also by steepest descent on some
even function of the error in satisfaction of the
system equation. This class of methods has been
investigated and applied by Graupe15 and independently by the authors in unpublished work during
1960. Some of the earlier specific applications
of met~gcts in this class have been discussed by
Turner
and Levine17 • It is this class of
methods with which Sections III-V are concerned.
The authors' studies of steepest descent on
equation error began in March 1960 with the
concept that an excessively large value for a
coefficient in the learning model would cause tne
"signature" of the faulty term to show up in the
equation error. Then one would expect the
quantity i ft£-X' dt
to have a positive

t

)0

l

value, where 'E is the error in the system equation. A term in "the system equation having a
correct value for its coefficient would give a
small or zero value for the above quantity.
Therefore this quantity could be used as a
measure of the rate at Which the coefficient of

648
15.4
XL should be reduced between runs in an iterative
procedure. It was recognized that a continuous
correction could be obtained by using a running
integral over a finite period or by using the
output of a first order lag circuit instead of
the above quantity. The latter possibility was
investigated on an analog computer in June 1960
in the case of a third order system equation and
was found to be successful in driving out good
values of all four coefficients in five seconds.
It was found, moreover, that the lag circuit had
little effect, equally satisfactory performance
being obtalned without it. Hithout the lag
circuit this method is simply steepest descent on
the square of the equation error, as was recognized after the authors had seen l1eissinger's
paper12.

Noting, in continuous steepest descent on the
square of the equation error, that the rate of
convergence to the answer became progressively
slower in the manner of a decaying exponential,
the author~ tried the scheme of sampling and
holding new values of the variables whenever the
equation error had been reduced to a prescribed
level. This gave improved convergence.
Next it was noted that the multipliers were acting essentially only as sign changers, so they
were replaced with relay-amplifier sign changers,
which converted the method to steepest descent on
the absolute magnitude of the equation error. At
about ~his time (September 1960) Graupe's preprint l was sent to the authors for discussion.
Independently, Graupe starting with implicit
synthesis, had been investigating several of the
same methods and applying them to nonlinear
problems. His example problems were solved by
his methods with time constants ranging from 6
secs. down to as little as 0.1 sec. He developed
also more general methods of steep descent, one
of which subsumes all methods of continuous steep
descent on an even function of the equation error.
Graupe studied also the case of a system described
by more than one system equation. One of Graupe's
more interesting remarks is that steepest descent
can be used for the determination of the highest
derivative in a differential equation in which
the highest derivative cannot conveniently be
isolated analytically for explicit calculation
and successive integration.
It is instructive to note relationships among the
methods discussed above and with certain other
methods. For example, it has been shown in unpublished analyses by B. J. Miller, North American
Aviation, Inc., .Columbus Division, that an analog
circuit'for implicit synthesis is in some cases
identical with that for steepest descent on a
function of the system equation error.
Linear programming is closely related to parameter
determination. As performed on an operational
analog computer18 , linear programming is simply
steepest descent (or ascent) on an "objective
functioa' of the parameters Ci' with the

complication that there are in general some constraints on the parameters. If the objective
functlon l~ nonlinear, the same approach is
applicable. Even time-varying constraints can be
handled, thus permitting solution of a class of
problems in dynamic programming. Thinking of the
solution of a linear programming problem as being
the optimum set of values of the parameters of an
"operating system" (as distinguished from a
"hardware system"), one can then conceive by
analogy that steepest descent can be applied to
the optimization of the parameters of a hardware
system. T~~s possibility has been pointed out by
NeissinO'er
and has been applied by }J!argolis and
Leondes 13,14, thus establishing a close relationship between parameter determination methods and
the "optimizers" which are being developed for
process control systems 19- 22 •
There is no particular virtue in steepest descent
as such. What is really desired is to move as
rapidly as possible in error-parameter space in
the direction of the answer rather than merely in
the steepest direction. It is interesting to note
that the long-term average of the instantaneous
steepest directions is the direction to the
answer. Accordingly, it might be advantageous to
employ incremental descents in averaged directions,
or continuous descent in the running average
direction, rather than steepest descent. In the
latter case, if the running average is weighted
in an exponentially decaying manner with time
measured backwards from the present, one has the
method mentioned earlier as involving a first
order lag circuit. These possibilities have not
been investigated by the authors.
The following section presents a general exposition of the method of continuous steepest
descent on the absolute magnitude of the error in
satisfying the system equation. This method is
that utilized in the application of Sections IV
and V.
III.
Conside: the

A Parameter Determining Technique
li~ar differ~ntial

equa\ion

c}.OXl'~IX.+···+~x =F(i:)+~,F(tl+···-t io.,F('t)

(3)

where the coefficients are not necessarily constants, but may be functions of time and/or of x,
F(t) or their derivatives. A method for determining the coefficients aif'j is sought--a method
requiring only that F(t) and X(t) be given.
First, let aci and bcj represent the computed
values of coefficients ai and bj' respectively,
at any time t. Then~ define the equation error
m
d'x
,
dJF(t)
e = =0 a.~idt' - J'"
~ bc;.j ~dt.J
.(+)
~Qw at a~y time ta the s~ecified v~lues of X, X,
X, ••• , X, F(t), F(t), '{t), ••• , F(t) define an
m+n+2 dimensional hypersurface given by

&;

(5)

649

15.4
where e and the aci and bcj are the coordinates
of a running point in the m+n+2 dimensional errorparameter space. Further, the hyperplane E =0
(of dimension m+n+l) is tangent to this hypersurface along some hyperplane (of dimension ~n)
within £=0. Thus the point of intersectton of
all tangency locus hyperplanes of dimension ~n
in e=O (as the hypersurface moves and deforms
in time) is sought. The coordinates of that
point will be {aD' aJ, ••• 8m, b], ••• bn , and
E: (=0)) •

a co

\£1

}

~ f £1 ::..&..: ~
:.!... Flt)
() IE-I
~e,
\t\

-~~\
~ ben

(6 )

d lEI
d

~

(7)

• n

=

-Go,A.
lEI

·x

- -G''£'''X
lfl

'"'C", -

• F"It)
b-Cl :: - c;. • ..£..
1£'
\I
•

(8)

n

= - (;. ·ift · F(t)

where G is a large number (gain).
Example
Consider the equation

=

+ a.. , X
F(t)
(~)
We seek to determine a·O and al where F(t), x(t),
and x(t) are known.

a. o )(

_ £

a.c.o - it. · X

c>l"

-:.£...
F(t)
I£l
Thus one may, considering these slopes, drive the
running point "downhill" along the path of
steepest descent toward the nearest set of values
of the parameters for which the absolute value of
the error is minimum (zero). Specifically, if
the projection of the running point into the E
0 hyperplane is made to move along the normal
to the tangency locus hyperplane (of dimension
m+n) and toward tbe latter hyperplane, a uniform
convergence toward (80, aI, ••• , am, bl, b2,
•••• , b n ) is assured, since when the running
point has reached the tangency locus hyperplane
it is closer to the point of intersection than
it was when it started. Thus rates of correction
to the parameters may be specified for descent
in the steepest direction by'choosing them to be
proportional to the slopes of the hypersurface
in the coordinate directions:

•

lEI = I a..,<> X -+ 3.." X - F(t) I
(ill
represents a surface opening upward from the aOa
plane. This surface is tangent to the plane in l
the line A given by
( , 2.)

whence the slopes of the surface along the acO
and acl axes (i.e., the components of the
gradient vector) are

}

t)

b~",

where acO and acl are the computed values of the
coefficients. Now geometrically (see Figure 1)

lEt

() e>..'m

~

(to)

l\.c.o X + <:\." ><. - F(t) ::. 0

,\-le now proceed to find the required point of
intersection. Form the slopes in the errorparameter hyperspace as follows:
'()Ief :: .l:.-. X
~

First, define the system equation satisfaction
error,

a",

£..

= WI ·x

( 13)

(\+)

If the coordinates of the running point are driven
at rates proportional to the negatives of these
slopes, the running point moves toward the line A
along a path normal to that line. As the values
of X, ~, and F change with time the tangent line
will rotate about some fixed point having
coordinat~s (eo, al). The running point, however,
will converge uniformly upon the (aO' al) position
as noted above.
IV.

Application Of The Hethod To
Determination Of
Human Coefficients

Background
One area of application of the above described
technique arises in the determination of an
equation repres2~ting the human operator as a
control element • This area has been investigated in the past ~J utilization of classical
engineering (control system) techniques. The
techniques used range froTI analytic transfer
function and cross-spectrum approachec (24-28) to
the simulation approach involving the use of
electronic compute3~(29-3l). A recent report by
HcRuer and Krendel
presents an excellent review
of studies in which th~se various techniques have
been applied. All of these techniques seek to
express the output of the human operator as a
function of input to him.
One factor which has become quite apparent is that
none of these techniques permits the rapid and
efficient determination of parametric values for P.
human transfer function. For example, the
describing function and quasi-linear models require at least a cross-spectrum analysis and
computerized mathematical/statistical progrRms.
Even the simulation te2~~%ue as developed at the
Goodyear Aircraft Co.
'
involves a process of
successive iteratl0n in order to match simulated
human output to actual human output. The only
application of the simulatlon approach in which
an attempt was made to estimate automatically the
human transfer fun5tion parametric values was in
the work of Fuchs.
Fuchs was successful in

650
15.4
automatically estimating, one at a time. the coefficients of a second order linear differential
equation representing part of a human transfer
function. All remaining values of the coefficients were required to be pre~specified.
The application to be discussed involves the
simultaneous determination of four coefficients
in a human transfer function. The technique
described in Section III was employed in two
experiments involving the human operator as a
controller in a complex servo loop. Specifically,
variations in the values of the human transfer
function parameters were determined as a function
of known sets of system characteristics. The
first experiment treated the influence of display
quickening upon the parameters of the human
response equation, while the second experiment
treated the influ~nce of damping upon th~se
parameters.
The equation form chosen for the human transfer
func1.ion was

HTF

=

(15)

where a, b, c, and d were the coefficients to be
determined. This is the linear component of the
quasi-linear model exercised by McRuer and
Krendel.
Apparatus
A block diagram of the entire system apparatus is
presented in Figure~. For descriptive purposes,
three major elements may be recognized. First,
there is the tracking device. Involved here are
display anti control hardware and the various
elements needed to produce an input signal, perform nroper op~rat~ons upon the control stick
output, and present gn error signal to the
subject. Second, there is what shall be ~ermed
the synthesizing circuit, which performs the
determination of the coefficients in the human
response equation. Third,4there is a circuit
which comprlses the simulatlon of a human transfer
funct~on which was used in evaluating the
synthesizing circuit.
Synthesizlng Circuit
Figure 3 presents a block diagram of an analog
circuit deslgned to implement the human coefficient determining technique. In the descr~bed
circuit the~e exis~s at any time ~ a set of values
of 80 (t), 8" (t), 9" (t), e(t-.2), e(t-.2) and
£ (t). Based upon these values a set of
coefficients (a c ' b c , Cc and d c ) is computed which
puts the running point on the tangent hyperplane,
thus driving ~ toward zero.
The system equation satisfaction error,
defined by

is

E:: be 6o(t) + Cc 9Q (t) +qc 9.(t)-e(t'-.z)-a.,e(t~.2)(t6)
The 0.2 second delay was obtained from a second
order Pad~ approximation. It will be noted that
with the circuit as described, the coefficients

obtained will contain contributions from the
control element. These can easily be removed if
the control element transfer function is known.
Reliability and Validity of Coefficient
Determination
The reliability and validity of the determination
of coefficients in an equation of the form of
equation (15) was investigated. An analog circuit
representing equation (15) and having controllable
coefficients was placed in the overall network in
a manner such that it could be substituted (ase
Figure (2) for the human operator. A series of
runs was then made in which known (human-like)
values were set into the "manalog", a circuit
representing equation (15); and during these runs
the synthesizing circuit determined the values of
coefficients in its usual manner. Inasmuch as the
circuit was found in pilot studies to require
approximately 30 sec. to converge to an asymptotic
value for a human subject, runs were of 1.5 min.
duration, the values of the coefficients being
averaged over the last minute. Table I summarizes
the results of this series of runs. It may be
seen that the coefficient-determining circuitry
did not produce precisely the d~ired values of
the coefficients--even though the true variability
of any given coefficient was nil. The. observed
variability over runs was attributed to the fact
that the manalog, Which exhibits smoother performance than the human, did not uniformly reach
final coefficient values during the one and onehalf minute trial due to the fact that convergence
rate is a function of the magnitude of the error
signal and its derivatives which are randomly
different over trials.
Based upon this set of observations, an iterative
procedure designed to circumvent the convergence
problem was devised, as follows: after each trial
the coefficients therein determined were set as
initial conditions in the coefficient-determining
circuits. This procedure was used effectively
throughout all runs of the experiment. Table II
presents the data for five sequential manalog runs
using the iterative process. The coefficients are
stable to within .001 after two iterations.
The forcing function 9i (t) was randomly generated,
which resulted in a forcing function rms which
varied greatly from trial to trial. In order to
standardize the forcing function input over
subjects it was decided to average over four trials
for each subject under each condition. This
procedure resulted in an rms which, based upon a
sample of 100 trials, varied by a maximum factor
of 1.3 over 25 blocks of four trials. This
factor represents the ratio of the highest rms
(based upon four randomly selected trials) to the
lowest similarly based rms. The value 1.3 may be
compared with a single trial maximum ratio value
of 2.1.
The reliability of the technique under the
averaging procedure was determined by estimating
the variF.nce of each of the coefficients. Forty

651
15.4
manalog trials were run with initial conditions
in the coefficient determining circuit set at the
true values indicated ~n Table II. These trials
were then averaged in groups of four. The resulting ten sets of coefficients yielded standard
deviations as follows:
cr"a.:: .oola; (J""b:: .0005; Uc :..0007;

O"'"d:.ooo+

The same data as described in the previous
paragraph permit a comparison of average determined value and true value. Coefficients a, b,
c, and d when averaged over the forty trials were
found to have values: .2008, .0507, .2504 and
•003 respectively.
General Experiment Design
Serving as subjects were four male volunteers;
two jet flight test pilots, one USAF-reserve jet
pilot, and one non-pilot. Subjects were run
individually in the tracking device previously
described.
Experimental trials were administered in blocks
of five with about 90 sec. rest between trials.
During the 90 sec. interval the various metrics
were recorded, and the iterative initial conditions procedure discussed above was implemented.
After each block of trials a rest of from 5 - 10
min. occurred.
Experiment I

e

display should show a weighting of the
term
decreased from that obtained under conditions
,"Therein an urquickened display was used. FUrther,
it might also be predicted that the condition of
Partial Quickening (K;2 = 0, Kl> 0 in Figure 4)
should yield a weighting of ~ intermediate to
those obtained under the No Quickening (Kl = 0,
K2 = 0) and Full Quickening (Kl) 0, K2> 0)
conditions. An experiment was performed to test
these predict~ons.
The general details of method were those discussed
above. The non-zero values of the quickening
coefficients were taken as Kl = .5 and K2 = .25 •
No spring was attached to the control stick nor
was the damper attached. Conditions were presented to subjects in a randomized order with the
restriction that the No Quickening condition
appeared first.
The weights assigned to the e term of the human
response equation umer the three quickening
(averaged over subjects) conditions are summarized
in Figure 5. The differences between the No
Quickening and Partial Quickening conditions, and
between the No Quickening and FUll Quickening
conditions are significant beyond the .01 level
(t = 7.31 and 7.02, respectively, with 3 df). A
similar examination of the other coefficients of
the human response equation (see Equation 15)
failed to reveal other differences of significance
at this level.
Experiment I I

One of the major efforts to consider the human
as a functional servo system is the work of
3Taylor et al. at the Naval Research Laboratoif
36. Perhaps the bes~ statement of this approach
appears in jhe now-classic paper by Birmingham
and Taylor3 wherein the approach is set forth as
what they term a basic principle of control
design. In essence, this principle states that,
when designing man-machine control systems, man
should be required to act no more complexly than
does a simple amplifier.
One application of the Birmingham and Taylor
principle results in a system that is said to 'be
quickened. Figure 4 presents two systems -- one
quickened, the other not quickened.> Specifically,
system A is not quickened; the human views an
error signal which is e = E). - eo. In system B,
the,quickened system, the htiman views an error
signal e = 9,' - t 60 + ,~,eo
l

s::

.20

&

NQ

-

H

Q)

C+-.l

&
ru
H

r--<

s::

.10

ru

S
::r::

F

-

s::

·rl

ru

Q

0

~Q)

FQ

'rl

t>

r---

·rl

C+-.l

......
Q)
0
0

-.10
Quickening Conditions
Figure

I)

VJeights assigned to efts a Function of
Qujckening Conditions

-ru

.2

~Q)
·rl

t>

t;j

.1

C+-.l

Q)

o
o

o

~

________

~

.05

__________________
.10

~

________

.] 5

~

________

.20

Coefficient of Damping
Figure 6 VJeights Assigned to

e as

Damping Coefficient

a FUnction of

________
.30
.25

~

~

660
15.4

.

8

aircraft

if

Sh
simulation

ex:.

eX:.
(}

+

+

:f . .

coils

L
-.l

coiJs

t_o-..:;re....lay

+

e
-0(.

+()(.

+

Sh

Figure 7

Block Diagram of Apparatus for Determination of
Aerodynamic Coefficients

I-t_o~re....] ay

LIST OF EXHIBITORS
Aeronutronic Div., Ford Motor Co .................................................................................... .
Ampex Computer Products Co .......................................................................................... .
Ampex Magnetic Tape Products ....................................................................................... .
ANelex Corporation ............................................................................................................. .
Applied Development Corp ................................................................................................ .
Automatic Electric Sales Corp ............................................................................................ .
Autonetics, a Div. of No. American Aviation, Inc ................................... _........................ .
The Bendix Corp., Computer Div ........................................................................................ .
Berkeley Div. of Beckman Instrum., Inc ............................................................................ .
Bryant Computer Products ................................................................................................... .
Burroughs Corp .................................................................................................................... .
California Computer Products Co ...................................................................................... .
C. P. Clare and Co................................................................................................................ .
Clary Corp ............................................................................................................................ .
Computer Control Co., Inc .................................................................................................. .
Computronics, Inc. ............................................................................................................. .
Consolidated Electrodynamics Corp. ................................................................................. .
Con,trol Data Corp ................................................................................................................ .
DATAMATION, F. D. Thompson Public., Inc .................................................................. .
Digital Equipment Corp ...................................................................................................... .
Digitronics Corp. . ................................................................................................................ .
Dynacor, Inc. ..................................................................................................................... .
Electro-Logic Corp .............................................................................................................. .
Electronic Engineering Co. of Calif.................................................................................... .
Engineered Electronics Co .................................................................................................. .
Fairchild Semiconductor Corp ............................................................................................ .
Ferranti Electric, Inc ............................................................................................................ .
Friden, Inc. ......................................................................................................................... .
General Electric Co. Information Systems Dept.. .............................................................. .
General Electric Co. Light Military Electronics ................................................................. .
Genesys ................................................................................................................................. .
International Business Machines Corp ................................................................................ .
Laboratory for Electronics, Inc ............................................................................................ .
Librascope Div., General Precision Inc ...................................................... ",.".,.",.", .. , .. ,."
Monroe Calculating Machine Co., Inc ................................................................................ .
Moxon Electronics ............................................................................................................... .
The National Cash Register Co .......................................................................................... .
Pacific Telephone and Telegraph Co .................................................................................. .
Packard Bell Computer Corp ................................................................................................ .
Philco Corp. G & I Group, Computer Div ........................................................................ .
Photocircuits Corp. . ............................................................................................................ .
Potter Instrument Co., Inc .................................................................................................... .
Ramo-Wooldridge, a Div. of Thompson Ramo ................................................................. .
Recordak Corp. ................................................................................................................... .
Reeves Soundcraft Corp ........................................................................................................ .
Remington Rand UNIVAC Div. of Sperry ......................................................................... .
Rese Engineering, Inc .......................................................................................................... .
Rheem Semiconductor Corp ................................................................................................ .
Royal McBee Corp ................................................................................................................ .
Soroban Engrg. Inc ................................................................................................................ .
Sprague Electric Co .............................................................................................................. .
Stromberg-Carlson San Diego ............................................................................................... .
Tally Register Corp .............................................................................................................. .
Telex Corp ....... _.............. _._ ....... _. __ ... _....................................... _........................................... .
Texas Instruments, Inc ... _......................... _............................... _...... _.. _.................................. .
Uptime Corp ................................. _................... _........... _...................................................... .
Walkirt Co .......................................................... _... _.................... ___ ._ ............................. _...... .
John Wiley & Son, Inc ... _... _............................... _..... _............................................................ .

661
Newport Beach, California
Culver City, California
Opelika, Alabama
Boston, Massachusetts
Hawthorne, California
Northlake, Illinois
Downey, California
Los Angeles, California
Richmond, California
Walled Lake, Michigan
Detroit, Michigan
Downey, California
Los Angeles, California
San Gabriel, California
Los Angeles, California
Denver, Colorado
Pasadena, California
Minneapolis, Minnesota
Los Angeles, California
Maynard, Massachusetts
Albertson, L.I., New York
Rockville, Maryland
Venice, California
Santa Ana, California
Santa Ana, California
Mountain View, California
Hempstead, New York
San Leandro, California
Bethesda, Maryland
Johnson City, New York
Los Angeles, California
New York City, New York
Boston, Massachusetts
Glendale, California
Orange, New Jersey
Beverly Hills, California
Dayton, Ohio
Los Angeles, California
Los Angeles, California
Los Angeles, California
Glen Cove, New York
Plainview, L.I., New York
Canoga Park, California
New York City, New York
Danbury, Connecticut
New York City, New York
Philadelphia, Pennsylvania
Mountain View, California
Port Chester, New York
Melbourne, Fla.
North Adams, Massachusetts
San Diego, California
Seattle, Washington
Telex Park, Minneapolis, Minn.
Dallas, Texas
Denver, Colo.
Inglewood, California
New York City, New York
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.3 Linearized : No XMP Toolkit : Adobe XMP Core 4.2.1-c041 52.342996, 2008/05/07-21:37:19 Producer : Adobe Acrobat 9.0 Paper Capture Plug-in Modify Date : 2008:11:16 18:31:55-08:00 Create Date : 2008:11:16 18:31:55-08:00 Metadata Date : 2008:11:16 18:31:55-08:00 Format : application/pdf Document ID : uuid:b080bf44-3d6e-442c-ae9f-1a9b9f621122 Instance ID : uuid:a300b07a-fc9e-4c9f-9725-fd28d0adc665 Page Layout : SinglePage Page Mode : UseOutlines Page Count : 668
EXIF Metadata provided by EXIF.tools
1961 05_#19 05 #19

1961-05_#19 1961-05_%2319

Navigation menu

Versions of this User Manual:

Views

Navigation