Thursday, September 6, 2007

Elements of Scientific Research and Discovery

ELEMENTS OF SCIENTIFIC RESEARCH AND DISCOVERY:
A Study with CER in High-Temperature Superconductivity

Sakir Kocabas

Dept. of Space Engineering, Istanbul Technical University,
Maslak 80626, Istanbul, Turkey.

Abstract

In this paper we describe a program, CER, which models some of the research activities carried out in the process of the discovery of high-temperature superconductors in 1986 and 1987. These activities include goal and strategy choosing, literature searches, proposing experiments, expectation setting, designing and conducting experiments, data collection, generating and testing hypotheses, modifying hypotheses, and generating explanations.

CER’s design includes many of the elements of scientific research and discovery and provides a step toward a complete computational model. The system has 17 discovery operators which use over 150 methodological rules many of which are general and applicable to other domains of physics and chemistry.

Keywords: Scientific discovery, autonomous operators, methodological rules, consistency, completeness, hypothesis generation, scientific explanation.

1 Introduction

Among the computational models of scientific discovery developed until recently, the KEKADA system (Kulkarni & Simon, 1988; 1990) is interesting in the way it simulates several different research tasks. The CER system described in this paper introduces many improvements on KEKADA by providing a more detailed representation of scientific research and thus constitutes a more comprehensive computational model. CER is designed to model the discovery of high-temperature oxide superconductors in an interrelated series of tasks such as proposing research goals, choosing research strategies, proposing experiments, designing and conducting experiments, data collection, generating and testing hypotheses, verifying, modifying or deleting hypotheses, the supervision of goal satisfaction, and generating explanations.

Our reasons for choosing high-temperature superconductivity as the domain for modeling scientific research are as follows: 1) Research in high-temperature superconductivity is well documented in short articles and research reports published in several leading science journals, 2) it is relatively easy to trace and reconstruct the historical progress of the discoveries, the introduction of new hypotheses, the development and transformation of the theoretical ideas, and their influence on subsequent research in this field, 3) a study into the discoveries in this field should enable us to build more realistic models of scientific research and discovery, and 4) we would be able to test our results in an active field of research.

CER's domain knowledge is largely based on about 1200 research reports and short articles on high-temperature superconductivity, published in science journals such as Physics Today, Nature, Japanese Journal of Applied Physics and Physical Review-B between September 1986 (when Bednorz and Müller's paper first appeared in Zeitschrift für Physik) and November 1988. Some of the information was obtained through interviews with researchers in high-temperature superconductivity. Most of CER's methodological knowledge was elicited from the research reports. The system's general theoretical knowledge, which includes detailed information about all the chemical elements in the Periodic Table as well as information about various oxide compounds, was drawn from physics and chemistry handbooks.

The reports mentioned, contain numerous details of the elements of scientific research such as research goals, strategies, experiments, expectations, experiment results, hypotheses, verification (or falsification) of the expectations and earlier hypotheses, explanations, and also suggestions for further research. Based on these articles and research reports, we have constructed a chronological order of the theoretical and empirical developments in this field.

This paper is organised as follows: Section 2 presents a summary of the developments which led to the discovery of high-temperature superconductors in a chronological order. Section 3 describes the CER system in terms of its knowledge representation, interactions, its operators and its initial knowledge. Section 4 provides an example of how CER simulates the discovery of high-temperature superconductors. In section 5, generality of CER's knowledge representation, system operators, and search methods are discussed. Finally, Section 6 sums up the conclusions drawn from the CER experiment with suggestions for future work.

2 High-Temperature Superconductivity

Superconductivity is a phenomenon occuring at low temperatures in some electrical conductors, in which conduction electrons undergo a collective transition to an ordered state with many unique and remarkable properties. These include the disappearence of electrical resistivity, appearence of large diamagnetism and other unusual magnetic effects, substantial alteration of many thermal properties and the occurence of quantum effects otherwise observable only at the atomic and subatomic level (see e.g. Langenberg, 1987).

2.1 The Discovery of Superconductivity and the Early Developments

The story of superconductivity begins in Leiden in 1911 when H. Kammerlingh Onnes, after succeeding to liquify helium, decided to conduct physical tests on various materials. While measuring the electrical resistivity of mercury, he noticed that the resistivity fell sharply toward zero at 4.2K. Subsequently other superconducting elements, alloys and compounds were found. The temperature at which the transition to superconducting state occurs is called the critical temperature (or Tc). In 1933 W. Meissner and R. Ochsenfeld discovered that a metal cooled into the superconducting state in a magnetic field expels the field from its interior ("Meissner effect").

Superconductivity remained to be a much studied but puzzling phenomenon for nearly half a century after its discovery. A great deal of experimental information was accumulated on its occurence and properties, and several useful phenomenological th
eories were developed. Then in 1957, J. Bardeen, L.N. Cooper and J.R. Schreiffer proposed the first successful micro-physical theory of superconductivity (the BCS theory). This theory can explain how and why the electrons in a conductor may form an ordered superconducting state and makes predictions about many properties of superconductors, which are in good agreement with experimental data. Many of the current theories on superconductivity are variations of the BCS theory.

Onnes' discovery of superconductivity can be characterised by a general research goal such as: "Investigate the physical system in extreme conditions," rather than a specific goal. The extreme conditions are physical conditions such as extremely high/low temperatures, pressures, magnetic fields, electrical fields and gravitational fields. In fact many earlier discoveries in the physical sciences are the products of the activities to accomplish goals suchas these. Modern high-energy physics and astrophysics still have much to study within the framework of this general goal.

2.2 The Discovery of High-Tc Superconductors by G. Bednorz and A. Müller

When Bednorz and Müller started their research, a number of superconducting oxides such as Ba(Pb,Bi)O3 and Li Ti2O4 were known. Also, the superconductivity of oxide coated aluminium films and granules was known. A complete explanation of the origins of the superconductivity in Ba(Pb,Bi)O3 was not available, but was of great intrerest to pysicists in the early 1980s (see Beasley & Geballe, 1984). A summary of the reconstruction of the events that led to the discovery of the La-Ba-Cu-O superconductor, based on the interviews of Khurana (1987a; 1987b) with Bednorz and Müller, is as follows:

1- A. Müller noted that the critical temperature (Tc) of the films composed of small, oxide coated grains of aluminium, was about twice that of pure aluminium which has a Tc of 1.1K.

2- He found this very interesting and wondered whether a similar enhancement in Tc could be made in metallic oxide superconductors whose critical temperatures were already in the 10-15K range.

3- But the theorists told him that because of its low Tc, aluminium was described by the weak-coupling BCS theory, but that the ehancement was not possible in superconductors with Tcs 10-15K, which were better described by the so-called strong-coupling th
eory.

4- Müller decided not to listen to the theorists (see, Khurana, 1987b). Bednorz and Müller thought that they could raise the Tc oxide superconductors.

5- They were motivated by the unusual properties of the superconducting phase of BaPb(1-x)Bi(x)O3 and LiTi2O4 . Despite their low electron density these oxides have relatively high Tcs. They argued that the lower electron density in oxide superconductors was probably compensated for by the enhanced electron-phonon interaction. Bednorz and Müller saw in BaPb(1-x)Bi(x)O3 a possibility of finding superconductors with higher Tcs.

6- They reasoned that increasing the electron density in oxides to values comparable to those in real metals, might allow to increase the Tc.

7- They regarded the metallic behavior in electrical conductivity as an important indicator in selecting compounds for superconductivity tests. They carried out a detailed and determined search for superconductors with high Tcs among oxides with metallic properites.

8- Bednorz and Müller were aware that the resistivity of thin films of BaPb(1-x)Bi(x)O3 increases before the onset of superconductivity, especially if the films do not have the right amount of oxygen.

9- Based on their knowledge about the electrical properties of oxides, they focused on oxides containing copper or nickel in mixed valency states, that is oxides in which a fraction of the transition metal ions are in one valence state and another fraction in a different valence state. They made systematic and careful study measurements on the resistivies.

10- Bednorz also searched the literature for known oxides of copper and nickel and studied carefully whatever was known about their high temperature properties.

11- Their experimental study started with LaNiO3, an oxide in which nickel is in valence +3. They tried to change the electronic bandwidth of the material internally, by substituting aluminium for nickel. (Al and Ni have very low electrical resistivities, high electron densities and have similar valency states.)

12- The substitution was not successful: La-Ni-Al-O became an insulator on cooling.

13- Then they tried substitutions on the lanthanum sites. They tried yttrium substitutions, but ended up with insulating materials. At the time they missed discovering the 90K superconductor. They had the wrong combination - yttrium with nickel instead of copper.

14- The focus of research shifted to copper, because partial substitutions of copper for nickel in LaNiO3 improved the metallic properties.

15- In the course of a literature search, Bednorz learned about the work of C. Michel, L. Er-Rakho and B. Raveau of Universite de Caen on La-Ba-Cu-O, soon after it was published in 1985. Having worked already with a few copper substitutions, Bednorz realized that they could do something with copper alone and with the partial replacement of lanthanum by two-valent barium.

16- They totally replaced Ni with Cu in LaNiO3 and turned their attention to the lanthanum sites for substitutions in the La-Cu-O structure. They tried the partial substitutions of Ba for La in this compound.

17- During these partial substitutions, they discovered the superconductor La(5-x)Ba(x)Cu5O5(3-y) for x=1, x=0.75 and y >>0 which showed onset of superconductivity above 30K.

18- Their first paper received little attention. But S. Tanaka's group at the University of Tokyo and P. Chu's group at the University of Houston independently confirmed superconductivity in La-Ba-Cu-O after they saw the paper by Bednorz and Müller in Zeitschrift für Physik.

19- The Tokyo group by this time (December 1986) independently determined La(2-x)Ba(x) CuO(4-y) to be the superconducting phase, as had Bednorz, Müller and their collaborator Takashige and both groups had obtained further evidence of superconductivity by measuring the Meissner effect.

2.3 Discussion

In contrast to Onnes' discovery of superconductivity, Bednorz and Müller's discovery of high-Tc superconductivity was directed by specific goals. From the account given above, their research goals can be formulated as follows: 1) Explain the unusual increase in Tc of oxide coated films and granules of superconducting metals (e.g. of aluminium), 2) explain why Ba(Pb,Bi)O3 has a high Tc despite its low electron density, 3) study the possibility of the existence of oxide superconductors with higher Tcs.

Bednorz and Muller were motivated by the relatively high Tc of the oxide coated films of aluminium, and by the properties of the oxide superconductors Ba(Pb,Bi)O3 and LiTi2O4. The former had unusual properties: In addition to the low electron densities, it had a small and in some cases non-detectable heat capacity increase at the transition temperature. The origins of the superconductivity in Ba(Pb,Bi)O3 was not known, but was of great intrerest to the scientific community in the early 1980s.

In their experiments, Bednorz and Müller did not try to substitute Y for La in La-Ba-Cu-O, probably because that they had tried the Y substitution earlier on LaNiO3 with unsuccessful results and generalized their reasults. On the other hand, their experimental strategy worked well, so that it would later lead Paul Chu and his group to the discovery of a new class of oxide superconductors, Y-Ba-Cu-O, with much higher Tcs (around 90K) than that of La-Ba-Cu-O.However, the hypothesis "Substitution of Y for La does not improve Tc," did not prove to be useful. A similar situation was observed later, in the substitution of elements with magnetic moments for Y in Y-Ba-Cu-O compounds, with results to disprove the earlier hypothesis that the presence of atoms with magnetic moments markedly suppress superconductivity.

2.4 Later Developments

We will not list the subsequent developments here, for this would far exceed the limits of this paper. Instead, we will present a brief discussion as follows: After the results of Bednorz and Müller were confirmed by Tanaka's group at the university of Tokyo and Chu's group at the University of Houston, research on the La-Ba-Cu-O based compounds accelerated. The proliferation of research groups involved in high-Tc superconductivity introduced parellelism into research in this field with each group having its own set of priorities, specific goals and strategies.

While many groups continued to search for materials with higher Tcs, some focused their efforts on the physical and chemical structure of the new materials, some groups were conducting experiments to test the validity of the current theoretical explanations of the phenomenon, and still others on how to improve the methods of manufacturing these materials. Yet, the research strategies adopted by Bednorz and Müller were followed by most of those research groups that were directly invol ved in finding materials with higher Tcs. For a long time (over 12 months), experiments were conducted to find compounds with higher Tcs in the La-Ba-Cu-O based materials.

It appears that a combination of "hill-climbing" and "depth-first search" research strategy was adopted in research during this period, determined by the early discovery of Y-Ba-Cu-O superconductor which has about three times the Tc of the original La-Ba-Cu-O compound. The strategy shift took place only after almost all the possible substitutions on the Y-Ba-Cu-O were exhausted. In fact the successful substitution of non rare-earth elements was coincided with the strategy switch to the Ba(Pb,Bi)O3 oxide, which led to the discoveries of Tl-Ba-Cu-O and Bi-Ba-Sr-Cu-O superconductors, one after the other, both with Tcs above 100K.

Because of the involvement of many research groups and the variations in their goals and research strategies, many hypotheses were generated on the possible causes of the increase and decrease of the Tc of these materials, some of which were in contradiction with one another. As a result, many of the earlier hypotheses were abandoned (e.g. the hypotheses on the effects of the existence of magnetic elements, crystal structure, the importance of Cu-O chains and Cu-O planes), but some others are still in effect e.g. the hypotheses on the effects of the substitutions of Al, Ag.While some predictions of the BCS theory were refuted e.g. in La-Ba-Cu-O compounds, the energy gap was not related to Tc in the way the BCS theory predicts and the phonon mechanism, and the isotope effect in La-Ba-Cu-O was too small (about 1/10 of the BCS prediction), others remained in effect e.g. the existence of electron pairs.

3 An Overview of CER

CER consists of a knowledge base, an explanation based generalization program (Mitchell, Keller, & Kedar-Cabelli, 1986), a set of system operators each consisting of a series of condition-action rules, a classifier (Nilsson, 1965), and a small set of primitive Prolog function definitions to assist input, output and list processing. In this section the system's organisation and representation of descriptive and definitive knowledge is explained first. This is followed by a desription of the system's control architecture and prescriptive knowledge. The system's prescriptive knowledge is represented by a series of operators, which carry out its research activities. Each operator is described in terms of its inputs, activities and outputs.

Some of these operators use a linear classifier in choosing goals, strategies, methods and processes from among the alternatives. The classifier itself is described in terms of its inputs and outputs.

3.1 The System's Organisation and Representation of Descriptive and Definitive Knowledge

CER's descriptive and definitive knowledge is represented as categorized predicate statements in Prolog, organised in the following categories: 1) logical knowledge, 2) formal knowledge, 3) epistemic knowledge, 4) theoretical, hypothetical, empirical knowledge, 5) factual knowledge, 6) historical knowledge. This categorization is based on the criteria developed by Kocabas (1989), and was implemented in (Kocabas, 1991). Examples of the system's descriptive and definitive knowledge presented in relation with their categories below, also indicate the dimensions of its initial knowledge.

Logical knowledge of CER contains the definitions of logic functions and some logical relationships between types of expressions, e.g.,

"larger" is a transitive relation.
"same_group" is a reflexive relation.

Formal knowledge contains domain definitions, class memberships and class-superclass relationships. Some examples of CER's knowledge in this category are:

Ni is an element in LaNiO3.
"Na" is the symbol of sodium.
Its initial formal knowledge includes statements such as.
Superconductivity is a physical property.
Theoretical analysis is a strategy for explanation.
Gathering knowledge is a strategy for explanation.
Experimentation is a strategy for studying a phenomenon.
Element substitution is a process.

CER's factual knowledge includes factual statements about its domain objects. An example of its factual knowledge is:

The price of Scandium is over US$ 50,000 per kilogram.

CER's theoretical knowledge contains theoretical, hypothetical and empirical statements acquired or generated by the system. Examples of theoretical knowledge are:

The Tc of Y-Ba-Cu-O compound is 91 K.
Specific heat and high-Tc superconductivity are related.
Presence of magnetic ions reduce superconductivity.
Superconductivity and electron density are positively related.
Aluminium, copper, gold and silver have high electrical conductivity.
Substitution of elements with high electrical conductivity improves electron density.

Some of the arguments in theoretical statements refer to processes. CER's knowledge of processes and events are represented by qualitative schemas (Forbus, 1984).Reference to a process in a high level theoretical expression appears as follows:

the(reduces,process(substitution/of,tve,y,ybco),tc).

which means that a tetravalent element substitution forY in Y-Ba-Cu-O superconductor reduces Tc. The process is represented by a schema as follows:

process([substitution/of,[tve,y,ybco]],
substances([E,ybco,...]),
preconditions([formal(tve,E), formal(same_group,E,y),...]),
process_conditions([...]),
products([...])).

where "substances" indicate the substances involved in the reaction, "preconditions" indicate the conditions for the event to take place (e.g., E is a tetravalent element, E is in the same grup as yttrium), "process conditions" describe the physical conditions (such as temperature, pressure, etc.), and "products" indicate the products obtained.

CER's epistemic (or meta) knowledge contains statements about its domain predicates, hypotheses, etc. It is important to know in record, if certain acquired or generated hypotheses can explain a phenomenon under consideration. Some examples of such statements are:

Superconductivity is an important property.
The superconductivity of Ba(Pb,Bi)O3 is not explained.
The isotope effect holds for metal superconductivity.
Electron-phonon interaction mechanism explains superconductivity.

In these statements "isotope effect" and "electron-phonon interaction" refer to an experimental process that can be tested indirectly.

Historical knowledge contains the records of the historical development of research in superconductivity starting from Onnes' discovery of the phenomenon in 1911. It also includes the records of the conducted experiments to avoid repetititons and the records of the refuted hypotheses. Some examples of CER's "historical" knowledge are:

"Superconductivity was discovered by H.K. Onnes in 1911."
"Al was substituted for Ni in formula LaNiO3 to yield compound LaAlO3."

CER's initial knowledge is the logical, formal, epistemic, theoretical, factual and historical knowledge in its knowledge base, as exemplified above. The system's initial knowledge also includes detailed formal, theoretical and factual knowledge about all the chemical elements in the periodic table, covering more than forty properties for each element. Since CER's descriptive knowledge is maintained as separate from its prescriptive knowledge, new descriptive domain knowledge can be added to it. The system also accepts, by interaction, the results of the experiments it has proposed. Some other interactions occur during its activity (e.g. assigning time limits to strategies, methods and experiments). These are indicated in the section that describes the system's behaviour below.

During its run, some information such as the time limits to its goals, strategies, methods and processes are provided to CER by interaction. Also, simulation is carried out externally to represent the activities of the operators that have not yet been implemented, and the outcomes are given to the system.

3.2 CER's Control Knowledge

The design of a comprehensive computational model of discovery must ultimately include all the essential elements of scientific research. Accordingly, it must include the following research tasks: Proposing research goals, choosing a goal among alternatives, formulating a framework for the selected goal, proposing strategies to achieve the goal, choosing a strategy among alternatives, gathering and organising knowledge, identifying and resolving conflicts. It must also include the tasks of proposing, designing and conducting experiments, setting expectations for designed experiments, testing and data collection, generating hypotheses, verifying and modifying hypotheses, supervision of goal satisfaction, generating explanations, and constructing theories by finding higher level relationships in hypothetical and empirical knowledge.

CER's design includes seventeen system operators for its research tasks, and a control opeator with an explanation based learning subsystem. Its operators are organised in two levels, the former are level-1 operators and the latter is its only level-2 operator. Each operator consists of a set of condition-action rules. Currently, as explained below, only five of these operators have been implemented.

In order to accomplish its research objectives, the program has to carry out its research tasks in a certain order. For this reason, it must know which operator to activate first, and what other(s) after the completion of a certain task. The system has a hierarchic homuncular control architecture (Kocabas, 1991b), and learns its control knowledge by explanation based generalization (Mitchell, et al., 1986). The control operator activates the task operators according to the state of a message list that functions like a blackboard.

3.3 CER's Methodological Knowledge and Research Operators

CER's level-1 operators are named as follows: Goal Setters, Goal Choosers, Framework Setters, Strategy Proposers, Knowledge Gatherers, Knowledge Organisers, Conflict Identifiers and Resolvers, Experiment Proposers, Experiment Designers, Expectation Setters, Experimenters, Experiment Data Collectors, Hypothesis Generators, Hypothesis Verifiers, Goal Satisfaction Supervisers, Explanation Generators, and Theory Builders.

Among these, only the Goal Setters, Goal Choosers, Framework Setters, Strategy Proposers and Experiment Proposers have been implemented, while Knowledge Gatherers, Conflict Identifiers and Resolvers, Experiment Designers, Hypothesis Generators and Goal Satisfaction Supervisers are partially developed. The rest have yet to be developed from design to computational implementation. CER is a major project still under development, and the reason for including the latter in the system description at all, is to indicate the scope for its future development.

As will be explained below, some of CER's operators (namely, Goal Choosers, Strategy Proposers, Knowledge Gatherers and Experiment Proposers) employ a simple trainable linear classifier (Nilsson, 1965) in choosing goals, strategies, methods and experiment materials among alternatives. The use of such classifiers is essential in decision making that involves conflicting constraints. We can now describe the system's operators in terms of their inputs, activities and outputs. In their descriptions, occasionally their acronyms are used for conciseness.

3.3.1 Goal Setters

Most intelligent activities can be considered as oriented towards achieving a set of goals, though some of these can be ambiguous or general. CER's goal proposing rules (GS) are confined to scientific interest. CER proposes research goals such as explaining an unusual phenomenon, studying an unexplained phenomenon, and more specifically, studying the possibilities of improving the desired properties of a compound.

Well-defined goals usually have a time limit for their satisfaction. More general and more important research goals are normally given longer time limits. One of the GS rules of CER assigns time limits to the proposed goals, and another one checks if the goals are correctly formulated, and asks them to be reformulated as necessary. The system currently has thirteen such rules. High level description of some of CER's GS rules are as follows:

If a phenomenon shows unexpected or unusual characteristics, then make it your goal to explain it.
If a phenomenon has not been explained, then make it your goal to explain it.
If a phenomenon is not explainable, then make it your goal to study it.
If a physical property is an important property, then try to enhance that property on some substance.
If a goal is entered to the agenda, then assign a time limit to it.

Inputs to Goal Setters are descriptive and definitive knowledge in the knowledge base, by means of which the system can generate its research goals. Inputs to this operator can be epistemic statements such as

The superconductivity of Ba(Pb,Bi)O3 is not explained.
The superconductivity of oxide coated films of aluminium is not explained.
Superconductivity is an important property.

and formal statements like

Superconductivity is a physical property.
Oxide superconductivity is a kind of superconductivity.

and theoretical statements such as

LiTi2 O4 has superconductivity.
Ba(Pb,Bi)O3 has superconductivity.

The outputs of Goal Setters are expressions labeled as messages with the internal label "goal" and an assigned time limit. A goal can be like

Explain the superconductivity of Ba(Pb,Bi)O3.
Improve superconductivity in an oxide compound.

The time limits to goals are given by interaction, which indicate the length of time that the goal can be on the agenda. Any goal whose time limit has expired is dropped from the agenda by Goal Satisfaction Supervisers (described below).

3.3.2 Goal Choosers

Choosing between goals, strategies and experiments can be an important task in scientific research, and in many cases may require, in addition to methodological knowledge, a large amount of commonsense and theoretical knowledge. CER currently has two Goal Choosers (GC), which have both been implemented. One of these rules uses the system's classifier to choose between proposed goals. The classifier uses several criteria: cost, reward, time limit, achievability (by existing knowledge and by existing technology), and likeliness to be achieved by other researchers before. The values of these parameters are qualitative and some of them are given to the classifier externally, while others (such as cost, availability) are drawn from the system's knowledge base. The rules of this operator are as follows:

If there are more than one goals, then use the classifier to choose a goal, and label it as the current goal. If there is only one goal, then label it as the current goal.
If the classification fails, then select a goal randomly.

So, the input to this operator is a set of goals with assigned time limits, and the qualitative values given to the parameters of the classifier for each goal, while the output is a single goal, with the internal label "current goal".

3.3.3 Framework Setters

When a scientist pursues a specific research goal, s/he does not have to recall the unrelated knowledge, but considers knowledge relevant to the research project. S/he would recall the less related knowlege for analogy only when the current research strategies, methods and techniques do not produce the expected results (see, e.g. Lenat & Feigenbaum, 1987).

CER's Framework Setters (FS) copy all knowledge related with the current goal from the static memory into the system's dynamic memory. The program currently has four FS rules. These have not yet been fully implemented, and therefore the task is simulated externally. Some examples of the FS rules are as follows:

If the current goal is to explain a hypothesis, then retrieve from the static memory all the relevant information about the current goal.
If the current goal is to improve a property, then retrieve all the relevant information about that property.

The "relevant information" is retreived syntactically, through the predicate names and the other arguments of the statement that expresses the current goal. The inputs of Framework Setters are the messages, which indicate the focus of the framework (e.g., a set of properties and/or objects).

3.3.4 Strategy Proposers

When a research goal is chosen, there may be alternative strategies for it. Moreover, a research strategy may use different methods. Some research goals are simply achieved by literature search, some others by theoretical analysis, and yet others by experimentation. Scientists choose the most appropriate research strategies and sometimes change them during the course of research to achieve their goal in an economic way. CER's Strategy Proposers (SP) perform their task in accordance with the
system's current research goal. One of the SP rules assigns time limits to the strategies. The system currently has fully implemented fifteen SP rules, one of which use the classifier to choose between alternative strategies, on the basis of cost, time required, and likeliness to succeed. The parameter values to the classifier are given externally. The following are some of CER's SP rules:

If the current goal is to explain a phenomenon, then make a list of all the strategies for finding and explanation.
If the current goal is to improve a property P1 and another property P2 is positively related with P1 and a process S1 improves P2, then propose experiments to apply S1.
If the current goal is to improve a property P1 and a process S1 improves P1, then propose experiments with S1.
If the current goal is to improve a property P1 and a process S1 improves P1 and a process S2 causes S1, then propose experiments with S2.

Inputs to this operator can be formal statements such as

Gathering knowledge is a strategy for finding an explanation.
Theoretical analysis is a strategy for finding an explanation.
Experimentation is a strategy for studying a phenomenon.

and theoretical statements such as

Element substitution improves metallic conductivity.
Electron density and metallic conductivity are positively related.
Metallic electrical conductivity and oxide superconductivity are positively related.
Applying pressure improves superconductivity.
Presence of magnetic ions reduces superconductivity.
Half filled electron bands cause superconductivity.

The inputs also include the message that states the current goal. The outputs of this operator are messages that indicate the considered strategies and current strategy with the assigned time limits.


3.3.5 Experiment Proposers

In scientific research, when experimentation is selected as the research strategy, experiments are proposed in accordance with the type of experimental study. For example, experiments in physics and chemistry can be divided into several major groups that can indicate the type. In addition to the general rules about experiment proposals and designs, rules specific to particular types of processes and techniques can also be defined.

The tasks of this operator include the following: Making a list of relevant processes and techniques for the research, determining the relevant test properties for an experiment, determining the experiment materials having these properties, and choosing the best material from a list of candidates. The last task is carried out by classification over a set of parameters such as availability, likeliness to yield success, cost and relative hazards (i.e. radioactivity, explosiveness, toxicity, flammability and corrosiveness). CER currently has eleven Experiment Proposers (EP), three of which use the classifier to choose experiment materials, substitution elements and substituting elements among alternatives. Some of the rules of this operator are as follows:

If the current goal is to study whether the derivatives of a compound with a specific property have the same property, and the current strategy is experimentation, then choose a process for experimentation.
If the current strategy is experimentation applying a particular process, then record that process as the current process.
If a process has been chosen, then assign a time limit for it.
If the current strategy is to study a phenomenon by experimentation, and a process has been chosen, then determine the relevant properties for the experiments.
If the current goal is to improve a desirable property, the current strategy is experimentation, and a process has been chosen, then select the experiment materials with the relevant properties.
If there are alternative experiment materials for the same process, then choose the best material by classification.

Inputs to Experiment Proposers can be formal statements such as

Polimerization is a process.
Element substitution is a process.
Condensation is a process.
Oxidation is a process.
Ni is an element of LaNiO3.
Y and La belong to the same group.
Sr and Ba belong to the same group.
"Related" is a reflexive relation.
Two properties P and Q are related, if they are positively or negatively related.
Al and Ni have similar electrical conductivity.

and theoretical statements like

Superconductivity and Meissner effect are related phenomena.
Metallic electrical conductivity and thermal conductivity are positively related.
LaNiO3 has metallic electrical conductivity.
LiTi2O4 has metallic electrical conductivity.
Ba(Pb,Bi)O3 has metallic electrical conductivity.
Al has high electrical conductivity.
Ni has high electrical conductivity.
Cu has high electrical conductivity.

Inputs to Experiment Proposers also include the messages that indicate the current goal and current strategy. The outputs are messages indicating the relevant processes, relevant properties, the current process, experiment materials, current experiment materials, substitution elements and substituting elements.

3.3.6 CER's Other Operators

Apart from those described above, the system has twelve other designed operators which have not been implemented. These are: Experiment Designers, Knowledge Gatherers, Goal Satisfaction Supervisers, Conflict Identifiers and Resolvers, Hypothesis Generators, Knowledge Organisers, Expectation Setters, Experimenters, Experiment Data Collectors, Hypothesis Verifiers, Explanation Generators, and Theory Builders. The tasks of each operator are described below.

Experiment Designers (ED). The tasks of this operator include determining the test properties, providing the required process description and providing experiment forms, and assigning a time limit to the current experiment. CER currently has five ED rules. The inputs to this operator are the messages output by the EP rules and some formal knowledge (e.g. process descriptions) from the knowledge base. The outputs are messages indicating test properties, process conditions and experiment forms.

Knowledge Gatherers (KG). An intelligent system gathers knowledge, and filters and translates this knowledge into its own representation language in an intelligible and memorizable form. In order to avoid memory overflow with irrelevant informatio
n, it must have a set of relevance criteria for the kind of knowledge it needs for its current goal set. CER's Knowledge Gatherers has been designed in view of these problems. The system currently has five rules. The inputs to KG's are messages issued by Goal Choosers and Strategy Proposers, and formal statements from the knowledge base. Its outputs are the messages that indicate the current method, and the relevant properties for knowledge gathering.

Goal Satisfaction Supervisers (GSS). An intelligent system that can generate its own goals must also be able to determine when these goals are satisfied during the course of its activities. At present, Karp's (1990) GENSIM/HYPGENE is probably the only system with some capability to supervise the fulfilment of its goals. CER currently has six such rules which monitor goal satisfaction by comparing the research results with the current research goal.

Conflict Identifiers and Resolvers (CIR). Conflicts in knowledge systems can appear in the forms of a) inconsistency, b) incompleteness, and c) incoherence. Identification and resolution of conflicts is an important task in maintaining a dependable knowledge system. In order to be resolved, conflicts must first of all be correctly identified in an efficient way. Conflict identification is much dependent on the knowledge organisation and representation of a system. CER's categorised predicate logic representation Kocabas, 1989) facilitates the identification and resolution of conflicts. As a result, the system can identify several types of inconsistency and incompleteness. Incoherences mainly arise from category and type confusions in language. However, the current version cannot handle incoherence problems.

CER can identify contradictions between i) factual statements, ii) theoretical statements and factual statements, iii) theoretical statements, iv) formal statements, v) historical and theoretical statements, vi) theoretical and formal statements, vii) historical and factual statements. The system has seven rules for identifying contradictions, which record them according to their types. (We did not include all the possible combinations of intercategorical contradictions, but the above seven combinations already provide a detailed classification for identifying contradictions.)

CER identifies a state of incompleteness in its knowledge base in the following forms: i) A factual statement (of an experiment) that is not explainable by a theoretical, hypothetical or empirical statement. ii) a theoretical statement which is not explainable by a more general theoretical statement, iii) absence of an effect, property or object (which can be explained by generating exclusion hypotheses). CER has three rules for identifying these types of incompleteness. One of its conflict identification and resolution rules assigns time limits to the activity of this operator, to maintain functionality against lengthy searches.

CER's resolution of conflicts is facilitated by its systematic identification of them in different types. Each type of conflict is resolved in a separate way. For example, a contradiction between two factual statements is resolved by referring to the corresponding facts (by experimentation and/or observation). On the other hand, if the contradiction is between a theoretical and a factual statement, then the validity of the factual statement is checked and as a result, either the generality of the hypothesis is reduced or the factual statement is discarded. An incomplete state can be resolved by generating a goal to find an explanation or to study the phenomenon involved. Such goals are then considered for research.

Hypothesis Generators (HG). Hypothesis formation is one of the most important activities in scientific research, and has been studied extensively. Various methods of hypothesis formation are described in summary by Lenat and Feigenbaum (1987), and by Darden (1987), who classifies hypothesis formation methods as i) induction, ii) retroduction (or abduction), iii) abstraction, and iv) analogy.

The tasks of CER's Hypothesis Generators include: generating hypotheses on the variations in the physical, chemical, etc., properties of the system under study, using induction, abstraction and abduction and further generalization of the new hypotheses. CER currently has forty HG rules most of which were extracted from the research reports on high-Tc superconductivity. The rules have not yet been implemented. Some of the HG rules are as follows:

If a physical effect P1 cancels another effect P2, then hypothesize that there is another effect related with P1 and P2.

If the value of a property P1 changes in parallel with the changes of the value of another property P2 in a physical system, then hypothesise that P1 and P2 are related.

An example of how CER's HG rules generalize experiment results is as follows:

If P is the result of a set of experiments and E is the corresponding expectation, then generate hypotheses from the result in the following way:

Process: S
Property: P
Expectation: E = The process S has the effect R on P.
Expected Qualitative Variation of P : Ve
Experimental Variation of P: V

If V = Ve, then formulate the hypothesis: The process has the effect R on P.
If V = - Ve, then formulate the hypothesis: The process has the opposite effect of R on P.
If V = 0, then formulate the hypothesis: The process has no effect of R on P.

The results of experiments are generalized into hypotheses in stages. (For example, if the experiment result is stated as a factual statement: "Substitution of Al for Ni in LaNiO3 did not improve conductivity," this is generalized in an increasing degree of abstraction in stages as follows:

a) Substitution of Al for Ni in LaNiO3 does not improve conductivity.
b) Substitution of Al _for Ni does not improve conductivity.
c) Substitution of Al does not improve conductivity.

If in a later experiment, say, the result is

d) Substitution of Al for Bi in Ba-Pb-Bi-O improves conductivity,

then the hypothesis (c) is deleted. Note that (d) does not contradict (b). This example contains multiple levels of abstraction (Darden, 1987).

Knowledge Organisers (KO). The activities of this operator include the organisation of acquired and generated knowledge, translation of gathered knowledge into the system's representation, and classification of predicate expressions into their categories in accordance with CER's categorization scheme.

Expectation Setters (ES). The activities of this operator include: determining which test properties (e.g. crystal structure, conductivity, specific heat) are expected to change values qualitatively in an experiment and whether the experiment should improve, reduce or should not effect these properties.

Experimenters (E). Experimenters must have the necessary technical knowledge and skills on experiment materials, apparatus, procedures, processes and measurements, and safety measures to conduct the experiments under controlled conditions.

Data Collectors (DC). Data collection is an important stage of research in experimental research. CER currently has only one DC rule which dictates the measurement of qualitative and quantitative test properties.

Hypothesis Verifiers (HV). The activities of this operator include checking if a newly generated hypothesis contradicts knowledge in the dynamic memory and proposing to repeat the experiments to test the results when there is a contradiction.

Explanation Generators (EG). The task of this operator is to search for an explanation to a factual statement or a hypothesis in the system's existing knowledge. An explanation to a proposition (e.g. the results of an experiment) is a set of hypotheses in the knowledge base, from which the former is derivable. For example, consider the hypothesis: "Copper substitution improves conductivity." This is explainable by (or deducible from) the following hypotheses:

An increase in electron density improves conductivity.
Copper substitution causes an increase in electron density.

So, to find an explanation to a factual statement or a hypothesis, CER searches for a set of more general, related hypotheses such that the former is deducible from this set.

Theory Builders (TB). This operator is designed to discover higher level relationships between domain concepts. CER's knowledge organisation allows the integration of quantitative discovery systems such as BACON. Therefore, quantitative discovery can be incoporated into this subsystem.

3.4 CER's Classifier

CER uses only one classifier (Nilsson, 1965; Hunt, 1975) for several different classification tasks.1 In experimental scientific research, preference is normally given to materials that are less costly, more easily available, more likely to yield success, and less hazardous. It is easy to see that these criteria can be in conflict with one another. For example, a particular material can be cheap, but highly toxic; another material can be easily available, but less likely to yield successful results in the proposed experiments. In such cases, the problem is to find the best material against a set of conflicting parameters. Purely rule based methods cannot resolve such problems efficiently, unless some supplementary methods are used to eliminate a large number of such rules, for, as the number of classification parameters increase, the number of rules required for classification can increase exponentially. Whereas a classifier can pack n! sets of rules in a vector of n parameter ranges. Additionally, classifiers can provide approximate solutions with incomplete data. For reasons of space, we will not give a detailed description of CER's classifier here, but will merely provide an outline of its features instead.

CER's classifier is invoked by some of the system's rules, and uses different evaluation matrices for different classification tasks such as choosing goals, strategies, methods, processes and experiment materials from among alternatives. These matrices can be created, developed and modified on the run without impairing the system's activities, and can be saved for future use. The classifier can be trained in two different ways: 1) by directly providing qualitative values to each parameter (e.g. cost, availability, relative hazards, etc.), 2) by learning from failure, in which case, the parameter values given to the correct object are used as increments in modifying the matrix. CER's classifier can build its evaluation matrices entirely by this second type of learning. In other words, learning by failure is applicable to it even when the evaluation matrix is blank. However, in this method learning is incremental and therefore slow.
--------
1 The classifier is a short program (about 4K), also implemented in Prolog.


3.6 The System's General Behaviour

Having described CER's control and research operators, we can now examine its behavior in the framework of its scientific research activities. The system's behavior depends on the control knowledge that it has acquired through training. Therefore, w
e can only present a description of its behavior in reference to a set of control rules that it has acquired after a training session. What follows is the general description of the system's behavior after such a training session.

In its first run, the program generates a set of research goals from the records about unexplained phenomena, important physical properties and contradicting hypotheses in its knowledge base. As illustrated in Figure 2, CER generates basically two kinds of research goals: 1) finding an explanation to a phenomenon, 2) studying a phenomenon. Once it has formulated its research goals, it assigns time limits to them. Its next task is to choose a research goal to focus on. When the research goal is selected, a research framework has to be drawn, so that the system can focus on the relevant aspects of the research problem, by recalling the relevant information about the objects, properties and relations from its static knowledge base. Next, the system proposes research strategies, and then selects a strategy from the general strategies that it knows.

CER knows two general strategies for explaining a phenomenon: Gathering knowledge from external sources, and theoretical analysis. If the current goal is to explain a phenomenon, the system can try to find the explanation by several different methods: Gathering knowledge from books, journals and software, and from experts. The program chooses its strategies and methods by classification. If the explanation is still not found, then the next strategy, theoretical analysis over the new knowledge state may provide the answer.

The system has two strategies for studying a phenomenon: Theoretical analysis and experimentation and/or observation, for which there may be a number of alternative processes and techniques. For example, in studying a physical phenomenon, new materials with certain properties may need to be synthesized, which may require the application of certain processes.

In order to study a phenomenon, CER proposes strategies, and chooses a strategy. If the strategy is experimentation, then it proposes processes for experimentation, and chooses a process. Then it finds the appropriate materials for the experiments, from which the system chooses the best experiment materials. After the experiment materials are determined, experiments are designed, expectations are stated, and then experiments are conducted. The experiment results are compared with the current goal, and if the goal has not been satisfied, alternative materials, processes, and strategies are tried. The research continues until all goals are achieved, or their time limits expire.

CER's search in pursuing its goals can be viewed as a combination of heuristic search and best-first search. Heuristic search is employed by the activities of the system's methodological rules. The program's search in choosing goals, strategies, methods, etc., can be viewed as best-first search, for its behavior in these cases is dependent on the weights given to the alternative choices. CER's control message list functions like a constantly changing agenda, by means of which the system's control operator directs its activities.

4 Simulation of the Discoveries of High-Tc Superconductors

In this section we will describe the steps of CER's behavior when placed in the problem situations faced by the physicists before the discovery of the high-Tc superconductors. CER models a complex series of tasks leading to a discovery, and differences in CER's background knowledge leads it to choose different goals, strategies, methods and experiments, and hence, to different research routes and results. Since some of its operators have not been fully developed computationally, certain activities of the system were simulated externally. In such cases we will discuss the reasons for the interactions, what kind of background knowledge was necessary for CER to make the choices independently, and what might have led Bednorz and Müller to make the choices they did. These are provided in separate paragraphs enclosed in square brackets, following the description of the system's activities.

For reasons of space, we will only provide a summary of CER's behavior in pursuing only one of the goals it proposes in some detail. For simplicity, the descriptions, do not contain any reference to the training of CER's control operator for the control rules. Also, for the same reasons, the classification activities of the system's classifier are described only in terms of the inputs and outputs. The section ends with an overview of the simulation.

4.1 The Simulation

When CER is run, its Goal Setters is activated, which proposes three research goals, on the basis of the system's descriptive and definitive knowledge. These are as follows:

G1: Explain the superconductivity of the oxide coated films of aluminium.
G2: Explain the superconductivity of Ba(Pb,Bi)O3.
G3: Study the possibility of improving superconductivity in an oxide compound.

The Goal Setters assigns time limits to the proposed goals. The time limits are provided by interaction. (In the particular run we describe here, we assigned G2 the shortest and G3 the longest time limits.)

[CER uses its grammatical and theoretical knowledge about the objects and events for formulating these goals. It's Goal Setters operator produces them from statements like: "The superconductivity of oxide coated aluminium films is not explained," "The superconductivity of Ba(Pb,Bi)O3 is not explained," and "Oxide superconductivity is an important property," in the system's knowledge base. The program does not derive such high level knowledge from its theoretical domain knowledge, but is provided with it in its grammatical knowledge. The acquisition of such knowledge requires theoretical, technical, and commonsense knowledge.]

When the goals are proposed in this way, the control operator passes the control to CER's Goal Choosers operator, which uses the system's classifier to choose the goal which must be focused on first. This operator chooses second goal (G2) as the current goal against four criteria: reward, achievability, likeliness to be achieved by others before, and time limits. Each metric has three ranges: high, medium and low.

Once the current goal is chosen, control passes to the Framework Setters operator, which retrieves the other relevant knowledge about Ba(Pb,Bi)O3, and its components Ba, Pb, Bi and O, from static memory to dynamic memory.

[The "relevant knowledge" includes information about the various physical and chemical properties of the compound and its components. CER's Framework Setters operator has not been fully implemented, and its activities are simulated externally.]

Then the Strategy Proposers operator is activated, which considers the strategy of explanation by gathering knowledge and theoretical analysis. Time limits to the strategies in view of the current goal are determined by interaction. The operator u
ses the classifier to choose between the two strategies, and gathering knowledge is selected as the current strategy on the basis of cost, required time, and likeliness to succeed.

[Since the current goal is to explain a phenomenon, only the strategy rules that are related with explaining a phenomenon are activated. CER has only two strategies for explaining a phenomenon, which are knowledge gathering and theoretical analysis. The values to the parameters of the classifier are provided by interaction. To decide about the costs and the likeliness of success requires commonsense and theoretical knowledge. Currently, CER's knowledge base does not contain such knowledge.]

Then control passes to CER's Knowledge Gatherers operator, which uses the classifier to choose between several methods of knowledge gathering such as consulting books, journals, domain experts and software. The classifier chooses to consult domain experts on the basis of the values provided to the parameters of cost, required time, and likeliness to succeed. When the activities of Knowledge Gatherers come to an end, control passes to the Goal Satisfaction Supervisers operator, which decides whether the current goal has been satisfied.

[The outcomes of the supervision of goal satisfaction is provided externally, as this operator has not been implemented. At this stage, if the Goal Satisfaction Supervisers operator decides that the goal has not been satisfied, and the time limits assigned to the current method and strategy has not expired, the control operator transfers control to the Knowledge Gatherers and Strategy Proposers, to try the alternative methods and strategies. This is facilitated by the deletion of the current method and strategy after they have been tried, so that the system can focus on the remaining methods and strategies only.]

After failing to find an explanation for the unusual properties of Ba(Pb,Bi)O3 within the assigned time limit, CER focuses on the first goal (G1) and thus tries to find an explanation for the unusual behaviour of oxide coated films of aluminium. It considers the strategies of explanation and tries knowledge gathering first and then theoretical analysis. These strategies fail to produce an explanation.

When CER ends its research on these two goals, the remaining one (G3) becomes the current goal. Control passes to the Strategy Proposers operator, which proposes the strategy, "Improve metallic conductivity by experimentation," together with two a
lternatives, "Apply P1," and "Apply P2" where P1 and P2 are alternative processes in CER's knowledge base.

[The SP operator proposes these strategies on the basis of CER's theoretical knowledge about superconductivity such as "Metallic electrical conductivity and oxide superconductivity are positively related", "The process P1 improves oxide superconductivity," and "The process P2 improves oxide superconductivity." We do not know what alternative strategies Bednorz and Müller considered, but CER's alternative strategies "Apply P1" and "Apply P2" refer to dummy proceses, and were added to monitor the behavior of the system. In formulating strategies at this stage, various other physical properties (or variables) related with the electronic properties of substances could be considered. Such properties include specific heat, thermal conductivity, average electronegativity and ionic states, from which alternative strategies such as "Improve thermal conductivity," "Reduce average electronegativity," etc could be formulated. However, CER's current knowledge base lacks information about whether there is a relationship between these properties and superconductivity.]

Next, time limits are assigned to the strategies. CER employs its classifier to choose between strategies against cost, time limits and likeliness to succeed. The values to the parameters are provided by interaction for each strategy. The classifier chooses to "improve metallic electrical conductivity by experimentation" as the current strategy.

[Time limits are provided by interaction. Normally, it would require technical, theoretical and commonsense knowledge to estimate how long a strategy should be in the agenda.]

Once the current strategy is decided, the activity of Strategy Proposers come to an end, and control passes to the Experiment Proposers. This operator lists the known processes related with the strategy, and "element substitution" is selected by interaction as the current process from among several other chemical processes. A time limit is given to the current process. This operator also determines the relevant properties for the experiment materials by referring to the knowledge base to find out which physical properties are positively related to metallic electrical conductivity. Metallic thermal conductivity is found to be positively related to metallic electrical conductivity. The operator now determines the experiment materials on which the element substitution is to be applied, by using the relevant properties (in this case metallic electrical and thermal conductivity) as the basis.

The Experiment Proposers operator searches for oxides with these properties in the knowledge base, and finds several such oxides, which include LaNiO3, LiTi2O4, and Ba(Pb,Bi)O3. It uses the classifier to choose the best experiment material among these on the basis of availability, cost and hazards in use. LaNiO3 is chosen as the current experiment material. The substitution elements in LaNiO3 are determined as [La,Ni,O]. The Experiment Proposers operator uses the classifier to choose the best substitution element, which selects Ni as the current substitution element on the basis of likeliness to succeed.

[We do not know what other oxides with metallic properties Bednorz and Müller have considered for the experiments. Again, the values given to the classifier by interaction reflect the theoretical and technical knowledge required for choosing the best experiment material. After LaNiO3 is selected for the experiments, the choice of the substitution element in this compound also requires some theoretical knowledge about the elements and the possible products involved. For the substitutions, La, or even O could be selected depending on the values given to the classification criteria. As we shall see below, CER proposes substitutions on La, as Bednorz and Müller have subsequently tried. We do not know if they ever considered the substitution of oxygen, a substitution that was to be tried much later by physicists, who used sulphur as a substitute, for this element is in the same group with oxygen.]

Once the substitution element is determined, the Experiment Proposers operator has to determine the likely elements to substitute for La in LaNiO3.The operator proposes Au, Ag, Cu, and Al for their high electrical conductivity as the substituting elements. Once again, the classifier is used to determine the best element among these. The values given to the parameters (availability, cost, likeliness to succeed, and relative hazards in this case), cause the classifier to choose Al as the current substituting element.

[Among the proposed substituting elements, gold (Au) is expensive besides being a rather less reactive element. Similar arguments apply to silver (Ag), though to a lesser extent. Copper (Cu) and aluminium (Al) are widely available and much cheaper elements, in addition to being more reactive than the former.]

Once the substituting element is selected as Al from among [Au,Ag,Cu,Al], experiments are proposed. The proposed experiments are: Substitutions of Al for Ni in LaNiO3.A time limit is set for the experiments. Now control passes to the Experiment Designers operator, which determines the test properties, and issues an experiment form that describes the process conditions. Control passes to Experimenters, Data Collectors and Hypothesis Generators.

The results of the experiments are generalized by the Hypothesis Generators, and are added to CER's knowledge base. Control passes to CER's Goal Satisfaction Supervisers operator. The results fail to satisfy the current goal. The time limit on the
experiments expires. The message that refers to the current substituting element (Al) is deleted from the message list.

[The activities of Experiment Designers, Experimenters, Data Collectors and Hypothesis Generators are simulated externally, as these operators have not been implemented. Accordingly, the experiment results and their generalizations are provided by interaction.]

By this time, CER's message list includes messages about the current goal, current strategy, current process, current experiment materials (which include LaNiO3, LiTi2O4, and Ba(Pb,Bi)O3), current substitution elements [La,Ni,O] and the current substituting elements [Au,Ag,Cu].

Because the current strategy is still active, control passes to CER's Experiment Proposers operator. By forceful interaction, this operator is given La as the current substitution element. Otherwise the operator would normally choose Ni again. This is because the current Experiment Proposers of CER tries all the substituting elements for Ni before passing to another substitution element (e.g. La).

[Forceful interaction is carried out to trace Bednorz and Müller's research course during their experiments on LaNiO3.Bednorz and Müller seem to have made an interesting "breadth-first search" choice there in choosing La instead of continuing with the Ni substitutions with the other substituting elements.]

In the absence of any other criteria, the Experiment Proposers operator finds the substituting elements as Sc, Y and Ac by their being in the same group as La in the Periodic Table.

[At this stage of their experiments, Bednorz and Müller could not have chosenY for La on the basis of electrical conductivity, for La and Y have the same electrical resistivity (57.10-8 ohm.m.).]

The Experiment Proposers operator uses the classifier to choose the substitution element from among Sc,Y and Ac, which chooses Y as the substituting element, against the criteria of availability, likeliness to succeed, cost, and relative hazards. Similarly, the Y substitutions fail and the related hypotheses are added to CER's knowledge base as the time limit on the current experiments expire.

[Here, Y is a clear choice among its alternatives in the same group, for Sc (scandium) is a very expensive element (about $ 50,000/kg), while Ac (actinium) is both radioactive and expensive. CER's knowledge base contains fact ual information about the prices, and theoretical information about the radioactivities of the elements.]

As the current strategy and process are still valid, the Task Direction operator activates the Experiment Proposers once again. This time Ni is chosen as the substitution element by forceful interaction. The substituting elements that remain valid from the previous experiments are Au, Ag and Cu.The classifier is used to choose the current substituting element, which it chooses as Cu.The proposed experiments are: Substitutions of Cu for Ni in LaNiO3. A time limit is given for the experiments.

Control now passes to the Experiment Designers operator, which determines the test properties, and issues an experiment form that describes the process conditions. Experiments are conducted ans the results of the experiments are generalized by the Hypothesis Generators, and are added to CER's knowledge base. Control passes to CER's Goal Satisfaction Supervisers operator. This time the experiments succeed: Substitutions of Cu for Ni in LaNiO3 improves electrical conductivity. However, the time limit on the current experiment material (LaNiO3) expires.

[At this stage of their research, Bednorz and Müller conduct a literature search on the oxides of copper and lanthanum. This is when they learn about Michel et al's work on Ba-La-Cu-O (see, Khurana, 1987b). But the latter's work does not include electrical conductivity measurements of these compounds at low temperatures.]

CER's Task Direction operator activates Experiment Proposers operator, which finds the experiment materials as [La-Cu-O, LiTi2O4, Ba(Pb,Bi)O3,...]. The classifier chooses La-Cu-O as the current experiment material. The operator determines the substitution elements as La,Cu, and O.The classifier chooses La as the substitution element. One substituting element, Ba, is found by the use of external knowledge.

[This list of substituting elements could also include Be, Mg, Ca, and Sr, which are in the same group as Ba, but Bednorz and Müller rely on the new information from their literature search.]

Control passes to the Experiment Designers operator again, which determines the test properties, and issues an experiment form that describes the process conditions. The results of the experiments are generalized by the Hypothesis Generators, and are added to CER's knowledge base. Then, control passes to CER's Goal Satisfaction Supervisers operator, which reports the satisfaction of the current goal. The Ba substitutions in La-Cu-O are successful, as they lead to the discovery of the La-Ba-Cu-O superconductor with a Tc of 30K. Figures 6.3 and 6.4 summarize CER's simulation of the experiments leading to the discovery.

4.2 Overview of the Simulation

CER assigns time limits to all goals for their place in the agenda. In scientific research, assigning time limits to goals, strategies, methods and experiments is necessary, although they do not have to be accurate or even explicitly stated. Every researcher has a rough idea, at the beginning of his/her research as to how long a particular problem should be on his/her agenda. Lenat's (1979) AM system also allocates time and space constraints to its search activities.

CER's behavior considerably depends on the outcomes of these classification activities. Different parameter values given to the classifier may change the course of research from one goal, strategy, method, process, and experiment materials to another. Therefore, due to the time limits assigned to each of these, some of the activities implied by the alternative research paths may never be carried out.

Whether the time limits assigned to goals, strategies, methods, etc. has expired, is monitored externally in the current implementation of CER. Real times can be assigned by time related functions of the host programming language. CER also gives emphasis to expressing goals, strategies and methods as clear as possible.

In pursuing the third goal (G3), the integrated uses of CER's classifier plays a more important part than in pursuing the earlier goals, as it is used, not only in choosing between strategies and methods, but also in choosing the experiment materials, substitution elements and the substituting elements. The parameters of the system's classifier cover a practical but realistic range of properties or features such as availability, cost, hazards in use and likeliness to yield success. Some of these parameter values are drawn from knowledge represented in CER's knowledge base (e.g. from statements about the costs and relative hazards of the materials), while some others (e.g. availability, likeliness to succeed) require more background knowledge and experience.

5 Evaluation of CER

It must have become clear from the description of the system's simulation of the discovery of the high temperature superconductors that, in itself, CER's simulation does not constitute an outstanding achievement, as it extensively relies on knowledge provided to it by interaction for its critical decisions. However, this should not overshadow an important feature of the program: CER sets an example to how a complex empirical discovery program can be developed and trained in carrying out itsresearch tasks.

Scientific research and discovery is a complex process, and any computational model may turn out to be incomplete. Nevertheless, a comprehensive design for such a model serves as a good starting point. In this section, we will first take a critical look at the system's organisation of descriptive and definitive knowledge and its research and discovery operators which control the system's methodological rules. Then we will discuss the generality of the system's research strategies and methods, and conclude the section with a discussion on CER's methods of learning and discovery, together with some suggestions for future research.

5.1 Knowledge Organisation

Current discovery systems employ either a frame representation, a rule based or predicate logic representation, or some combination of these. CER's representation of descriptive and definitive knowledge in categorized predicate statements is based on the knowledge organisation methods that we developed earlier (see, Kocabas, 1989), and is different from the methods employed by other researchers.

CER also uses qualitative schemas (Forbus, 1984) for representing processes, but its schemas are somewhat simpler than those of IDS (Nordhausen & Langley, 1987), AbE (Karp, 1990), and COAST (Rajamoney, 1990). On the other hand, the program integrates schemas with its categorized predicate logic representation. This integration enables the system to reason about processes in an efficient way. To illustrate, consider the theoretical statement taken from CER's knowledge base,

the(reduces,process(substitution/of,tve,y,ybco),tc).

which states that the substitution of a tetravalent element forY in the Y-Ba-Cu-O superconductor reduces Tc. As is seen, in this statement, the process identification appears as an argument to a theoretical statement. This means that, new hypotheses from such theoretical statements can be generated by perturbations over the arguments before any direct reference to the process descriptions themselves. The categorization also facilitates the integration of methodological and control knowledge with the system's descriptive and definitive knowledge.

5.2 System Operators

CER's design aims at modeling various tasks carried out in scientific research. Its research operators cover a wide range of research activities, in which they can serve as a starting point for the design of such models. CER's task-based homuncularorganisation suggests that the computational modeling of scientific research and discovery can focus on different aspects of research. Accordingly, one can look at the discovery systems from a number of different angles. This indicates that a whole series of computational models can be developed for studying a number of different aspects of scientific research and discovery. In this way, the relationships between the elements of scientific research can be studied in a systematic way and in finer detail. CER's design enables us to make comparisons between the existing models of discovery, by their system operators, research tasks and activities. By looking at the research operators of these systems and comparing them with a comprehensive model of discovery, we can map their scope in the general framework of scientific research.

CER's operators contain some of the methodological rules of problem generators, problem choosers, strategy proposers and decision makers of Kulkarni and Simon's (1988) KEKADA with minor modifications or improvements.2 This supports their claim that these rules are general. However, an important difference between CER and KEKADA is the former's use of a classifier, while the latter relies on a series of rules for choosing processes and experiment materials.

Classifiers provide three main advantages over the rule based methods in such classification activities: 1) They are more flexible, for it is much easier to dynamically modify or change the evaluation matrix of a classifier than modifying or changing methodological rules, 2) They can pack more information per unit of computational space, and 3) They can be trained, while rules cannot.

In addition to the rules adapted from KEKADA, the system has many other rules that are general and applicable in research in other domains of physics and chemistry. For example, consider one of the rules of CER's Goal Setters:

If a physical property is an important property, then enhance that property in some substance.

-----------
2 For example, the PG1 and EP1 rules of KEKADA are also employed by CER.

This rule is not only applicable to superconductivity, but also to a series of other physical properties such as heat resistance, electrical resistivity, magnetism, radioactivity, semiconductivity and various forms of chemical reactivity.

Also, the majority of CER's hypothesis generating rules were abstracted from the research reports on high-Tc superconductivity, and are general. Only a small percentage of the system's rules are domain specific. This is important, considering th
e number of methodological rules the program uses.

CER does not generate new methodological rules. Currently, there are only a few systems with this capability in a nontrivial sense, such as Lenat's (1983) EURISKO. However, the addition of such rules to CER's operators, and their adaptation to the system's control structure is easy. This is due to the clarity of program's knowledge representation and its trainable homuncular organisation.

5.3 Research Strategies and Methods

CER's main strategies, methods and techniques are the strategies of explanation and theoretical and empirical study, methods of knowledge gathering, and the techniques of experimentation (i.e., various chemical processes). The system's strategies are quite general and applicable to research in different fields of science (see, Figure 6.2). For example, the system's behavior in trying to find explanations to the superconductivity of oxide coated films of aluminium and Ba(Pb,Bi)O3 can be considered as a general model in searching for explanation in scientific research. Similarly, the system's strategies for studying a phenomenon (by theoretical analysis and experimentation) are applicable to research in any experimental science.

Many scientists try to explain a scientific problem by their immediate knowledge before they refer to external sources (e.g. libraries, experts), and and carry out theoretical analysis of the concepts involved. There may be variations in the order of these activities, but the system's design allows these variations by its hierarchic control architecture, and even introduces some new ideas on modeling parallelism in scientific research.

The design idea of the system emphasizes that clarity in goals, strategies, methods and techniques in research activities is important. Lack of clarity in these can bring the activity of an intelligent system to a halt. CER addresses this problem both by the clarity of its knowledge representation, and by its mechanisms for reformulating its goals and strategies as necessary. (However, in the current version, the latter are conducted in a rather simplistic way, as they consist of rewriting the goal or strategy.)

A significant improvement of the system over the earlier systems is its assignment of time limits to the activity of its goals, strategies, methods and experiments. Lenat's AM (Lenat, 1979) and EURISKO (1983), also have this capability, but the latter are formal discovery systems. Assignment of time limits is very important in intelligent activities (such as scientific research which requires a high degree of planning), for such activities are constrained by the principles of economics and resource management.

CER's design includes two main types of theory revision: by experimentation, and by conflict resolution. The system can identify several different types of contradictions between different categories of knowledge. Although the conflict identification and resolution rules have not been fully implemented, they can easily be developed. This is one of the directions for the future development of the system.

In addition to its theory revision capabilities, CER has one important feature: It can revise its research strategies, methods and processes. This is a feature that very few other discovery systems (e.g. KEKADA) currently possess, but not to the extent that CER does.

5.4 Methods of Learning and Discovery

CER conducts search at three different levels: 1) search in directing tasks through the control rules of its level-2 control operator, 2) knowledge level search by the action rules of its level-1 operators, and 3) symbol level search by its classifier. In this, the system integrates the methods of learning at three different levels, and also provides an example to knowledge level control of symbol level systems at the same time, in the way its action rules control the system's classifier.

The system's methods of learning includes learning by experimentation, by generalizations (of its hypotheses), explanation based learning (of its control rules), and learning by classification. The program's generalizations of its experiment results and hypotheses consist of replacing individual objects names (e.g., of an element) with class names that appear as arguments in the expressions.

5.5 Control Knowledge and its Organisation

CER's control architecture has some similarities with blackboard control architecture (Hayes-Roth, 1985), particularly in that its message list can be seen as functioning like a blackboard. However, there are some basic differences. First, the former's knowledge organisation is entirely different from those implemented in the blackboard systems. Second, the latter systems use a global blackboard, while CER uses its message list only as a medium of communication between its operators, and can also use a multiplicity of such message lists in a hierarchic homuncular organisation (Kocabas, 1991b). Third, the system's operators only represent methodological knowledge as opposed to the "knowledge sources" of the blackboard systems, which may also represent descriptive and definitive knowledge. Finally, blackboard systems use sophisticated schedulers to determine which configuration of the blackboard should cause the activation of which knowledge sources, whereas CER can learn its control knowledge.

Several other systems such as LEX (Mithchell, Keller, & Kedar-Cabelli, 1986), PET (Porter & Kibler, 1986), and PRODIGY (Minton & Carbonell, 1987) also have the capability of learning control knowledge by explanation based methods. Among these, PET has the additional capability of learning control in sequences of activities or "episodes". However, these systems operate in relatively narrow domains. They do not employ message lists as CER does, but use the states of their dynamic memory as the problem states. Consequently, they are "flat" systems as opposed to CER's essentially hierarchic control architecture.

A significant point about the organisation and control of CER's methodological knowledge is that it provides a prototypical model of how scientific research and discovery can be taught. According to this model, methodological knowledge (in the form of action rules) can be taught by instruction, while the control and coordination knowledge, by explanation based methods.

What makes CER interesting in this respect is that it illustrates how an intelligent system can be "trained" at least in certain aspects of scientific research such as proposing and choosing research goals, strategies and methods. Its "open" control architecture provides the system flexibility, in which its descriptive and definitive knowledge and its methodological rules can be freely amended or changed. So, if the system performs poorly after the changes, it can be retrained. When new methodological rules are added, the system's control structure can adapt to these changes more easily than the control procedures applied in many other discovery systems. CER's control operator can also be trained by other methods such as classification besides explanation based generalization. However, for this task, the classification requires a large number of training examples, and therefore, is cumbersome.

5.6 CER's behavior on further research in high-Tc superconductivity

We have described how CER models the discovery of the La-Ba-Cu-O superconductor. With its increased knowledge after the discovery, the system can also reproduce the subsequent discoveries of the Y-Ba-Cu-O, Tl-Ba-Ca-Cu-O and Bi-Ca-Sr-Cu-O superconductors. In modeling the discovery of the Sr-La-Cu-O superconductor with a Tc of 40K, CER creates the goal to study the derivatives of La-Ba-Cu-O.

Superconductivity is an electron-related phenomenon and CER can use the other electron-related physical properties in directing its future research. It would therefore propose to use Bi and Pb substitutions for Cu in La-Ba-Cu-O and Y-Ba-Cu-O superconductors on the basis of electronegativity. (Bi, Pb, Tc, Re, Hg, Ag and Cu have the same electronegativities, but among these Tc is very expensive besides being radioactive.) Similarly, the system would propose the substitutions of Ta and Al for Ti in LiTi2O4 on the same basis. We know that in the later experiments, Cu has been successfully replaced by Bi yielding Cu-free superconductors with even higher Tcs. Since its field of application is still active both in theoretical and experimental sense, CER's methods can be put to further test against future developments and more lessons can be learned about the issues of computational modeling of scientific research in an active research area.

Our observations over the research activities in high temperature superconductivity suggest that modern scientific research is not totally a serial activity as it was in the past centuries. It can branch off into smaller research projects each of which can be carried out independently and in parallel. This creates a number of coordination problems which were not faced by the scientists of the earlier centuries. How must the parallel projects be carried out for maximum efficiency in terms of time and resources? How can they effect the overall course of the main research programme? These are some of the questions that face modern scientific research programmes.

As an illustrative example from CER's own field of application to the problem of coordination in research, it is interesting to see that the Tl-Ba-Cu-O and Bi-Ca-Sr-Cu-O superconductors with transition temperatures over 100K were not discovered until after all the rare-earth element substitutions on the Y-Ba-Cu-O superconductor were completed by research groups around the world. Was this because all these research groups were following an established set of heuristics on strategy choice and continued to follow it? Or was it because the decisions about the next strategy were dependent on the complete results of experiments on the La-Ba-Cu-O based superconductors? Is there a more effective set of heuristics on strategy choice? How do the adopted strategies effect the course of research? We hope that this study will open the way to find answers to some of these questions.

6 Conclusions

In this paper we have described CER, a system that models the discovery of high temperature oxide superconductors. The program constitutes a more comprehensive computational model of scientific research and discovery than its predecessors. Our observations into research and discoveries in high temperature superconductivity indicate that scientific research is directed by methodological rules distinct from the rules used in other intelligent human activities in everyday problem solving (e.g. rules derived from commonsense and technical knowledge.)

The system's design clearly reflects our observation that scientific discovery is not a process in itself, but the end result of a series of processes which is called scientific research. Planning is no doubt an important and indivisible component of scientific research, and is guided by some methodological rules like those of CER, as well as by commonsense and technical knowledge. Although planning can take place in various stages of scientific research, such as strategy choosing, experiment design, experimentation, and data collection, there is more to scientific research than planning. This is because there are a series of research activities which cannot be characterized as planning (e.g., conflict identification and resolution, hypothesis formation, hypothesis verification, and generating explanations).

The CER system improves on its predecessors by providing a more detailed treatment of the elements of scientific research. Consequently, the system has a more varied and detailed search space than any previously developed discovery system. Another important improvement introduced by the system is the assignment of time limits to its tasks and activities in pursuing its goals, strategies, methods and experiments. CER's design also emphasizes the need to reformulate its research goals, strategies and methods where necessary, to maintain uninterrupted activity in research.

The program conducts search at three levels: symbol level, knowledge level and control level. The symbol level search is carried out by by the program's classifier, while knowledge level search is conducted by its rules of action, and control level search by the system's control operator. CER's knowledge organisation facilitates the integration of search at different levels, and the use of such different methods of learning as classification, abstraction, and explanation based learning.

Modern scientific research is a complex enterprise usually requiring a large number of small but necessary inventions and discoveries of tools, techniques and subsidiary hypotheses before its main goals and strategies are accomplished. Even the design of an experiment requires a great deal of background knowledge about the methods, materials, processes, experimental tools and their proper arrangement, and about the conditions of measurement. Therefore, a few hundred rules in a computational system can only provide a sketchy model for scientific research and discovery.

Nevertheless, what is interesting and stimulating about the computational modeling of discovery is that it provides us the insights on the reasoning behind the critical decisions taken by the scientists during the course of their research. Moreover, human mind too has its own limitations, and a successful computational model surfaces the unproductive decisions taken by the scientists as well as the good ones during their research. It also provides a panaromic view of the process of research and may even help to develop a methodology (or methodologies) for better strategies for research in different fields. Usually a large number of rules that are used in any research leading to a discovery are informal rules acquired in years of experience and are not recorded or taught in a systematic way.

CER provides a detailed reference for evaluating the research methods, and the methods of representation and control of various models of scientific discovery, as different models may represent different aspects of discovery. The system also provides guidelines for finding how (or how much of) a particular discovery can be effectively represented by a computational model with the methods of machine learning.

The system's pool of methodological knowledge can be used as a starting point for exploring more methodological rules in different domains of research. Currently, there is little known about the methods used by scientists in generating original methodological knowledge, and the role of generalization, abstraction and analogy in generating such knowledge.

The system's homuncular control architecture introduces a new perspective into the design of computational models of research and discovery. An advantage of this hierarchy is that, a system can learn its control knowledge for distributing its tasks over its subsystems by explanation based learning. We have described how explanation based generalization has been applied to train CER's control operator. This control architecture can be extended downwards to the system's action rules, adding more flexibility into the representation of the system's methodological and control knowledge.

Another advantage of the hierarchic organisation of such operators is to enable the system to make maximum possible use of parallelism. However, as Lenat and Feigenbaum (1987) argued in a similar context, there is a limit to what parallelism can offer in modeling scientific research. This is because, scientific research uses a good deal of planning which requires much sequential reasoning. In any case, we hope that the hierarchic homuncular architecture of CER will provide a computational testbed for the uses and the limits of parallelism in such systems.

References

...

Kocabas, S. (1989). Functional Categorization of Knowledge: Applications in computational modeling of scientific research and discovery. PhD Thesis. Dept. of Electronic and Electrical Engineering, King’s College London, University of London.

...

Reference on Superconductivity

Beasley, M.R. and Gaballe, T.H. (1984). Superconducting Materials. Physics Today. October 1984, 60-68.

Edelsack, E.A. (1973). Fundamentals of Superconductivity. In The Science and Technology of Superconductivity. W.D. Gregory, W.N. Matthews Jr, E.A. Edelsack (eds.) Vol. 1, 5-24. New York: Plenum Press.

Khurana, A. (1987a). Search and Discovery: Superconductivity seen above the boiling point of Nitrogen. Physics Today. April, 1987, 17-23.

Khurana, A. (1987b). Search and Discovery: Bednorz and Müller win Nobel Prize for new superconducting materials. Physics Today, December, 1987, 17-19.

Langenberg, D.N. (1987). McGraw Hill Encyclopedia of Science and Technology. 609-617.

Thursday, August 30, 2007

ISLAM AND SCIENCE

ISLAM and SCIENCE

Dr Sakir Kocabas

1. Introduction

The contribution of Islamic civilization to scientific development has been a subject of debate both by Muslim and non-Muslim writers since the 19th century. The controversial position of Muslims in such debates is due to the fact that they have lost their effectiveness in science and in other fields of life after a certain stage in history.

When we look at the "Islam and Science" debates in the last two centuries, we see several approaches in general. The first one, which was developed by European historians of philosophy and science in the last century and lasted until World War II, claims that Muslims have not contributed to civilization in the fields of science and philosophy anything significant beyond being the "commentators" of Ancient Greek philosophy. We see such eminent philosophers as Bertrand Russell1 among the proponents of this view.

The second approach is the one developed by some Muslim writers in need of intellectual defense for Islamic civilization against the views of European historians of science and philosophy. According to this approach, science and philosophy are not only not needed for human happiness and well being in any case, but even harmful activities. Consequently, there is nothing to worry about the fact that Muslims are lagging behind in science and technology.

The third approach, which is also developed by some Muslim scholars, claims that Muslims have conducted pioneering studies in science and philosophy in the Classical Era of Islamic civilization, but were held back in scientific activity by devastating external influences such as the Crusades and the Moghul invasions.

The fourth approach, which was adopted by some western historians of science such as Sarton2, and more recently by Huff3, recognizes that Muslims made major contributions to science in the early period of the Islamic civilization, and attempts to explain the decline of scientific activity in terms of social, cultural, economic and legal issues.

The fifth approach which is being advocated by some Muslim writers4, claims that the Muslims' earlier contributions to science have nothing to do with Islam, as these studies were conducted in a non-Islamic, secular spirit, and if at all, Muslims will only be successful in the same way. The proponents of this approach also want to extend the separation of "science” and “religion" which took effect after the Renaissance and Reformation in Europe. They carry the related arguments about Christianity over Islam without a detailed conceptual analysis, and try to establish an "Islam and science" division in the Muslim mind.

Our approach will be a different one bringing a detailed analysis and a new synthesis to the subject, although carrying elements from some of the above stated approaches. We will try to describe our approach in some detail below, but before that, let us take a closer look at the other five approaches.

The first approach has lost its credibility today among people who are acquainted with history of science, but nevertheless is still being argued by some short sighted writers. We will show below with examples why this approach is based on mistaken views. The second approach is being defended by naive Muslims in good will, without being aware of what they advocate seriously contradicts the concept of knowledge (=‘ilm) and a series of other related concepts in the Qur'an. The third approach on the other hand, is in the error of exaggerating the relationships between scientific motivation and social, political and economic conditions. In this study, we will start from the point reached by the fourth approach. We will attempt to unearth the basis of the problem, namely, what have happened after a period of intensive learning, investigation leading to the development of a scientific research tradition in the Classical Era of Islamic civilization, i.e. the 8-11th centuries. But before we do that, let us try to explicate why the fifth approach starts from mistaken premises.

In order to decide about how much of the historical reality is reflected by the claim that Islam had nothing to do with the Muslims' contribution to science during the Classical Era (roughly 8-11th centuries), we have to bear in mind a few historical facs. It is well known that the Europe of the Middle Ages turned to scientific investigation as it turned away from the views of the Church about the world and nature. Whereas, we observe that the Arabic and Turkic populations who did not have any noticeable scientific activity before Islam, began to produce their scientific and philosophical works after they accepted Islam. Therefore, the Muslims in that era conducted their scientific and philosophical studies with the motivation and within the intellectual framework that they acquired largely from Islam.

To see that it was Islam that gave the Muslims of the Classical Era such great motivation in science and philosophy compared to their contemporaries, we have to understand what sort of conceptual transformation that Islam had made in their minds. What were their concepts of the world and reality before their acceptance of Islam? And how did their concepts of kmowledge and reality transformed by their acceptance of Islam? We can ask similar questions for the christian peoples in Europe in the Middle Ages: What sort of conceptual transformation took place in Europe after the 12th century? What sort of conceptual changes lay behind the scientific development that took place after the 16th century?

Some contraversial european historians, before answering such questions kept claiming that scientific thought in Europe was developed from ancient Greeks, in an attempt to blur the effects of Islam on the Renaissance, the Reformation and the subsequent scientific endavour. This opinion has long lost its historical credibility, but there may still be some who would like to defend it. A simple question should be sufficient to bring this claim into perspective: Why had the Europeans not been able to start the Renaissance and develop their scientific thinking much earlier, as they had the works of the ancient Greek philosophy and science in their hands for over a thousand years? Why had Europe waited another 1200 years for the development of scientific thought in Europe? The answers to these questions can be: Either the Europeans were unable to understand the works of the ancient Greeks during that period, and began to understand them only after reading the works of the Muslim thinkers such as Ibn Rushd, or the ancient Greek thought by itself did not provide the concepts and motivation good enough to initiate such a scientific enterprise, or both. (Despite their controversial nature we can say that all these possibilities are worthy of a detailed investigation.)

When we study the history of Islamic thought, we see that the Muslims have been in a large scale and multi-dimensional research activity in science and philosophy until the 12th century, lasting as sporadic and isolated activities until the 15th century. During this period, on the one hand they studied the works of ancient Greek and Indian scholars5, and on the other they developed completely novel approaches and methods of systematic investigation of nature. We will explain this with examples below.

The development of these different approaches and methods require a certain creative intellectual capacity. Scientific creativity and its relation to motivation and conceptual structures is a subject which we have been studying recently6. In order to have an insight to the nature and extent of the conceptual transformation that Islam has brought to the Muslims in the Classical Era let us now take a look at certain concepts and their interrelations in Islam, particularly in the Qur'an.

2. The Concept of Knowledge in The Qur’an

Throughout the history of Islam, a great many things have been said about the concept of knowledge (= ilm) by Muslim and non-Muslim writers7. These will be outside of the scope of our present concern. We have conducted a 6-month study over the concept of knowledge (= ilm) in the Qur'an in 1986. A summary remarks on the grammar of the concept "ilm" in the verses8 of the Qur'an in which the word takes place in its root form and derivatives can be given as follows:

1) First of all, the concept of knowledge (= ilm) in the Qur'an is an indivisible, holistic concept. (In this respect, there is a certain important difference between the concept of "ilm" in the Qur'an and that of today's Muslims. We shall return to this point below.)

2) The Qur'anic concept of knowledge covers all knowledge, without making any distinction between sciences as such.

3) The concept of knowledge in the Qur'an does not allow the qualification of true/false knowledge, because the word is always used in certain relation to reality. On the other hand it can be used in indicating as someone having or not having the knowledge of something, as in "... has the knowledge of" or "... has not the knowledge". (See verses: 006.108, 006.119, 030.029)

4) The Qur'anic concept of knowledge indicates that reality is grasped with knowledge, and the limits of knowledge determines the extent of the awareness of reality and consciousness. (027.084)

5) In the Qur'an, it is stated that Allah has encircled (= wasia) everything with knowledge. This might mean that knowledge extends through space. (006.080)

6) Also, the expression "a knowledge from Allah’s presence" (= min ladunna ‘ilma) takes place in verses as a particularly different kind of knowledge, in contrast to the knowledge that extends through space. (018.065)

7) The fact that Allah has power over everything, can be known by man. (065.012)

8) The verb "to know" (= alima) is applicable to both individual human beings and to a nation (= qawm) as in the expression "a nation that knows". (002.230,006.097, 007.032)

9) "Those who know and those who do not, cannot be the same". (039.009)

The conceptual grammar of "knowledge" which we have summarized here, constitutes a part of a wider conceptual network in the Qur'an. The importance and necessity of knowledge in Islam, emerges as a prerequisite for being human. Only through knowledge man wants to know how far his abilities and responsibilities extend.


3. The Importance and Necessity of Knowledge

What is it that makes man important, while he occupies such a small space-time region on a small planet compared with the astronomical dimensions of the heavens that emcompass hundred billions of galaxies? An answer can be given to this question in Islamic thought is: What makes us important is the fact that Allah has given us some superior qualities among His creation, notably our superior ability of learning and thinking, by which we can increase our contact with reality and consciousness.

For a man believing in Allah, the most important question of science is this: "How does Allah administer or rule the heavens and the earth?" Another question that follows this is: "Can human mind comprehend how Allah rules the heavens and the earth?" This may come as a surprise to some readers, but both questions have answers, and the answer to the second question within the Quranic framework, is a definite "yes". The answers are given in the Qur'an as follows:

"Have they not studied the Administration of the heavens and the earth, and what things that Allah has created? (= awa lam yanzuru fi malakut as samawati wal ardi wa ma khalaqallahu min shay)" (007.185)

"Allah is He, who created the seven heavens and of the earth the like of them. The instruction (= amr) is sent down through them, so that you know that Allah has power over everything, and that Allah has encompassed everything with a knowledge." (065.012)

As can be seen, in the first verse above, man is motivated to a systematic study of the Administration of the heavens and the earth, or in other words, over the “laws” and “principles” of the interactions which take place in them. The word "study" (= yanzuru fi) in this verse indicates reference to both observation and thinking, and the term “nazariya” (= theory) has been derived from this word in the classical era of Islamic thought. Accordingly “nazara fi”, can be understood as “systematic observational and theoretical study”.

In the second verse, we see how the concept of instruction (= amr) that emerges as a key concept in the "Administration of the heavens and the earth" is closely related to the concept of knowledge (= ‘ilm). In this verse, the expression "so that you should know", clearly indicates that man can know how Allah keeps the events happening in the heavens and the earth under His knowledge and His control. The knowledge of this should indeed be the most important gain for mankind.

Notice that the question: "How does Allah direct the heavens and the earth?" is, in a narrower sense, also the fundamental question of physical science: How do various phenomena take place in nature in an ordered way? How did the order that we see in the universe come about? We shall return to this below.

It is with these motivations that the Muslims in the classical era turned towards understanding themselves and the space that they lived in. This was the motivation behind their occupation with such sciences as mathematics, logic, physics, chemistry, botany and astronomy. The prominence given to knowledge, learning and thinking in Islam, is a pointer to the importance that is given to man. The Prophet (s.a.) had taught Muslims through the Qur'an, the importance of knowledge, learning, and thinking. And the Muslims, through their books between 8-12th centuries, taught Christians and Jews in al-Andalus how to correctly reason about nature. Regretfully, due to serious changes in their concept system, which started to take place in the 11th century, Muslims began to loose motivation to science, and as a result, their influence in science gradually dwindled and died out in the subsequent centuries. The present state of the Muslims regarding scientific activity constitutes a complete contrast with that of the Islamic civilization in the classical era. The concept of knowledge has now lost its significance, while some secondary or obscure concepts have acquired prominence in the Muslim's mind today.


4. The Concept of Knowledge and Motivation in Islamic Civilization

The concept of knowledge (= ‘ilm) had such a central role in the Islamic civilization during the Classical Era, that this made the famous orientalist Franz Rosenthal9 to coin his observation that there had been no other civilization in history, in which the concept of knowledge had played such a central role as in the Islamic civilization in the classical era, including the Western civilization.

It can be said that, correct thinking develops over correct premises, a correct concept system, and correct observation and inference methods. Correct thinking affects motivation to research positively, as it leads to a better understanding of the world, and to useful discoveries and inventions. Systematic knowledge rests on a correct concept system. For this reason, in the process of the improvement and increase of systematic knowledge, the concept system on which it stands, plays an important role. In our own study on scientific creativity, we are convinced that the concept system has a critical role over scientific motivation and scientific creativity.

With the birth of Islam, the development of tendencies towards learning, quickly turned into a campaign, and within a short span in historical terms, it caused the formation of scientific motivation among Muslims. The campaign for learning had already started in the Medina period, when the Prophet (s.a.) introduced a policy of releasing the prisoners of war (who were then treated as slaves) on the condition of teaching Muslim children reading and writing.10 In the same years, it is also known that Muslims opened a school in Medina. Such an educational policy could not even be dreamt of in those ages. In this era, Muslims considered learning and acquisition of knowledge as a paramount duty of being human.

The conceptual change, and the learning campaign that developed in parallel, started giving its results in scientific development within less than a century, during the time of the Umayyads. This was followed on in an even stronger form during the Abbasids (750-1254), particularly by the establishment of Bayt al Hikma (= the House of Wisdom) by the Abbasid ruler Harun al Rashid. This royal institute, being unique in its kind in the history of science, had started as a center for translation for all the works of ancient cultures, from poetry to medicine, astronomy and philosophy, but soon turned into a center for original scientific work.

With their pioneering work in chemistry, the Umayyad prince Khalid bin Yazid (665-704), Ja’far al Sadiq (700-765), Jabir bin Hayyan (approx. 721-805), Zunnun al-Misri (d. 860), Al Razi (= Alrhazes, 860-925), Ibni Sina (= Avicenna, 980-1037) and Al Matruji (? - 1007) laid the foundations of modern chemistry as an experimental science. In physics, we see Al Kindi (= Alkindus, a 796-872) and again in physics with his research in optics Ibn Haytham (a. 965-1051); in mathematics Al Khawarizmi (a. 780-850) and Thabit bin Qurra (a. 834-901); in zoology Jahiz (a. 776-869); in astronomy Bayruni (a. 973-1051), al Zarqali (1029-1087), and Ibn Shatir (d. 1375); in medicine Al Razi (= Alrhazes a 864-925) and Ibn Sina (= Avicenna, y 980-1037), and in medicine, physics and philosophy Ibn Rushd (=Averroes, 1126-1198) among the prominent Muslim pioneers of science and philosophy.11 We can now take a brief look at the important contributions of Muslims to chemistry, physics and mathematics in this period.

The Muslims' contribution to the development of chemistry as an experimental science has been crucial in several respects. First, as can be seen in Jabir bin Hayyan's collection, contrary to the ancient Greek tradition, Muslims thought that chemical substances are composed of a set of basic properties combined in certain proportions, and that these properties could be separated and recombined to yield new substances12. Notice that this idea introduces the concepts of analysis and synthesis and an accompanying methodology for research into chemistry for the first time in its history. Secondly, their claim that these basic properties are held together in balance (= mizan) in chemical substances, introduces the notions of stability and equillibrium13. Third, but not the least important, they have described their chemical experiments in such a way that it is possible to see all the parameters of a modern chemical experiment. For example, we see everything in the description of an oxidation experiment of mercury conducted and described by Al Matruji14, that the description of a modern chemical experiment should include: a) reaction materials, b) their quantities, c) reaction equipment used, d) reaction conditions, e) reaction products, and f) their quantities. The same approach can be seen in the experiments of other early Muslim alchemists. It can hardly be denied that these are very important contributions to the development of chemistry as an experimental and theoretical science. Using this methodology, Muslim scientists had isolated inorganic substances such as alkalis and hydrochloric, nitric, and sulphuric acids and ammonia. They conducted distillation experiments on organic substances and separated several basic organic substances.

In the field of physics, Ibn Haytham's work on parabolic and spherical mirrors, and glass magnifiers, and his work on the refraction of light in general formed the basis of optics. Ibn Haytham studied the behavior of light when passing from a less dense medium to a denser medium, and led the way for the discovery of the famous sinusoidal law of refraction of light15. This was the second law discovered in physics after Archimedes' well-known discovery.

Perhaps the most important contribution of Muslims to the development of classical physics has been an indirect one; by the invention of algebra. Al Khawarizmi's (780-850) introduction of the concept of equation and the use of variables in place of numbers in mathematical problem solving, is regarded as one of the most important abstractions in the history of mathematics. The first abstraction in mathematics was the invention of decimal numbers and the introduction of zero. The second important abstraction was transforming geometry into an axiom system by Thales (640-546 B.C.) and Euclid (4th century B.C.) After Khawarizmi’s invention of algebra, the next important abstraction in mathematics took place nine centuries later, with the introduction of the concept of function by Newton and Leibniz in the 17th century. With their invention of algebra, the Muslims had shown how complex arithmetical problems could be easily solved by symbolic equations. This was a serious departure from the geometrical techniques used by ancient Greeks. The development of algebra, coupled with geometry, has led to the development of analytical geometry and trigonometry, whose foundations were also laid by Muslim mathematicians. Without algebra, there could be no analytic geometry, calculus and classical physics, and consequently no industrial revolution. In astronomy, we know that the Muslims had not only discovered that the earth was spherical, but also measured the distance between the longitudes during the Abbasids. They had also considered the heliocentric system of planets several centuries before Galileo and Copernicus. In fact, Copernicus’ book contains astronomical drawings identical to those drawn by Tusi several centuries earlier. 16

When we compare the studies that the Muslims conducted in science between 8-11th centuries to the scientific developments in Europe since the 16th century, of course they might look minimal. However, for a correct evaluation, the Muslims' success in several fields of science in these centuries, must be viewed in comparison and contrast with the happenings in the other parts of the world during the same centuries. Then the Muslims would stand out as unrivalled in scientific activity and motivation among their contemporaries. The methodological error that is frequently fallen into in the so called "Islam and Science" debates is the result of an illicit evaluation.

The important point that must be noted is: The Muslims' studies in this period had led to the development of a "research tradition" based on experimentation and observation for the first time in the world history as we know it today.17 Right here we are faced with the central questions of our analysis: Why did the Muslims not continue their activities in science after such a successful start. Why did not they develop a systematic and multi-dimensional research that formed the basis of sciences such as physics, chemistry and astronomy? What had happened in the history of Islam that scientific and philosophical studies gradually slipped out of the field of interest of Muslim scholars? What kinds of psychological, social, economic and political factors caused Muslims to leave research in the fields of science? These are the questions that must be asked in the "Islam and Science" debates until the correct answers and explanations are found.

The decline of Muslim scientific activity after a brilliant and successful start has attracted the attention of a number of Muslim writers and historians since the 17th. century with the Ottoman scholar Katip Chelebi, down today. Katip Chelebi's insightful observations had related the decline to a conceptual problem, but his warnings did not start a serious movement in the Ottoman administration. Until recently most of the proposed explanations, were directed to explain the decline with political causes such as the Crusades in the 12th. Century and the destruction (especially of libraries and men of knowledge) by the Moghul invasions in the 13th. Century; and to economic causes such as the decline of the importance of the Silk Road, and the accompanying shift of economic power from Islamic countries towards Europe as a result of the geographic discoveries that took place in the 15th and 16th Centuries. All these explanations may have some truth in it, but they do not explain the loss of motivation in science in Muslim societies as a whole.

Huff,18 a leading figure in the field of comparative historical study of science, states that Muslims had made a brilliant start in the middle age, and quickly gained and established a clear superiority over China, India and Europe in almost all the fields of scientific activity, but that their activity started to decline after the 12th century. He examines the causes of decline through several interrelated issues which can be summarized in three categories: 1) the role of the scientist in the society, 2) the basic beliefs of the scientist about nature, and 3) the existence of the social and legal institutions that support the development of science.

When he examines the role of the scientist, Huff classifies the intellectuals in the Islamic society of the Middle Age into three classes: fuqaha (= jurists), mutakallimun (= theologians), and philosphers. He recounts al-Kindi, al-Farabi, al-Razi (Alrazes), Ibn Sina (Avicenna), al-Biruni, and Ibn Rushd among the Muslim philosophers who have contributed the development of early modern science. He states that the philosophers could not maintain influence on their societies after some their ideas became the target of the theologians, causing the former to lose support for their activities from the Muslim population. In this period, the theologians used the opinions of al Ghazali and Ibn Taymiya to attack philosophers. Huff also states that the fuqaha (=jurists) from time to time severely criticized the theologians themselves. According to Huff, the Muslim philosophers did not have clearly defined and valid social roles in their society. Those who studied philosophy and science mostly had an additional duty accepted by their societies. For example, Ibn Rushd was also a jurist, and Ibn Shatir was a muwaqqit preparing prayer timetables for the daily prayers of Muslims.

From the viewpoint of the basic beliefs of the scientist about the nature, Huff relates several principles for scientific inquiry, some of which can be listed as: 1) a rational and objective investigation of nature in order to understand it operations is possible and desirable, 2) such an investigation makes use of empirical methods, 3) might make use of mathematics and deductive reasoning, 4) the scientist should eschew all voices of authority, tradition and popular opinion in question of how nature functions, except to the extent that the information is rationally verifiable, and 5) the scientist must practice systematic doubt, and sometimes endure a prolonged uncertainty in his disciplined search for an understanding of natural phenomena. Most of these principles were practiced by the Muslim scientists, as they had developed and applied empirical and methods, particularly in chemistry and optics, and they were the first to use mathematics in a field of science, namely in astronomy.

Huff also notes the negative role of the doctrines developed by the theologians, which rendered systematic studies of nature a meaningless activity. Lastly, he states that in the Medieval Muslim world, the legal and social institutions were not developed to support the activities of Muslim scientists. The colleges (= madrasa) that were developed by Muslim foundations focused on teaching classical Arabic, Qur’anic interpretation (= tafsir), prophetic tradition (= hadith), logic (= mantiq) and theology (= kalam), and limited teaching of mathematics and medicine. Later, college education became confined to “religious sciences”. Study of astronomy and mathematics were the subjects of observatories, while medical studies were carried out in hospitals. The educational system of the Madrasa were based on mastering the subject of individual teachers, rather than being organized in faculties. The curricula of these colleges were determined by the foundation (= waqf) who provided its finances, and as they did not give much respect for sciences other than the “religious sciences”, studies in experimental sciences and philosophy were left to individual efforts. In contrast, the European universities founded several centuries after their Muslim counterparts obtained their legal and educational independence soon after.

In summary, Huff tries to explain the decline of science in Islamic civilization primarily by the failure of Muslims in developing the necessary institutions and in providing legal autonomy to scientific activities. These are important insights into the nature of this historical phenomenon, and we agree with Huff on these accounts to a certain extent, but we have to go deeper below the social and legal causes. Social and legal order in a society is continuously shaped and reshaped by the beliefs and motivations and the order of concepts of the society. For this reason we claim that the deeper causes of the decline has to be searched for in the conceptual changes that began to take place in the language and minds of the Muslims around 11th century.

For a historical change in such proportions, we never found satisfactory the explanations that rested only on political and economic causes. We were considering since the early 1970s that there had to be more convincing explanations for such a grand scale decline, but were not able to figure out the real causes. Years later, after a debate when doing a PhD in London in 1986, a question began to emerge as to whether the Muslims used certain Qur'anic concepts such as "aql" (= using intellect) and "ilm" (= knowledge) correctly, i.e. in the same grammar as in the Book. Then, following a three-month study on the concept of "aql", we had realized that this word was used quite differently from its native grammar in the Qur'an.

Our later studies on several concepts which we called "The AMR Constellation of Words", revealed an important conceptual network which was directly related to the cosmic order. These concepts were completely ignored by today's Muslims, while they had at least partially constituted the basis of the ideas of many Muslim scholars about reality in one form or another, between 8-12th centuries, from Al Kindi to Ibn Rushd.
This study was indicating that there had been a serious conceptual break in the history of Islamic thought. It began to look clear that this conceptual break gradually diminished the motivation for scientific research among Muslims, and as a result of this had they failed to develop the necessary social institutions and legal support, but not the other way round. In the next section we discuss this conceptual degeneration in some detail.

5. The Conceptual Disintegration in the 11th Century: The Division of the Concept of Knowledge

We can say that the most important conceptual change in the history of Islamic thought had taken place on the concept of "ilm" (= knowledge). This concept is used by today's Muslims as divided into two broad and disjunctive categories as "ilm ad-din" (= religious knowledge) and "ilm ad-dunya" (= worldly knowledge). In a study conducted on this concept both in the Qur'an and the six hadith books which compile the sayings and actions of the Prophet (s.a.), we noticed that there was no trace of such a division in these sources. On the contrary, the concept appears in these sources as an indivisible whole. This indicated that such a conceptual division on knowledge as "religious knowledge - worldly knowledge" had been introduced in the latter half of the 10th century, about a century later from when the hadith were compiled. We can now say that the conceptual division of "knowledge" began to take serious effect in the Muslim thought in the 11th century, and before the end of the 12th century with the exile of Ibn Rushd, it became an accepted norm in the Muslim world from Cordova to Baghdad. In this way, the integral concept of knowledge (= ‘ilm) with its close relation to the concept of reality (= haqq), left its place to a divided concept of knowledge and reality.

At this point some might argue that the qualification of knowledge as "religious knowledge - worldly knowledge" had resulted from a necessity due to the increase in the amount and variety of knowledge. We maintain that this conceptual division was introduced by the 10th century Muslim theologians (particularly by the Ash’arites) for certain other purposes. These purposes primarily included the "protection of Muslims from certain heretical beliefs and ideas". Whatever their aims were, the conceptual changes introduced by the theologians can hardly be taken as a mere manifestation of goodwill, considering the drastic results that they entailed. They introduced both a simplistic and a contradictory set of concepts in place of a rich and complex, but a consistent conceptual structure in the Qur'an. Simplistic, because it overlooked the fact that the Qur’anic concept of knowledge includes all expressions which reflect reality; and contradictory because it ended up denying reality itself.

The division of knowledge as "religious knowledge - worldly knowledge" in the 11th century resulted in questioning the status of physics, chemistry, astronomy, mathematics and logic, which were now regarded as "worldly knowledge". In the beginning it had been stated that these sciences "had, positively or negatively, nothing to do with the religion"19, then the idea that knowledge of these sciences are superfluous, began to creep in the minds of Muslims. Quite often, these sciences were considered as "useless knowledge" as opposed to the "religious knowledge" which was regarded as "useful" by definition. Muslims who were engaged in such sciences, while previously being supported by the public and rulers alike, began to lose support, or even became isolated from society.

As an example, we can cite the case of Ibn Rushd (= Averroes) who was the chief justice in Cordova and the doctor of the Caliph during the Andalusian Umayyads in the 12th century. Ibn Rushd was persecuted by the theologians for his ideas about science and philosophy, when the latter had established their political power in the Umayyid administration. The theologians publicly burned Ibn Rushd's books and wanted to him to be sentenced to death, but he was narrowly saved by the Caliph owing to his earlier services to the state, and was exiled to North Africa instead. Reactions to men of science to the extent of the Inquisition, are in general not observed in the history of Islam. But, as we can now see more clearly, the dismissal of philosophy as heresy, and science as a useless enterprise, has deeply influenced the motivation of the Muslims away from such activities conceptually, psychologically and politically.

On the other hand, many early Muslim philosophers also had based some of their ideas about reality on several concepts adopted from ancient philosophers like Plato and Aristotle. Such controversial concepts as existence (= wujud), infinite past (= qidam), and intellect (= ‘aql), and the logical concepts such as universal (= kulli) and particular (= juz’i) were being used in philosophical and theological discussions instead of the rich network of Qur’anic concepts about the whole reality. Unending debates were taking place between the philosophers and the theologians where the latter were also using the same concepts as their opponents, in addition to the evasive dialectic methods. This led to the gradual isolation of the philosophers from their society, as they could not defend their position effectively in a concept system detached from the Qur’an. It is no surprise that the most original contribution of Muslims have been in the fields of chemistry and algebra, where they relied on the Qur’anic concepts of “balance” (= mizan), and “shay”.

The most interesting outcome of this conceptual break was yet to come. The division of the concept of knowledge, which appears as a unified concept in the Qur'an, as "religious knowledge - worldly knowledge", and the almost unanimous acceptance of this division by the Muslims after the 11-12th century meant the acceptance of secularism by the Muslims in thought as a doctrine. In other words, by accepting such a conceptual division, Muslims would have effectively accepted the separation of their "religion" from their "world". Despite this, when they were faced with the situation of forcefully accepting secularism as a legal principle after nine centuries, they reacted strongly. The rationality of this reaction needs to be evaluated in the same framework with the quiet acceptance of the division of the concept of knowledge in the 11th century.

The conceptual change that took place during the 11th century did not confine to the concept of knowledge, but also spilled over a series of other concepts related with "creation". Abandoning the understanding of a cosmic order based on the concepts of “haqq” (= reality) and "amr" (= instruction) and a set of related concepts in the Qur'an, the theologians adopted a view of physical space based on the hypothesis of continuous creation-annihilation. This hypothesis was developed from a simplistic concept of "creation" which resulted from the reduction of a dozen concepts in the Qur'an related with "creation". In this process, the theologians reduced such concepts in the Qur'an as khalaqa, jaala, baththa, nabata, fatara, banaa, sawwara, sawwa, etc., that mean designing, making, evolving, giving form, bringing into existence, constructing, growing, etc., into a single concept "creating" (= khalaqa) and in this way, turned a rich, complex and consistent concept system into a single concept which swallowed all the details of the original set of tightly related concepts.

As a result of this conceptual reductionism, many Muslims soon found themselves in a position to deny that substances had any essential properties, and by denying the principle of causality in any form, they had locked their minds in a concept system that made scientific explanations quite impossible. (Imagine the development of empirical sciences such as physics, chemistry and astronomy, without accepting the principle of causality or that the substances have characteristic properties.) Yet it is clear from the verses related with the word "amr" (= instruction) and a set of other related concepts in the Qur'an, that the properties of substances are manifestations of a set of instructions that make the substances themselves.

There are about 250 verses (= ayah) in the Qur'an in which the word "amr" (= instruction) and its derivatives take place. Some of these verses state that Allah rules the heavens and the earth with His "amr" (= instructions). Indeed, it is clearly stated in the verses that mention the movements of celestial objects, that they move in accordance with the instructions that had been revealed to (or loaded in) the heavens during their formation:

"... Then He decreed it (the heaven) as the seven heavens, and revealed in (or loaded in) each heaven its instruction." (041.012)

"The sun, the moon and the stars are subjected [to remain in their courses] (= musakharat) by His instruction." (007.054, 014.033, 016.012, 022.065)

From the grammar of the word "amr" that occurs in these verses, a conceptual framework emerges, which indicates that a set of instructions that are distributed in space, where they can be joined or dispersed. In this framework, it is understood that physical events take place in an order determined by the complex interactions, unifications and distributions of the instructions in the physical space. However, the cosmic order is not totally unchangeable and free from divine intervention. As can be seen from the verses related with the word "izn", Allah may intervene to any space-time regions with new instructions and can alter its physical properties according to His will. In this way He can bring about changes that would otherwise be impossible with the existing instructions in that space-time region. Similarly, He can also prevent the happenings that would have resulted from the existing instructions otherwise. Indeed, from the verses related with "amr" and "izn", it is clear that Allah intervenes with some events by means of new instructions which He "sends down" periodically. What we call "miracles" are also partly explainable in this framework. We say "partly", because we do not exactly know what can and cannot be materialized within the interactions of the existing instructions in a space-time region.

On the other hand, in verses related to the word "sakhara" (= make subject to / give under use or control) in the Qur'an, Allah states that He has "made subject to mankind whatever is in the heavens and the earth":

"Do you not see that Allah has made subject to you whatever in the heavens and the earth (= sakhara la kum)?" (031.020)

“He has made subject to you whatever is in the heavens and the earth, all from Him (= wa sakhara la kum ma fi-s samawati wa-l ‘ardi jami’an minhu); in this there are indeed signs for a nation who reflect (= qawmin yatafakkarun).” (045.013)

The words "amr", "izn", "sakhara", "sultan", "qadr" and "qada" constitute an extremely remarkable conceptual network in the Qur’an. We have made a detailed study on this subject, and we intend to publish this work soon in a book titled "Foundations of Scientific Thought in Islam".

The denial of the essential properties of substances and causal relationships between physical events by the theologians and Al Ghazali (1058-1111) in the 11th century, was criticized in detail by Ibn Rushd (1126-1198) in the 12th century 20. Ibn Rushd also took seriously the erroneous division of the concept of knowledge by the theologians, and wrote a book titled Fasl al Maqal. 21 In this book he tried to demonstrate the indivisibility of science and religion, both in logico-philosophical and legal terms (as he was both a philosopher and the chief justice of Cordova.) Despite his serious warnings on this matter in both of his books Fasl al Maqal and Tahafut al Tahafut, his work did not receive sufficient attention and understanding by Muslims of his times and of later centuries.

The continuation of conceptual disintegration in subsequent centuries resulted in the abandonment of the research tradition developed until the 12th century. (It would be impossible to do research in experimental sciences such as physics and chemistry, in an intellectual framework where causality and the essential properties of substances were denied.) The philosophical disagreement between Al Ghazali and Ibn Rushd was debated until the 15th century when the Ottoman sultan Mehmed II decided to resolve it by an academic debate. The sultan asked a committee to be set up from scholars to discuss the issue in a free spirit. This is related by Osman Turan, a 20th century Turkish historian as:

"Sultan Mehmed had gathered the scholars of the age around him. He wanted to resolve the disagreement between Al Ghazali and Ibn Rushd. For this reason, he formed a commission under the chair of Hodja-Zadeh who had publications in philosophy. However, because of the complexity of the matter for the scholars of that age, the problem remained unresolved, and the controversy between philosophy and religion continued." 22

However, after Sultan Selim I (1512-1520), the official educational policy in the Ottoman Madrasas gradually shifted towards the Ash’arites’ views from that of the Ma’turidi theologians who give more prominence to reason and rationality. As a result, interest in experimental sciences further declined in the Ottoman institutions.

The reasons for the failure of the Ottoman universities (= the Madrasa) in competing with the European universities in the field of mathematical and physical sciences, can be found in its mistaken views about philosophy and science. But at the root of this failure was the gradual abandonment of the research tradition developed by the early Muslim scientists. However, in contrast to their failure in physical sciences, sporadic technological achievements by the Ottomans continued until the 17th century, particularly in the military technology. The real Ottoman success was in political science and administration which can partly be explained by the effect of the Enderun (= the Royal College) which was independent of the Madrasa system. Additionally, the Ottomans followed the original political concepts of Islam to a certain extent, rather than that of the 14th century theologian Ibn Taymiyyah. This is another topic that needs a careful and detailed study in itself.

Despite their earlier technological successes, the poor performance of the Ottomans in science and philosophy in comparison to the developments in Europe, was noticed early enough by some Ottoman scholars. For example, Katip Celebi (1609-1657) in his Mizan ul-Hak, had alerted the Ottoman administration about the complete failure of the Madrasa in the study of physical sciences and mathematics by its dismissal of such sciences as "of philosophy", despite the clear advancements made in European universities in these sciences. But such warnings were bound to fail to produce real concern within the distorted concept system that had started to settle in the Muslims' minds a few centuries earlier.

6. Ottoman Initiatives for Renewal and Muslims in the 20th Century

By the turn of the century, the Ottomans started to feel deeply the decline in the structure of the state from economy to defence as a result of their failure in the fields of philosophy and science, and consequently in technology. However, the Ottoman administrators and intellectuals (with a few exceptions like Katip Celebi, Koci Bey and Ahmed Cevdet Pasha) tried to reverse the decline only by a series of social and political measures such as in the reforms of Islahat (= recovery), Tanzimat (= reorganisation) and Meshrutiyet (= constitutional reform). They could not see that their problems would not be resolved by mere political and social measures, as they were rooted much deeper in the conceptual plane, and were affecting the Muslims' motivation to science in an extremely negative way. The failure in science in turn, was deeply affecting, in an indirect way, the social and political structure and its institutions.

Finally, the educational campaign started by Sultan Abdulhamid II towards the end of the 19th century, which at times met serious opposition from Muslims themselves, proved to be insufficient. In the end, the Ottoman state left the stage of history following a decisive defeat by European states who commanded scientific and technological superiority in World War I.

The attempts to solve conceptual problems by political and social measures continued in the Turkish Republican Era. The political measures such as the cultural reforms like changing the alphabet and enforcing new outfits; the social reforms like the adoption of constitution and legal codes from the French, Swiss and Italian laws in the early period of the Republic (1923-1933); the industrialization campaign in the second stage (1950-1960); the heavy-industrialization phase in the third stage (1970-1975); and finally, the policies of liberalization and market economy (1980-1990), can be regarded as the continuation of the efforts in the same direction as started in the last century.

As the conceptual structure that was inherited from the Ottomans could not be changed by forceful social, legal and pseudo-cultural measures, the Republican policies of development, which themselves were infected with the same conceptual diseases albeit in the opposite end, have failed to yield real success. The positivist policies that were introduced in the Republican era, were aimed at suppressing Muslims in effect, if not in form, extinguishing the social motivation still further by straining the illicit "religion - science" division even further, instead of trying to resolve it.

The Republican governments failed to develop an effective "science policy" despite the illustrious slogans they produced about science. Until recent years, there was even no academy of sciences in Turkey, and the one that exists now looks at cultural affiliations rather than academic achievements as criteria for nominations. The reasons why there is still no proper academy of sciences, no ministry of science and technology, and no policy of science and technology in this country must be seriously investigated by the thinking people in this country.

7. Other Recent Attempts

About two decades ago some Muslim writers in the UK and Pakistan had initiated a programme which they called “Islamization of Knowledge”. These writers had proposed that the scientific theories developed by "Western" scientists be examined carefully and modified in accordance with the “Islamic viewpoint” (inevitably this will be their own viewpoint based on a concept system adopted from some relativist "Western" philosophers), and thus be "Islamized". Whatever underlying goodwill their proponents may have had in mind, we considered such attempts as spurious as they did not reflect any serious consideration of the conceptual problems involved. As could be expected, this project had ended in failure within a few years from its inceptions.

Another attempt came from the distinguished Muslim historian of science Nasr, who proposed mystical foundations for motivation in scientific research for the Muslim scientists of the future 23. Nasr claimed that the early Muslim scientists, particularly the alchemists were motivated by the prospect of gaining the knowledge of the hidden.

Both programs were bound to fail because of the errors they precluded. The proponents did not see that the decline of the Islamic civilization in the field of science, and consequently in many other fields of life, was the result of the Muslims' abandonment of their own research tradition, and deeper below, their loss of motivation for learning and research. No program can succeed before Muslims regain their motivation to science, which in turn, can happen only when the Muslims abandon the disintegrated concept system and reclaim the rich and consistent concept system in the Qur'an. Then only, will they begin to see reality as it is, and free themselves from the need of any slogans and the evasiveness of the shallow policies of change.

As to Pervez Hoodbhoy's24 comments: We have to take seriously his criticisms on the ill-formed notion of science of today's Muslims. However, we have also to consider the serious errors in his analyses, and will show how his analyses are based on a series of mistaken premises.

One of the serious errors of Hoodbhoy is his claim that science is a secular activity25. His claim is based on his mistaken views about the scientific motivation of Muslims during the 8-11th centuries. He seems to forget that the Muslims in that era conducted their studies and research within a certain concept system and the related understanding of "being human" that they learned from Islam. Early Muslims had great respect for truth (= haqq) and knowledge (= ‘ilm), and would stand all kinds of hardships for truth and knowledge.

If Hoodboy were correct in his claims that the Muslims scientists and philosophers in that era owed their success to a secular view of the world, why had other more secular societies not been able to demonstrate such a remarkable scientific activity around the same centuries? The fact is that Muslims in those centuries had an unrivalled position in terms of scientific activity, despite that they had never obstructed the activities of other cultures that lived with them.

Hoodbhoy correctly identifies that the "Islamic" countries are in a "crisis of science" down from popular to the administrative levels, and that this crisis has been causing them a complete destruction in many fields of life. However, he continues to state that, when examined carefully, it would be seen that this crisis is of political nature in its essence26.

We have only to remember that this diagnosis had already been given by the Ottoman and the Turkish Republican intellectuals and administrators much earlier, and that the political measures taken to that effect had until today, produced nothing but failure. We can see that deep in the heart of this crisis, there lies problems of conceptual nature, rather than political or economic problems.

Finally, after having described the tragic situation of today's Muslims as regards to science, Hoodbhoy finds the only solution for Muslims to initiate a scientific revival in a secular approach, with the acceptance of "science - religion" duality. We have just described how a similar approach fails in Turkey since the early days of the Republic. The reasons for the failure was first of all, that this approach was obstructing the Muslims' motivation to learning and to research in general, let alone their scientific motivation. Secondly and more importantly, the conceptual division entailed by such a secular approach, contradicts the unifying concept of knowledge and a host of related concepts in the Qur'an.

It would be naive to expect Muslims to have any sustainable motivation for scientific activity within such a contradictory and secular conceptual framework. Besides, any other motivation would be indistinguishable from anything within the existing framework of modern scientific culture which itself cannot offer any new solutions to the existing problems of modern science. Fame or the feeling of superiority, are the main motivations for science in the contemporary secular culture, but not any love for truth and reality. More scientists with the same motivations would not earn mankind any better world than this unjust world that we live in.

It should be noted that the conditions of contemporary world do not support isolated scientific activities, except in some extreme cases. This means that scientists are in much closer contact with each other than they were in the earlier centuries. Therefore Muslim scientists would do better if they focused on more basic problems of modern science, rather than on the particular problems of modern science, but only through a concept system which has perfect contacts with reality. This is because, at the roots of the illnesses of the contemporary culture, there lies a complex conceptual network inherited from the distant past, which has already begun to hinder further progress in science.

8. Back to the Future

We insist that, the Muslims' scientific revival in the future can be realized neither by Hoodbhoy's programme, nor with that of the other Muslim writers that he criticizes. Because none of them seem to be aware of the conceptual disintegration behind the problems, and how it obstructs the scientific motivation of the Muslims of today. These writers find the solution only in the changes of external factors with new additions to a disintegrated concept system. What drives men/women to learning is their motivation, which can only develop in a concept system that feeds it. We have seen a live historical example that it is possible to extinguish cognitive motivation in a concept system and the related cultural environment by introducing changes to that concept system, as that has happened to the Islamic civilization. The question now lingers is: How can we revive a disintegrated concept system which was in perfect order when it started? Is there any easy way of reconstructing a degenerated concept system?

We are not going to answer these questions in this study. This subject will be dealt with in detail in a book that we intend to publish soon. What we can say here is that, before Muslims realize the necessity for such a conceptual restructuring, they cannot give much hope for the future of Islamic civilization. Unless and until they regain the rich concept structure that they abandoned around the 11th century, the Muslims will not possess the cognitive motivation by which they can make real progress in science and other fields of civilization. Finally, we can also say that, if Muslims, in an effort which we can call going "back to the future", succeed in regaining the concept system that they began to abandon around the 11th. Century, then the history of mankind shall once again witness surprising scientific developments from the hands of Muslims, surprising even in the standards of the fast changing scientific and technological conditions of our time.


References

1. Russell, B. (1969). History of Western Philosophy. George Allen & Unwin, London, p. 420.

2. Sarton, G. (1927-48). Introduction to History of Science. Williams & Wilkins, Baltimore, Vol. 3, Chapter 5.

3. Huff, T.E. (1993). The Rise of Early Modern Science: Islam, China and the West. Cambridge U.P.

4. Hoodbhoy, P. (1992). Islam ve Bilim. (Turkish Tr.), Cep Kitaplari, Istanbul.

5. Scientific activity was institutionalized in the famous Bayt al-Hikma (= House of Wisdom) established by Abbasid caliphs in Baghdad in the 9th century, from where it quickly spread throughout the then Islamic world from Spain to Central Asia.

6. Kocabas, S. (1993). Elements of Scientific Creativity. Working Notes: AAAI Spring Symposium Series, 23-25 March 1993, Stanford, USA, pp. 39-46.

7) See: Rosenthal, F. (1970). Knowledge Triumphant. E.J. Brill, Leiden.

8) The figures in the parantheses indicate the numbers of the chapters and verses of the Qur'an. E.g. (006.108) means the 6th chapter, 108th verse.

9) Rosenthal, F. (1970). Knowledge Triumphant. E.J. Brill, Leiden.

10) See, Hamidullah, M. (1966). The Prophet of Islam. Tr. into Turkish by M.Said Mutlu. Irfan Yayinevi, Istanbul, p. 14. (The author relates this information from classical Islamic sources, Ibn Sa’d, Suhaili, and Ibn Hanbal.)

11. The reader is referred to the following sources for the Muslim contribution to early modern science:
- Sarton, G. (1927-48). Introduction to History of Science. Williams and Wilkins, Baltimore.
- Nasr, S.H. (1989). Islamic Science. Insan Yayinlari, Istanbul.
- Huff, T.E. (1993). The Rise of Early Modern Science. Cambridge U.P., Cambridge.
- Demirci, M. (1996). Beyt-ul Hikme. (In Turkish). Insan Yayinlari, Istanbul.
- Akin, O. and Desay, M. (1993). Five Great Scholars of Algebra. (In Turkish). MEB Yayinlari, Ankara.
- Leicester, H.M. (1971). The Historical Background of Chemistry. Dover, N.Y.

12. See: Leicester, H.M. (1971). The Historical Background of Chemistry. Dover, New York. p. 66.

13. Ibid, p. 66.

14. Ibid, p. 71.

15. Topdemir, H.G. (1991). Ibnul Heysem'in Optik Arastirmalari (= The Optical Studies of Ibn Haytham). Bilim, Felsefe, Tarih. No. 1, pp 187-190.

16. See, Huff, T.E. (1993). The Rise of Early Modern Science. Cambridge U.P, Cambridge. p. 56 and 58.

17. This view is also supported by Huff (1993).

18. See, Huff, T.E. (1993).

19. Al Ghazali in his book Al Munkiz min ad-Dalal states that logic and mathematics which are counted as philosophical sciences, have nothing to do, positively or negatively, with the religion. We do not argue about his intentions about this qualification, but observe that it is quite open to misunderstandings.

20) See, Averroes (1978, pp. 316-321). Tahafut al-Tahafut. Tr. by Simon van Den Bergh. Luzac, London.

21) Averroes (1976). Kitab Fasl al-Maqal (On the Harmony Between Religion and Philosophy). Tr. by G. F. Hourani. Luzac, London.

22) Turan, O. Turk Cihan Hakimiyeti Mefkuresi Tarihi. Vol I-II, p. 542.

23) Nasr, S.H. (1991). Islamic Science. Insan Yayinlari, Istanbul.

24) Hoodbhoy, P. (1992). Islam ve Bilim. (Turkish Tr.) Cep Kitaplari, Istanbul.

25) Ibid, p. 17.

26) Ibid, p. 21.

Tuesday, August 21, 2007

How Allah Directs Natural Phenomena

HOW ALLAH DIRECTS NATURAL PHENOMENA

Sakir Kocabas

Summary


In this work we look into how Allah directs what we call “natural phenomena”. Let us remind from the outset that our study is based on the ayahs (= verses) of the Qur’an. For a more detailed study, the sayings of Prophet Muhammad (s.a.w.) on the subject need also be taken into account. Yet, relying on the general principle that there can be no contradiction between the ayahs of the Qur’an and the sayings of the Prophet, we believe that the conclusions that can be derived from the Qur’an about this subject will be sufficient to draw a correct frame to start with.

Before we go on to explore the main subject of this study, we need to recall some of the ayahs in the Qur’an about how Allah, the Creator of the heavens and the earth, has established the order in the heavens and the earth and how He maintains it. For this reason we will first see the ayahs of the Qur’an, which state that Allah is the Real Ruler (= Malik al-Haqq). Secondly, we examine in some detail, the ayahs that express how Allah has established the order in the heavens and how He maintains it. Next, we attempt to bring clarity to the concepts of “physical phenomenon” and “natural phenomenon”. After these definitions we attempt to explore our main subject: How Allah controls and directs physical phenomena and natural phenomena. Finally, we end our survey with a summary of the conclusions.

1. Allah is the Real Ruler (= Malik al-Haqq)

In the Qur’an, there are more than 30 ayahs (= Qur’anic verses) that state that Allah is the Creator of the heavens and the earth.1 Some other ayahs state that Allah’s is the dominion of the heavens and the earth (= lahu mulk as-samawati wa-l ard). Moreover, Allah also states in the Qur’an that He is the Real Ruler (= Malik al-Haqq):

“Exalted is Allah, the Real Ruler; be not in haste with the Qur’an before its revelation to you is completed, but say: My Sustainer, increase me in knowledge.” (Ta-Ha 20/114)

We learn from this ayah an important name of Allah: Malik al-Hakk (= the Real Ruler). We need to dwell on the meaning of this name. Briefly, the above ayah expresses in a clear and succint way that whatever happens in the heavens and the earth happens under the direction and control of Allah. The following ayah on the other hand, states clearly that there is nothing in the heavens and the earth that escapes His knowledge:

“Allah is He who created the seven heavens, and of the earth the like of them [in number]; the instruction (= amr) descends through the midst of them [all]; that you may know that Allah has power over all things and that Allah has encircled all things with knowledge (= wa annallaha qad ahata bi kulli shay’in ‘ilma).” (Talaq 65/12)

“But the god of you all is Allah; there is no god but He; He surrounds all things with knowledge (= wasia kulli shay’in ‘ilma).” (Ta-Ha 20/98)

These ayahs state that Allah has encircled and surrounded everything with knowledge. Moreover, as stated in many other ayahs in the Qur’an, “He is well informed of the actions of His servants (= wallahu khabeerun bi ma ya’malun); “He sees their actions” (= wallahu baseerun bi ma ya’malun); “He hears and sees” (= innahu huwas sami’ul baseer); and as stated in Mulk/67, “He sees everything” (= innahu bi kulli shay’in baseer); in Fatir/38 “Allah knows the secrets of the heavens and the earth (= innallaha ‘alimul ghaybis samawati wal ard); and in the same ayah “indeed He knows the secrets of the hearts (= innahu ‘aleemun bi zatissudur); and in Yunus/10 “… nothing in the heavens and the earth that weighs as an atom (= misqala zarratin), or [anything] smaller or greater than that escapes from His attention (= wa ma ya’zubu ‘an rabbika), all [of this] are in an open book.” It is clearly understood from these ayahs that, all that happens in the heavens and the earth is like an open book to Him, and that nothing happens in the heavens and the earth outside His knowledge.

2. Allah’s is the Administration of the Heavens and the Earth

In the previous section we saw the ayahs which state that Allah is the Real Ruler, and that there is nothing that escapes His knowledge. Yet, His power is not limited to this, for Allah holds in His hand, the Administration (= malakut) of all things:

“Exalted is He in whose hand the is administration of all things (= fa subhan allazi bi yadihi malakutu kulli shay’); you will return to Him.” (Ya-Sin 38/83)

From these verses it is clear that Allah holds the Administration of the heavens and the earth, and He is the Real Ruler. The limited power that He gives to some people in this world for a determined period, is only by His will, and He takes it back when He will.

After we saw the ayahs that state that Allah is the Real Ruler of the heavens and the earth, we are faced with two questions: How does Allah rule the heavens and the earth? Can the human mind comprehend how He rules the heavens and the earth? At first sight these questions may seem to be impossible to answer, yet the answers to both questions can be derived easily from the Qur’an. The next ayah invites mankind to conduct observational and theoretical study on the Administration of the heavens and the earth:

“Have they not studied the administration of the heavens and the earth and what things that Allah has created? (= awa lam yanzuru fi malakut as-samawati wal ardi wa ma khalaqallahu min shay)” (A’raf 7/185)

Another ayah below expresses how Allah actually realizes the Administration, and that this is to be known by mankind:

“Allah is He who has created the seven heavens, and of the earth the like of them; the amr (= instruction) descends through the midst of them so that you may know (= li ya’lamu) that Allah has power over all things and that Allah has encircled everything with knowledge (= wa annallaha qad ahata bi kulli shay’in ‘ilma).” (Talaq 65/12)

In this ayah the word “amr” (= instruction/command) is a particularly important keyword for understanding the administration of the heavens and the earth. As can be understood from the ayah, Allah rules the heavens and the earth by His amr. But what is amr and what is its function in the administration? In order to understand this, we have to look into the ayahs in the Qur’an that this word and its derivatives occur. We attempt to explain briefly what this word refers to in the ayahs next.3

3. The Word Amr in the Qur’an and the Order in the Heavens

In the Qur’an, in relation to the creation and the administration of the heavens, the word amr appears in the ayahs in three principal frames:4

1) The amr that has been revealed into the heavens during their creation, by which the primary order has been established . (We call this the “primary amr”.)

2) The amr that is sent by Allah to influence and control the events in the world. (We call this the “secondary amr”.) With this amr, which He sends down by His angels and which is directly subject to His permission (= izn), Allah can change the current order in any region of space, and can create new and unseen events by it.5

3) The amr that will terminate the current order in the heavens and the earth, which in the Qur’an is called the amr of the Hour (= amr as-Saah).

We can now take a closer look at how the word amr takes place in these three contexts.

The order in the Heavens: The primary amr

The use of the word “amr” together with the words “sakhara” (= make dependent) and “qadr” (= measure) in the ayahs in the first sense above, is closely related with how the order has been established and maintained in the heavens. This is made clear by the ayahs which state that the seven heavens have been revealed in (or loaded with) their instructions with their creation, and that the states of the heavens and of the objects in them are maintained by this amr:

“And in two days He decreed (= qada) them [the heaven and the earth] as the seven heavens, and revealed in each heaven its instruction (= wa awha fi kulli samain amraha) ...” (Fussilat 41/12)

“The sun, the moon and the stars are all subjected [to remain in their courses] by His instruction (= musakharatun bi amrihi).” (A’raf 7/54, Ibrahim 14/33)

“And one of His signs is that the heaven and the earth stands with His amr (= an taqum as-samau wal ardu bi amrihi) ...” (Rum 30/25)

“Did you not see that Allah has made subject to you whatever is on the earth? Ships flow by His amr; He holds the heaven from falling on earth so that it would not fall, except by His permission (= izn); Allah is Most Kind and Most Merciful to mankind.” (Haj 22/65)

As can be seen, the order in the heavens is established and maintained by the (primary) amr that has been revealed in them by Allah. In this case, whatever happens in the heavens must happen in accordance with this amr, so long as there is no other intervention by Allah.6

This understanding leads us to an interesting concept of science, such that in this conceptual framework the aim of scientific investigation becomes understanding and explicating the structure and distribution of the instruction that has been revealed in the heavens by the Creator. An understanding of science as such, would not only explain many things about the order and harmony in the known space, but also would bring clarity to the issue of the creation and formation of the objects in the space. No cosmology developed to date has the basic concepts by which the extremely complicated, and yet excellent order that we observe from the micro-world to the macro-world, can be explained in a consistent way. How did, from a small number of basic physical forces, evolve the wonderously rich physical, chemical, biological and psychological interactions in the world? Is there a cosmic plan behind all this? These are the questions which occupy the minds of many scientists working in the fields of physics and cosmology.7

From the ayahs related with the primary amr in the Qur’an, we can infer that the order in the heavens emerge as a result of the interactions of the instructions (= amr) dispersed in all regions of space (= makan). In this case, the word “amr” (= instruction) emerges as a fundamental concept directly related with “being”. In information physics, the concept of information is used as a basic concept in explaining the degree of order (or the negative entropy) of physical systems.8 But there are categorical differences between the concepts of amr and information: It seems that the primary amr is a set of instructions which not only determines the order in a region of space, but also brings about what we call “matter” itself. From this, we can say that the concept of amr, unlike the concept of information, is a concept related with “being”, or in philosophical terms, is an ontological concept. (Indeed, in a number of ayahs the word amr occurs in close relation with the word kun (= be); see e.g. ayahs in Baqarah 2/117, Al-i Imran 3/47, Maryam 19/35, Mu’min 40/68, Ya-Sin 36/82).

At this point, we are faced with the question whether the primary amr is sufficient in itself or not, in maintaining the order in the heavens as an important question. Before we make a judgement on this issue, we need to consider the ayah:

“It is Allah who holds the heavens and the earth from collapse (= yumsiku-s samawati wal arda an tazula); if they should collapse, there is none, not one can hold them therafter; verily He is most Forbearing, oft Forgiving.” (Fatir 35/4)

This ayah brings several possibilities in mind. The first one is that the verb “holds” (= yumsiku) can be understood as “holds with His amr”, so that when the effects of the primary amr is obliterated by Allah, there would be no one other than Him to bring back the order. The ayah in Rum 30/25 that we saw earlier strengthens this possibility. The second one is that the heavens are protected from reduction or collapse, such that the word “tazula” which is a derivative of “zawal” (= reduction, collapse) may be pointing to such possibility.9

Another possibility is that, the cosmic order which has been established by the primary amr cannot go on indefinitely by itself, and that Allah maintains the order by His secondary amr. It is also imaginable that both possibilities can be the case. Other possibilities than what we said here in a theoretical and speculative framework need also be considered and investigated.

Allah’s intervention to the events in this world: The secondary amr

Let us continue with the relationships between the word amr and the order in the heavens. Since the order in the heavens has been established by the primary amr that has been revealed in them, one might think: If we have a complete understanding of the primary amr which have been revealed in the heavens, and consequently to all systems in them, we can understand all that happens in them. In this way we can possess complete knowledge about these happenings, and see the future. (This could be the final vision of the contemporary understanding of science.) Yet, as explained in some detail below, the problem is not as simple as this. The main reason is that the amr is not something that cannot be changed, and that its effects cannot be overridden once it has been revealed in these systems. An ayah which we saw earlier, explicitly states that other instructions are being (continuously or periodically) sent down by Allah:

“Allah is He who created the seven heavens, and of the earth the like of them [in number]; the instruction (= amr) descends through the midst of them [all]; that you may know that Allah has power over all things and that Allah has encircled all things with knowledge.” (Talaq 65/12)

Indeed, as will be seen when the conceptual frames of the words amr and all other related words (haqq, qadr, qada, izn, sakhara, sultan, ‘aql, and ruh), the amr is not something that consists of the primary amr. We understand from the ayahs in which the words amr, haqq, izn, qadr, and qada occurs, that the effects of the primary amr can be cancelled, overridden, or entirely new conditions can be created by new amr (the secondary amr) sent down by Allah.

The secondary amr sent by the Real Ruler is what is (in terms of the verbs used in the Qur’an in association with it) determined (mubrim), decided on (mustaqir), measured to a measure (qadaran maqdura), differentiated (yufraqu), directed or administered (yudabbir), sent (mursil), sent down (munzil), distributed (muqassimat), decreed (qada), and infused in (yulqi). The angels are sent down with this amr, and they descend through the heavens with it, and after the amr is obeyed (ata), applied/done (maf’ul) and completed (balagha), the amr ascends (ya’ruj) to Allah, and returns (yurji) to Him. The effects of this amr are sometimes made visible (zahara) to mankind. These verses indicate that the completion of the cycle of amr can be regarded both as periodical and continuous.

Some of the other ayahs directly related with the secondary amr are:

“… an amr from Allah’s presence …” (Maida 5/52, Duhan 44/5)

“Or do they determine the amr? We indeed are the determiner (= mubrimun).” (Zukhruf 43/79)

“…all amr have been decided on (= wa kulli amrin mustaqir).” (Qamar 54/3)

“[Allah] directs the amr from the heaven to the earth (= yudabbir al-amri min as-samai ilal ard); then it ascends (= ya’ruj) to Him in [part of ] a day the measure of which is thousand years in your count.” (Sajda 32/5)

“[Allah] sends / sends down the amr (= mursil/munzil).” (Duhan 44/5, Talaq 65/5)

“Allah’s amr is a measure measured (= qadaran maqdura).” (Ahzab 33/38)

“[A nigt] in which all wise [or mighty] amr are differentiated (= fiha yufraqu kulli amrin hakeem).” (Duhan 44/4)

“The angels and the Spirit (= Ruh) descend in that [night] by the leave (= izn) of their Sustainer from [or with] all amr.” (Qadr 97/4)

“When He decrees an amr, He says: ‘Be!’, and it is (= iza qada amran yaqulu lahu kun fa yakun).” (Baqara 2/117, Al-i Imran 3/47, Maryam 19/35, Mu’min 40/68, Ya-Sin 36/82)

“The amr of Allah has come (= ata amrullah) …” (Nahl 16/1)

“… the amr of Allah is done [or applied] (= wa kana amrullahi maf’ula).” (Nisa 4/47)

“… the amr of Allah has become visible (= zahara amrullah) …” (Tawba 9/48)

“Our amr is but a single [act] like the twinkling of an eye.” (Qamar 54/50)

“… Allah has power over His amr (= wallahu ghalibun ‘ala amrihi), but most among mankind know it not.” (Yusuf 12/21)

There are ayahs in the Qur’an indicating that the angels are given the task of applying the secondary amr. Some of these are,

“We [angels] descend only by the leave of your Sustainer (= wa ma natanazzalu illa bi amri rabbika)…” (Maryam 19/64)

“And to Allah bow all that is in the heavens and in the earth whether moving [living] creatures or the angels; for none are arrogant [before their Sustainer].” (Nahl 16/49-50)

“And they [the angels] speak not before He [speaks], and they act by His amr.” (Anbiya 21/27)

Another two ayahs related with the secondary amr, which particularly attract our attention are,

“Nor can a soul die except by Allah’s leave (= izn), the term being fixed as by wiriting…” (Al-i Imran 3/145)

“For each [person] there are [angels] before and behind him; they protect him from the amr of Allah (= yahfazuna min amrillah). Verily, never will Allah change the condition of a nation until they change what is in their soul; but when Allah wishes the punishment of a nation, there can be no turning it back, nor will they find beside Him any to protect.” (Ra’d 13/11)

As can be seen, the first ayah states no person/soul (= nafs) dies except by Allah’s leave (=bi iznillah) which is associated with an amr. This ayah also shows that all the physical conditions determined by the primary amr would not be sufficient to cause the death of a person, however the seem to be deadly. This issue is made clear by the statement in the last ayah, “they protect him form the amr of Allah” so that these protectors [angels], protect that person from the unbearable and deadly effects of the primary amr.10

All these ayahs clearly indicate that the primary amr that has been revealed in the heavens, may not explain every event that happens in the heavens and the earth, despite that the order in the heavens has primarily been established and is maintained by it. Still, we must not overlook the fact that the primary amr has a basic function in maintaining the cosmic order.

We also see from these ayahs that the decree and its application of the amr that overtakes or overcomes the primary amr in a region of space is totally dependent on Allah’s leave (= izn). This is very important, because without Allah’s leave, the primary emr continues its function, and all makan (= spaces) and all the objects in them continue to carry the properties determined by it. The following ayahs clearly state this,

“Did you not see that Allah has made subject to you whatever is on the earth? Ships flow(= tajree) by His amr; He holds the heaven from falling on the earth so that it would not fall, except by His permission (= izn); Allah is Most Kind and Most Merciful to mankind.” (Haj 22/65)

“… the sun, the moon and the stars are in subjection by His amr; verily in this are signs for a nation who use intellect.” (Nahl 16/12)

As can be seen from these ayahs, the properties and motions of the objects in the heavens are formed by the primary amr, and as long as Allah does not send another amr on them by His izn, their properties and motions will continue. The effective use of objects in the heavens and the forces in them by mankind requires studying these properties and the physical forces that determine these properties. We can say that the basic physical forces emerge as the result of the order (= mizan) that has been placed in the heavens.

We can acquire, by all the activity that we call “scientific research”, only the knowledge of the effects of the primary amr and the established order (= mizan). In the future, even if we should have an excellent knowledge of physics and computation, we can only have the knowledge of predicting the effects of the primary amr in a certain region of space.11 As we will explain shortly, when Allah interferes with His amr in any “natural penomenon” it would be impossible to make any reliable prediction about the processes of that phenomenon by physical methods, because it is impossible to know by any scientific method, when and where the secondary amr will take effect.

The ayahs related with this issue clearly shows that, even if we have a complete knowledge of the primary amr that has been revealed in the heavens, we would still not have a complete and absolute knowledge about the world. Even if we used all our scientific research methods, we cannot obtain even the trace of knowledge of the secondary amr that Allah may send by His leave (= izn), to cancel out or partially or completely change the effects of the primary amr in a region of space. This tells us that even when we possess an excellent science and technology, we should put our reliance and trust only in Allah.

***

We see in some of the ayahs of the Qur’an that a close relationship is made between using intellect and understanding cosmological events. Since the order in the heavens and the earth is the work of Allah, the study and research for understanding this great work should be a paramount duty for mankind, because in this way, the true might of Allah can be better understood and better appreciated.

Besides, these ayahs clearly motivate mankind to reason about the creation of the heavens and to understand the amr which lay beneath the cosmological events, and their effective use for the benefit of mankind. These ayahs also ask mankind to take lessons from such events, and guide them to realize that the life of the Hereafter which Allah promises is far superior to the life of this world. Some of the related ayahs are,

“Indeed, in the creation of the heavens and the earth; in the alternation of the day and night; in the ships which flow (= tajree) in the ocean; in the rain which Allah sends down from the sky and revives the earth with it after its death; in the beasts of all kinds that He scatters throughout the earth; in the redirection (= tasreef) of the winds, and the clouds which they trail like their slaves between the earth and the sky, are signs for a nation who use intellect.” (Baqara 2/164)

“In the alternation of the day and night; in the sustenance which Allah sends down from the sky and revives the earth with it after its death; and in the redirection (= tasreef) of the winds, are signs for a nation who use intellect.” (Jasiya 45/5)

“And such are the parables We set for mankind, but none use intellect on them except those who have knowledge (= wa ma ya’qiluha illal ‘alimun).” (Ankabut 29/43)

In the first two verses above, the word “ya’qilun” (= those who use intellect) refers to those who can see the connection between these ayahs and reality. The same term appears in the following verses which state that the life of the Hereafter is superior to the life of this world:

“… But the home [or life] of the Hereafter (= dar al akhira) is better for the righteous; will you not use intellect?” (A’raf 7/169)

“The things you have been given are but the provision and the glitter of the life of this world; better is Allah’s reward and more lasting. Will you not use intellect?” (Qasas 28/60)

On the other hand, the following ayah states that those who do not use intellect are like cattle, or even lower in guidence:

“Do you think that most of them listen or use intellect? They are like cattle, and even more misguided (= bal hum adall).” (Furqan 25/44)

In the ayah below, a surprising stetement appears: “Whatever is in the heavens” have been given to the use and benefit of mankind:

“[Allah] has subjected (= sakhara) to you whatever is in the heavens and the earth, all from Him; verily in this there are signs (= ayat) for a nation who reflect (=qawmin yatafakkarun).”

In this ayah the expression “all from Him” indicates that the use of all the objects and events, or all the physical forces that form and determine their properties and motions are given, without exception, potentially to all men who strive to understand them. In this ayah we also see that the word “sakhara” (= subjected to) is linked with the phrase “a nation who reflect”. This means that the effective use of the objects and the physical forces in the heavens and the earth will be accomplished by collaborative study and the use of intellect by people as nations, rather than as isolated individuals. This would require of course, a public orientation and participation.

The end of the order in the heavens: The amr of the Hour

The amr which is termed in the Qur’an as the amr of the Hour (= amr-us saah), is the instruction which will terminate the order established with the primary amr. As understood from the ayahs related with the Day of Standing (= yawm al qiyama) in the Qur’an, this amr will take effect in a day which will encompass Resurrection and the Day of Reckoning (= yawm al hisab). This subject is dealt with in detail in another study titled “The Day of Standing in the Qur’an and Traditions”, which we hope to have translated into English soon. After these explanations, we can now go on to the definitions of the terms “physical event” and “natural phenomenon” within this conceptual framework.

4. The Definition of “Physical Event”

After this brief inquiry into the ayahs related with the word amr, we can now attempt to provide a definition of “physical event” within this framework: A physical event or physical phenomenon, is an event which happens only within the context of the primary amr which has been revealed in the heavens with their creation. The characteristic feature of such events is that they are repeatable (or repeatedly observable) by humans in the laboratory and observation conditions. We can say that causality in physical events arises as a result of the order or symmetry (= mizan) laid with the primary amr. Causality can be said to be relevant only in the space of large (or macro) scale interactions. Symmetry is also in effect in most interactions in micro space, but here causality leaves its place to uncertainty due to some fundamental properties of light (= photons?) invariably used in the measurements.

The uncertainty in physical events arises in two categorically different forms: uncertainty in the microworld, and uncertainty in the macroworld. We stated that the first arises from the basic properties of light used in observations and measurements. The second type of uncertainty arises from the difficulties of determining the initial conditions of certain complex physical events. This can be called the statistical uncertainty. The physicists believe that many physical phenomena can be modeled by differential equations. In such models, the main problem is to determine the initial conditions, so that starting with these conditions at a time to, the equations would give the status of the event at time t1. In many physical events, the initial conditions are too complicated to know, but in many others, these are known within statistical limits.

5. The Definiton of “Natural Phenomenon”

The term “natural phenomenon” is mostly used for macro scale events observed in the world and in space. Solar and lunar eclipses, various meteorological events, and earthquakes are regarded as phenomena in this framework. In today’s understanding of science all natural phenomena are believed to be a composition of mere physical events. However, unlike accurately predictable events such as the solar and lunar eclipses, the unpredictibility of meteorological events and earthquakes have led the scientists to consider them in a separate class as “chaotic events”.

Based on this classification, causality in natural phenomena needs to be considered in two different frames. Space events such as the solar and lunar eclipses can be explained by the effects of the physical forces, such as gravity, which determine the orbits of the objects in space. But as the explanations of meteorological and tectonic events require taking into account of a number of different effects at the same time, a complete explanation of such an event becomes impossible. The term “butterfly effect” for meteorological phenomena has been coined by some scientists for this purpose. In meteorological and geological phenomena, many effects such as the particular spatial configuration of the planets in their orbits according to the earth and the sun, solar explosions (or “solar spots”), the impact of large meteors on the earth, and other space events can be at work together. All these effects contribute to the uncertainties in the predictions and explanations of such events.

6. Allah’s Intervention in Physical Events

Earlier, we saw from the ayahs in the Qur’an that Allah intervenes in physical events in any region of space and time by His secondary amr as He wishes (= yuridu/arada) to do so. When Allah’s amr comes to a space-time region, it may result in three different effects: 1) The obstruction or cancellation of the effects of the primary amr in the same region, 2) The strengthening the effects of the primary amr, 3) The emergence of an entirely new set of effects in the same region.

In the first case, Allah’s new amr (the secondary amr) interacts with the primary amr that occupies the same place so as to cancel or weaken its effects.

In the second case, Allah’s amr interacts with the primary amr so as to increase its effects, and in this way, it strengthens and/or focuses the current effects.

In the third case, Allah’s new amr brings about entirely new and previously unseen effects either by opening new space for itself, or by interacting with the primary amr in the same space.

7. Allah’s Intervention to Natural Phenomena

A number of ayahs in the Qur’an clearly describes examples of how Allah intervenes in and directs what we call meteorological and geological events. We shall see some of these ayas shortly. But first, let us consider what may happen when Allah intervenes in natural phenomena. In such cases we can think of four different effects: 1) Delaying or initiating the occurrence of the natural event in order to disperse its distructive effects which would otherwise be caused under the effects of the primary amr, 2) Focusing and directing the effects of the natural event, 3) Increasing the strength of the effects of the event, 4) Bringing about natural events with perceived effects of such kind as previously unseen. Let us now see some of the ayahs which exemplify these four different effects:

“Did you not see that Allah makes the clouds move gently, then joins them together and then makes them into a heap? Then you see the rain emerge from their midst; and He sends down from the sky mountain masses (of clouds) in which is hail; He strikes with it whom He will, and He turns it away from whom He will. The vivid flash of His lightning almost blinds the eyes.” (Nur 24/43)

In this ayah the expression “He strikes with it whom He will, and He turns it away from whom He will” exemplifies the effects of the first and the second type that we listed above. Some ayahs which could be taken as examples for the other two cases are,

“[The nation of] Aad, behaved arrogantly in the land with no just reason, and they said: ‘Who is mightier than us?’ Could they not see that Allah, who created them, was mightier than they? Yet they denied Our signs.” (Fussilat 41/15)

“So We sent against them a furious wind through the days of disaster that We might give them a taste of punishment of humiliation in this life; but more humiliating still, will be the punishment of the life to come. And they will not be helped.” (Fussilat 41/16)

“And the [nation of] Aad; they were destroyed by a furious wind, exceedingly violent.” (Haaqqa 69/6)

“[The nation of Samood] rebelled against the amr of their Sustainer; so the stunning noise [of a thunderbolt] seized them even while they were looking on.” (Zariyat 51/44)

The expression “stunning noise” can be regarded as an example of a previously unseen “phenomenon”. (How “natural” could it be regarded is another matter.) In another unusual “phenomenon”, the nation of Lot were destroyed by the collapse of the land swallowing the whole city with all its population, together with a hail of baked stones [lava stones or meteorites?] from above:

“[The angels] said: O, Lot!, we are messengers from your Sustainer; they [your nation] shall not touch you; depart with your kinfolk with the dead of the night, and none of you look back; as for your wife, she shall suffer the fate of the others. In the morning their hour will come. Is not the morning near?” (Houd 11/81)

“And when our amr came, we turned it [the city] upside down, and let loose upon it a shower of baked-stones spread layer on layer.” (Houd 11/82)

These ayahs describe, clearly with no need for more comments, how these nations were destroyed by what we may still call “natural events”.

At this point, regarding our current subject, we are faced with two extremely important questions: If Allah severely punishes sometimes entire nations, what could be the reasons for such punishments? Could it be known or predicted how such punishments will happen? The answers to these questions can be found in the Qur’an in ayahs where the word “sunnatullah” occur. In the Qur’an, the word “sunnatullah” occurs in reference to certain forms of conduct for people and nations. These laws are not changeable, not for even the Messengers of Allah:

“… no change will you find in the laws of Allah (= fa lan tajida li sunnatillahi tabdeela); and no turning off will you find in the laws of Allah.” (Faatir 35/43)

In the Qur’an these laws of conduct are also referred as “the law of the ancients [past nations]” (= sunnatul awwaleen) in some ayahs. The codes of these laws, or in other words the conditions for them to take effect can be found in the ayahs where the words “haqq” and “sunna” take place. We can summarize some of the conditions of these laws as follows:

- To behave arrogantly in the land with no just reason.

- To secretly devise evil plots.

- To kill the messengers of Allah, and those who instruct with equity.

- To make friends with the subjects of Satan (= shayateen) against Allah.

Let us now see the related ayahs. The first is related with the arrogants:

“[The nation of] Aad, behaved arrogantly in the land with no just reason (=fastakbaru fil ardi bi ghayri-l haqq), and they said: ‘Who is mightier than us?’ Could they not see that Allah, who created them, was mightier than they? Yet they denied Our signs.” (Fussilat 41/15)

The Aad paid in this world the due of their evil conduct by being destroyed by a furious storm:

“So We sent against them a furious storm through the days of disaster that We might give them a taste of punishment of humiliation in this life; but more humiliating still, will be the punishment of the life to come. And they will not be helped.” (Fussilat 41/16)

Another great sin which deserves punishment in this world is to secretly devise evil plots (= makr-us sayyia) against people:

“[Their] arrogance in the land and [their] plotting evil (= istakbaran fil ardi wa makr-us sayyia); evil plots will harm only their authors. Are they looking for other than the law of the ancients? “… no change will you find in the laws of Allah (= fa lan tajida li sunnatillahi tabdeela); and no turning off will you find in the laws of Allah.” (Faatir 35/43)

Related with those who devise evil plots, the following ayahs need also be considered:

“Do those who secretly devise evil plots (= allazina makaru-s sayyiat) feel secure that Allah will not cause the earth to swallow them up, or that the wrath will not seize them from directions they little percieve.” (Nahl 16/45)

“Or that He may not seize them in the midst of their going to and fro, when they cannot escape?” (Nahl 16/46)

“Or that He will not give them over to slow destruction? Yet your Sustainer is Compasionate and Merciful.” (Nahl 16/47)

The expression “your Sustainer is Compassionate and Merciful” can be uderstood as that Allah will save those who were wronged by the hands of the plotters of evil. It can also mean that Allah gives long respite to those who devise evil plots, beside openly warning them off by His words about the due results of their deeds, so that they may take heed and give up their evil deeds.

From these ayahs we understand that such plotters of evil are to be sternly punished in this world in four different ways:

1) Allah will bury them in the ground by a terrible disaster.

2) They will be seized by the wrath from a direction they hardly percieve.

3) They will be seized when they go about doing their business.

4) They will be subjected to a slow destruction.

The past nations which murdered Allah’s messengers, or forcefully drove them away from their homes have been destroyed according to these laws (see, e.g. Isra 17/76-77). Those people who unjustly murdered the individuals who instructed equity among them, have also been destroyed in accordance with these laws:

“As to those who deny Allah’s revelations, and slay the Prophets and slay with no just reason those who instruct with equity among mankind; announce them a grevious penalty.” (Al-i Imran 3/21)

“And We have destined for them intimate companions [shayateen]; who make past and future seem fair to them; well was the word justified against them [or: they deserved the fate] which overtook the parties (= umam) of the jinn and men who have gone before them. They shall assuredly be lost.” (Fussilat 41/25)

The expression “well was the word justified against them” (= haqqat ‘alayhim-ul qawl) in the last ayah can be taken to refer to the laws that have been applied to the ancients.

What we have discussed to this point are the laws by which mankind are punished in this world when they exceed the limits. We can now return to our second question above: Could it be known beforehand how the punishment will come into effect?

It is not possible to predict by which “natural event”, and in fact how sunnatullah will take effect. But since the limits of the conditions of these laws are given in the Qur’an, the pending disaster can be estimated by closely observing the behavior of the society, particularly the behavior of those who command and exercise power in it (e.g., its leaders, elites and the wealthy), as to whether the conditions are fulfilled or not.

There are many ayahs in the Qur’an about how Allah has punished the wrongdoing nations in the past some of which seem to be relevant to the conditions of our time:

“Many a cities have insolently opposed the command of their Sustainer and His Messengers, and we called them to a severe account (= hasabnaha hisaban shadeeda); We punished them with exemplary punishment.” (Talaq 65/8)

“When we decide to destroy a settlement, we first send our command to those of them who live in comfort; if they trangress, so that the word is proved true against them; then We destroy them utterly.” (Isra 17/16)

“We did not wrong them, but they wronged their own souls; when the amr of your Systainer comes (= lamma ja’a amru rabbuka), the deities they invoked other than Allah availed them nothing; they only hastened their ruin.” (Houd 11/101)

Lastly, apart from these ayahs, the following ayah is of great interest regarding what may be expected to happen in the future in this world:

“There is no city (= qarya) but shall be destroyed or sternly punished before the Last Day; that is decreed in the Book (= kana zalika fi-l kitabi mastura).” (Isra 17/58)

It would be the duty of all men and women who think, keep contact with reality and take heed, to be prepared as needs to be prepared before the truth of these ayahs become visible.

Conclusion

Many ayahs in the Qur’an declare that Allah is the Creator of the heavens and the earth, and their Real Ruler. He has encompassed and encircled all things with knowledge. Nothing in the heavens and the earth can ever escape His knowledge. Allah intervenses and directs with His secondary amras He wishes, what we call “natural phenomena” which normally happen within the framework of the primary order which He has established in the heavens by His primary amr. He punishes with them whom He decides among mankind, and turns the destruction away from whom He wishes. Allah has made clear by His laws termed as “sunnatullah” in the Qur’an, under what conditions He sends His punishment on people and nations. These laws are also somewhat related with the order (= mizan) which has been established by their creation. If the limits of these laws are well known, it can be sensed when they will take effect, but except as Allah will, it cannot be known exactly when and how they will take effect. We must strive to understand, by using all the means available to us, the happenings in the heavens and the earth and try to understand reality as a whole. When we do this study in a perfect manner and keep equity about ourselves and our position in this world at the same time, we would better appreciate the true might of Allah (= haqqa qadrihi). We will also realize, as has been written in the Qur’an, that the life of the Hereafter is much superior to the life of this world and will give direction to our lives accordingly. Allah knows the best of all things. Was-salaam.


Notes

1 We call what is termed in the Qur’an “the heavens and the earth” as “the universe”. The validity of the term “universe” is being questioned by some physicists such as David Deutsch, who would prefer the term “multiverse” instead. The repeated use of the expression “the heavens and the earth” in the Qur’an can be viewed as to stress the particular importance of the earth in the heavens, with its being the home of millions of different living species including the humans, and its surprisingly suitable conditions for the sustenance of life. This particular place of the planet earth in the universe has been of great interest to many physicists and cosmologists in recent decades. (See, Ref. 7-b)

2 The expression “yanzuru fi” in this verse refers to both observation and systmatic (theoretical) thinking, and muslim scientists in the Classical Era have derived the term “nazariya” (= theory) from the root of this verb “nazara fi”.

3 For more detailed explanations please see,

- Kocabas, S. “Islam’da Bilginin Temelleri”. Iz Yayincilik, Istanbul, 1997.

(An extended English version of this book is being prepared, and we hope to publish it in the future.)

4 The word “amr” occurs in the Qur’an in other frames than the two we have given here. See, Ref. 3 for details.

5 We must emphasize that the word “amr” is not classified in the Qur’an as “primary amr” and “secondary amr”. We have introduced this distinction from the differences of frames of the use of this word. But the appropriateness of this classification can even be seen from the personal pronouns that are used with the word “amr” in the Qur’an. What we call “the primary amr” corresponds to the uses in the ayahs where the word appears almost exclusively in the form “His amr” with the third person singular pronoun, and our term “the secondary amr” corresponds to the uses of the word as “the amr of Allah”, “Our amr”, and simply “the amr”.

6 Whether this amr can be understood as the “operating system” of the heavens, or as a kind of “software” loaded in the heavens, is a subject that deserves to be seriously considered.

7 See;

a - Davies, P. (1992). “The Mind of God”. Touchstone Books. New York.

b - Barrow, J.D. & Tipler, F. (1996). “The Anthropic Cosmological Principle.” Oxford: Oxford University Press.


8 For information physics, see: Stonier, T. (1990). “Information an the Internal Structure of the Universe”. London: Springer-Verlag.

9 The current theoretical framework about elementary particles involves the protection of the basic building blocks of material existence (e.g. protons and electrons) from decay. As an example, consider protons which are accepted to be one of the basic constituents of hydrogen atoms: Unlike free neutrons; protons can stay for a long period (at least 1030 s.) without decaying into lighter particles (mesons and leptons). Had there not been in effect a particular form of baryonic symmetry between elementary particles, there would be no atoms, and no living and inanimate objects as we know them in the world. The word “tazula” may be taken to refer to preventing such kind of collapse.

10 For detailed information on this subject, see Ref. 3.

11 Despite this, we believe that all scientific research in this direction need to be continued by all means, because only those who have knowledge can see the limits of current scientific knowledge, and can better appreciate the true might of Allah. Also, we need to remember the ayah: “... and say: Could those who know be like those who know not? ...” (Zumar 39/9)

Thursday, August 16, 2007

Integration of Research Tasks in Modeling Discoveries in Particle Physics

INTEGRATION OF RESEARCH TASKS IN
MODELING DISCOVERIES IN PARTICLE PHYSICS
Sakir Kocabas

Pat Langley
(langley @ cs.stanford.edu)
Robotics Laboratory, Computer Science Dept.,
Stanford University, Stanford, CA 94305 USA

Abstract:
This paper describes a discovery system, BR-4, which integrates several research tasks in modeling the discovery of certain quantum properties and conservation laws by physicists in this century. The program is directed by consistency and completeness constraints, and has the capabilities of theory formation and theory revision in its domain, and of explaining its knowledge state by these constraints . BR-4 is capable of formulating new elementary particles and particle reactions, and proposing observations to test their existence. The program revises its domain theory when it detects formal and theoretical contradictions, and when its domain theory conflicts with observational data.

-----------
* Also affiliated with ITU, Faculty of Space Sciences and Technology, Istanbul, Turkey.

** Also affiliated with the Institute for the Study of Learning and Expertise, 2451 High St., Palo Alto, CA 94301 USA.

1. Introduction
Computational modeling of discovery has been the focus of attention by several research groups in the last ten years, and a number of models with different capabilities have been developed. These capabilities include goal selection, experime nt design, data collection, expectation setting, quantitative reasoning, concept formation, hypothesis formation, theory formation, theory revision, explanation, and paradigm shifts by qualitative models. In current models only a few of these discovery tasks have been integrated in one system. The integration of discovery tasks continues to be a difficult problem in this reearch area of artificial intelligence.

The subject of this paper is an integrated discovery model BR-4, with the capabilities of theory formation, event prediction, data acquisition, explanation, and theory revision. Before we describe the system and its behavior, it is appropri ate to present some background information about its task domain, particle physics.

1.1. The Domain of Particle Physics

Particle physics studies the nature of elementary particles - the building blocks of matter - and interactions among these entities. The basic phenomena in this field take the form of reactions, similar in many ways to those found in chemistry. For instance, two such observed reactions* are

p + p --> p + n + pi
pio --> g + g


where the symbols p, n, pi, pio and g represent the proton, neutron, pion, pion-zero and gamma particles, respectively.

As in chemistry, physics require that reactions among elementary particles obey certain conservation laws. For instance, one of the most basic laws states that any such reaction conserve electric charge of the particles involved. Electric charge is an example of a quantum property, and one of the main tasks in particle physics concerns the assignment of values for quantum properties such that observed reactions conserve those properties. Thus, both of the above reactions conserve electric charge provided we assign the commonly accepted charges 1 to p, 0 to n, 1 to pi, 0 to pio, and 0 to g. Other assignments are also possible for this pair of reactions, but they would not be consistent with other observed particles.

-----------
* Typically, physicists infer the occurence of such reactions from tracks in cloud chambers and similar evidence. We will not attempt to model this inference process here, and instead will simply treat reactions as though they are directly observed.

The concern with conservation also explains why some particle reactions are never observed. For example, the process of beta decay,

n --> p + e + /nu,

in which a neutron decays into a proton p, an electron e, and an antineutrino /n , has been widely detected, in contrast, the decay of protons, as in the reactions

p -> pi + pio
p -> /e + g

has never been seen despite its inherent plausibility. All three reactions satisfy conservation of energy and electric charge, yet only the first occurs in nature. However, one can explain the absence of the other reactions by the existence of another quantum property, the baryon number, that must also be conserved and that these two reaction would violate. Thus, another central task in particle physics involves the explanation of unobserved reactions through the postulation of new qantum numbers.

Other activities include the postulation of new particles, either on theoretical or empirical grounds, and the prediction of reactions that satisfy known conservation laws. Testing such predictions leads into the realm of experimental particle phsics, which we will not address here. But the above pursuits cover a wide range of behaviors that occur in this scientific field.

The above analysis of the discovery tasks suggests that six basic operations play a central role in particle physics. First, one must have a representation to receive and evaluate data about domain objects and events, Second, for a given set of particles, quantum numbers and observed reactions, one must be able to determine a set of quantum values that satisfy conservation for those reactions. Third, one must have a mechanism to explain the currently observed and unobservable reactions in terms of the constraints of the model. Fourth, one must be able to posit new quantum properties that account for the absence of unobserved reactions. Fifth, one requires an operator that posits new particles and determine their role in known reactions. Finally, one must have some mechanism for predicting reactions that have not yet been observed, but which follow from the current theoretical model. We have incorporated these operators into BR-4, where they play a central role in the process of theory formation and revision. (We will refer to them as Read-Data, Determine-Values, Explain-Event, Posit-Property, Posit-Particle, and Predict-Reaction, respectively.)

Operators of this sort must alter some internal representation that contains hypotheses about the particles, properties, and reactions that exist. This representation can take many forms, but following Valdes-Perez et al. (1993), one can view it as two matrices. One matrix lists particles against quantum properties, with each matrix entry specifying the value for a specific particle on a specific prorperty. The other matrix lists particles against reactions, with an entry containing the total number of times the particle occurs in the reaction. In this light, the operator for determining quantum values alters entries in the first matrix, whereas each of the other three operators (Posit-Property, Posit-Particle, and Predict-Reaction) extends one or both matrices along one of their dimensions.

In the next section we describe the knowledge representation and the discovery operators of BR-4 together with its control structure in modeling several different discovery taks ith illustrative examples from particle physics. This will be followed by a discussion on the system's methods and proections for future work. The paper ends with a summary of the conclusions drawn from this research.

2. The System's Knowledge Representation and Behavior
In this section we describe the program's knowledge representation methods and its behavior in modeling certain discoveries in particle physics. The program uses a structured knowledge representation similar to qualitative schemas as in AbE (O'Rorke et al, 1990) and the other recent discovery models.

2.1. Knowledge Representation

BR-4's knowledge organization distinguishes descriptive and prescriptive knowledge. The former type of knowledge is represented as frames, and the latter as a series of operators and functions. The program has six operators which are named as follows: Read-Data, Determine-Values, Explain-Event, Posit-Property, Posit-Particle and Predict-Reaction.
The main data items of BR-4 are elementary particles and their reactions. Both are represented as frames in the system's knowledge base. Particle frames include the name of the particle, the quantum properties and their values. The general form of a particle frame is as follows:

frame: P (frame name)
class : particle
q1 : v1
q2 : v2
.......
qn : vn.

where P is the name of the particle, q1,...,qn the quantum properties, and v1,...,vn the corresponding quantum values, which can be -1, 0, or 1.
Particle reactions are represented in a similar way, this time containing information about the reactions, such as the particles involved, the reaction conditions, the physical status of the reaction, and its validity under the current theory. The general form of a particle reaction frame is as follows:

frame: reaction
class : physical event
actual status : A
logical status : L, logical_status(N,L)
reactants : R
products : P
active properties : Q, active_properties(N,Q)
reactants properties : Rp, reactants_properties(Q,Rp)
products properties : Pp, products_properties(Q,Pp)
conditions : (Rp = Pp) or (Rp =/= Pp).

where A indicates whether the reaction has been physically observed or unobserved, and L indicates whether the reaction is valid or invalid under the current theoretical knowledge of the system. R and P are the lists of the particles involved in the reaction as the reactants and the products respectively. Q indicates the vector of quantum properties that play an active role in the reaction, while Rp and Pp are the quantum value vectors of the reactants and the products. Normally, particle reactions are added to the program's knowledge base (e.g. for the reaction (n --> p + e + /nu) as follows:

frame: r1
class = reaction
actual status = observed
reactants = [n]
products = [p,e,/nu].

Such input reaction frames are then transformed into the form below by the Read-Data operator acting on the parent frame:

frame: r1,
class = reaction
actual status = observed
logical status = valid
reactants = [n]
products = [p,e,/nu]
active properties = [q0, q1]
reactants properties = [1, 0]
products properties = [1, 0]
conditions = {[1,0] = [1,0]}.

The amended slots are added after their values are calculated by the Read-Data operator. In this wa, the system's domain theory is built, onwhich BR-4's other operators act as described below in a control structure summarized in Figure 1.

___________
| Read Data | <-- new data
|___________|
| |
__|____|___ ___________
| |--->| Explain |
| | |_Event_____|
| | _____|_____
| |--->| Determine |<------
| Domain |<---|_Value_____| |
| Theory | _____|_____ |
| |--->| Posit |_______|
| |<---|_Property__| |
| | _____|_____ |
| |--->| Posit |_______|
| |<---|_Particle__|
| | _____|_____
| |--->| Predict |
|___________|<---|_Reactions_|


Figure 1. BR-4's general control structure in the
discovery of quantum properties

2.2. Theory Formation and Revision

The program starts with a simple domain theory about several particles and a small number of observable reactions. BR-4's theory formation activites are driven by its Explain-Event operator which acts on particle reaction frames, looking for reactions which cannot be explained with the system's consistency and completeness contraints. The consistency condition states that any observed particle reaction must be valid by the system's domain theory, where validity is defined as compliance with the quantum conservation laws. An inconsistent reaction in this sense, is unexplainable by the Explain-Event operator.

There are two heuristics for eliminating such contradictions. One is to revise the quantum values of particles in a depth-first search with backtracking through the space of values, until a consistent value set is found. The second heuristic is to introduce a hidden particle to balance the reaction, in either the input or the output, positing that it actually takes part in the reaction but for some reason is not directly observable. The system then computes the property values for this particle, identifying it with an already known particle, or creating an entirely new particle. The first heuristic is applied by the Determine-Values operator and the second one by Posit-Particle.

The completeness condition is defined over unobserved reactions. Any unobserved particle reaction must be violating some quantum conservation law. If the domain theory of BR-4 contains an unobservable reaction that does not seem to violate a quantum conservation law, then this is also an unexplainable event for the Explain-Event operator. This means that the system's domain theory is incomplete regarding the unobserved reaction. In such cases, the system's Posit-Property operator takes control, which posits a new quantum property also to be conserved in observed particle reactions, but not by the unobserved reactions. Determining the values of this property requires search, first for the particles in the missing reaction, and an embedded search for the values of particles in other reactions. This search is carried out by the Determine-Values operator, and as before, if the system arrives at a partial combination of values that rules out an observed reaction or fails to eliminate the unobserved one, it backtracks and considers alternative paths until it finds an acceptable set.

We can extend the notion of incompleteness to include theories that do not explicitly specify all reactions that follow from them, as occurs when BR-4's Posit-Particle postulates a ne particle. In this situation, the system's Predict-Reactions operator systematically generates all possible reactions (decays and collisions) of the ne particle involving one, two or three other known particles. For each such tentative reaction R, the program predicts that R will occur if it conserves all known properties.

3. Illustrative Examples From Particle Physics
In this section we describe the behavior of BR-4 on three examples of discovery fom the history of particle physics, involving the neutrino, baryon and lepton numbers, and electron and muon numbers.

Table 1. The quantum values of particles known prior
to the discovery of the neutrino.
-----------------------------------------------------
Particle mass charge spin

g 0.0 0 1
e 0.51 -1 1/2
p 938.26 1 1/2
n 939.55 0 1/2
/e 0.51 1 1/2

n 0.0 0 1/2
/n 0.0 0 1/2
-----------------------------------------------------

3.1. Discovery of the Neutrino

Until the early 1930's, scientists knew only a few elementary particles, shown in Table 1 along with their mass and their values on the three known quantum properties, energy, charge and spin. The known reactions were also limited to a small number:


p + p --> p + p
e + /e --> g
g --> e + /e


This situation changed after the discovery of the neutron in 1932, when experiments on beta decay revealed the reaction
n --> p + e

in which a neutron decays into a proton and an electron. However, this reaction was problematic in that it violated the conservation of energy and spin, with the total energy and spin counts unbalanced in the reaction. Rather than abandon the conservation law, physicists postulated the presence of a new particle,* also generated during beta decay, that would balance out the missing energy and spin. Although not visible in the reaction, they inferred the property values for this particle from the values for the other particles in the decay process. They concluded that this neutrino has zero rest mass, no electrical charge, and a spin of one half.

Given the reactions above and the quantum numbers in Table 1, BR-4 responds in a similar manner. The system's Explain-Event operator cannot explain the fourth reaction, as it detect passes control to Posit-Property. This operator considers to assign alternative spin values in an attempt to find a consistent set of values that would balance the reaction. But in this case, BR-4 is not allowed to modify the spin values, as these are assumed to be correctly established by observation. This leaves revision of the unbalanced reaction as the

------------
* In the early 1930's there were serious debates among physicists as to the validity of the conservation laws in the subatomic world.

Table 2. Particle reactions that were (a) observed and (b) not observed
in experiments after the introduction of the particles in Table 1.
-----------------------------------------------------------------------
a) Observed reactions b) Unobserved reactions

p + p --> p + p p --> /e + g
n --> p + e + /nu p --> /e + e + /e
/e + e --> g p --> /e + g + g
g + p --> e + /e + p
/nu + p --> n + /e
nu + n --> p + e
-----------------------------------------------------------------------

only solution as the control passes to the Posit-Particle operator, which adds an extra particle to the output side of the reaction, giving

n --> p + e + nu.

Using the conservation laws, Determine-Values computes the charge and spin of the new particle, nu, as 0 and 1/2 respectively. Another possible revision would have added a new particle with opposite properties to /n, to the input side of the reaction, but physicists favored the former solution as they were thinking in terms of a decay process.

However, the inclusion of the neutrino and its antiparticle leaves the theory incomplete, in that they imply reactions with other known particles. BR-4's Predict-Reactions operator finds no decays for the neutrino, but it does find three collision reactions that are consistent with the theory:

/nu + p --> n + /e
nu + n --> p + e
nu + /nu --> g

which are predicted to be observed in experiments. The first two of these were later detected by physicists. The third reaction has a very low probablity and is rather difficult to detect.

3.2. Proposing Baryon and Lepton Numbers

The discovery of the neutrino left physicists with seven elementary particles,* having the properties and values shown in Table 1. Physicists realized that the existence of these particles, combined with known quantum conservation laws, implied a variety of reactions. Subsequent observations revealed evidence for the predicted reactions in Table 2 (a) but not for those shown in Table 2 (b). For some reason, the three predicted decays of the proton did not occur in nature. To explain this, physicists proposed a new quantum property, known as the baryon number.**
--------------------
* The neutrino-antineutrino distinction as experimentally verified in the late 1950's.
** Stuckelberg proposed this new quantum property in 1938 as the protonic charge which was later to be called the baryon number.

Table 3. The quantum values for elementary particles known in 1953
after the discovery of baryon and lepton numbers.
--------------------------------------------------------------------
Particle mass charge spin baryon lepton

g 0.00 0 1 0 0
e 0.51 -1 1/2 0 1
p 938.26 1 1/2 1 0
n 939.55 0 1/2 1 0
/e 0.51 1 1/2 0 -1
nu 0.00 0 1/2 0 1
/nu 0.00 0 1/2 0 -1
mu 105.60 -1 1/2 0 1
/mu 105.60 1 1/2 0 -1
pi 139.60 1 - 0 0
/pi 139.60 -1 - 0 0
pio 135.00 0 - 0 0
---------------------------------------------------------------------

BR-4's Predict-Reactions operator proposes the same reactions, but the Explain-Event operator cannot explain the absence of the reactions in Table 2 (b). The program selects the first reaction, p --> /e + g, and turns it into a set of inequalities, each based on a different combination of values for the particles involved. In this case, it would generate the four ineualities

1 =/= 0 + 0
1 =/= 1 + 1
0 =/= 1 + 0
0 =/= 0 + 1

The Determine-Values operator then selects one of these value sets, say the first, p =1, /e = 0, g = 0, and tests them in the observed reactions, say n --> p + e + /nu, this time treating it as an equality, and obtains

n = 1 + 0 + /nu

which leaves the property values for n and /nu unspecified. Two value sets are possible for this pair, n = 1, /nu = 0 and n = 0, /nu = -1. The first value set is consistent with all the then known reactions, while the second set is inconsistent with the reaction nu + n --> p + e. At any point, detection of an unbalanced reaction that violates conservation of the new property causing backtracking to one of the alternative value sets. If the search exhausts all such sets produced from observed reactions, the system backtracks further and considers alternative value sets generated from the unobserved reactions.

Given the experimental results in Table 2, BR-4 arrives at the value zero for all particles except the proton and neutron, to which it assigns the value one. These settings correspond to those obtained by physicists for the baryon number, which successfully explain the absence of the reactions in Table 2 (b).

Alternatively, by using the value set in the third inequality above, BR-4 would propose another quantum property by assigning the following values to particles: p = 0, n = 0, /e = -1, g = 0, and e = 1. These values correspond to the lepton numbers of elementar particles (see, Table 3).

Table 4. Some particle reactions that were (a) observed and (b) not
observed in experiments after the discovery of baryon and lepton numbers.
----------------------------------------------------------------------
a) Observed reactions b) Unobserved reactions

pi --> /nu + mu mu --> e + g
/pi --> mu + /nu pi --> /mu + g
mu --> e + nu + /nu pi --> /e + g
/mu --> /e + /nu + nu
pio --> g + /e + e
pio --> g + g
pio --> e + e + /e + /e
----------------------------------------------------------------------

In 1935, Yukawa had proposed the existence of additional particles with the mass of about 100 MeV in the nucleus. The reasoning behind Yukawa's proposal, which we have not attempted to model, involved energy calculations on atomic nuclei. Later, in the 1940s, observations on cosmic rays revealed five such particles: the muon (mu) and anti-muon (/mu), the pion (pi) and anti-pion (/pi), and the pion-zero (pio), along with the property values in Table 4. Baryon and lepton numbers could explain the possibility and absence of the reactions of these particles in the 1950s. Some of these reactions are given in Table 5(a) and 5 (b).

3.3. Electron and Muon Numbers

With the discovery of the baryon and lepton numbers, physicists had produced a theory, involving 12 elementary particles and four quantum properties plus the relativistic masses of the particles, that was apparently consistent and complete. Table 3 reflects this state of physical knowledge. Some skepticisms remained, such as for the neutrino, which seemed very difficult to observe for theoretical reasons. However, in 1953, experiments revealed indirect evidence for the reaction

/nu + p --> n + /e.

Unfortunately, this reaction occurred when the anti-neutrino n had been generated through beta decay (n p + e + n ), but not when produced through muon decay (m --> e + nu + /nu).

To resolve this dilemma, scientists postulated that the two reactions actually generated two distinct types of neutrinos, calling the former an electron neutrino (ne) and the latter a muon neutrino (nu_mu). This distinction (and the analogous one for anti-neutrinos) introduced two additional rows in the table of particles. However, it also produced the unobserved reactions shown in Table 5(b), which physicists again sought to explain by introducing yet another property, which they named the electron number.

Our model cannot directly explain the historical distinction into two classes of neutrinos, but we believe it constitutes a variation on the heuristic for postulating new particles that originally led to inference of the neutrino. Once this distinction has been made, BR-4 realizes that its current theory is incomplete, in that it cannot explain the unobserved reactions involving the muon neutrino and its antiparticle. Postulating a new property, it searches the space of values using the same process as it used for the baryon and lepton numbers. The resulting values agree with those proposed by physicists for the electron number, but are not sufficient to rule out the unobserved reaction (pi --> /mu + g). Explanation of this omission requires introduction of yet another quantum property, this one corresponding to the muon number, which physicists postulated in 1962.

Table 5. Some particle reactions that were (a) observed and (b) not
observed in experiments after introducing distinction between electron
neutrinos (nu_e) and muon neutrinos (nu_mu).
----------------------------------------------------------------------
a) Observed reactions b) Unobserved reactions

pi --> /mu _ nu_mu mu --> e + g
/pi --> mu + /nu_mu /nu_mu + p --> n + /e
mu --> e + /nu_e + nu_mu nu_mu + n --> p + e

/mu --> /e + nu_e + /nu_mu pi --> /mu + g
pio --> g + e + /e pi --> /e + g
pio --> g + g
pio --> e + /e + e + /e
----------------------------------------------------------------------

4. Discussion of the Framework
Now that we have seen some examples of BR-4's operation, we can consider the implications of the model for research on scientific creativity, related work on scientific discovery, and some directions for future research on this topic.

4.1 Implications of the Model

Modern scientific research is one of the most complex human activities, requiring the use of different types of general and specific knowledge. It can also involve more than a dozen different search spaces ranging from scientific problem formulation through data collection and evaluation, to hypothesis formation, theory formation and theory revision (see, Kocabas, 1993). Within the research activities, different types of discovery and creativity can be distinguished as logico-mathematical, formal, theoretical and empirical discovery. Current computational models have shortcomings in capturing the details of historical discoveries for reasons described by Tweney (1990). However, this should not diminish their usefulness, as they can provide an overall look into the structure of the developments of theories both in their formation and revision processes. They can also be useful in analyzing the historical progress of scientific ideas and of the possibility of alternative ideas together with their implications.

In this study, our aim has not been to model the historical details of particle physics, but to show that certain computational mechanisms can account for theory formation and revision in this domain. The basic mechanisms in BR-4 -- search guided by heuristic knowledge -- bears close resemblance to those implicated in normal human problem solving, as studied by Newell and Simon (1972), as well as many others.

If correct, this view suggests that some of the creative activities in particle physics has much in common with everyday reasoning. However, modern scientific reasoning is much more reliant on logico-mathematical, theoretical and methodological knowledge than everyday reasoning in addition to empirical and commonsense knowledge. Unlike simple search spaces dealt with in everyday reasoning, it also has to deal with a number of different search spaces at the same time if it has to result in discoveries, or even to make progress at all (see, e.g., Klahr, 1994; Kocabas, 1993). Our model operates only in the spaces of empirical hypothesis and theory formation, event prediction, problem formulation and theory revision.

Previous models of scientific discovery, such as those described by Langley, Simon, Bradshaw, and Zytkow (1987), have taken a similar stance on the creative process. However, most such work has focused on limited aspects of scientific reasoning, such as the discovery of laws or the formation of structural theories.

With BR-4, we have attempted to cover a broader range of the discovery process within a unified framework. We described how the system formulates new problems whenever new data reveals its current theory to be either inconsistent or incomplete. In handling problems of inconsistency, BR-4 relies on depth-first search guided by algebraic and domain heuristics to explore the space of values for quantum properties, resorting to the postulation of new particles only if its search fails.

In dealing with incompleteness, the model predicts new reactions that follow from the introduction of new particles and posits new quantum properties to explain why some of these reactions never occur. The introduction of new particles and new properties constitute important examples of theory formation.

Our system does not provide a detailed account of the historical record, but it does explain several impressive discoveries at a more abstract level, using simple mechanisms of a familiar kind. This limited success provides further evidence that at least some types of scientific creativity does not require any special processes, but can be explained as a straightforward extension of existing theories of human cognition.

4.2 Related Work on Scientific Discovery

Our computational model of discovery draws many of its ideas from earlier work in this area. BR-4 is a direct descendant of Zytkow and Simon's (1986) STAHL, which modeled a variety of qualitative discoveries in the history of chemistry. The detection of inconsistencies in reactions played a central role in this system, with one of its responses being the introduction of new elements like phlogiston, which served much the same role in early chemistry as the neutrino did in particle physics.
Rose and Langley (1986) described STAHLp, a rational reconstruction of the earlier system that showed all of its discoveries could be explained in terms of inconsistencies and their resolution. In addition, they used the system to model a number of other reaction-oriented discoveries from the history of science. Moreover, their approach showed that dependency-directed reasoning simplified the theory revision process, letting their STAHLp handle problems with a search-control scheme that relied on simple hill climbing.

The BR-3 system, presented by Kocabas (1991), extended this framework to include the detection of incomplete theories, and the postulation of new properties to explain the absence of reactions. Kocabas applied this idea to the history of particle physics, using it to explain both the origin of several quantum numbers and the particular values assigned to them by scientists. In related work (Kocabas, 1994), he described another system TREV which formulates new particles and new reactions, but this system does not integrate these functions in its discovery process. BR-3 was the immediate precursor of BR-4, differing mainly in that the former lacked the ability to postulate new particles and to predict new reactions.

Valdes-Perez (in press) has described an alternative approach to discovery in particle physics, which he has implemented in the PAULI system. This scheme use a variation on linear programming to search the space of property values, subject to constraints that reflect observed and unobserved reactions. Also, Fischer and Zytkow (1992) have reported on GELL-MANN, a system designed to explain the formation of the quark theory, which also carries out a search through a space of parameter values subject to constraints.

A more general framework, proposed by Valdes-Perez, Simon, and Zytkow (1993), views the process of formulating structural models in terms of matrix operations. They show how many existing systems, including those described above, can be viewed in this light, with the basic operations involving the extension of a matrix along one or more dimensions and the revision of entries in the cells of the matrix. Our own BR-4 system also fits well into this framework, as suggested by our use of Valdes-Perez et al.'s terminology in Section 2.

Other research on theory revision seems less closely related. Rajamoney's (1990) COAST system designs experiments to distinguish between alternative structural models in physics, and Karp's (1990) HypGene uses a similar idea for biological theories. Kulkarni and Simon (1990) describe KEKADA, a computational model that integrates theory revision, experiment design, and problem formulation to model Krebs' discovery of the urea cycle. Shrager and Langley (1990) consider the relations among these systems in more detail.

4.3. Directions for Future Work

Although BR-4 provides an abstract account for some important developments in the history of particle physics, there remains considerable room for extensions to the model. One direction for improvement involves the notion of explanation. In some sense, the current system formulates explanations when it finds that a newly observed reaction is consistent with the existing theory or when it proposes a new property that rules out an unobserved reaction. However, BR-4 does not generate an explicit proof or other structure that connects assumptions and observations. In future work, we plan to model the explanatory process in more detail, with the system deducing the presence or absence of specific reactions from declara tive statements of quantum properties and conservation laws. In turn, this may let us recast BR-4's operators in terms of an abduction process (Ng & Mooney, 1990; O'Rorke et al., 1990) that modifies assumptions to explain known phenomena.

We also hope to extend the system to handle the introduction of componential models, which describe particles at one level as combinations of more primitive particles. Langley et al.'s (1987) DALTON took some initial steps along these lines to explain the relations between chemical molecules and elements, but we believe that we can adapt BR-4 to explain the origins of the quark theory and its alternatives. The basic task here involves explaining why elementary particles with some quantum properties exist and others do not. The constraints of consistency and completeness, which play such a central role in BR-4, seem well suited for this problem, which involves postulating new component particles (quarks), then searching the space of quantum values and their compositions that satisfy certain constraints (e.g., symmetry) for known particles and violate these constraints for nonexistent ones.

Finally, like most other models of scientific discovery, BR-4 ignores the interactions that occur among different researchers. Scientists cooperate along some dimensions, with theorists passing on predictions to experimentalists, who in turn report their observations to theorists. They also compete in developing theories to explain new findings, in discovering evidence for predicted events, and by noting errors in others' reasoning. The history of particle physics is rich in examples of such interactions, and we believe that some revisions to BR-4 will let us model some of them. In particular, we plan to assign different facets of the system's domain knowledge to different agents, which would communicate through a common representation; we will also let different agents explore different branches when search suggests alternative solutions.

5. Concluding Remarks
In this paper we presented BR-4, an abstract computational model of scientific discovery. We examined the system's behavior on three problems from particle physics, showing that it can replicate, though in a schematic way, important steps in the historical development of this field, some of which were considered major discoveries when first introduced. In particular, BR-4 proposes the existence of the neutrino to avoid violating conservation of spin, it invents baryon and lepton numbers to explain the absence of reactions involving proton decay, and it postulates electron and muon numbers to rule out unobserved neutrino reactions. In addition, the system can determine appropriate quantum values for each particle, and it can predict the reactions implied by a set of particles and quantum properties.

The BR-4 model accomplishes these feats using simple processes that play a central role in many aspects of human cognition. The system employs four basic operators for determining property values, creating new properties, positing new particles, and predicting reactions. Moreover, it uses consistency and completeness constraints to selectively apply these operators, and it incorporates depth-first control scheme to carry out search when necessary. The simplicity of these mechanisms, and their similarity to other processes observed in human behavior, suggest that one can explain some aspects of scientific creativity in similar terms.

References
Fischer, P., & Zytkow, J. M. (1992). Incremental generation and exploration of hidden structure. Proceedings of the ML92 Workshop on Machine Discovery.

Karp, P. (1990). Hypothesis formation as design. In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Klahr, D. (1994). Extended abstract: Children, adults and machines as discovery systems. Machine Learning, 14, 313-320.

Kocabas, S. (1991). Conflict resolution as discovery in particle physics. Machine Learning, 6 , 277-309.

Kocabas, S. (1992). Elements of scientific research: Modeling discoveries in oxide superconductivity. Proceedings of the ML 92 Workshop on Machine Discovery. Aberdeen, Scotland. (pp. 63-70).

Kocabas, S. (1993). Elements of Scientific Creativity. In Technical Report: Artificial Intelligence and Creativity . AAAI Press, pp. 39-45.

Kocabas, S. (1994). Goal directed discovery and explanation in particle physics. In Working Notes: Goal Driven Learning, AAAI Spring Symposium Series. (pp. 54-61).

Kulkarni, D. and Simon, H. (1988). The processes of scientific discovery. Cognitive Science, 12, 139-175.

Langley, P., Simon, H. A., Bradshaw, G. L., & Zytkow, J. M. (1987). Scientific discovery: Computational explorations of the creative processes. Cambridge, MA: MIT Press.

Ng, ?. and Mooney, ? (199?). ...

O'Rorke, P., Morris, S. and Schulenburg, D. (1990). Theory formation by abstraction. In Shrager, J., and Langley P. eds. Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Rajamoney, S.A. (1990). A computational approach to theory revision. In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Rose, D., & Langley, P. (1986). Chemical discovery as belief revision. Machine Learning, 1, 423-451.

Shen, W. M., & Simon, H. A. (1989). Rule creation and rule learning through environmental exploration. Proceedings of the Eleventh International Joint Conference on Artificial Intelligence (pp. ***--***). Detroit, MI: Morgan Kaufmann.

Shrager, J. and Langley, P. Computational approaches to scientific discovery.In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Tweney, R.D. (1990). In Shrager, J. and Langley, P. (eds.) Computational Models of Scientific Discovery and Theory Formation. Morgan Kaufmann, San Mateo, CA.

Valdes-Perez, R. E. (in press). Discovery of conserved properties in particle physics: A comparison of two models. Machine Learning.

Valdes-Perez, R. E., Zytkow, J. M., & Simon, H. A. (1993). Scientific model building as search in matrix spaces. Proceedings of the Eleventh National Conference on Artificial Intelligence (pp. 472-478). Washington, DC: AAAI Press.

Zytkow, J.M. and Simon, H. (1986). A theory of historical discovery: The construction of componential models. Machine Learning, 1, 107-137.

Goal Directed Discovery and Explanation in Particle Physics

GOAL DIRECTED DISCOVERY AND EXPLANATION
IN PARTICLE PHYSICS
Sakir Kocabas

Department of Artificial Intelligence
Tubitak - MRC, PK 21 Gebze, Turkey

Abstract:
This paper describes a goal directed discovery system, TREV, which models the disvery of certain quantum properties and conservation laws by physicists between 1920 and 1960. The program is directed by completeness and consistency constraints, and has the capability of explaining its knowledge state by these constraints. TREV is capable of formulating new elementary particles and particle reactions, and proposing observations to test their existence. According to the results of such observations, the program can revise its knowledge base (e.g. its hypotheses about the particles), until it achieves a consistent and complete theory of its domain.

1. Introduction
Goal directed discovery has been the focus of attention by several researchers in the last ten years, and a number of computational models with different capabilities have been developed. Among these systems, BACON (Langley, Simon, Bradshaw & Zytkow, 1987), has the capabilities of data collection, quantitative reasoning and hypothesis formation; IDS (Nordhausen & Langley, 1993) and FAHRENHEIT (Zytkow, 1987) have the features of data collection, qualitative and quantitative reasoning, and hypothesis formation; GLAUBER (Langley, et al., 1987), concept formation and the discovery of qualitative laws; STAHL (Zytkow & Simon, 1986), STAHLp (Rose & Langley, 1986), REVOLVER (Rose & LANGLEY, 1986), concept formation (i.e., the componential models of chemical substances or quark compositions of elementary particles) and theory revision; MECHEM (Valdes-Perez, 1992) discovery of reaction pathways; AbE (O'Rorke, Morris & Schulenburg, 1990), theory formation, explanation and theory revision by using qualitative schemas; GALILEO (Zytkow, 1990), theory formation; KEKADA (Kulkarni & Simon, 1988), goal selection, hypothesis formation, experiment design, and expectation setting; COAST (Rajamoney, 1990) and ECHO (Thagard, P. and Nowak, G., 1990), theory formation, theory revision and paradigm shifts by qualitative models; and BR-3 (Kocabas, 1991), theory formation and theory revision.

The subject of this paper is a goal directed discovery model TREV, with the capabilities of theory formation, experiment design, data acquisition, explanation, and theory revision. Before we describe the system and its behavior, it is appropriate to present some background information about its task domain, particle physics.

1.1 The Domain of Particle Physics

Until the last decade of the 19th century, material substances were thought to be consisting of indivisible atoms. Towards the end of that century, experiments with cathode ray tubes revealed the first elementary particle (the electron), which was to be identified as one of the basic components of an atom. Early in the 20th century, other elementary particles, the proton and the neutron were discovered. Later, observations on cosmic rays revealed a number of other particles such as the muon, pion, kaon, the neutrinos and the lambda particles. There are now well over a hundred elementary particles known, some of which are listed with their quantum properties in Table 1. Most of these particles are unstable, and quickly decay into a series of lighter and more stable particles such as the electron and neutrino, and into gamma rays. For example, a neutron decays to produce a proton, an electron and an antineutrino; and a pion decays into an antimuon and a neutrino:

n --> p + e + /nu
pi --> /mu + nu.
Particles also interact with one another under natural and experimental conditions, producing other elementary particles or gamma radiation. These reactions are called "particle transmutations". An example to such interactions is the high-energy electron-proton collision, which produces a neutron and a neutrino:

e + p --> n + nu.

The theoretical possibility of such particle reactions depend on a series of quantum conservation laws. According to these laws, quantum properties such as spin, lepton number, electrical charge, baryon number, strangeness, energy, and momentum are conserved in particle decays and collisions. However, some quantum properties may not be conserved in certain particle reactions, (e.g., the strangeness property is not conserved in weak interactions.)

Table 1. Some elementary particles and their quantum properties. With the exception of gamma, each particle has an antiparticle with opposite quantum values. The antiparticles are indicated with a '/' in the text (e.g. as in /n for anti-neutron).


----------------------------------------------------------------
electrical lepton baryon spin strangeness
charge number number
----------------------------------------------------------------
gamma 0 0 0 1 0
nu 0 1 0 1/2 0
mu -1 1 0 1/2 0
tau -1 1 0 1/2 0
e -1 1 0 1/2 0
pi 1 0 0 0 0
pi0 0 0 0 0 0
k 1 0 0 0 1
k0 0 0 0 0 1
p 1 0 1 1/2 0
n 0 0 1 1/2 0
----------------------------------------------------------------

1.2 Theory Development in Particle Physics

The earliest known laws about elementary particle reactions were the energy and charge conservation laws. The law of the conservation of charge can be stated as follows: The sum of the charges of the initial particles entering a reaction is equal to the sum of the charges of the final particles. The following reactions conserve electrical charge and have been "observed" by physicists:

p + p --> p + n + pi
pi0 --> gamma + gamma

where p, n, pi, pi0, and gamma designate the proton, neutron, pion, pion-zero and gamma particles respectively. It has been known since early this century that the proton and electron have opposite and unit electrical charges. The neutron has been known to be unstable, decaying into a proton, an electron, and an antineutrino in what is called "beta decay", or

n --> p + e + /nu
but a proton decay has never been observed, and the stability of this particle had puzzled the physicists. Why does it not decay into lighter particles? Reactions such as

p --> pi + pi0
p --> /e + gamma

never happen despite the fact that they apparently obey the charge conservation law. A theoretical framework based only on the charge conservation law would not be capable of explaining the absence of these reactions. In other words, such a theory would be incomplete concerning particle reactions.
The discrepancy between the theoretically valid and physically observable reactions was a conflict that had to be resolved. Physicists resolved these conflicts by postulating new quantum properties and conservation laws, so that theoretically valid but physically unobservable reactions were rendered theoretically invalid by these laws (see, Omnes, 1970; Griffiths, 1987). In this way the absence of these reactions were explained by their violation of the conservation of the new quantum property. The next problem was to find the quantum value distribution of the new property over the elementary particles.

=====================================
To illustrate how such conflicts were resolved, let us consider a reaction which conserves electrical charge but has not been observed

p --> pi + pi0.

Let us assume that this reaction violates the conservation of a new property (e.g., the "protonic charge"). Now, if we arbitrarily assign the new charge value to the proton as one and assume that the other particles, pi and pi0, do not have this charge (i.e., they both have zero protonic charge), then the reaction would be unbalanced by the new charge (i.e., 1 =/= 0 + 0). This would explain why the reaction had never been observed. Nevertheless, the value set [1,0,0] is not the only one that makes the reaction unbalanced, as the values [0,1,1], [0,1,0], [0,0,1] and [1,1,1] produce the same effect.
On the other hand, the new quantum values make some observed reactions unbalanced, as in

p + p --> p + n + pi
p + /pi --> n + pi0

These reactions conserve electrical charge, but not the "known" values of the new charge. This can be seen by substituting the protonic charge values:

1 + 1 = 1 + n + 0
1 + /pi = n + 0

This suggests that some of the other particles in these reactions must have nonzero protonic charge. Here, if we assign the protonic charge value of one to the neutron and zero to /pi, the reactions would be balanced. However, other valid and observed reactions may conflict with the assigned values, and we may have to revise some of the assumptions about the protonic charge values of particles accordingly.

+++++++++++++++++++
TREV, like its predecessor BR-3 (Kocabas, 1991) rediscovers the quantum properties in the same way as explained above. As the program's goal is to achieve a consistent and complete knowledge state, it postulates new hypotheses, and revises its domain knowledge until it achieves its goal state. In this way TREV models the discoveries of the lepton, baryon, electron, and muon number properties in particle physics. Apart from its theory formation and theory revision capabilities, the program has also the ability of proposing experiments and providing explanations for its assumptions about its domain objects.

In the remaining part of this paper we first present an overview of the system, and describe its behaviour in modeling the discoveries of the quantum properties, in proposing experiments, and in providing explanations. This is followed by a comparative discussion on the system's research goals, knowledge representation, theory revision and search methods, and its generality. The paper concludes with a summary of the results.

2. The System's Knowledge Representation and Behavior
The program uses a structured knowledge representation similar to qualitative schemas as in AbE (O'Rorke et al, 1990) and the other recent discovery models. This structured representation facilitates the system's identification of problem states such as incompleteness and inconsistency. Therefore we begin with describing the knowledge representation methods of TREV in some detail.

2.1 Knowledge Representation

TREV's knowledge organization distinguishes descriptive and prescriptive knowledge. The former type of knowledge is represented as frames, and the latter as a series of operators and functions. The program has nine operators which are named as follows: 'evaluate', 'check-consistency', 'check-completeness', 'postulate-properties', 'revise-hypotheses', 'find-quantum-values', 'formulate-new-particles', 'formulate-virtual-particles', and 'formulate-reactions'. The program also has a similarity based learning (SBL) module.
The main data items of TREV are elementary particles and their reactions. Both are represented as frames in the system's knowledge base. Particle frames include the name of the particle, the quantum properties and their values. The general form of a particle frame is as follows:

frame: P
class = particle
q1 = v1
q2 = v2
...
qn = vn.

where P is the name of the particle, q1,...,qn the quantum properties, and v1,...,vn the corresponding quantum values, which can be -1, 0, or 1.
Particle reactions are represented in a similar way, this time containing information about the reactions, such as the particles involved, the reaction conditions, the physical status of the reaction, and its validity under the current theory. The general form of a particle reaction frame is as follows:

frame: reaction
class = physical event
actual status = A
logical status = L, logical-status(N,L)
reactants = R
products = P
active properties = Q, active-properties(N,Q)
reactants properties = Rp, reactants-properties(Q,Rp)
products properties = Pp, products-properties(Q,Pp)
conditions = (Rp = Pp) or (Rp =/= Pp)

where A indicates whether the reaction has been physically observed or unobserved, and L indicates whether the reaction is valid or invalid under the current theoretical knowledge of the system. R and P are the lists of the particles involved in the reaction as the reactants and the products respectively. Q indicates the vector of quantum properties that play an active role in the reaction, while Rp and Pp are the quantum value vectors of the reactants and the products. Normally, particle reactions are added to the program's knowledge base (e.g. for the reaction n -> p + e + /nu) as follows:

frame: r1
class = reaction
actual status = observed
reactants = [n]
products = [p, e, /nu].

Such input reaction frames are then transformed into the form below by inheritance from the parent frame:

frame: r1,
class = reaction
actual status = observed
logical status = valid
reactants = [n]
products = [p, e, /nu]
active properties = [q0, q1]
reactants properties = [1, 0]
products properties = [1, 0]
conditions = {[1,0] = [1,0]}.

The amended slots are added after their values are calculated by the 'evaluate' operator.
TREV has two operators, 'check-consistency' and 'check-completeness', which can identify the problem states (inconsistency and incompleteness) about reactions.

The 'check-consistency' operator can decide whether the information in a reaction frame is consistent or inconsistent with the system's knowledge, by the following rules:

If R is a reaction,
and its actual status is o b s e r v e d,
and its logical status is v a l i d,
then R is consistent with the system's knowledge base.

If R is a reaction,
and its actual status is o b s e r v e d,
and its logical status is i n v a l i d,
then R is inconsistent with the system's knowledge base.

The check-completeness operator on the other hand, can also decide whether a reaction is explainable within the system's current knowledge, i.e., why the reation is physically observable or unobservable. In other words, the program can decide whether its knowledge concerning a particle reaction is complete or incomplete. The completeness rules are as follows:

If R is a reaction,
and its actual status is u n o b s e r v e d,
and its logical status is i n v a l i d,
then the system's knowledge base is complete regarding R.

If R is a reaction,
and its actual status is u n o b s e r v e d,
and its logical status is v a l i d,
then the system's knowledge base is incomplete regarding R.

The program checks its knowledge about reactions for consistency and completeness every time it is presented with a new set of data, and tries to achieve a consistent and complete knowledge state. In this, TREV uses a a control structure employed by its predecessor, BR-3 (Kocabas, 1991). Figure 1 summarizes the system's control structure. Accordingly, TREV first checks for consistency by using the above rules over its reaction frames, and reports inconsistent reactions to a message list. Inconsistent reactions are observed reaction that do not conserve a certain quantum property in the program's knowledge base.

An inconsistency report in the message list activates the 'revise-hypotheses operator'. This operator modifies the system's knowledge about the particles' quantum property values by first turning the inconsistent reactions into algebraic equations and finding sets of alternative quantum values for the particles appearing in these reactions. Since there are only three possible quantum values, namely -1, 0 and 1, modifications alternate between these values. Each vallue set is tried until the consistency constraints are satisfied.

On the other hand, after consistency has been achieved, but TREV cannot explain why a certain unobserved particle reaction is impossible, the program posts an incompleteness message to the message list. This in turn, activates the 'postulate-property' operator, which postulates a new quantum property. The program adds the new quantum property to a new slot in the particle frames with the default values of zero.

The 'find-quantum-values' operator turns the unobserved reaction formula into an algebraic inequality, and finds a set of quantum values for the particles in the formula. E.g. for the unobserved reaction p --> /e + gamma, the inequalities

0 =/= 0 + 1
0 =/= -1 + 0
1 =/= 0 + 0
1 =/= -1 + 0
1 =/= -1 + 1

are generated by the program. Each of these inequalities represent a set of quantum values for the new property, which enable TREV to explain the absence of the reaction. The first quantum value set (p=0, /e=0, gamma=1) is assigned to the particles first. However, the new values must be consistent with the system's knowledge of elementary particles and their observed reactions. To secure this, the quantum values for the new property are assigned to other particles, such that its conservation is satisfied in the observed reactions. The check-consistency operator checks if the new values are consistent, and the revise-hypotheses operator revises them as necessary. This cycle continues until the system achieves a consistent and complete knowledge state.

inconsistent revise check
knowledge ---> hypotheses ---> consistency
state and completeness


incomplete postulate find check
knowledge ---> new ---> quantum ---> consistency
state properties values


consistent and
complete ---> stop.
knowledge state

Figure 1. TREV's general control structure in the discovery of
quantum properties.

2.2 Formulation of New Particles

The program can define new particles by making modifications on the values of quantum property slots of existing particle frames. For example, from the neutron's frame

frame: n (neutron)
class = particle
q1 = 0 (electrical charge)
q2 = 0 (lepton number)
q3 = 1 (baryon number)

a new particle can be defined by changing the q1 value to -1 to obtain
the particle

frame: p1 (proposed particle)
class = proposed particle
q1 = -1 (electrical charge)
q2 = 0 (lepton number)
q3 = 1 (baryon number)

which, incidentally corresponds to anti-proton. The program proposes to make observations to check whether such postulated particles exist in nature. The important point about this exercise is that certain quantum property combinations never exist (e.g. particles having nonzero baryon and lepton values at the same time.) In fact, this observation had led to the development of the quark theory in particle physics in the 1960s.

After observations, if the proposed particle has been decided not to exist in nature then it is recorded as nonexistent particle e.g. as

frame: np1
class = nonexistent particle
q1 = v1
q2 = v2
q3 = v3

From its accumulated knowledge about existing elementary particles, TREV can construct hypotheses about the nonexistence of certain quantum value combinations, by an inductive method called exclusion based learning (Kocabas, 1989). These hypotheses state that particles with certain quantum property value combinations cannot exist. TREV can modify its exclusion hypotheses in view of the new knowledge about elementary particles. As soon as a new particle frame is created, the program checks its exclusion hypotheses to decide if the quantum values of the particle contradicts a hypothesis. If it does, the individual hypothesis is removed.
The exclusion hypotheses are added to the system's knowledge base as frames:

frame: ep1,
class = excluded q-composition
q1 = v1
q2 = v2
q3 = #

which means that the quantum values v1 and v2 for the properties q1 and q2 respectively, cannot be possessed by an elementary particle.

2.3 Formulation of Virtual Particles and New Reactions

The program formulates particle decays and collisions by first defining a set of 'virtual' particles. These are formulated simply by adding the vectors of quantum property values of two or three particles. An example to such virtual particles is the one that is formulated by adding the quantum values of the proton [1,0,1] and electron [-1,1,0], resulting in a proton-electron virtual particle with the quantum values of [0,1,1].

proton electron (proton-electron)

[1,0,1] + [-1,1,0] = [0,1,1]

In this way, a virtual particle with zero electrical charge, and with lepton and baryon numbers of 1 is defined. Such virtual particles are used in constructing particle decay and collision reactions. One such possible construction can be a neutron decay:

n --> p + e

which, incidentally, is not a valid reaction, because it does not conserve the quantum values of lepton property, as quantum value vectors of the reactants and products are not equal, i.e., [0,1,0] =/= [0,1,1]. On the other hand, the reaction, which is obtained by using the neutron and the virtual particle proton-electron- antineutrino (p,e,/nu),

n --> p + e + /nu


is a valid and observed reaction as it conserves all the three quantum properties, electrical charge, lepton and baryon numbers with the quantum value vectors of both sides being equal, i.e [0,1,0] = [0,1,0].

Testing the reactions proposed by TREV may lead to the discovery of new quantum properties. If a proposed reaction is valid by the program's knowledge of quantum values, but cannot be observed, then this creates an incompleteness problem for the program. As has been described above, in such cases TREV postulates a new quantum property and tries to find a consistent and complete set of values for particles regarding the new property.

2.4 TREV's Methods of Explanation

The program uses its structured knowledge representation for producing explanations about the objects and events of its domain. Explanations are provided when the system is in a consistent and complete knowledge state.
The program can explain why a certain proposed particle reaction is consistent or inconsistent with the system's knowledge about particle physics. In this type of explanations, TREV uses the definition of consistency over the reaction in question.

The consistency (or validity) of a certain proposed reaction is explained by proving that the reaction conserves the quantum values that the program knows. If the reaction does not conserve these quantum values, then it is not inconsistent (or invalid). Consistency (or validity) of a reaction can easily be decided by checking its 'logical status' slot, or by calculating the quantum value vectors of the reactant and the resultant particles and by comparing them. For example, the reaction n --> p + e + /nu is consistent because the 'actual state' slot of the reaction's frame says that the reaction has been observed, and the 'logical status' slot says it is valid. If the reaction frame does not have such a slot, then the 'check-validity' operator fires, which in turn finds if the reaction conserves the known quantum properties.

TREV can explain why a certain reaction is not observable by proving that it violates the conservation of a quantum property that it knows. Also, by using its completeness constraints, the program can explain why the impossibility of a certain unobserved reaction is or is not explainable within the program's domain theory. When the program cannot explain the absence of such a reaction by its domain theory, then it concludes that its knowledge about elementary particles is incomplete concerning the unobserved reaction. As has been described above, TREV resolves such problem states by postulating a new quantum property.

On the other hand, the program can also explain why there can be no particles with a certain set of quantum properties, by using its exclusion hypotheses for such explanations. For example, the exclusion hypothesis

frame: ep1,
class = excluded q-composition
q1 = 1
q2 = 1
q3 = #

explains why there cannot be a particle with the quantum values of q1=1, q2=1, and q3=0.
The system's explanatory power increases as it discovers new quantum properties, and as the particle descriptions become more detailed by including new quantum property slots and values.

TREV can learn to explain consistency and completeness by its similarity based learning (SBL) module. In learning a concept (e.g. 'consistent'), the SBL module compares the positive instances of the concept (i.e. valid and observed reactions), and creates the definition of the concept. The system's consistency and completeness rules are created in this way.

3. Discussion on the System's Methods
TREV is a system that combines several features of a discovery model. Every discovery system, by definition, must have the ability to learn. The program has three distinct types of learning ability, namely inductive learning and learning by discovery. As described above, TREV learns its consistency and completeness constraints by similarity based learning, and its exclusion hypotheses, by exclusion based learning methods. The program also constructs its domain theory with its ability to learn by observation and by discovery. The former involves the formulation of new particles and reactions, and their subsequent comparison with the physical world. The latter takes place by postulating new quantum properties and assigning a set of corresponding quantum values to the particles.

An important feature of a discovery model is theory development, which itself can be divided in two tasks as theory formation and theory revision. TREV extends its domain theory by using its learning and discovery abilities, by adding exclusion hypotheses, by formulating its consistency and completeness constraints, and by postulating new quantum properties when faced with an incomplete knowledge state. When it is faced with an inconsistent knowledge state, the program revises its domain knowledge (i.e. knowledge about particles and their reactions) by using its consistency constraints together with general algebraic constraints.

In its theory development and theory revision activities based on the consistency and completeness constraints, the program works in a coordinated way. However, the system's other task operators work independently and in an uncoordinated way. For example, the 'evaluate', 'formulate-new-particles', 'formulate-virtual-particles', and 'formulate-reactions' operators are fired by an external agent (e.g. a user) independently. Similarly, explanation generating functions of the system are called on user demand and for specific purposes, such as in explaining why a particular is unobservable.

Also, the operators which formulate new particles and reactions are not constrained by domain dependent and general constrains. Hence, they operate in a relatively large search space. As a result, these operators can formulate uninteresting domain objects as well as the interesting ones.

TREV's explanation functions take advantage the system's structural knowledge representation. The explanations provided are simple, and do not go deeper into the system's domain theory. However, the program can be improved in this direction.

The program's ability to fromulate new objects means that it has the ability to propose observations to decide whether the formulated objects (i.e. elementary particles and reactions) exist in nature. Observation results are entered by the 'user'. There are a few discovery models, such as IDS (Nordhausen & langley, 1993) and FAHRENHEIT (Zytkow, 1987) that can directly receive data from their physical environment. However, experimental setup is rather complex for any direct data acquisition in the domain of TREV.

The program has two types of theory revision capability. One is based on using the consistency constraints, and the other is theory revision by observational evidence.

Another shortcoming of the program is that the theory formation and revision operators fired by a rule set whose conditions are determined by the message list. In other words, the control rules are hardwired, though an explanation based learning method could be used to learn such rules. We will address this problem in the future versions of the program.

4. Conclusions
One important problem in artificial intelligence is building models that integrate different methods of representation and learning. We have described a discovery system, directed by completeness and consistency constraints, with the capabilities of theory formation and theory revision, and with the ability of explaining its knowledge state by its domain constraints. The system is capable of formulating new elementary particles and particle reactions, and proposing observations to test their existence. The program has a certain degree of integration in its representation, learning and discovery methods, which can be further improved.

References
Griffiths, D. (1987). Introduction to Elementary Particles. John Wiley and Sons, N.Y.

Kocabas, S. (1989). Scientific Explanation by Exclusion. In Proceedings of the 12th Congress on Cybernetics, Namur, Belgium.

Kocabas, S. (1991). Conflict resolution as discovery in particle physics. Machine Learning, Vol 6, No 3, 277-309.

Kulkarni, D. and Simon, H. (1988). The processes of scientific discovery. Cognitive Science, 12, 139-175.

Langley, P., Simon, H., Bradshaw, G., and Zykow, J. (1987). Scientific discovery: Exploration of the creative processes. MIT Press.

Nordhausen, B. and Langley, P. (1993). An integrated framework for empirical discovery. Machine Learning, 12, 17-47.

Omnes, R. (1970). Intorduction to Particle Physics. Tr. by G. Barton. Wiley Interscience, London.

O'Rorke, P., Morris, S. and Schulenburg, D. (1990). Theory formation by abstraction. In Shrager, J., and Langley P. eds. Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Rajamoney, S.A. (1990). A computational approach to theory revision. In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Rose, D. and Langley, P. (1986). Chemical discovery as belief revision. Machine Learning, 1, 423-452.

Thagard, P. and Nowak, G. (1990). The conceptual structure of the geological revolution. In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Valdes-Perez, R. (1992). Theory driven discovery of reaction pathways in the MECHEM system. In Proceedings of the National Conference on Artificial Intelligence.

Zytkow, J.M. (1987). Combining many searches in the FAHRENHEIT discovery system. Proceedings of the Fourth International Workshop on Machine Learning, Morgan Kaufmann, 281-287, Los Altos, CA.

Zytkow, J.M. (1990). Deriving laws through analysis of processes and equations. In Shrager, J., and Langley P., eds., Computational models of scientific discovery and theory formation. Morgan Kaufmann, San Mateo, CA.

Zytkow, J.M. and Simon, H. (1986). A theory of historical discovery: The construction of componential models. Machine Learning, 1, 107-137.

Automated Formulation of Reactions and Reaction Chains in Nuclear Astrophysics

Automated Formulation of Reactions and Reaction Chains in Nuclear Astrophysics
Sakir Kocabas
Department of Space Sciences and Technology
ITU, 80626 Maslak, Istanbul, TURKEY

Pat Langley
(LANGLEY @ NEWATLANTIS.ISLE.ORG)
Institute for the Study of Learning and Expertise
2164 Staunton Court, Palo Alto, CA 94306 USA

Abstract
In this paper we describe ASTRA, a computational research aid for the formulation and analysis of process explanations in nuclear astrophysics. The system operates in two independent modules. The first module generates fusion and decay reactions for the light elements from hydrogen to oxygen by using knowledge of quantum theory, and from these reactions, the second module constructs all theoretically possible reaction chains as process explanations for the nucleosynthesis of helium, carbon and oxygen. ASTRA has found apparently novel reactions that involve proton, electron and neutron capture. Currently, there is a small number of reactions and pathways proposed by astrophysicists to explain the synthesis of these elements and their relative abundance in stellar systems. ASTRA also produces many alternative reaction pathways, some of which are of interest to scientists working in this domain.

1 Introduction
Computational modeling of scientific discovery has been a primary concern of a small number of research groups in artificial intelligence, and has made considerable advances in its short history. A number of models have been developed in the last two decades to simulate discoveries in fields such as mathematics, physics, chemistry, and biology. These models addressed different aspects of discovery in formal and experimental sciences, such as mathematical theory formation (Lenat, 1979), searching for quantitative relationships and hypothesis formation (Langley, Simon, Bradshaw & Zytkow, 1987), theory development through the discovery of componential models (Zytkow & Simon, 1986; Rose & Langley, 1986), scientific problem formulation and experiment design (Kulkarni & Simon, 1990), theory formation and theory revision (Kocabas, 1991), and theory formation (Valdes-Perez, 1994).

In recent years however, interest increased towards the computational discovery of new scientific knowledge by means of new models. One of the earliest computational tools used in producing new scientific knowledge was DENDRAL (Feigenbaum, Buchanan & Lederberg, 1971), which helped analytical chemists to build correct 2-d models of some complex chemical substances. Two recent examples are Hendrickson's (1995) SYNGEN which designs the synthesis of some organic compounds from initial and intermediate compounds, and Valdes-Perez's (1995; 1997) MECHEM which has found new reaction pathways in physical chemistry.

This paper focuses on the results of ASTRA, an astrophysical research aid designed to support scientists in explaining the nucleosynthesis of elements and their relative abundance in stars. The program is a successor of BR-4 (Kocabas & Langley, 1995) which was developed as an integrated model for studying the role of predictions in particle physics. The behavior and the results of ASTRA is described with an emphasis on the system's abilities as a research tool in astrophysics.

2. Research Problems in Astrophysics
Astrophysics is a curious field of study related with the tiniest and the largest objects in the universe, the elementary particles, and stars and galaxies. One of its subfields, nuclear astrophysics, mainly concerns with the formation of chemical elements from hydrogen (H) and helium (4He), thought to have emerged in the early history of the universe, through a series of fusion and decay reactions in stars. Another important problem concerns the relative abundances of elements, in particular the abundance carbon (12C), nitrogen (14N) and oxygen (16O) relative to lighter elements lile lithium (7Li), beryllium (9Be) and boron (11B).

According to the current astrophysical theories, stars go through several stages in their lifetimes. The first stage, which follows the star. s initial formation by the condensation of cosmic clouds and hydrogen gas, involves . hydrogen burning. . During this stage, stars radiate energy emitted by a series of exothermic fusion reactions in which hydrogen is transformed into helium. Astrophysicists propose three different pathways (Audouze & Vauclair, 1980, p. 52; Williams, 1991, p. 351) to account for hydrogen burning in stars the size of the sun and smaller. Later stages consist of more complex reactions, typically involving heavier elements.

In their attempt to explain nucleosyntheses, the theorists first select a stellar model in thermal equilibrium which makes certain assumptions about the mass, temperature, density, and the element distribution in the stellar plasma. They then identify the particle and nuclear reactions consistent with quantum physics by calculation. Finally, they calculate the rates of these reactions, by using experimental and theoretical knowledge about nuclear cross-sections and reactant abundances. In this way they obtain a set of valid reactions with their rate coefficients. Scientists use the reactions with high rates to construct the pathways, either by working forward from lighter elements to the final element, or backward from the final element, until reaching to the existing lighter elements.

Naturally, there are many possible reactions, and a great number of reaction pathways even for a small number of reactions to start with. Astrophysicists deal with this problem by focusing their attention on only a small set of reactions, relying on heuristics to constrain the generation of explanatory hypotheses. In this process, there seems to be at least two places where automation could be used: In the formulation of all possible reactions within any selected energy band, and in the construction of pathways from any selected set of reactions.

In developing ASTRA our main objective was to see its results on several important research topics in nuclear astrophysics. These were: 1) hydrogen-burning processes, 2) helium burning processes, 3) formation of heavier elements carbon, nitrogen and oxygen through hydrogen and helium burning, and other fusion chains, 4) the role of neutrons in such processes, and 5) the anomaly in the relative abundance of the light elements.

We have examined a number of books and journal papers on nuclear astrophysics, notably the following work : Audouze & Vauclair (1980); Clayton (1983); Fowler (1986); Fowler, et al., 1967; Fowler et al., 1975; Harris & Fowler, et al., 1983; Cujec & Fowler, 1980; Kippenhahn & Weigert (1994); Lang (1974); and Williams (1991).

In the next section, we describe ASTRA in terms of its inputs, outputs and operations. Section 3 describes the experimental results of ASTRA, and Section 4 discusses its results. The paper ends with a summary of the conclusions.

3. System Description of ASTRA
Before we describe our application of ASTRA to nuclear astrophysics, we should first describe its inputs, outputs and procedures, which include two main stages. The first generates all theoretically valid reactions, and the second produces reaction chains as process explanations for the nucleosynthesis of elements.

3.1 Generating Reactions
The first stage of ASTRA takes as input descriptions for a set of elements and isotopes. Each entity is characterized in terms of five quantum properties: rest mass (in MeV/c2), electric charge, spin counts, lepton counts, and baryon counts. We also give ASTRA theoretical knowledge about conservation relations over these quantum properties that hold in reactions among the elements and isotopes. Finally, we constrain the system to consider only the exothermic reactions, assuming that endothermic reactions play a relatively minor role in stellar nucleosynthesis.

Based on this information, ASTRA systematically generates all fusion and decay reactions among these elements that obey the conservation laws, together with their energy emissions, or Q-values, in terms of mega electron volts (MeV). The reactions generated by the program are in the form: Rm Pn , m = 1,2; n = 1,2,3 where Rm and Pn are the sets of the reacting and resulting elements respectively, and m and n are the number of elements in the sets. (For m = 1, the formula represents decay reactions). An example of the output of this module for the H + 6Li reactions is as follows:*

-------
* The reaction formulations of ASTRA are based on neutral atoms. For this reason, there appear minor differences with textbook notations, such as in the second reaction above whose textbook version is H + 6Li 7Li + n, instead of H + 6Li 7Li + /e + n.


reaction( [h,li6], [be7], 5.68 ),
reaction( [h,li6], [li7, nu], 6.48 ),
reaction( [h,li6], [he4, he3], 4.08 ).


where the first list shows the reacting elements, the second the resulting elements, and the figures before the right parantheses, the total Q-values.
For the runs described in this paper, we provided ASTRA with the elements from hydrogen to oxygen, their isotopes and a few elementary particles like the electron, proton, neutron and the neutrino with their antiparticles, giving a total of 36 distinct entities. From these, the system generated some 400 different reactions, but some were minor variations on one another such as 3He + 9Be 12C + e + /e and 3He + 9Be 12C + n + /n. We eliminated such near repetitions manually, leaving 276 reactions that included 262 fusion reactions and 14 decays.

3.2 Generating Reaction Chains
ASTRA. s second stage takes as input these primitive reactions, along with an element E whose syntheses we want explained and the basic elements/isotopes (E) that we assume as given (typically hydrogen and deuterium). In response, the system generates all reaction chains that lead from the starting elements to the final element through the various reactions identified in the first stage. The system uses a depth-first, backward chaining search to construct the reaction chains. On the first step, ASTRA finds those reactions that give as an output the final element E. Upon selecting one of these reactions, R, it recursively finds those reactions that give as an output one of more R. s input elements. The algorithm continues this process, halting its recursion when it finds a reaction chain for which all the reacting elements are in (E), or when it cannot find a reaction off which to chain. ASTRA generates all possible reaction chains in this systematic manner.

ASTRA produces all possible exothermic fusion reactions and decays including the ones given in the astrophysics literature that we have looked at. The program constructs a large number of reaction chains, most of which would be ruled out by physicists on grounds of low reaction rates. However, some of the pathways produced by the program seem to be viable alternatives to the currently proposed mechanisms, both on the account of the energy emissions and the existence of the elements in stars.

4. The Results of ASTRA
In this section we report the results of our tests with ASTRA, within the conceptual framework of research topics in nuclear astrophysics. We first address two broad classes of reactions that are believed to play an important role in stellar nucleosyntheses, then turn to reaction chains that explain the synthesis of heavier elements.

4.1 Proton, Electron and Neutron Captures
The synthesis of chemical elements from hydrogen and helium in successive steps in stellar systems are explained by astrophysicists by a series of fusion and decay reactions. Two main processes among these reactions are proton and neutron captures in which a nucleus reacts with a proton or a neutron, giving a heavier element or isotope. Electron captures in which an orbital electron is absorbed by the atomic nucleus with the emission of a neutrino, play a relatively minor role in the nucleosyntheses.

Proton captures are an important class of exothermic reactions that take part in hydrogen burning processes. We have found 33 examples of proton captures given in astrophysics literature (e.g., Fowler, et al., 1967, 1975, 1983) for elements from hydrogen to oxygen (16O).

ASTRA. s first stage predicts that all elements from hydrogen to nitrogen (15N), with the exception of 4He, participate in proton capture. The program produces 46 such reactions, including all 33 examples we have found in texts, but also 13 others which we have not seen in astrophysics texts that we examined. Some of these reactions are:

H + 6Li 7Be
H + 9Be 4He + 4He + D
H + 9Be 10B
H + 10B 7Be + 4He
H + 11B 12C.


Electron capture reactions are weak interactions in which an electron is absorbed by the atomic nucleus to be transformed into one with a smaller atomic number. An important example which also take place in what is called the pp2 chain given below, is (e + 7Be 7Li + n). ASTRA. s first stage produces 6 electron capture reactions of which only the one just given appears in astrophysics texts.
In fusion reactions that involve neutron capture, an element combines with a neutron to form a heavier isotope of the same element. We found 17 neutron captures for light elements in the literature, while ASTRA predicts 59 such reactions that are theoretically possible for the same elements. These include the following reactions that we did not see in the texts:

n + 6Li 7Be + n
n + 7Be 4He + 4He
n + 8Be 9Be
n + 10B 11B


The third reaction may play an important role in stellar reaction pathways, which we will consider shortly.

4.2 Neutron and Deuteron Production
Neutron capture requires a continuous supply of neutrons in the stellar plasma, so that it relies on some neutron producing reaction. Audouze & Vauclair (1980, p. 86) suggest that

D + D 3He + n ,

is the only reaction that releases neutrons in the hydrogen burning stage of main-sequence stars. Yet, ASTRA also predicts six additional reactions that produce neutrons, three of which are

D + T 4He + n
3He + 7Li 9B + n
D + 9Be 10B + n
4He + 9Be 12C + n.

The first reaction appears likely in main-sequence stars, as D and T exist in them. However, astrophysicists would ignore most of these reactions for their low reactant abundances in stellar plasma. Most of the neutron-producing reactions rely on a deuteron as one of their inputs. The best known deuteron source is the reaction:

H + H D + /e + n ,

and in astrophysics texts we have found two more reactions that produce deuterium (T + 3He 4He + D and H + 9Be 8Be + D ). However, the first stage of ASTRA predicts 15 other reactions of this sort. These include:

3He + 6Li 7Be + D
3He + 7Li 8Be + D
4He + 10B 12C + D
3He + 11B 12C + D
3He + 13C 14N + D.


The first two of these reactions should take place in main-sequence stars, as 6Li and 7Li are known to exist there, yet we have not found either reaction in the literature that we examined. Again, astrophysicists would presumably ignore most of these reactions for their low reactant abundances.

4.3 Helium Synthesis by Hyrogen Burning
The transformation of hydrogen into helium in a series of nuclear processes which take place in main sequence stars as the principal source of energy. The standard reaction chains given in astrophysics texts (e.g. Audouze & Vauclair, 1980, p. 52; Williams, 1991, p. 351) for helium synthesis in such stars are the hydrogen-burning processes called . proton-proton. or pp chains. The first of these chains, is given as:


a. H + H D + /e + n
b. D + H 3He
c. 3He + 3He 4He + H + H.


The net effect of this reaction chain when reaction (a) occurs twice, is 4 H 4He + 2 n + 26.72 MeV. Another pathway hypothesized, is called . alpha-catalyzed chain. , is

d. 3He + 4He 7Be
e. 7Be + e 7Li + n
f. H + 7Li 8Be
g. 8Be 4He + 4He


in which reactions b and c provide both the 3He and the 4He needed by reaction d. An alternative pathway, which also appears in texts, replaces reaction e with H + 7Be 8B and f with 8B 8Be + /e + n, which produce the 8Be needed by the final reaction through a different mechanism. Astrophysicists refer to these three pathways as the pp1, pp2 and pp3 chains, respectively.
When asked to generate reaction chains from hydrogen to helium, the ASTRA system finds all of these reaction chains including the CNO cycle. Yet, ASTRA formulates another reaction chain which involves an electron capture, as in pp2:


H + H D + n
D + H 3He
3He + 6Li 9B
e + 9B 9Be + n
H + 9Be 4He + 6Li.


which has the same net effect. We did not see any record of this chain in the literature we examined. The program also finds 44 other processes of helium synthesis that differ in their last link of the chains. Many of these would be disregarded by astrophysicists for their small cross sections, including the chain:

H + H D + n
D + 4He 6Li
H + 6Li 7Li + n
H + 7Li 4He + 4He ,


As D is believed to be quickly destroyed by the reaction D + H 3He. Both Cujec & Fowler (1980) and Harris, Fowler, Caughlan, and Zimmerman (1983) argue that the reactions involving D are unlikely due to their low abundance. However, Clayton (1983, pp. 371-2) notes that the density of deuterium in the interstellar medium and the sun remains unknown, and suggests that the substance might be more common that usually believed.

4.4 Generation of Carbon and Oxygen
The origin and the relative abundance of carbon and oxygen has been one of the main concerns of astrophysics. The standard account (e.g., Fowler, 1986, pp. 5-6) relies on the process of helium-burning, in which helium nuclei react to form carbon and oxygen in the following steps:


4He + 4He 8Be
4He + 8Be 12C
4He + 12C 16O .


However, there were theoretical problems with this account; the first reaction is endothermic and the lifetime of 8Be is very short (2x 10-16s). Later calculations showed that 8Be resonances were sufficiently stable to allow the reaction with an alpha particle to produce carbon as in the second reaction. ASTRA does not formulate the reaction 4He + 4He 8Be because it is slightly endothermic, but the system finds 20 other reactions that produce 8Be, such as

D + 6Li 8Be
3He + 7Li 8Be + D
n + 7Be 8Be .


Once 8Be is available, 4He + 8Be 12C can take place exothermically, so ASTRA formulates this reaction. The system produces 24 additional chains that differ in their final steps to 12C. These include:

n + 8Be 9Be
4He + 9Be 12C + n ,


which relies on one of the neutron capture reactions that we discussed earlier. An alternative and even more plausible pathway produced by ASTRA involves a proton capture:

H + 8Be 9Be + n
4He + 9Be 12C + n .
Briefly, if 8Be captures a neutron or proton before it decays, then it transforms into its stable isotope 9Be. This in turn produces carbon by reacting with 4He, where the emitted neutron from the latter reaction can combine with another 8Be. Once 12C is formed, in whatever manner, it can react with 4He exothermically to produce oxygen


4He + 12C 16O .

In summary, ASTRA finds a number of reaction chains to carbon and oxygen that do not appear in astrophysics literature, all of which are theoretically possible, but the final judgement about their scientific value requires further evaluation, as we discuss next.

5. Discussion of Results
We have carefully compared ASTRA. s outputs, at both the reaction and pathway level to those available in astrophysics texts (Clayton, 1983; Audouze & Vauclair, 1980; Kippenhahn & Weigert, 1994; Fowler et al., 1967, 1975, 1983; Cujec & Fowler, 1980). We have examined the results of the system only on exothermic reactions, but ASTRA can formulate reactions in any energy band.

Although ASTRA calculates the Q-values of all reactions that it formulates, the current version does not take into account the reaction rates which are used by astrophysicists in determining the more likely reactions and reaction chains. Due to this limitation, the current version cannot decide which reactions must be dominant in a given burning phase in the star. We are considering to implement this capability in the program's future versions, in such a way that, given a stellar model (e.g., a model of the sun), when the reaction rates are given, it should eliminate some of the low-rate reactions before constructing the reaction chains.

It would be impossible for astrophysicists even to formulate all theoretically possible reactions for an exhaustive research, without a computational aid like ASTRA. For example, Fowler, et al. 1967; 1970 and 1983 cite 88 reactions for elements from H to 16O in their research, while ASTRA uses 276 such reactions. In the 88 reactions, the same authors cite 33 H-capture, 17 n-capture and 8 D-fusion reactions, while ASTRA formulates 46 H-captures, 59 n-captures and 75 D-fusions for the same range of elements.

The ASTRA program can handle a very large volume of data for constructing reaction chains, and although the hydrogen and helium burning processes have been dealt with extensively in the current literature, there may still be room for research. We understand that there is even more room for research on the synthesis of the heavier elements. Therefore, a complete analysis on the reactions and pathways can only be carried out with the aid of a computational tool such as our program.

Despite its current limitations the program still has the potential of being useful in several subfields of nuclear astrophysics. Although we tested ASTRA on the exothermic reactions of the light elements from H to 16O, the system can be used in exploring the reactions of heavier elements from oxygen to iron and further, which take place in stellar and interstellar processes.

Our research is continuing in three strands: on the one, we are in the process of evaluating the current results of the system, while on the other, we plan to add to it, the ability to use the rates of the reactions to distinguish more likely mechanisms, and finally, we are improving its interface to make the system a more useful research aid for astrophysicists.

6. Conclusions
In this paper we described ASTRA, a computational tool which formulates nuclear reactions and pathways for researchers in astrophysics. Although we have been generally satisfied with ASTRA's performance to date, there clearly exists a number of directions in which we can extend our work. We are planning to present the program's predictions to domain experts to further evaluate the behavior of the current system in terms of the novelty and plausibility of its results. Our work is continuing to improve ASTRA, and to make it a more useful research tool for astrophysicists.

References
Audouze, J., & Vauclair, S. (1980). An introduction to nuclear astrophysics. Holland: D. Riedel.

Clayton, D.D. (1983). Principles of Stellar Evolution and Nucleosynthesis. Chicago: The University of Chicago Press.

Cujec, B. & Fowler, W.A. (1980). Neglect of D, T, and 3He in advanced stellar evolution. The Astropysical Journal, 236: 658-660.

Kippenhahn, R. and Weigert, A. (1994). Stellar Structure and Evolution. London: Springer-Verlag.

Feigenbaum, E. A., Buchanan, B.G., Lederberg, J. (1971). On generality and problem solving: A case study using the DENDRAL program. In Machine Intelligence (vol. 6). Edinburgh: Edinburgh University Press.

Fowler, W.A. (1986). The synthesis of the chemical elements carbon and oxygen. In S.L. Shapiro & S.A. Teukolsky (Eds.), Highlights of modern astrophysics. New York: John Wiley & Sons.

Fowler, W.A., Caughlan, G.R., and Zimmermann, B.A. (1967). Thermonuclear Reaction Rates. Ann. Rev. Astron. Astrphysics, 5, 525-570.

Fowler, W.A., Caughlan, G.R., and Zimmermann, B.A. (1975). Thermonuclear Reaction Rates. Ann. Rev. Astron. Astrphysics, 13, 69-112.

Harris, M.J., Fowler, W.A. Caughlan, G.R., and Zimmermann, B. (1983). Thermonuclear reaction rates. Ann. Rev. Astron. Astrophysics, 21: 165-176.

Hendrickson, J.B. (1995). Systematic synthesis design: The SYNGEN program. Working Notes of the AAAI Spring Symposium on Systematic Methods of Scientific Discovery (pp. 13-17). Stanford, CA: AAAI Press.

Kocabas, S. (1991). Conflict resolution as discovery in particle physics. Machine Learning, 6, 277-309.

Kocabas, S., & Langley, P. (1995). Integration of research tasks for modeling discoveries in particle physics. Working notes of the AAAI Spring Symposium on Systematic Methods of Scientific Discovery (pp. 87-92). Stanford, CA: AAAI Press.

Kulkarni, D., & Simon, H.A. (1990). Experimentation in machine discovery. In J. Shrager & P. Langley (Eds.), Computational models of scientific discovery and theory formation. San Mateo, CA: Morgan Kaufmann.

Lang, K.R. (1974). Astrophysical formulae: A compendium for physicists and astrophysicists. New York: Springer-Verlag.

Langley, P., Simon, H.A., Bradshaw, G.L., & Zytkow, J.M. (1987). Scientific Discovery: Computational explorations of the creative processes. Cambridge, MA: MIT Press.

Lenat, D. (1979). On automated scientific theory formation: a case study using the AM program. In: Hayes, J., Michie, D., and Mikulich, L.I., eds. Machine Intelligence 9, 251-283, Halstead: New York.

Rose, D., & Langley, P. (1986). Chemical discovery as belief revision. Machine Learning, 1, 423-451.

Valdes-Perez, R.E. (1994). Discovery of conserved properties in particle physics: A comparison of two models. Machine Learning, 17, 47-67.

Valdes-Perez, R.E. (1995). Machine discovery in chemistry: New results. Artificial Intelligence, 74, 191-201.

Valdes-Perez, R.E. (1997). Computer-aided mechanism elucidation of acetylene hydrocarboxylation to acrylic acid based on a novel union of empirical and formal methods. Organometallics, 16(14): 3114-3127.

Williams, W.S.C. (1991). Nuclear and Particle Physics. Oxford: Clarendon Press.

Zytkow, J.M., & Simon, H.A. (1986). A theory of historical discovery: The construction of componential models. Machine Learning, 1, 107-137.

Automated Formulation of Reactions and Pathways in Nuclear Astrophysics

AUTOMATED FORMULATION OF REACTIONS AND PATHWAYS IN NUCLEAR ASTROPHYSICS: NEW RESULTS

Sakir Kocabas

Department of Space Engineering


Abstract

This paper describes some of the new results from ASTRA, a knowledge based research aid for the formulation and analysis of process explanations in nuclear astrophysics. The program formulates valid fusion and decay reactions for the elements by using its knowledge of quantum theory, and from these reactions constructs all theoretically possible reaction chains as process explanations for the nucleosynthesis of elements. Earlier applications of ASTRA generated reactions of the elements and isotopes from hydrogen to oxygen, and found novel reactions that involve proton, electron and neutron capture for these elements, and a series of new reaction chains for hydrogen burning processes. We have recently extended the system’s knowledge base for elements from oxygen to sulphur. The new applications of ASTRA generated a series of reactions and pathways involving the heavier elements fluorine, neon, sodium, magnesium, aluminium, silicon and sulphur, some of which we did not see in the texts. The program also generated a complete series of carbon, nitrogen and oxygen burning reactions some of which may be of interest to astrophysicists.

Key words: Automated reasoning, scientific discovery.

1 Introduction

Computational design and construction of chemical and nuclear reaction processes have recently become an active area of research in computer aided scientific discovery. Three examples of such efforts are Hendrickson's (1995) SYNGEN which designs the synthesis of some organic compounds from initial and intermediate compounds, Valdes-Perez's (1995) MECHEM which has found new reaction pathways in physical chemistry (see, also Zeigarnik, et al., 1997), and Kocabas and Langley’s (1998) ASTRA system has found new reactions and pathways in nuclear astrophysics. This system was designed to support scientists in explaining various fusion processes, the nucleosynthesis of elements and their relative abundance in stars.

ASTRA differs from the earlier systems mainly in its focus on astrophysics, and in its ability to generate the basic reactions of the elements by using the principles of quantum physics. In this respect, the program is a successor of BR-4 (Kocabas & Langley, 1995), which carries out theory revision in particle physics much like its predecessor BR-3 (Kocabas, 1991). The BR-3 system in turn uses techniques and ideas from STAHL (Zytkow & Simon, 1986) and STAHLp (Rose & Langley, 1986), which modeled qualitative discovery in chemistry.

The behavior ASTRA was described elsewhere (see, Kocabas & Langley, 1998; 1999), so the focus here will be on some of the new results of this program with an emphasis on the system's abilities as a research tool in astrophysics.

2 Application Area: Nuclear Astrophysics

Nuclear astrophysics is a branch of astrophysics that mainly concerns with the formation of heavier elements such as carbon (12C), nitrogen (14N) and oxygen (16O) from hydrogen (H) and helium (4He), through a series of fusion and decay processes in stars. Exploration of the processes in which the heavier elements from oxygen (16O) to iron (56Fe) participate is another main topic in this field.

Current astrophysical theories suggest that, stars go through several stages in their lifetimes after their initial formation by the condensation of cosmic clouds and hydrogen gas. The first is called the “hydrogen burning stage” during which stars radiate energy emitted by a series of exothermic fusion reactions in which hydrogen is transformed into helium. Astrophysicists propose several different pathways (Audouze & Vauclair, 1980, p. 52; Williams, 1991, p. 351) to account for hydrogen burning in stars. Later stages, depending on the size of the star, involve processes such as helium burning, carbon, nitrogen and oxygen burning.

Astrophysicists explain nucleosyntheses by first adopting a stellar model in thermal equilibrium which makes certain assumptions about the mass, temperature, density, and the element distribution in the star. They then formulate the nuclear reactions using the constraints of quantum physics. They also calculate the rates of these reactions, by using experimental and theoretical knowledge about nuclear cross-sections and reactant abundances. Theoretically, there are many possible reactions, and a great number of reaction pathways even for a small number of elements. Astrophysicists deal with this problem by deleting the less likely reactions and focusing their attention on the reactions with high rates.

In our previous work with ASTRA (Kocabas & Langley, 1998; 1999; Kocabas & Langley, in press), we examined the resuts of the program on several research topics in nuclear astrophysics. These were: 1) hydrogen-burning processes, 2) helium burning processes, 3) formation of heavier elements carbon, nitrogen and oxygen through hydrogen and helium burning, and other fusion chains, 4) the role of neutrons in such processes, and 5) the anomaly in the relative abundance of the light elements.

In evaluating the results of ASTRA, we have examined a number of books and journal papers on nuclear astrophysics, notably the following work: Audouze & Vauclair (1980); Clayton (1983); Fowler (1986); Fowler, et al., 1967; Fowler et al., 1975; Harris & Fowler, et al., 1983; Cujec & Fowler, 1980; Kippenhahn & Weigert (1994); Lang (1974); Williams (1991); and Adelberger, E.G., et al. (1998). We have also discussed the results of the system with experts in astrophysics.

In the next section, we describe ASTRA in terms of its inputs, outputs and operations. Section 4 describes the experimental results of ASTRA, Section 5 discusses these results, and Section 6 discusses related reseach. The paper ends with a summary of the conclusions.

3 The ASTRA System

We first describe briefly, ASTRA’s inputs, outputs and operations, before we describe our application of the system to nuclear astrophysics with some of the earlier and the new results. A more detailed description can be found in Kocabas and Langley (1998). The program operates in two stages: the first generates all theoretically valid reactions, and the second produces reaction chains as process explanations for the nucleosynthesis of elements.

3.1 Formulating Reactions

The knowledge base of ASTRA includes descriptions for a set of elements and isotopes. The current version has information about 68 such entities. Each entity is characterized in terms of five quantum properties: rest mass (in MeV/c2), electric charge, spin, lepton, and baryon counts. ASTRA also has theoretical knowledge about conservation rules concerning the quantum values, which hold in reactions among the elements and isotopes. Typically, the exothermic reactions play the major role in stellar nucleosynthesis, but the program allows the selection of the energy band to assist more detailed study.

Based on this information, ASTRA systematically generates all collision and decay reactions among these elements that obey the conservation laws, together with their energy emissions, or Q-values, in terms of mega electron volts (MeV). The reactions generated by the program are in the form: Rm -> Pn , m = 1,2,3; n = 1,2,3 where Rm and Pn are the sets of the reacting and resulting elements respectively, and m and n are the number of elements in the sets. (For m = 1, m=2 and m=3 the formula represents decays, and double and triple collision reactions respectively). Examples of the runs of this module based on information about elements from hydrogen to oxygen can be found in Kocabas & Langley (1998).

Here we describe the new results of ASTRA with information about the elements from hydrogen to sulphur, their isotopes and a few elementary particles like the electron, proton, neutron and the neutrino with their antiparticles, giving a total of 68 distinct entities. From these, the system generated more than 600 different reactions, but some were minor variations on one another. We eliminated such variants manually, leaving 472 reactions that included 344 fusion reactions and 28 decays.

3.2 Constructing Reaction Chains

In order to construct reaction chains, ASTRA’s second stage takes as input the reactions generated by its first stage, an element E whose syntheses we want explained, and a starting element (typically hydrogen). The system generates all reaction chains that lead from the starting element to the final element through the various reactions identified in the first stage. ASTRA’s mechanisms for constructing reaction chains has been described elsewhere (see, Kocabas & Langley, 1998).

The program constructs a large number of reaction chains, most of which would be ruled out by physicists on grounds of low reaction rates. However, as a research aid, ASTRA provides a full range of possible reaction mechnisms to astrophysicists for more complete analysis of the the nuclear processes in their field of research.

4 The New Results of ASTRA

In this section we report the new results of our tests with ASTRA concerning hydrogen burning with heavier elements such as oxygen, fluor, neon, sodium, magnesium, aluminium, silicon and phosphorus. We begin with three classes of reactions that are believed to play an important role in stellar nucleosyntheses: proton, electron and neutron captures. We then turn to processes of helium, carbon and oxygen burning which explain the synthesis of heavier elements.

4.1 Proton, Electron and Neutron Captures

Three main processes for the nucleosynthesis of elements in stellar systems are proton, electron and neutron captures in which a nucleus reacts with a proton, electron or a neutron, giving a heavier element or isotope.

Proton captures are an important class of exothermic reactions that take part in hydrogen burning processes. In astrophysics literature we have found 33 examples of proton captures (e.g., Fowler, et al., 1967, 1975, 1983) for elements from hydrogen to oxygen (16O), and 20 more for elements from oxygen to sulphur (32S).

ASTRA’s first stage predicts that all elements from hydrogen to sulphur(32S), with the exception of 4He, participate in exothermic proton capture. The program produces 46 such reactions for elements from hydrogen to oxygen, including all 33 examples we have found in texts, but also 13 others which we have not seen in astrophysics texts that we examined. The program also finds 72 proton captures for elements from oxygen (16O) to sulphur (32S), including the 20 such reactions cited in the same literature.

In electron capture reactions, an electron is absorbed by the atomic nucleus to be transformed into one with a smaller atomic number. ASTRA’s first stage produces 6 electron capture reactions for elements from hydrogen to oxygen of which only one appears in astrophysics texts. The program also found 8 electron capture reactions for elements from oxygen to sulphur, none of which we have seen in the texts.

In neutron capture, an element combines with a neutron to form a heavier isotope of the same element. We found 17 neutron captures for elements from hydrogen to oxygen in the literature, while ASTRA predicts 59 such reactions that are theoretically possible for the same elements. Some examples of these reactions can be found in Kocabas and Langley (1998). Recent runs of the system generated 76 reactions for elements from oxygen to sulphur, none of which we have seen in the texts we have examined.

4.2 Hyrogen Burning Processes

In the main sequence stars hydogen is transformed into helium in a series of nuclear reaction chains called hydrogen burning processes. These processes are the main source of energy for such stars. The standard processes given in astrophysics texts (e.g. Audouze & Vauclair, 1980, p. 52; Williams, 1991, p. 351) for helium synthesis in such stars are called “proton-proton” or pp chains. Other hydrogen burning reactions that appear in texts involve heavier elements carbon, nitrogen and oxygen, and the pathway is called the CNO-chain.

When asked to generate reaction chains from hydrogen to helium, the ASTRA system finds all of these reaction chains including the CNO cycle. Yet, ASTRA also produces a viable variant of the CNO cycle using the electron capture of 13N (see, Kocabas & Langley, 1998).

Recently, we have run ASTRA on hydrogen burning reactions involving the elements heavier than oxygen. Such reactions are hypothesized to occur in stars larger than the sun. Some of the hydrogen burning chains that the program found, involving the elements fluorine, neon, sodium, magnesium, silicon, phosphorus and sulphur are:

H + 16O -> 17O + nu

H + 17O -> 18F

H + 18F -> 19Ne

19Ne + e -> 19F + e + nu (e-capture)

H + 19F -> 16O + 4He

------------------------------

Cumulative account: 4 H -> 4He + 2 nu



H + 23Na -> 24Mg

H + 24Mg -> 25Mg + nu

H + 25Mg -> 26Al

H + 26Al -> 27Si

27Si + e -> 27Al + e + nu

H + 27Al -> 24Mg + 4He

---------------------------------

H + 28Si -> 29Si + nu

H + 29Si -> 30P

H + 30P -> 31S

31S -> 31P + nu

H + 31P -> 28Si + 4He

----------------------------

ASTRA produces many more alternatives to these reactions, providing a complete framework to be examined by researchers in this field.

4.3 Helium Burning Processes

One of the main concerns of astrophysics has been the origin and the relative abundance of carbon and. The standard account (e.g., Fowler, 1986, pp. 5-6) assumes the reaction of helium nuclei to form carbon and oxygen. Earlier runs of ASTRA produced 24 additional chains that differ in their final steps to 12C. These include:

n + 8Be --> 9Be

4He + 9Be --> 12C + n ,

which relies on a neutron capture reaction. Astrophysicists qualified this process as one that can compete with the standard account in explosive stars that produce many neutrons. We have discussed ASTRA’s results on the nucleosynthesis of carbon and oxygen including the related helium burning processes elsewhere (Kocabas & Langley, 1998; Kocabas & Langley, in press) in more detail. So we now turn to the new results of the program on this issue.

ASTRA finds 24 helium burning reactions involving the range of elements from oxygen to silicon, including the 16 such reactions cited in the texts. Some of these are:

4He + 16O -> 20Ne + 5.16

4He + 20Ne -> 24Mg + 9.3

4He + 23Na -> 27Al + 10.2

4He + 24Mg -> 28Si + 10.1

4He + 28Si -> 32S + 6.9

where the figures on the right represent the energy emissions in MeV.

A comparison of the helium burning reactions produced by ASTRA with the natural abundances of the elements from oxygen to sulphur in the CRC Handbook (Weast, R.C. & Astle, M.J., 1981) reveals an interesting result: The elements fluorine, neon, sodium, magnesium, silicon, phosphorus and sulphur in the solar system must have been formed by helium burning processs, rather than neutron captures. This is because, the stable isotope abundances of these elements indicate a parallelism with the stepwise alpha-capture (helium burning) of the stable lighter isotopes of the elements in the series. This matter seemed to deserve further analysis.

4.4 Carbon, Nitrogen and Oxygen Burning

Carbon burning takes place after the helium burning stage in a star. ASTRA finds four carbon burning reactions which produce the elements neon, sodium, and magnesium:


12C + 12C -> 24Mg + 14.4

12C + 12C -> H + 23Na + 2.72

12C + 12C -> 4He + 20Ne + 5.1

In nitrogen burning, two nitrogen atoms fuse together to form elements ranging from oxygen to silicon. ASTRA finds 10 such reactions, two of which are:


14N + 14N -> 28Si + 27.82

14N + 14N -> 12C + 16O + 10.46


Finally, ASTRA formulates six oxygen burning reactions in which the elements magnesium, silicon, phosphorus and sulphur are generated. Three of these reactions are

16O + 16O -> 32S + 17.12

16O + 16O -> n + 31S + 2.05

16O + 16O -> 8Be + 24Mg + 0.02

Carbon, nitrogen and oxygen burning reactions happen only in massive stars as they require higher energies to initiate. The astrophysics texts that we examined mention only a few of these reactions, such as 12C + 12C -> 24Mg, 14N + 14N -> 28Si, and 16O + 16O -> 32S, while ASTRA provides a full account of such reactions.

5. Discussion of Results



We have dicussed some of the results of ASTRA with astrophysicists and carefully compared its outputs to those available in astrophysics texts (Clayton, 1983; Audouze & Vauclair, 1980; Kippenhahn & Weigert, 1994; Fowler et al., 1967, 1975, 1983; Cujec & Fowler, 1980; Adelberger, E.G., et al. (1998). We received some encouraging comments from domain experts about the results of the program but we need more detailed analysis before claims of originality.

Earlier we examined the results of ASTRA only on exothermic reactions. Following discussions with domain experts, we have improved the system to formulate reactions in any selected energy band. In certain stellar conditions, some endothermic reactions can contribute to speed up certain nuclear processes.

The current version of ASTRA does not calculate the reaction rates which are used by astrophysicists in determining the more likely reactions and reaction chains. Astrophysicists suggested that this feature would be very useful in a research tool like ASTRA. However, the current version can receive as input the rate values for each reaction it has formulated, and by deleting those with low rates, can effectively eliminate a large number of reaction chains for their slow rates. We are considering to fully implement this capability in the program's future versions with the help of domain experts.

On the other hand, without a computational aid like ASTRA, it would be impossible for astrophysicists even to formulate all the theoretically possible reactions for an exhaustive research. The program can handle a very large volume of data for constructing reaction chains, and although the hydrogen and helium burning processes have been dealt with extensively for lighter elements in the current literature, there is still much scope for research on the processes of the heavier elements. A complete analysis on the reactions and pathways can only be carried out with the aid of a computational tool such as our program.

Although we tested ASTRA on the reactions of the elements from hydrogen (H) to sulphur (32S), the system can be used in exploring the reactions of heavier elements from sulphur to iron (56Fe) and further, which take place in stellar and interstellar processes.

While we are in the process of evaluating the new results of the system, we are also planning to add the ability to use the rates of the reactions to distinguish more likely mechanisms, and finally, we are improving its interface to make the system a more useful research aid for astrophysicists.

6. Conclusions

In this paper we described the new results of ASTRA, a computational tool which formulates reactions and pathways for researchers in nuclear astrophysics. We received encouraging comments from astrophysicsts about the earlier results of the program, and suggestions on how to further imporve its features. We continue to collaborate with domain experts to evaluate the behavior of the current system in terms of the novelty and plausibility of its latest results, and to improve ASTRA in its functionalities to make it a more useful research tool for astrophysicists.

References

Adelberger, E.G., et al. (1998). Solar fusion cross sections. Reviews of Modern Physics, vol. 70, No. 4. Pp 1266-1291.

Audouze, J., & Vauclair, S. (1980). An introduction to nuclear astrophysics. Holland: D. Riedel.

Clayton, D.D. (1983). Principles of Stellar Evolution and Nucleosynthesis. Chicago: The University of Chicago Press.

Cujec, B. & Fowler, W.A. (1980). Neglect of D, T, and 3He in advanced stellar evolution. The Astropysical Journal, 236: 658-660.

Fowler, W.A. (1986). The synthesis of the chemical elements carbon and oxygen. In S.L. Shapiro & S.A. Teukolsky (Eds.), Highlights of modern astrophysics. New York: John Wiley & Sons.

Fowler, W.A., Caughlan, G.R., and Zimmermann, B.A. (1967). Thermonuclear Reaction Rates. Ann. Rev. Astron. Astrphysics, 5, 525-570.

Fowler, W.A., Caughlan, G.R., and Zimmermann, B.A. (1975). Thermonuclear Reaction Rates. Ann. Rev. Astron. Astrphysics, 13, 69-112.

Harris, M.J., Fowler, W.A. Caughlan, G.R., and Zimmermann, B. (1983). Thermonuclear reaction rates. Ann. Rev. Astron. Astrophysics, 21: 165-176.

Hendrickson, J.B. (1995). Systematic synthesis design: The SYNGEN program. Working Notes of the AAAI Spring Symposium on Systematic Methods of Scientific Discovery (pp. 13-17). Stanford, CA: AAAI Press.

Kippenhahn, R. and Weigert, A. (1994). Stellar Structure and Evolution. London: Springer-Verlag.

Kocabas, S. (1991). Conflict resolution as discovery in particle physics. Machine Learning, 6, 277-309.

Kocabas, S., & Langley, P. (1995). Integration of research tasks for modeling discoveries in particle physics. Working notes of the AAAI Spring Symposium on Systematic Methods of Scientific Discovery (pp. 87-92). Stanford, CA: AAAI Press.

Kocabas, S. & Langley, P. (1998). Generating process explanations in nuclear astrophysics. Proceedings of the ECAI-98 Workshop on Machine Discovery (pp.4-9), Brighton, UK.

Kocabas, S. & Langley, P. (1999). Automated formulation of Reactions and reaction chains in nuclear astrophysics. Proceedings of the 8th Turkish Symposium of Artificial Intelligence and Neural Networks. Boğaziçi University. pp. 247-256.

Kocabas, S. & Langley, P. (in press). Computer generation of process explanations in nuclear astrophysics. International Journal of Human-Computer Studies.

Lang, K.R. (1974). Astrophysical formulae: A compendium for physicists and astrophysicists. New York: Springer-Verlag.

Rose, D., & Langley, P. (1986). Chemical discovery as belief revision. Machine Learning, 1, 423-451.

Valdes-Perez, R.E. (1995). Machine discovery in chemistry: New results. Artificial Intelligence, 74, 191-201.

Zeigarnik, A.V., Valdes-Perez, R.E., Temkin, O.N., Bruk, L.G. & Shalgunov, S.I. (1997). Computer-aided mechanism elucidation of acetylene hydrocarboxylation to acrylic acid based on a novel union of empirical and formal methods. Organometallics, 16(14): 3114-3127.

Weast, R.C. & Astle, M.J. (Eds.). (1981). CRC handbook of chemistry and physics (62dn ed.). Florida: CRC Press.

Williams, W.S.C. (1991). Nuclear and Particle Physics. Oxford: Clarendon Press.

Zytkow, J.M., & Simon, H.A. (1986). A theory of historical discovery: The construction of componential models. Machine Learning, 1, 107-137.