Benchmark Problems On Production Rules

Part One: Facts and Rules

Data structures

Facts and rules are symbol structures. They make a distinction between a special kind of symbol called a variable and everything else. The skeleton code contains a function

(variable? s)

that returns true if and only if the input symbol s is a variable. It happens to be true if s begins with a question mark. So for example the symbol

?x

is a variable while the symbol

is not a variable. This turns out to be a common convention in Artificial Intelligence.

A fact is determined by a list of symbols that contains no variables. Such structures are also known as ground terms in computational logic. Concretely, then, a fact has the form:

(fact list-of-symbols)

A very famous encoding of facts into a database concerns the "blocks world", which describes the arrangement of children's alphabet blocks in stacks on a work table. You have symbols to represent the blocks, which typically take the form blockn where n is a small number. You have the symbol table to represent the table. You have a relation symbol on which indicates that one block is immediately stacked on top of another in a tower, and a relation symbol above which indicates that one block is located in the same tower as another, anywhere higher up. In this setting, you might have sample facts like these to describe one complete tower:

((fact (on block1 table))
 (fact (on block2 block1))
 (fact (on block3 block2))
 (fact (above block1 table))
 (fact (above block2 table))
 (fact (above block3 table))
 (fact (above block2 block1))
 (fact (above block3 block1))
 (fact (above block3 block2)))

This situation describes a tower consisting of three blocks stacked on top of one another: block1, block2 and block3. You can see that the above relation is pretty cumbersome to specify and it can be determined automatically from the basic relation of what blocks are on what other blocks. That provides the basis for rules.

A rule has two parts: a pattern and an action. A pattern is a list of symbols, possibly containing a set of variables. An action uses those variables to abstract from a fact or a rule. Note that rules are recursive! Rules can have rules as parts just as all the programs that we have considered so far in class have other programs as parts.

Intuitively, the variables in a pattern are placeholders for a range of specific symbols that the rule might apply to. To use the rule we find a fact that specifies particular values of the variables. Then we replace the variables in the action with these values. And we take the action.

Concretely, a rule has the form:

(rule list-of-symbols action)

Here are two example rules.

(rule (on ?a ?b) (fact (above ?a ?b)))

This says that if block ?a is on block ?b, then it must also be true that block ?a is above block ?b. This is a simple kind of rule that infers one fact from another.

(rule (on ?a ?b)
 (rule (above ?b ?c)
  (fact (above ?a ?c))))

This is more complicated. It says that if ?a is immediately on top of block ?b, and block ?b in turn is anywhere in a tower above another block ?c, then ?a must also be in that tower above ?c. This rule works recursively. Whenever we find one block on another, this rule "fires" and adds a new rule to the situation. That rule encodes our knowledge about the two blocks by looking to extend tower information we have from the lower block to the higher block.

Examples of inference

Let's see how these rules would be instantiated in the particular situation. First off, we can match the first fact with the first rule:

(fact (on block1 table))
(rule (on ?a ?b) (fact (above ?a ?b)))

This requires taking ?a to be block1 and ?b to be table. Based on this match, we instantiate the fact which is the action of the rule:

(fact (above block1 table))

Similar inferences allow us to infer:

(fact (above block2 block1))
(fact (above (block3 block2))

Now we can consider the second rule. Again it matches based on the on facts in the database. For example one match is:

(fact (on block1 table))
(rule (on ?a ?b) (rule (above ?b ?c) (fact (above ?a ?c))))

Again we take ?a to be block1 and ?b to be table. Based on this match, we have a new rule to add to the knowledge base which represents the action of the rule.

(rule (above table ?c) (fact (above block1 ?c)))

In other words, if we find out that something's under the table (so to speak), then we will also learn that it's under block1. This rule will never do anything, given how the blocks world works. But there are two analogous rules that we can also add in this situation that will start doing work for us. These are the matches with block2 and block3.

(rule (above block1 ?c) (fact (above block2 ?c)))
(rule (above block2 ?c) (fact (above block3 ?c)))

These new rules match the above facts we inferred by the first rule we considered - in what you might think of as a basically recursive way. From here:

(fact (above block1 table))
(rule (above block1 ?c) (fact (above block2 ?c)))

we get:

(fact (above block2 table))

and analogously

(fact (above block3 block1))

Processing has to continue with these new facts! We can now run the inference to match

(fact (above block2 table))
(rule (above block2 ?c) (fact (above block3 ?c)))

And we finally get

(fact (above block3 table))

The match operation

We start by building up in pieces a function that computes and action, if possible, from a rule and a fact.

The logic of match

(define (match fact rule) … )

For example, in this case:

(match 
 '(fact (above block2 table))
 '(rule (above block2 ?c) (fact (above block3 ?c)))
)

The result would be

'(true, (fact (above block3 table)))

However, in the case

(match
 '(fact (above block3 block1))
 '(rule (above block2 ?c) (fact (above block3 ?c)))
)

The result would be

'(false, empty)

To be precise, the result of match is a list with two elements. The first element is a boolean value which indicates whether the rule applies to the fact to yield a match. The second value is the match if there was one, or the empty list otherwise. This is a common way of returning a complex result for a computation in Scheme. It is especially useful in cases such as this where a function may not succeed, or it may not always make sense to apply the function to the available data.

Bindings

In order to implement match, you will have to use a bunch of auxiliary functions and data structures. This is all spelled out in this section. The key data structure that you need is called a binding. It is a list of pairs of the form (variable, value). Aligning the pattern of the rule and the fact, if it is successful, creates a binding that indicates how the rule applies to this case. In the case of a successful match, substituting according to the binding in the action determines the fact or rule that represents the rule result. So the overall match operation boils down to two steps: align and substitute.

A basic operation that you will therefore need to start is a lookup operation that takes a variable and a binding and gets the value. You've written code like this before, but in this case you need to be prepared that you might not find a value for the variable and you need to indicate whether you found a value for the variable as a result. So you write a function

(define (lookup variable binding) …)

Which returns a two-element list, as usual, giving whether the variable was defined in the binding and if it was what its value is. By convention, the value of a variable that is not found in the binding is itself. For example:

>(lookup '?c '((?c table) (?d chris)))
='(true, table)

And

>(lookup ?c '((?d chris) (?e kim)))
=`(false,?c)

The logic of Align

The specification for the align operation is to take a fact and a pattern and return an optional binding, if there is one, using the same convention with a two-part list as with match. So we write a definition

(define (align fact pattern) … )

And we'll get results like:

>(align '(fact (on block2 table)) '(on block2 ?c))
='(true, ((?c table)))

>(align '(fact (on block1 table)) '(on block2 ?c))
='(false, empty)

Align is an interface to a lower-level recursive helper function align-step which incrementally extends an input binding by stepping through the list of symbols associated with a fact - call it the nucleus of the fact - one element at a time. The idea of such a helper function should be familiar - it is the same kind of thing that we used in adding up the elements of a list or evaluating a UFO program. Concretely, we have a definition

(define (align-step nucleus-part pattern-part binding) … )

And we will get results like

>(align-step empty empty '((?c table)))
='(true, ((?c table)))

(And the same for every binding!) Or:

>(align-step '(table) '(?c) empty)
='(true, ((?c table)))

And so forth.

It even helps to write a further helper function Align-term defined this way:

(define (align-term nucleus-elt pattern-elt binding) …)

That just looks at two elements - one from the nucleus and one from the pattern - and handles separately the case where the element in the pattern is a variable and the case where the element in the pattern is a constant.

The logic of Substitute

Substitute can be written directly by recursion. It is easiest if you just recurse on an arbitrary scheme data structure, applying the substitution whenever a constituent is a list. So define

(define (substitute data bindings) … )

With results like:

>(substitute '(rule (above ?b ?c) (fact (above ?a ?c))) '((?a block2) (?b block1)))
='(rule (above block1 ?c) (fact (above block2 ?c)))