First-order logic

First-order logic (FOL) is a type of logical language. It is the most expressive language that has a somewhat “appropriate” automated inference procedure, called resolution. FOL must be defined with the following components:

A set $V$ of variables. These represent objects in the domain.
A set $F$ of function symbols. These map objects to other objects (like sit(John, chair)).
- 0-ary function symbols are called constant symbols.
A set $P$ of predicate (relation) symbols. These evaluate to true/false and are defined over terms.

From here, we make a few more definitions:

Terms denote any expression that represents an element in the universe. These are constructed with variables and functions. For example: x, 5, f(g(x, y)).
Atomic formulas are predicate symbols applied to terms. These are the simplest well-formed formula that can have a truth value. Predicates form atomic symbols. These denote properties and relations that hold about the elements in the universe.
- The “atomic” comes from a lack of logical symbols (AND, OR) in the formulas.
- $P (t_{1}, \dots, t_{n})$ is true in $M$ if objects assigned to $t_{1}, \dots, t_{n}$ are related to each other by $P^{M}$ .
Formulas are expressions that evaluate to true or false. These themselves can be atomic formulas or built from atomic formulas using logical operators.
- We can create complex assertions by composing atomic formulas.
- Their truth is dependent on the truth of the atomic formulas in them.

A set $L$ of function and predicate symbols is called a first-order vocabulary. Note that variables are not part of the vocabulary. We can construct this set recursively:

Base: Every variable is a term.
Recursion: If $f$ is an $n$ -ary function symbol in $L$ and $t_{1}, t_{2}, \dots, t_{n}$ are $L$ -terms, then $f (t_{1}, t_{2}, \dots, t_{n})$ is a $L$ -term.
- So long as we have a function symbol with $n > 0$ variables, then the number of terms we can generate is infinite.

The set of first-order $L$ -formulas (syntax) is defined recursively. We have two more operators than in propositional logic.

Atomic formulas: $P (t_{1}, t_{2}, \dots, t_{n})$ , where $P$ is an $n$ -ary predicate symbol in $L$ and $t_{1}, t_{2}, \dots, t_{n}$ are $L$ -terms.
Negation: $\neg f$ , where $f$ is a $L$ -formula.
Conjunction: $f_{1} \land f_{2} \land \dots \land f_{n}$ , where $f_{1}, f_{2}, \dots, f_{n}$ are $L$ -formulas.
Disjunction: $f_{1} \lor f_{2} \lor \dots \lor f_{n}$ , where $f_{1}, f_{2}, \dots, f_{n}$ are $L$ -formulas.
Implication: $f_{1} \to f_{2}$ , where $f_{1}, f_{2}$ are $L$ -formulas.
Existential: $\exists x f$ , i.e., there exists some $x$ such that $f$ is true, where $x$ is a variable and $f$ is a $L$ -formula.
Universal: $\forall x f$ , where $x$ is a variable and $f$ is a $L$ -formula.

Conversion from natural language

Some basic representations:

Individuals/constants/0-ary functions: rain, snow, names (Tony, Mike, Nick).
Types: unary predicates
- $S (x)$ might indicate that $x$ is a skier.
Relationships: binary predicates
- $L (x, y)$ might indicate that $x$ likes $y$ .

Some tips:

For separate predicates, we can translate them separately, then combine them with a conjunction/disjunction/implication.
If the statement says “all” or “whatever”, that’s a good sign to use the universal operator.
“Is there an x that predicate?”, we can use $\exists$ .

Structures

Each FOL interpretation should:

Specify the objects in the world.
Settle which objects satisfy predicate $P$ .
Assigns to a function $f$ a mapping from objects to objects (assuming functions are always well-defined and single-valued).

Let $L$ be a first-order vocabulary. An $L$ -structure $M$ consists of the following:

A non-empty set $M$ called the universe (or domain) of discourse.
For each $n$ -ary predicate symbol $P \in L$ , an associated relation $P^{M} \subseteq M^{n}$ . $P^{M}$ is called the extension or interpretation of the predicate symbol $P$ in $M$ .
For each $n$ -ary function symbol $f \in L$ , an associated function $f^{M} : M^{n} \to M$ . $f^{M}$ is called the extension or interpretation of the function symbol $f$ in $M$ .
- If $n = 0$ , then $f$ is a constant symbol and $f^{M}$ is simply an element of $M$ .

$M$ is essentially a world configuration.

Sometimes, our formula may only be satisfied for certain variables in the domain. In this case, we must assign the objects. Let $M$ be a structure and $X$ be a set of variables. An object assignment $σ$ for $M$ is a mapping from variables in $X$ to the universe of $M$ .

However, sometimes we must extend this to function symbols (imagine a nested function $f (g (x))$ . The extension $\overset{σ}{ˉ}$ of $σ$ is defined recursively:

For every variable $x$ , $\overset{σ}{ˉ} (x) = σ (x)$ .
For every function symbol $f \in L, \overset{σ}{ˉ} (f (t_{1}, \dots, t_{n})) = f^{M} (\overset{σ}{ˉ} (t_{1}), \dots, \overset{σ}{ˉ} (t_{n}))$ .

For an $L$ -formula $C$ , $M ⊨ C [σ]$ (i.e., $M$ satisfies $C$ under $σ$ , or $M$ is a model of $C$ under $σ$ ) is defined recursively on the structure of $C$ as follows, assuming $A, B$ are $L$ -formulas:

$M ⊨ P (t_{1}, \dots, t_{n}) [σ] ⟺ ⟨ \overset{σ}{ˉ} (t_{1}), \dots, \overset{σ}{ˉ} (t_{n})⟩ \in P^{M}$
- A predicate symbol $P$ applied to terms $t_{1}, \dots, t_{n}$ is satisfied if the tuple of values assigned to those terms belong to the interpretation of $P$ in the model $M$ .
$M ⊨ (s = t) [σ] ⟺ \overset{σ}{ˉ} (s) = \overset{σ}{ˉ} (t)$
- An equality is satisfied if both terms evaluate to the same value in the assignment.
$M ⊨ \neg A [σ] ⟺ M ⊭ A [σ]$
- Negation is satisfied if the formula being negated is not satisfied.
$M ⊨ (A \lor B) [σ] ⟺ M ⊨ A [σ] or M ⊨ B [σ]$
- A disjunction (OR) is satisfied if at least one of the disjuncts is satisfied.
$M ⊨ (A \land B) [σ] ⟺ M ⊨ A [σ] and M ⊨ B [σ]$
- A conjunction (AND) is satisfied if both conjuncts are satisfied.
$M ⊨ (\forall x A) [σ] ⟺ M ⊨ A [σ (m / x)] \forall m \in M$
- A universal quantification is satisfied if the formula is satisfied for every possible value in the domain.
$M ⊨ (\exists x A) [σ] ⟺ M ⊨ A [σ (m / x)] for some m \in M$
- An existential quantification is satisfied if the formula is satisfied for at least one value in the domain.

A structure $M$ satisfies $Φ$ (a set of sentences), denoted by $M ⊨ Φ$ if for every sentence $A \in Φ$ , $M ⊨ A$ . If $M ⊨ Φ$ , we say $M$ is a model of $Φ$ . We say that $Φ$ is satisfiable if there is a structure $M$ such that $M ⊨ Φ$ .

For a set of sentences $Φ$ , a sentence $A$ is a logical consequence of $Φ$ (denoted by $Φ ⊨ A$ ) iff for every structure $M$ , if $M ⊨ Φ$ , then $M ⊨ A$ . Alternatively, then there is no $M$ such that $M ⊨ Φ \cup {\neg A}$ , i.e., $Φ \cup {\neg A}$ is unsatisfiable.

An occurrence of $x$ in $A$ is bounded (is “quantified”) iff it is in a sub-formula of $A$ of the form $\forall x B$ or $\exists x B$ . Basically, a variable $x$ is bound if it appears within the scope of a quantifier ( $\forall$ or $\exists$ ) that uses the same variable. The quantifier “binds” the variable, similar to how a parameter in a function definition binds the parameter name to values passed to the function. Otherwise the occurrence is free. For example, the $x$ in $P (x)$ is free, but the $x$ in the other $P (x)$ and $Q (x)$ are bounded by the existential quantifier $\exists x$ :

P (x) \land \exists x [P (x) \lor Q (x)]

A formula $A$ is closed if it contains no free occurrence of a variable. A closed formula is called a sentence. If $σ$ and $σ^{'}$ agree on the free variables of $A$ , then $M ⊨ A [σ]$ iff $M ⊨ A [σ^{'}]$ . If $A$ is a sentence (no free variables), then $σ$ is irrelevant and we omit mention of $σ$ and simply write $M ⊨ A$ .

jszhn

Recent Notes

ALOHA

ARP

American literature

Assert

Atomics

First-order logic

Conversion from natural language

Structures

Graph View

Backlinks