Do definitions have to fit axioms in logic?

Question

One thing I find confusing in propositional logic is that we have things like axioms and inference rules but then we seem to be able to define whatever we want in syntax that doesn't necessarily adhere to the axiom formats.

For example

https://en.wikipedia.org/wiki/Propositional_calculus#Example_1._Simple_axiom_system

This example system uses the modus ponens inference rule:

$P, P \to Q \vdash Q$

And the following axioms:

I. $(p \to (q \to p))$

II. $((p \to (q \to r)) \to ((p \to q) \to (p \to r)))$

III. $((\lnot p \to \lnot q) \to (q \to p))$

We have the $\lnot$ and $\to$ operator in the language but then we define $a \land b = \lnot(a \to \lnot b)$ even though this format does not match any of the three axioms, nor have we defined equality.

Why is this permitted? What are we allowed to define? What are we even using modus ponens and the axioms for if we can just make up whatever?

see van Dalen, page 30 : "As usual “$¬ϕ$” is used here as an abbreviation for “$ϕ →⊥$”. It is a convention adopted in the metalogic context. — Mauro ALLEGRANZA, Sep 05 '18 at 06:19
For formal definitions "inside" the calculus (FOL) see the post : Semantics and Logical structure in Definitions as well as the post Theory of definitions. — Mauro ALLEGRANZA, Sep 05 '18 at 06:57
@MauroALLEGRANZA: Many logic texts do not adopt that convention that you mentioned. Doing so is advantageous for treating intuitionistic logic, but from a classical logic viewpoint I find such an abbreviation to be unnatural. — user21820, Sep 05 '18 at 12:04
@user525966: By the way, feel free to come to the Logic chat-room for further inquiry. =) — user21820, Sep 05 '18 at 12:05

score 7 · Answer 1 · answered Sep 04 '18 at 23:43

7

You can define anything you want. However, the point of defining something is to make it easier to refer to, which means that the most useful definitions are for things that are:

(a) frequently referred to;

(b) not trivial; and often

(c) similar to something else

So, for example, we define $\wedge$ because it allows for a lot of shortcuts in writing the propositional logic, and it happens to align with the general understanding of the word "and". The "=" in the definition isn't really part of the logic, it's a part of the language surrounding it, and we know that there's a level at which we have to resort to shared understanding since you can only abstract things so far.

On the other hand, I probably wouldn't bother coming up with a definition for "the set of all even prime numbers in $\mathbb{N}$", because it's simple enough to just say $\{2\}$. Or if I did define it, it would only be for a very limited context (for example, one where I actually needed to prove that 2 is the only element in the set), so I could get away with a generic definition like $A$.

answered Sep 04 '18 at 23:43

ConMan

27,579

1

I don't believe any of this directly addresses the question (outside of the equality symbol being used in a metalogical sense which is helpful). I mean I could define $a a a $ blorp $b +$ but nothing would tell us how this is supposed to work. – user525966 Sep 04 '18 at 23:58
5

@user525966 - The only thing that hasn't been mentioned, because I believe it was presumed clear, is that such definitions are just informal abbreviations. I hope it doesn't need explanation that using a shorthand for an expression doesn't require us to know about, and doesn't change anything about, whether that expression is true or not. – Malice Vidrine Sep 05 '18 at 00:38
(There are, of course, formal analogues of this kind of informal abbreviation, but we rarely care about such things.) – Malice Vidrine Sep 05 '18 at 00:44
@MaliceVidrine I understand that it's an abbreviation, but it's an abbreviation of something that hasn't been defined or outlined anywhere – user525966 Sep 05 '18 at 00:46
@user525966 Defining a new connective in terms of the existing connectives of the System tells us how it works through the axioms and rules of inference for the existing connectives. You use the System to prove useful theorems about the new connective (albeit these proofs may not be obvious). – Graham Kemp Sep 05 '18 at 00:50
1

@user525966 - In what sense? You already know that "$\neg(\neg a\vee\neg b)$" is a well formed string. "$a\wedge b$" is nothing more than a shorthand for that string of characters. I can use the abbreviation "KON" for the phrase "The King of Narnia" even though there's no such thing, and even if I've never heard of Narnia. If this much puzzles you, then you have a question about linguistics, not math. – Malice Vidrine Sep 05 '18 at 00:51
Where my confusion lies is that I would assume $\lnot(a \to \lnot b)$ would need to fit the format of one of our axioms or inference rules in order for it to be valid. – user525966 Sep 05 '18 at 00:52
2

Why would it? You can abbreviate a false statement as much as a true statement. – Malice Vidrine Sep 05 '18 at 00:52
To be clear, I understand 100% completely that we're using $a \land b$ as a definition / relabeling of $\lnot(a \to \lnot b)$ and we're saying "whenever we're using $a \land b$ it's the same as saying $\lnot(a \to \lnot b)$". My issue is not with that. My issue is that we'd then need to ask whether $\lnot(a \to \lnot b)$ is valid / makes sense, but then it doesn't seem to fit into any of our axiom definitions. i.e. how are we to understand or make sense of what $\lnot(a \to \lnot b)$ is? – user525966 Sep 05 '18 at 00:54
2

What is this "valid/makes sense" if it is not "is a well-formed formula of the language"? Why would an abbreviation need this extra condition to hold? – Malice Vidrine Sep 05 '18 at 00:56
I'm not talking about some extra condition, I'm asking why we're permitting something like $\lnot(a \to \lnot b)$ in the first place when it doesn't match any of our axioms or inference rules. – user525966 Sep 05 '18 at 00:58
@GrahamKemp Right, it defines something new in terms of something we already have axioms/inference rules for, yes? But then where do those come into play for something like $\lnot(a \to \lnot b)$? – user525966 Sep 05 '18 at 00:59
3

The axioms don't tell you what the well formed formulas are. They tell you which ones are true. The recursive definition of a well formed formula comes prior to even stating the axioms. – Malice Vidrine Sep 05 '18 at 00:59
Because the logical "and" concept is natural to work with, having a symbol for it is quite useful. So we look for a way to define it using the existing symbols of the language: $\to,\neg$. When $a$ and $b$ are both true, then $a$ does not imply not $b$. When $a$ does not materially imply not $b$, then $a$ and $b$ are both true. So it is sensible that $a\wedge b$ is equivalent to $\neg(a\to\neg b)$. – Graham Kemp Sep 05 '18 at 01:01
But even if $\lnot(a \to \lnot b)$ is a wff, how do we know if it's true or not, if all we know are that the axioms are true? And what aspect tells us the axioms must be true? What if we were working with a three-valued logic with arbitrary symbols instead of true/false/other? What determines what "value" is assigned to the axioms? – user525966 Sep 05 '18 at 01:02
Then we look for useful theorems . $$a\to(b\to \neg(a\to\neg b))\\neg(a\to\neg b)\to a\\neg(a\to\neg b)\to b$$ And although the proof for these are not obvious, they do exist. $$a\to(b\to (a\land b))\(a\land b)\to a\(a\land b)\to b$$ – Graham Kemp Sep 05 '18 at 01:02
@GrahamKemp That isn't what I'm asking... I'm honestly getting a little frustrated in this comment chain. Again I understand completely, fully, 100%, without question, that $a \land b$ is a shorthand for $\lnot(a \to \lnot b)$. My question is not about that. – user525966 Sep 05 '18 at 01:02
2

We don't necessarily know if $\neg(\neg a\vee\neg b)$ is true or not! What does that have to do with abbreviating something? – Malice Vidrine Sep 05 '18 at 01:03
1

@user525966 - This is frustrating for us as well, because if you understand it's an abbreviation, and you understand the thing being abbreviated is a well formed formula, there is literally nothing else to understand. And you have yet to give a clear account of what it is you think has some bearing on whether an abbreviation is acceptable. – Malice Vidrine Sep 05 '18 at 01:10
If we can recursively define all wffs then why even have axioms / how do we know what we're assigning to any given wff? How are we using modus ponens in any of this? "true" and "false" make sense intuitively but I'm pretending I am an alien from another planet here, and instead of true/false maybe we have "bloop" and "blarp" instead. How are we supposed to know what these connectives actually do or how they behave based on the variables? What says what we're limited to assigning them and how? – user525966 Sep 05 '18 at 01:24
5

Because axioms aren't about telling us what formulas are well formed. That has never been their purpose. They tell us, combined with rules of inference, what we can prove (i.e. what will always be true if our logic is sound). This has nothing to do with defining things. This is also different than "how do we know what we're assigning to any given wff?", which also has nothing to do with defining things. How does modus ponens figure into defining things? Not at all. It figures into proving things. Your questions seem to be, instead, a great many questions about the semantics of logic. – Malice Vidrine Sep 05 '18 at 01:34
1

And the idea of introducing an alien into the story is just missing the point. We made decisions about what the semantics of logic should look like based on several ideas about what discourse involving true and false things should look like. There is no understanding it without drawing on some sense as real human being. – Malice Vidrine Sep 05 '18 at 01:38
But aren't the axioms part of the recursively-defined wffs? Are axioms just a set of tautologies? e.g. $(p \to (q \to p)) = T$? – user525966 Sep 05 '18 at 02:06
2

Yes, the axioms are WFFs, but they're just a subset of the WFFs. They're a set chosen in this case because not only are they tautologies (in the sense of true under any assignment of truth values to their atomic sentences--i.e. valid in the intended semantics), but any other tautology in that sense can be proven by them from using the modus ponens rule. – Malice Vidrine Sep 05 '18 at 02:40
More, they are well formed statements that are accepted to be tautologies from the understanding of what $\to,\neg$ should mean for a particular logic. – Graham Kemp Sep 05 '18 at 02:59
@MaliceVidrine How do we know modus ponens and those axioms can cover any/all possible tautologies? – user525966 Sep 05 '18 at 03:33
@GrahamKemp Not sure I understand the link. What do you mean by "assigning it meaning"? Do you mean making truth tables? Are truth tables the only way for us to give the connectives semantic meaning? – user525966 Sep 05 '18 at 03:35
1

@user525966 - Because we tried to arrange for this to be the case, and then employed some metamathematics to confirm that it is the case. It's not something we'd know a priori. – Malice Vidrine Sep 05 '18 at 03:46
@MaliceVidrine Are truth tables the only mechanism we have for describing how we wish to assign equivalences between wffs and T/F? E.g. $T \land T = T$ Even after the axioms and modus ponens and l the recursive wffs I don't see where the truth values are derivable. Or are the truth tables derivable from the axioms? – user525966 Sep 05 '18 at 03:58
@user525966: Truth-values are completely separate from the well-formed formulae. Please see my answer first to understand exactly what wffs are, and then you will see that by themselves they are just meaningless strings. They become meaningful only after you interpret them, and one way of doing so is that, given any assignment of truth-values to the propositional variables, you can recursively assign a truth-value to every wff according to the truth tables. – user21820 Sep 05 '18 at 05:37
[cont] This is not the only possible way of imbuing meaning. Given a set $X$, any assignment $i$ that maps each propositional variable to a subset of $X$ can be extended recursively to an assignment that maps each wff to a subset of $X$ according to the following: $i(¬A) = X∖i(A)$; $i(A∧B) = i(A)∩i(B)$; $i(A∨B) = i(A)∪i(B)$; $i(A⇒B) = i(¬A∨B)$. And then tautologies are the wffs that always get mapped to $X$ by the extended assignment regardless of the given $X$ and $i$. Truth-tables are not derivable, but they capture our logical intuition and motivation for logic. – user21820 Sep 05 '18 at 05:46

score 6 · Accepted Answer · answered Sep 05 '18 at 04:52

Your question arises from the failure of many texts in properly distinguishing between the meta-system and the actual formal system under study. You, at all times, are doing mathematics in the meta-system, and in the field of mathematical logic you are studying some formal system (such as the one you have here with some syntactic rules for forming well-formed formulae (wff) and one deductive rule and three axioms). So, let us precisely express them, and you will see. $ \def\quote#1{{``}#1{"}} \def\meta#1{\mathbin{\dot#1}} $

Syntactic rules

Note that wffs are strings. Given any two strings $x,y$ we shall use "$x+y$" to denote the concatenation of $x$ followed by $y$. We shall also use quotes to specify literal strings. For example, you are a person but "you" is a string.

Closure under negation: Given any wff $A$, the string $\quote\neg+A$ is also a wff.

Closure under implication: Given any wffs $A,B$, the string $\quote(+A+\quote\to+B+\quote)$ is also a wff.

Note how I used quote-marks above. It would be technically incorrect to write:

... the string $(A \to B)$ is also a wff. (technically incorrect)

Because the "$\to$" and the brackets are symbols in the formal system under study, not symbols in the meta-system we are using!

Deductive rules

The system under study has only one deductive rule:

Given any wffs P,Q, if you have deduced $P$ and $\quote(+P+\quote\to+Q+\quote)$, then you can deduce $Q$.

Again, note how I used quote-marks.

Abbreviative definitions

Now we come to the so-called 'definition' of "$\land$":

Take any strings $A,B$. The string $\quote(+A+\quote\land+B+\quote)$ is not a wff in the formal system under study, simply because "$\land$" is not a symbol in its language. However, we wish to use that string to stand for $\quote{\neg(}+A+\quote{\to\neg}+B+\quote)$.

This wish is not trivial to fulfill rigorously. The easiest way to do it correctly is to add a syntactic rule for closure of wffs under $\quote\land$:

Closure under conjunction: Given any wffs $A,B$, the string $\quote(+A+\quote\land+B+\quote)$ is also a wff.

and then check that you can still uniquely parse (interpret) a wff, so that it makes sense to stipulate that $\quote(+A+\quote\land+B+\quote)$ is rewritten as $\quote{\neg(}+A+\quote{\to\neg}+B+\quote)$ before parsing, to obtain our wish.

As you observed, such a rewrite-rule is not an axiom.

What is that 'equality'?

Note that I did not say that $\quote(+A+\quote\land+B+\quote)$ is the same string as $\quote{\neg(}+A+\quote{\to\neg}+B+\quote)$, because it is of course false. We are only using a rewrite-rule; the strings themselves are not equal.

You are equally free to 'define' any other notation in the same fashion, using rewrite-rules, and you would have to deal with the same issue of unique parsing. This happens in mathematics itself as well. When you define a new notation it is important that there is still only one way to read things.

So while it is technically wrong to state this rewrite-rule as an equality, it is intuitively 'equal' in the sense of being logically equivalent, since the final parsing is the same.

I hope that this addresses your inquiry. If everything is clear, you can continue reading. There is a different way to go about logic that would actually make what is technically wrong above correct, but it may be confusing unless you fully understand the more concrete way above.

Meta-operators

First let us see how we can abstract out the wff formation:

Given any string $A$, define $\meta\neg A = \quote\neg+A$.

Given any strings $A,B$, define $A \meta\to B = \quote(+A+\quote\to+B+\quote)$.

Note that unlike the strings $\quote\neg$ and $\quote\to$, $\meta\neg$ and $\meta\to$ are operations on strings (in the meta-system). So we can in fact do the following:

Given any strings $A,B$, define $A \meta\land B = \meta\neg( A \meta\to (\meta\neg B) )$.

Note that the brackets here are in the meta-system, used so that we know which string operation to perform first. If we use the typical precedence rules, namely that $\meta\neg$ is higher precedence than $\meta\to$, then we could have done the following:

Given any strings $A,B$, define $A \meta\land B = \meta\neg( A \meta\to \meta\neg B )$.

A more abstract way to conceptualize this is that $\meta\to$ and $\meta\neg$ are actually operations on parse trees rather than strings, and so the above definition of $\meta\land$ is just a definition of a new operation on parse trees in terms of previously defined ones.

The question that may arise at this point is: Why don't we do it this way and not use strings at all? The simple answer is that the only way to completely formalize a formal system is to be able to encode it into some linear representation such as strings, so you are still going to have to decide on how exactly to encode wffs as strings. Similarly when you use logic on paper. Hence the concrete first approach is ultimately the practical way.

score 4 · Answer 3 · answered Sep 05 '18 at 01:43

I'm going to guess that you're conflating two different notions, namely "well-formed" and "logically valid". (My guess is admittedly based on just one little piece of one of your comments, namely "valid / makes sense".)

Of those two notions, only "well-formed" is relevant to definitions. You can define new symbols to abbreviate any well-formed formula, for example $\neg(a\to\neg b)$. The well-formed formulas are the ones that "make sense", i.e., have a truth value once you specify truth values for the variables in them. For example (check this with a truth table if you haven't already done so), $\neg(a\to\neg b)$ is true if both $a$ and $b$ are true, but $\neg(a\to\neg b)$ is false in all other circumstances.

Of the two notions, only "logically valid" is governed by the axioms and inference rules. The axioms are certain, selected, logically valid formulas, and the inference rules enable us to produce additional logically valid formulas from the axioms. We'll never produce $\neg(a\to\neg b)$ that way, because it's not logically valid. As indicated above, it's false sometimes (whenever at least one of $a$ and $b$ is false).

So $\neg(a\to\neg b)$ is not valid, but it is well-formed. In other words, it's not always true, but it always makes sense, it always has a truth value (when $a$ and $b$ have truth values). And the latter is what's needed for the defined expression $a\land b$ to make sense.

"Of those two notions, only "well-formed" is relevant to definitions." So, would you assert that if we defined (a$\lor$b), where $\lor$ gets understood as logical disjunction, as ¬(a→¬b) that would make sense as a definition? At the very least, that's different from conventional definitions, since conventional definitions usually preserve truth if you replace one side with another when one side appears as a subformula in some formula. — Doug Spoonwood, Sep 05 '18 at 03:20
@DougSpoonwood If we made that definition, we'd be introducing the symbol $\lor$ to mean "and". Not a good idea, since people are accustomed to a different definition and are likely to be confused. But, in and of itself, it's a legitimate definition. (Note that people have used $\sim$ to mean negation and other people have used the same symbol for biconditional. Either definition by itself is legitimate.) — Andreas Blass, Sep 05 '18 at 03:26
@DougSpoonwood I should also say something about "where $\lor$ gets understood as logical disjunction" in your comment. If we defined $\lor$ in the way you described, then to understand $\lor$ as logical disjunction would be to misunderstand it. — Andreas Blass, Sep 05 '18 at 03:29

score 0 · Answer 4 · answered Sep 05 '18 at 03:15

The key issue here is soundness.

The purpose of definitions in propositional calculus lies in converting notions not using the primitive connectives into well-formed formulas using only the primitive connectives, and doing the converse. In other words, definitions exist to translate between connectives.

There's an alternative way of expressing definitions by having a propositional calculus with functorial variables instead of a propositional calculus without functorial variables. Basically, it turns out that definitions which define connectives convert into tautologies with functorial variables of the form (in Polish notation)

C $\delta$x $\delta$y

where x is one-side of the definition, and y is the other-side of the definition. It also turns that if C $\delta$x $\delta$y, then Exy, and if Exy then C $\delta$x $\delta$y. Correspondingly, every definition has the property that one-side is logically equivalent to the other side of the definition. Thus, for any definition of a connective, if a well-formed formula gets written in Polish notation, the connective should appear once as the first symbol in the well-formed formula and only appear once in that well-formed formula. If some other formula equals it, then one can reasonably define that connective by that other formula.

For example, one common definition (again in Polish notation) is:

Apq := CNpq

which defines logical disjunction in terms of implication and negation. But, since

E Apq CCpqq

is a tautology also, and A appears the first symbol in 'Apq' and only appears once in 'Apq', one could use 'CCpqq' to define 'A' instead of using 'CNpq'.

To go over your questions one by one:

"Why is this permitted?"

Because anytime an instance of the formula (once parentheses get restored) on one side appears within a well-formed formula W, it can replace can get replaced by the formula (once parentheses get restored) on the right without W' changing from true to false, or from false to true. Or in short, definitional replacement preserves truth (this property is immediately evident for a formula like C $\delta$x $\delta$y once you understand how substitution for $\delta$ works). Thus, it doesn't result in an invalidity. So, if the axioms are sound, definitional replacement preserves soundness.

"What are we allowed to define?"

Any connective can get defined in terms of formulas only having the primitive connectives of the system. This gets done to ensure that the system is adequate.

"What are we even using modus ponens and the axioms for if we can just make up whatever?"

Because only a subset or subclass of "whatever" will qualify as logically sound. Modus ponens is sound. So are the axioms. The definitions also either are sound, or work out as consistent with soundness. So again, the key issue here is soundness.