3

Create a set $S$ by adding each point of $\mathbb{R}^2$ with 50% probability (independently).

What is the probability that $S$ is connected? (and is this even a valid thing to ask?)

  • What do you mean by adding each point with $1/2$ probability? – rubikscube09 Sep 21 '21 at 17:14
  • Create a bunch of IIRDV Bernoulli random variables, one for each point. Let S be the set of points where the corresponding random variable is equal to 1. Is this well-defined? – Thomas Browning Sep 21 '21 at 17:17
  • 1
    I don't think you can really define this with an uncountable number of variables. If you can, though, you should be able to apply some 0-1 law and conclude that the probability is 0. – Trebor Sep 21 '21 at 17:19
  • 4
    There are enough random choices here that things are difficult to formalize. I think the language of measure, rather than probability, makes this clearer: we have the set $\mathcal{P}(\mathbb{R}^2)$ of all sets in the plane and we want to know the "measure" in some appropriate sense of the subset $C$ consisting of all connected sets. Both $C$ and $\mathcal{P}(\mathbb{R}^2)$ are perfectly nicely defined; the issue is that $\mathcal{P}(\mathbb{R}^2)$ is so big that we don't have an obvious measure on it which is appropriate to the problem. – Noah Schweber Sep 21 '21 at 17:23
  • Weird. So apparently you cannot construct uncountably many independent random variables. And I agree that there isn't an obvious measure to work with here. – Thomas Browning Sep 21 '21 at 17:29
  • I am pretty sure you absolutely can define the product probability measure on $2^{\mathbb{R}^2}$ as you wanted. See e.g. here. My guess is that the collection of connected subset will not be measurable. – Cronus Sep 21 '21 at 17:52
  • 3
    If we consider the hyperspace of compact sets $K(\Bbb R^2)$ with the Vietoris topology then the connected sets are a closed nowhere dense subspace, so at least topologically most compact sets are disconnected. This suggests that for reasonable ways to make the question precise the connected sets should have measure 0 – Alessandro Codenotti Sep 21 '21 at 18:09
  • 1
    @AlessandroCodenotti Interesting. However, my mental picture of a random subset of $\mathbb{R}^2$ is a dense set of points, which is very far from being compact, and seems like it should be connected. – Thomas Browning Sep 21 '21 at 18:53
  • I was going to mention what @AlessandroCodenotti mentioned about the topological category of connected sets. As an addition, however, category and measure are generally orthogonal notions (see https://mathoverflow.net/a/43502). So the preference of topological category is somewhat subjective (though it is my preference). – C. Caruvana Sep 21 '21 at 20:21
  • 1
    @ThomasBrowning I like the "picture a random subset" approach. Unfortunately in my head it's completely unclear whether the random dense subset of $\mathbb R^2$ is connected or totally disconnected – jackson Sep 22 '21 at 21:19

3 Answers3

7

To really answer this question, we have to go back to the Kolmogorov axioms.

The Kolmogorov axioms state that a probability space is a triple $(A, E, P)$ such that

  • $A$ is a set
  • $E$ is a collection of subsets of $A$ - that is, $E \subseteq \mathcal{P}(A)$
  • $P$ is a function $P : E \to [0, 1]$

such that the following rules hold:

$\sigma$-algebra axioms:

  • $A \in E$
  • If $B, C \in E$ then $B \setminus C \in E$
  • If $B_i$ is in $E$ for all $i \in \mathbb{N}$ then $\bigcup\limits_{i \in \mathbb{N}} B_i \in E$.

Measure axioms:

  • $P(\emptyset) = 0$
  • If $B_i$ is in $E$ for all $i \in \mathbb{N}$, and if $B_i \cap B_j = \emptyset$ for all $i \neq j$, then $P(\bigcup\limits_{i \in \mathbb{N}} B_i) = \sum\limits_{i \in \mathbb{N}} P(B_i)$

Probability axiom:

  • $P(A) = 1$

$A$ is known as "the sample space". $E$ is known as "the set of events", and elements of $E$ are called "events". And $P$ is the probability function.

Let's say we want to discuss the outcome of a Bernoulli experiment with $n$ independent identically distributed indicator variables $X_1, ..., X_n$, each of which takes value $1$ with probability $p$ and value $0$ with probability $1 - p$.

The obvious probability space for this discussion would be $A = \{0, 1\}^n$, the set of all $n$-tuples of either $0$ or $1$. We would take $E = \mathcal{P}(A)$ and define $P(S) = \sum\limits_{s \in S} \prod\limits_{i = 1}^n (1 - p) + (2p - 1) s_i$. The $(1 - p) + (2p - 1) s_i$ term looks a bit weird, but it's just designed to be $p$ when $s_i = 1$ and $1 - p$ when $s_i = 0$.

It's easy to verify that this is in fact a probability distribution, that the random variable $X_i(s) = s_i$ takes value $0$ with probability $1 - p$ and value $1$ with probability $p$, and that the $X_i$ are mutually independent.

What if we want to do infinitely many variables? It turns out that this is still possible. I won't go into the exact details of how it's done, but we can come up with probability space built from the sample space $\{0, 1\}^S$, where $S$ is some possibly infinite set, using something called the "Borel $\sigma$-algebra" as our event space. Basically, we only allow events that can be "built up" from the basic events of $X_i = 0$ and $X_i = 1$ using the processes of countable union and complementation. We can then define the probability measure $P$ using Caratheodory's Criterion and an outer measure. This is all rather technical and would require a good course in measure theory to introduce, but it can be done perfectly well.

So it's perfectly valid to take $|\mathbb{R}^2|$ different random variables and form a probability distribution out of them.

The problem here is that you would need to prove that $\{s \in \{0, 1\}^{\mathbb{R}^2} \mid \{x \in \mathbb{R}^2 \mid s_x = 1\}$ is connected$\}$ is actually part of the $\sigma$-algebra of events. Only events can have their probability taken.

I strongly suspect (but do not yet have a proof) that it will turn out this is not a measurable set. Therefore, we will be unable to ask the question of its probability.

Edit: if we're using the Borel $\sigma$-algebra, then I do in fact have a proof.

Theorem: Let $s$ be a set in the Borel $\sigma$-algebra on $\{0, 1\}^B$ where $B$ is a set. There must be some countable set $V \subseteq B$ such that for all $y \in S$, for all $z \in \{0, 1\}^B$, if for all $v \in V$, $y_v = z_v$, then $z \in S$.

Proof: we proceed by induction on the definition of the Borel $\sigma$-algebra.

Base case 1: $\{0, 1\}^B$. This one is immediate - simply take $V = \emptyset$.

Base case 2: $\{x \in \{0, 1\}^B \mid x_b = q\}$. This one is also immediate: take $V = \{b\}$.

Inductive step 1: Suppose $C, D$ are Borel sets satisfying the property. Pick $V_C$ and $V_D$ respectively. Then $V_C \cup V_D$ is countable and works for $C \setminus D$.

Inductive step 2: Suppose $C_i$ is a Borel set satisfying the property for all $1 \leq i \leq n$. Then for each $i$, take $V_{C_i}$ which works for $C_i$. Then $\bigcup\limits_{i \in \mathbb{N}} V_{C_i}$ is countable and works for $\bigcup\limits_{i \in \mathbb{N}} C_i$.

So the proof is complete. Now consider that there is no such $V$ which works for the set of connected sets.

Mark Saving
  • 33,541
3

I'll take a bit of a different approach to Mark's answer, using group invariance on the measure space. I think it's obvious that the probability of producing a connected set should be $0$, so I'll argue that there is at least one way of making $\mathcal{P}(\mathbb R^2)$ a probability measure space such that every point independently has an equal probability of being included. Unfortunately this approach doesn't clarify whether each point's probability is 50%.

Consider the group $\mathop{Sym}(\mathbb{R}^2)$ of self-bijections $\mathbb{R}^2 \to \mathbb R^2$. We can phrase the requirement that each point's inclusion is equally likely and independent of all other points in terms of the action of $\mathop{Sym}(\mathbb{R}^2)$ on $\mathcal{P}(\mathbb{R}^2)$ and $\mathcal{P}(\mathcal{P}(\mathbb{R}^2))$. That is, if $\phi$ is a bijection on $\mathbb{R}^2$, then $\phi$ acts on $\mathcal{P}(\mathbb{R}^2)$ via $\phi(A) = \{\phi(x) \mid x \in A\}$. Furthermore, given any $\mathfrak{A} \subset \mathcal{P}(\mathbb{R}^2)$, we can let $\phi(\mathfrak{A}) = \{\phi(A) \mid A \in \mathfrak{A}\}$.

What does it mean for each point to have equal and independent probability of inclusion? It means that if we have a probability measure space $(\mathcal{P}(\mathbb{R}^2), \Sigma, \mu)$, then for all $\mathfrak{A} \in \Sigma$, and any bijection $\phi \in \mathop{Sym}(\mathbb{R}^2)$, we have $\phi(\mathfrak{A}) \in \Sigma$, and $\mu(\phi(\mathfrak{A})) = \mu(\mathfrak{A})$. So the question is, if $\mathfrak{C} = \{A \subset \mathbb{R}^2 \mid A \text{ is connected}\}$, is there a $\mathop{Sym}(\mathbb R^2)$-invariant probability measure on $\mathcal P (\mathbb R^2)$ such that $\mathfrak{C}$ is measurable? If so, what is $\mu(\mathfrak C)$?

For any cardinality $\kappa < 2^{2^{\aleph_0}}$, the co-$\kappa$ probability measure on $\mathcal{P}(\mathbb{R}^2)$ is $\mathop{Sym}(\mathbb{R}^2)$-invariant (in fact it's $\mathop{Sym}(\mathcal P(\mathbb R^2))$-invariant), so such measures certainly exist.

And there are as many disconnected as connected subsets of $\mathbb{R}^2$. That is, $$ |\mathfrak{C}| = |\mathcal{P}(\mathbb{R}^2) \setminus \mathfrak{C}| = |\mathcal P(\mathbb R)| = 2^{2^{\aleph_0}}. $$ Indeed, let $A \subset \mathbb{R}$. For all such $A$, we can construct a distinct connected $C_A \subset \mathbb{R}^2$ and disconnected $D_A \subset \mathbb R^2$. Given $A \subset \mathbb{R}$, let $A'$ be the subset of $\mathbb R$ where all nonnegative elements are shifted up by $1$: $$ A' = (A \cap (-\infty, 0)) \cup ((A \cap [0, \infty)) + 1). $$ The $A \mapsto A'$ is an injective map $\mathcal P(\mathbb R) \to \mathcal P(\mathbb R)$. Now let $E_A = \{(x, y) \mid x \in A'\}$. Then $C_A := E_A \cup \{(\frac{1}{2}, \frac{1}{2}), (\frac{1}{2}, -\frac{1}{2})\}$ is disconnected, and $D_A := E_A \cup \{(x, y) \mid y=0\}$ is connected.

So $\mathfrak{C}$ is not co-$\kappa$ for any $\kappa < 2^{2^{\aleph_0}}$. In particular, the co-$2^{\aleph_0}$ probability measure is $\mathop{Sym}(\mathbb{R}^2)$-invariant and finds that $\mu(\mathfrak{C}) = 0$*.

Unfortunately, I doubt if if there are any $\mathop{Sym}(\mathbb{R}^2)$-invariant probability measures on $\mathcal{P}(\mathbb{R}^2)$ besides co-$\kappa$ measures. Besides, $\mathop{Sym}(\mathbb R^2)$-invariance is ridiculously strong requirement—for instance, I believe the Lebesgue completion of a $\mathop{Sym}(X)$-invariant measure on $\mathcal P (X)$ always has $\mathcal P(\mathcal P(X))$ as its $\sigma$-algebra. So I am very sympathetic to the answer "$\mathfrak C$ should not be measurable." But there is at least one way it can be measurable, and in that way its measure is $0$.

Edit: I believe I missed a detail about the co-$\kappa$ measure. I was assuming that for $A \subset \mathcal P (\mathbb R^2)$, if $A$ is not co-$\kappa$, then $\mu(A) = 0$. But this is not a measure at all, as any partition of $\mathcal P (\mathbb R^2)$ into $2$ disjoint and equal-cardinality subsets violates the disjoint-sum axiom of a measure. To fix this, we can only consider sets that are measurable in the co-$\kappa$ topology's Borel sigma algebra. Then $\mathfrak C$ is unmeasurable, and $\mu(\mathfrak C)$ is undefined.

jackson
  • 1,017
  • I find this answer very interesting, but I have some trouble following the details. What is the co-$\kappa$ probability measure? (I couldn't find anything about it in my measure theory textbooks, and on the internet I only got completely unrelated search results.) And can you clarify why you think that "the point inclusions are independent and equally likely" is equivalent to "$\mu$ is permutation-invariant"? (I think that examples such as this one show that dependent random variables can also be permutation-invariant?) – Josse van Dobben de Bruyn Dec 16 '21 at 19:55
  • I'm not certain if the phrase "co-$\kappa$" is standard but it's modeled off of "cofinite," as in cofinite topology and cofinite measure. The specific measure I'm talking about is one that finds $\mu(A) = 1$ if $A$ is co-$\kappa$ in $X$, meaning $|X \setminus A| \le \kappa$, $\mu(A) = 0$ if $|A| \le \kappa$, and $A$ is unmeasurable otherwise. Looking at the definition here (I would call this cocountable rather than cofinite) https://math.stackexchange.com/questions/1464801/cofinite-sigma-algebra-not-a-measure-space it seems possible I'm using it wrong and my conclusion doesn't hold at all... – jackson Dec 16 '21 at 21:27
  • 1
  • I may have been careless in my phrasing—I don't think I would say equivalent. But I do feel "independent + equally likely" implies that we consider points in $\mathbb R^2$ undifferentiated, and I feel an object has undifferentiated parts only if the object is equivalent after a group permutes those parts. If each point is independent, then the group in question should not respect any structure of $\mathbb R^2$ except the set-theoretic structure. In particular, if $\phi$ is any bijection on $\mathbb R^2$, then $\mu(A)$ should equal $\mu(\phi(A))$. As a consequence, rather than a restatement
  • – jackson Dec 16 '21 at 21:49
  • Thank you for clarifying! It seems that the point probabilities $\mathbb{P}[p \in S]$ are not even defined (let alone independent) in your model, so this does not strictly answer the literal question. Nevertheless, your broader symmetry-invariant approach is an interesting addition, and provides another example of a “reasonable” probability measure on $\mathcal{P}(\mathbb{R}^2)$, so it does (partially) answer the question in spirit. – Josse van Dobben de Bruyn Dec 19 '21 at 20:26