Any private-coin randomized protocol
that computes
with an error probability of at most
cannot use less than some multiple of
bits in the worst case.
Invoke Newman’s Theorem
.
Since
asymptotically, it suffices to show that
, for 
Now recall Yao’s Lemma:

which allows us to reduce the problem to coming up with a probability distribution
over
such that any deterministic protocol which errs with probability of at most
on any input according to
communicates at least some multiple of
bits in the worst case.
One initial thought worth investigating is what happens when if
uniform. Suppose Alice and Bob both choose any subset of
uniformly. Then the probability that they choose disjoint subsets is
, which goes to 0 as
grows large. So a protocol that just outputs 0 (they are NOT disjoint) can be made to have an arbitrarily small error asymptotically, which means that we only get an
bound by using the uniform distribution. We are not satisfied.
Instead, consider the following product distribution: Alice and Bob cannot possibly pick a subset of
whose size is different that
and they pick subsets of size
with uniform distribution. So
, where:

In this case, the probability that Alice and Bob choose disjoint subsets is:

We will show that any deterministic protocol that, under
, errs with a probability of at most
in calculating
, uses at least some factor of
in the worst case.
The intuition is as follows: Let
be the
protocol with the minimum communication complexity.
partitions
into rectangles, corresponding to its leaves. For each rectangle,
answers in the same way in the question of whether the two input subsets are disjoint or not.
However, some of these rectangles may be corrupted. In other words, the protocol may output that the input sets are disjoint, when in fact they are not. Let’s look at 1-rectangles. That is, rectangles in which the protocol answers YES, they are disjoint (1). We know that
errs with a probability of at most
. Therefore, all of its rectangles have small corruption. Specifically, if
is an 1-rectangle, then

The figure below illustrates why that is.

We show that exactly because the rectangles of
cannot be too corrupted, they have to be small in size (their measure according to
is small). This will imply that there are many of them and so we will have a lower bound to the communication complexity.
To elaborate further on this, let
be the maximum
-measure of any 1-rectangle of
. If we suppose for the purposes of contradiction that
for some
, then the number of 1-rectangles is bounded above by the number of leaves of
, which in turn is bounded above by
.

And so
. In other words, if the communication complexity of
is small, there must be a big 1-rectangle. And that rectangle has low corruption. We prove that this is impossible, acquiring a contradiction.
We prove the following Lemma (I):
“If
is a 1-rectangle of
,
then either
or
has to be at most
for some constant
.”
After establishing this, if we let
be a 1-rectangle of
and assume WLOG that
, then
by Lemma I, and so
,
which implies that every 1-rectangle
must be small, thus giving us our contradiction.
Heading over to the proof of Lemma I, we first two simplifying assumptions:
- S is large (we will formalize this below but if S were small then our lemma holds vacuously).
- All the subsets of
lying in
and
have size
. There is 0 probability of getting subsets of size
as inputs, so Alice and Bob can modify
to immediately reject such input cases so that we don’t have to worry about its behavior on them.
Intuition: If
is large, then its sets must span a large fraction of
. There cannot exist too many sets disjoint with the sets of
then. So
is small. This is illustrated in the figure below:

We shall start by proving that the span of the elements of
has to be big. This is the following Lemma, which could be stated without any relation to our communication complexity investigations:
LEMMA II: Let
be a collection of subsets of size
of
. Then there exists some
such that if
, then for
, there exist sets
such that
it holds that

Proof
We show that if we pick any
sets
, we can find another set
that shares less that
elements with the
sets we picked.
In other words:

This gives us an incremental procedure of finding the collection
: Every new set we add has less than
common elements with all the previous sets chosen and this is exactly what we want to prove. This is shown in the figure below:


for large enough
, where the constant
is obtained through Stirling’s approximation
. This concludes the proof of Lemma (II) because we have found that there must exist some
such that
.
Note that this Lemma implies that
spans a large portion of
. Since each one of the
-s adds at least another
new elements, we have that
, meaning that
spans at least one sixth of
, which is a good fraction for our purposes.
Let’s now see exactly how
having a large span forces
to be small. The main idea lies in the fact that
has low corruption, so we seek sets that are disjoint with most sets in
and that number is bounded above because of the large span property.
Let
to be the set of elements in
which are such that they intersect with a fraction of at most
of the sets of
. We must have
, as if that wasn’t the case, there would be more than
s of
in
and, because both
and
consist of rectangles of size
which we uniformly sample, that would violate our low-corruption hypothesis.
Applying Lemma II to
, we get that there exists some constant
such that if
– implying the largeness of
:
) – then there exists a sub-collection
of
with
whose span over
is big as we saw before.
So let’s assume that
as dictated above and let us take
. Now also consider the sub-collection of
consisting of sets that have an intersection with a fraction of at most
elements of
. Then
would have to contain at least half of the elements of
–
– as otherwise there would be more than
in
, contradicting once again our low-corruption hypothesis.
It is important to conceptually notice here how the low-corruption hypothesis forces large parts of
and
to not overlap with each other too much! So we seek sets that are mostly disjoint among the two groups. And the big span of
constrains our options greatly. We show that there cannot be that many sets by bounding
from above.
For each set
in
, there are at most
sets in
which intersect it. So there must exist
sets
in
that all don’t intersect it, which limits how many elements
can possibly have. That is we must have that
and knowing that each
brings in at least
new elements, we must have
assuming that
. That doesn’t seem like a lot: all we said is that
must be drawn from a fraction of
-ths of the universe. But it provides us with the bound we needed.
The number of possible
-s is at most:
,
where
is a constant given again by Stirling’s approximation.
So
. Taking
concludes our proof.