Any private-coin randomized protocol that computes with an error probability of at most cannot use less than some multiple of bits in the worst case.
Invoke Newman’s Theorem
.
Since asymptotically, it suffices to show that
, for
Now recall Yao’s Lemma:
which allows us to reduce the problem to coming up with a probability distribution over such that any deterministic protocol which errs with probability of at most on any input according to communicates at least some multiple of bits in the worst case.
One initial thought worth investigating is what happens when if uniform. Suppose Alice and Bob both choose any subset of uniformly. Then the probability that they choose disjoint subsets is , which goes to 0 as grows large. So a protocol that just outputs 0 (they are NOT disjoint) can be made to have an arbitrarily small error asymptotically, which means that we only get an bound by using the uniform distribution. We are not satisfied.
Instead, consider the following product distribution: Alice and Bob cannot possibly pick a subset of whose size is different that and they pick subsets of size with uniform distribution. So , where:
In this case, the probability that Alice and Bob choose disjoint subsets is:
We will show that any deterministic protocol that, under , errs with a probability of at most in calculating , uses at least some factor of in the worst case.
The intuition is as follows: Let be the protocol with the minimum communication complexity. partitions into rectangles, corresponding to its leaves. For each rectangle, answers in the same way in the question of whether the two input subsets are disjoint or not.
However, some of these rectangles may be corrupted. In other words, the protocol may output that the input sets are disjoint, when in fact they are not. Let’s look at 1-rectangles. That is, rectangles in which the protocol answers YES, they are disjoint (1). We know that errs with a probability of at most . Therefore, all of its rectangles have small corruption. Specifically, if is an 1-rectangle, then
The figure below illustrates why that is.
We show that exactly because the rectangles of cannot be too corrupted, they have to be small in size (their measure according to is small). This will imply that there are many of them and so we will have a lower bound to the communication complexity.
To elaborate further on this, let be the maximum -measure of any 1-rectangle of . If we suppose for the purposes of contradiction that for some , then the number of 1-rectangles is bounded above by the number of leaves of , which in turn is bounded above by .
And so . In other words, if the communication complexity of is small, there must be a big 1-rectangle. And that rectangle has low corruption. We prove that this is impossible, acquiring a contradiction.
We prove the following Lemma (I):
“If is a 1-rectangle of ,
then either or has to be at most for some constant .”
After establishing this, if we let be a 1-rectangle of and assume WLOG that , then by Lemma I, and so
,
which implies that every 1-rectangle must be small, thus giving us our contradiction.
Heading over to the proof of Lemma I, we first two simplifying assumptions:
- S is large (we will formalize this below but if S were small then our lemma holds vacuously).
- All the subsets of lying in and have size . There is 0 probability of getting subsets of size as inputs, so Alice and Bob can modify to immediately reject such input cases so that we don’t have to worry about its behavior on them.
Intuition: If is large, then its sets must span a large fraction of . There cannot exist too many sets disjoint with the sets of then. So is small. This is illustrated in the figure below:
We shall start by proving that the span of the elements of has to be big. This is the following Lemma, which could be stated without any relation to our communication complexity investigations:
LEMMA II: Let be a collection of subsets of size of . Then there exists some such that if , then for , there exist sets such that it holds that
Proof
We show that if we pick any sets , we can find another set that shares less that elements with the sets we picked.
In other words:
This gives us an incremental procedure of finding the collection : Every new set we add has less than common elements with all the previous sets chosen and this is exactly what we want to prove. This is shown in the figure below:
for large enough , where the constant is obtained through Stirling’s approximation . This concludes the proof of Lemma (II) because we have found that there must exist some such that .
Note that this Lemma implies that spans a large portion of . Since each one of the -s adds at least another new elements, we have that , meaning that spans at least one sixth of , which is a good fraction for our purposes.
Let’s now see exactly how having a large span forces to be small. The main idea lies in the fact that has low corruption, so we seek sets that are disjoint with most sets in and that number is bounded above because of the large span property.
Let to be the set of elements in which are such that they intersect with a fraction of at most of the sets of . We must have , as if that wasn’t the case, there would be more than s of in and, because both and consist of rectangles of size which we uniformly sample, that would violate our low-corruption hypothesis.
Applying Lemma II to , we get that there exists some constant such that if – implying the largeness of : ) – then there exists a sub-collection of with whose span over is big as we saw before.
So let’s assume that as dictated above and let us take . Now also consider the sub-collection of consisting of sets that have an intersection with a fraction of at most elements of . Then would have to contain at least half of the elements of – – as otherwise there would be more than in , contradicting once again our low-corruption hypothesis.
It is important to conceptually notice here how the low-corruption hypothesis forces large parts of and to not overlap with each other too much! So we seek sets that are mostly disjoint among the two groups. And the big span of constrains our options greatly. We show that there cannot be that many sets by bounding from above.
For each set in , there are at most sets in which intersect it. So there must exist sets in that all don’t intersect it, which limits how many elements can possibly have. That is we must have that and knowing that each brings in at least new elements, we must have assuming that . That doesn’t seem like a lot: all we said is that must be drawn from a fraction of -ths of the universe. But it provides us with the bound we needed.
The number of possible -s is at most:
,
where is a constant given again by Stirling’s approximation.
So . Taking concludes our proof.