The direct sum problem concerns the following question: “If a task cannot be completed using less than
steps, then do
independent instances of task
require at least
steps?”
In the theory of communication complexity, the task is calculating a certain function through some form of communication.
DEFINITION
Let be a function. We understand
,
defined as:
to represent the task of performing different communications with the goal of calculating
on
different inputs.
QUESTION: Is ?
An easy example of a relation for which a protocol exists that computes
with less communication than
times the minimum possible communication required to compute
itself is the following:
Consider the following function: Let . Alice has a set
of size
. Bob has no input. The goal is Bob to produce an element of
.
One strategy we can play with is Alice sending simply her smallest element to Bob. That requires bits, because the elements
can never be the minimum of
.
On the other hand, we require at least bits, because if a protocol existed that somehow made Bob aware of some element in
using less than
bits for any
, then Bob would not be able to encode numbers outside of some set
whose cardinality is at most
. Thus, letting
breaks the protocol.
So we need exactly bits to communicate some element of
to Bob.
Now we shall show that we can have a protocol which, given any sequence , can output a
-tuple
such that
. The way that protocol works is the following:
We can prove that there exists a set of
-tuples
with size
which is such that for any sequence
, there exists an element
which is such that
. Let us call this statement Lemma I and we will prove it below. Alice finds
exhaustively. Alice sends Bob the name of the element of
which works for the particular input. Overall, the cost of this communication is
bits of communication.
PROOF OF LEMMA
We rely on the probabilistic method. We make by repeating the following random experiment
times: “Choose
elements from
by sampling each element uniformly at random.”
Given any sequence , what is the probability that for all
, there exists some
such that
?
The probability that for some given , we have that
for all
is
. Therefore, the probability that for some
, at least some element misses its respective
-set is
. So the probability that this happens for all
is
So the probability that for all sequences , the $latex Q$ we constructed misses its mark is at most
and so our proof concludes.
Next, we shall show that there is actually a lower bound to the deterministic communication cost , based on the fact that we can extract a monochromatic cover for
through a monochromatic cover for
.
THEOREM
If requires
bits of communication, then
requires at least
bits of communication.
PROOF
We will use the following Lemma II: “If the inputs to can be covered with
monochromatic rectangles, then the inputs to
can be covered with
monochromatic rectangles.”
If we let be the communication cost for
, then the inputs to
can be covered by
monochromatic rectangles and therefore, by the Lemma above, the inputs to
can be covered by
rectangles, implying (*) that the computing
requires
bits communication, which finally yields that
.
(*) This implication follows from a fundamental Theorem on rectangle covers, which we will not cover here for brevity, but in another post.
Now for the proof of Lemma II:
To produce a cover of with the necessary amount of rectangles, we show that given any
, there exists a rectangle
that is monochromatic under
and that covers at least
of the inputs from
. Applying this fact repeatedly $\lceil 2n\cdot 2^{l/k}/rceil$ times, we have left at most:
elements which are uncovered and so our statement is proved.
For the proof of the key incremental claim, we exploit the fact that can be covered by
monochromatic rectangles, so that at least one “projection-rectangle” into the space
has a lot of elements.
Specifically, there must exist a rectangle covering at least
inputs from
. Now, project
into its
-th coordinate by retaining only the
-th pair of inputs of Alice and Bob:
is a rectangle because
is a rectangle, by invoking the basic rectangle-recognizability lemma and
is also monochromatic under
, since
is monochromatic under
.
Now, we have that , and so
Thus, for some , we much have that
, which proves our claim and finishes our theorem.