Adapted from “The story of set disjointness” by Arkadev Chattopadhyay and Toni Pitassi.

An Ω(n) lower bound on the randomized communication complexity of set disjointness.1

Theorem

Let f:2[n]×2[n]{0,1} be the set disjointness function: that is, given any sets A,B[n], f returns true if A and B are disjoint: f(A,B):=1[AB=]. Then the randomized2 communication complexity of f is Ω(n).3

More precisely, there is some constant probability of failure ϵ such that any randomized communication protocol that succeeds with probability 1ϵ on each input must use Ω(n) bits of communication.

Proof

Plan

By Algorithmic minimax principle, it is enough to show a distribution over inputs on which any deterministic protocol needs Ω(n) communication in order to succed with probability 1ϵ on average. Here, we will consider the (product) distribution where both A and B are chosen uniformly at random in ([n]n).

We’ll show that the corresponding (nn)×(nn) communication matrix has no large “nearly 0-monochromatic” rectangles. That is, for any subsets F,G([n]n), either

  • F or G only represents a 2Ω(n) fraction of ([n]n);
  • or a Ω(1) fraction of the pairs (A,B)F×G intersect.

Intuitively, we want to show that if F and G are large, then they must both be pretty equally spread over [n], and if that’s the case, they must intersect a lot. However, there doesn’t seem to be a clean notion of “equally spread” that you can prove independently for F and G, so instead we’ll:

  • assume that and F is large, and F and G don’t intersect much;
  • find a small number of sets A in F that cover much of [n] and yet don’t intersect G too much;
  • show that this limits the size of G.

Execution

Suppose that |F|2o(n)(nn), and that only an ϵ fraction of the pairs (A,B)F×G intersect. Some sets AF might be unusual in that they intersect with much more than an ϵ fraction of the sets in G, but at least half of them intersect with at most a 2ϵ fraction of G. Let FF be the family of such good sets; we have |F||F|/22o(n)(nn).

Then, find kO(n) sets A1,,AkF such that |A1Ak|n/3. This is always possible given how big F is: Just choose those sets greedily to increase the size of the union as quickly as possible. If at some point |A1Ai|<n/3 and you can’t find some Ai+1F that extends the size of the union by at least n/2, that would show that there are at most

j=0n/21(n/3nj)inside A1Ai(2n/3j)outside A1AiO((n/3n/2)(2n/3n/2))2Ω(n)(nn)

elements left in F.

Now (using the same trick as earlier), the average set in G intersects only a 2ϵ fraction of the sets A1,,Ak, so at least half of them intersect at most 4kϵ of them. Let GG be the family of such good sets; we have |G||G|/2.

Each set BG can only intersect the union A1Ak at 4kϵ points, and the rest of B must lie entirely in the remaining nΩ(n) elements of [n]. Therefore,

|G|(k4kϵ)(nΩ(n)n).

We can upper bound (nΩ(n)n) by 2Ω(n)(nn), and by choosing ϵ small enough, we can upper bound (k4kϵ) by 2cn for any c>0 we want, so we get

|G|2|G|2Ω(n)(nn),

as desired.

  1. László Babai, Péter Frankl, János Simon, “Complexity classes in communication complexity theory”. 

  2. It’s very easy to prove that its deterministic communication complexity is Ω(n), see Monochromatic rectangles

  3. We actually know that it’s Ω(n), but this is a much more complicated result.