Would decoupling be useful for this?
A colleague and I are now looking at a generalization of the problem from the previous post: Let
be a collection of rank one sign matrices. How large does
need to be for
to span a
-dimensional subspace of 
Our idea was to note this is equivalent to asking when
has rank
, then use the relation 
Accordingly, we need to know what
needs to be so it is extremely unlikely that
From two different directions we can get an idea of how large
should be:
- If we approximated
with a matrix of Rademachers, we know that
because it’s a subgaussian matrix in
. Thus we need
so
That is,
should be larger than
- Note that if we take a unit length vector
and partition it as
and furthermore write
where
and
are Rademacher vectors of length
, then
The terms being indexed by
are i.i.d., so consider the behavior of
Let’s say that by the CLT,
is distributed
, then this sum is distributed
. Thus
is a
r.v., so a subexponential r.v. concentrated around
. From a covering set argument, it looks like the deviations are on the order of
, so we’d need
or
to be larger than
The second intuitive argument seems to offer a route to actually finding a concrete relation
must satisfy: we need to show that
is subgaussian with explicit constants, then use that to show that
is subexponential with explicit constants.
The big hurdle here is that the entries in
are not independent, so we can’t just use Hoeffding’s inequality. What we can say is that
where each term in the sum is subgaussian with tails like
, but the fact that the terms aren’t independent prevents us from saying that the whole thing is subgaussian with a tail like 
So I’m wondering if there’s some kind of decoupling result I can use in this case? I know nothing about decoupling …
Update:
where the
are Rademacher sequences and I’m considering
as a matrix in the natural manner. You can relate the moments of this to the moments of
where
. I found a nasty moment bound for this in the literature, without explicit constants. See Tail and moment estimates for some types of chaos. Good ol’ Latala. I refuse to believe that this problem requires so much complication, though.
) you need to ensure that the projection of a fixed matrix onto their span,
satisfies
with high probability for some
fixed.
needs to be for
to be full rank; equivalently, take
Then I noticed that since
(by the SVD), this is equivalent to asking when
is full rank.
using symmetrization and the trace method to reduce to estimation of
, then Markov's inequality to get a tail bound, was promising at first. Since each
where
be independent zero-mean random matrices of dimension
. Suppose
and
almost surely for all
Then for any 
. Then
and by plugging through the calculations, when 
is full rank with high probability.
could be is
factor seems like a reasonable price to pay; in fact, because of the coupon collector phenomena, maybe it’s the same price you’d need to pay for a collection of random matrices selected from the standard euclidean basis matrices. In that light, this is an unexpectedly good result.
norm of a useful class of matrices
, the sum of the top
, as the SDP
is convex (since a function of the form
, i.e. the supremum of linear functions, is convex). Since a convex function on an open set is continuous, we see that
is continuous on the positive semidefinite cone, so
is continuous on the same set also.
norm of a PSD matrix
of having a new roommate, so naturally I’m planning what I’ll do with the extra $625/month I’ll be saving. Guffaw. The first two months are spoken for already— it’s past time for new glasses, and I haven’t paid off my Caltech dining account in several months—, then I’ll start saving up to get a car, since it looks like I’m going to be here another 3 years or so. After that, I dream of actually adding to my savings account for the first time in over a year.
which realizes
and then round the entries of
to signs somehow, take
to be an appropriate scaling matrix, and use
as your approximate sign decomposition of
)– I implemented this method, simply replacing the entries of
with reasonable probability at each step (take this with a grain of salt: I think this is easy to show, but I've not checked that it's true), which is all that you can hope to say even if you use the
decomposition. That is to say, there's nothing inherently special or useful about the
, so
. Therefore we can find a wide range of factorizations achieving
: take
to be any orthonormal matrix. So there's definitely not anything special about the factors, or the signs you'd round them to, in the general case.
can be far from sparse), although the factorizations I've seen returned with my particular greedy scheme have reasonably many terms, there's no kind of guarantee of sparseness.
norm ball
can be more simply written as
subject to
,
which solves this seems to be in the
1/2-ball. It’s easy to see that this ball is included in the feasible points, but no reason to think that it is the entire set of feasible points.
s.t.
,
.
is the SDP
s.t.
.
,

,
, then
and its components break down as follows:
,
,
,
,
, and 
is a matrix in the unit
norm