Moment generating function?

I’m trying to upper bound the norm of the pseudoinverse of a product of a fat matrix with orthogonal rows with a tall random matrix with independent entries: \|(V^T \Omega)^\dagger\|.

Since \|(V^T \Omega)^\dagger\| = 1/\sigma_{\text{min}}(V^T \Omega) , this is equivalent to lower bounding \lambda_{\text{min}} of the symmetric matrix V^T \Omega \Omega^T V. You can write this positive matrix in the form \sum_{i} u_i u_i^t where u_i is the ith column of V^T\Omega and the u_i are independent of each other because the entries of \Omega are independent.

Since \lambda_{\text{min}}(A) = -\lambda_{\text{max}}(-A) for a symmetric matrix A,


\displaystyle
\begin{aligned}
\mathbb{P}\left(\left\|(V^T \Omega)^\dagger\right\| > \sqrt{t}\right) & = \mathbb{P}\left(\lambda_{\text{min}}\left(V^T \Omega \Omega^T V\right) < 1/t\right) \\
 & = \mathbb{P}\left(\lambda_{\text{max}}\left(\sum_i - u_i u_i^t\right) > -1/t\right).
\end{aligned}

If you had an almost sure upper bound on the norm of the u_i, then you could directly use a Chernoff bound from Tropp’s paper to quickly get an upper bound on this probability, but I don’t have such an upper bound. In particular, I’d like to consider the case where \Omega is a Gaussian matrix.

However, perhaps fortunately, his general Laplace transform for random matrices still applies: if you have self-adjoint matrix functions A_i(\theta) which are semidefinite bounds on the cumulant generating functions of -u_i u_i^t— that is, \log(\mathbb{E} e^{- \theta u_i u_i^t}) \preceq A_i(\theta)—, then


\displaystyle
\mathbb{P}\left(\lambda_{\text{max}}\left(\sum_i -u_i u_i^t \right) > -1/t\right) \leq \inf_{\theta >0} \left\{ e^{\theta/t} \cdot \text{tr } \exp\left(\sum_i A_i(\theta)\right)\right\}.

To use this result, we must find nonvacuous A_i(\theta). Expand the moment generating function of -u_i u_i^t :


\displaystyle
\begin{aligned}
\mathbb{E} e^{-\theta u_i u_i^t} & = I + \mathbb{E} \sum_{k=1}^\infty \frac{(-\theta u_i u_i^t)^k}{k!} \\
 & = I + \mathbb{E} \left[ (e^{-\theta \|u_i\|^2}  - 1) \frac{u_iu_i^t}{\|u_i\|^2} \right] = \mathbb{E} \left[ e^{-\theta \|u_i\|^2} P_{u_i} + P_{{u_i}^\perp} \right].
\end{aligned}

This is where I’m currently stuck! If there was no negative sign (bear with me), then I could bound this final quantity with


\displaystyle
e^{\theta \|u_i\|^2} P_{V^T} + P_{(V^T)^\perp} = Q \begin{pmatrix} \mathbb{E} e^{\theta \|u_i\|^2} I_k & 0 \\ 0 & I_{n-k} \end{pmatrix} Q^t,

where k = \text{rank}(V^T) , n is the height of V and \Omega, and Q is an orthogonal matrix which doesn’t depend on i, and whose first k columns are V.
Then A_i(\theta) would be the log of this, so my problem would reduce to finding a nice expression or upper bound for the cumulant generating function of \|u_i\|^2, which turns out to be  \log \mathbb{E} \exp(\theta \omega_i^t VV^T \omega_i) , where \omega_i is the ith column of \Omega.

Unfortunately, eliminating the negative sign means I would be finding an upper bound on the norm of V^T \Omega, which although interesting, isn’t the problem I’m concerned with now.

The reason for this post, other than for me to get the details down somewhere, is to ask about that moment generating function. Say if \omega_i is a standard Gaussian vector, what’s a good bound on
\mathbb{E} \exp(\theta \omega_i^t VV^T \omega_i) ? Does it suffice to know that VV^T is a rank k projection, or do I need to know more about the entries of VV^T? I don’t care about this case, since it’s trivial to bound \|V^T \Omega\| when \Omega is a Gaussian matrix, but it’d be informative to see how the results from the sharp trivial bound compare with the results I can get using this Laplace method.

I fear that getting a good bound on \mathbb{E} \exp(\theta \omega_i^t VV^T \omega_i) for more general \omega_i is hard, because iirc, dealing with double sums of even gaussians is difficult to do sharply.

Update: A colleague pointed out that when \omega_i is Gaussian, then that’s just the moment generating function of a \chi^2 random variable with k degrees of freedom, which apparently has a known expression.

Possibly relevant posts:

Tags: , , , ,
No comments yet.

Leave a comment

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="">
CommentLuv Enabled