東京大学 情報理工学系研究科 コンピュータ科学専攻 2018年2月実施 問題1
Author
Description
In this problem, \(\mathbb{R}\) represents the set of real numbers, and \(\mathbb{R}^N\) represents the set of real column vectors of length \(N\). For \(\boldsymbol{v} \in \mathbb{R}^N\), \(\boldsymbol{v}^\top\) denotes its transpose. Let \(I\) be the \(N \times N\) identity matrix.
Consider an eigensystem of a real \(N \times N\) symmetric matrix \(A\),
where \(\lambda\) and \(\boldsymbol{x}\) are an eigenvalue and a corresponding eigenvector, respectively.
Let \(\lambda_{\text{max}}(M)\) be the maximum of eigenvalues of matrix \(M\).
You may use the following facts on the eigenvalues and the eigenvectors of a real \(N \times N\) symmetric matrix without proofs:
- There are \(N\) independent eigenvectors that form an orthogonal basis.
- Every eigenvalue is a real number.
Answer the following questions.
(1) Prove that if \(\boldsymbol{x}\) is an eigenvector of \(A\), it is also an eigenvector of \(A + \mu I\) for any \(\mu \in \mathbb{R}\).
(2) Prove that
(3) Prove that
for any \(\boldsymbol{v} \in \mathbb{R}^N\).
(4) Suppose that matrix \(B\) is also an \(N \times N\) real symmetric matrix. Prove that
Kai
(1)
Let \(x\) be an eigenvector with corresponding eigenvalue \(\lambda\): \(Ax = \lambda x\). Let \(\mu \in \mathbb{R}\).
So \(x\) is an eigenvector of \((A + \mu I)\) with corresponding eigenvalue of \(\lambda + \mu\).
(2)
Let \(v \in \mathbb{R^N}\) be a unit vector which maximizes \(v^T A v\). Because \(A\) is real and symmetric, then it can be diagonalized \(A = Q^T \Lambda Q\), where \(Q\) is orthogonal matrix, which columns form orthogonal basis. In following equation, let's substitute \(y = Qv\):
On the other hand, let \(x_{max}\) be eigenvector corresponding to \(\lambda_{max}(A)\). Because \(v\) maximizes \(v^T A v\), then:
Here we got:
For vector \(v\) which maximizes \(v^T A v\). Hence,
and the maximum is reached for eigenvector \(x_{max}\).
(3)
Let \(v\in\mathbb{R}^N\). From question (2) we know that \(v^T A v\leq \lambda_{max}(A) v^T v\) for arbitrary \(v\): look at second to the last line of Q2 calculations. Now if we know that fact, we get the inequality trivially:
(4)
Since \(A, B\) are symmetric and real, then \(A = Q^T\Lambda_A Q\) and \(B = P^T\Lambda_B P\) for some orthogonal \(Q\)and\(P\). From what we obtained in (Q2): let \(w\) be a~unit vector which maximizes \(w^T(A+B)w\). Then:
Substitute \(y = Qw\) and \(z = Pw\). Of course \(y^T y = z^T z = 1\), because \(y^T y = w^TQ^TQw = w^T w = 1\) (the same for \(z\)). Combining this and our knowledge form previous questions: