Featured Post

95 Fundamentals -- eigenvalue problem, Cayley-Hamilton theorem and the invariants of tensors

After previous episodes, I believe our readers have more understanding about what a tensor is. The concept of invariant is built in tensor itself -- a tensor is invariant to changes in coordination systems. However, the decomposition of a tensor T is not invariant. TijTij unless it is an isotropic tensor. Is it possible that there are something associated with a tensor that are also invariant to changes in coordination systems? That's what we are going to find out today!

Eigenvalues and Eigenvectors


From previous episode, we know that a second-order Cartesian tensor can a linear mapping of vectors. Usually, these vectors will change their directions after the mapping. However, there are some vectors that are kind of "intrinsic" to the tensor. These "intrinsic" vectors will not change their directions after the mapping of the tensor. We call these vectors "eigenvectors," where "eigen" means "own." The relative length change after the mapping of the eigenvector is its associated "eigenvalue," which we denote as λ.

It is quite easy to solve eigenvalues and eigenvectors. From its definition:
Tv=λv,(TλI)v=0
In order to have a nontrivial solution of v, the matrix (TλI) has to be singular (i.e., its determinant has to be 0, and thus cannot be inverted). Thus for a second-order Cartesian tensor with eigenvalues λ1,λ2,...,λn, we can define the characteristic polynomial of tensor T to be:
pT(λ)=det(λIT)=(λλ1)(λλ2)...(λλn)

The Characteristic Polynomial

For a second-order Cartesian tensor in a 3-dimensional space, we can expand every terms in the characteristic polynomial:
pT(λ)=det(λIT)=λ3(T11+T22+T33)λ2+(T11T22T12T21+T11T33T13T31+T22T33T23T32)λ(T11T22T33+T21T32T13+T12T23T31T13T31T22T23T32T11T33T12T21)
We can simplify it into:
pT(λ)=λ3tr(T)λ2+(|T11T12T21T22|+|T22T23T23T33|+|T11T13T31T33|)λdet(T)
Since the solution of pT(λ)=0 is λ1,λ2,λ3, the characteristic polynomial can also be expressed as:
pT(λ)=λ3(λ1+λ2+λ3)λ2+(λ1λ2+λ2λ3+λ1λ3)λλ1λ2λ3
From direct comparison, we can thus define 3 indices for tensor T:
I1(T)=tr(T)=(λ1+λ2+λ3)
I2(T)=|T11T12T21T22|+|T22T23T23T33|+|T11T13T31T33|=12((tr(T))2tr(T2))=λ1λ2+λ2λ3+λ1λ3
I3(T)=det(T)=λ1λ2λ3

However, what will happen to these polynomial and indices if we change our coordinate system? From our tensor transformation rule, we know that:
pT(λ)=det(λIT)=det(λδijTij)=det(λLILTLTLT)=det(L)det(λIT)det(LT)=det(λIT)=det(λδijTij)
Here we used the fact that (det(L))2=1. We show that the characteristic polynomial is invariant to coordination changes. Since the three indices we just defined previously are the coefficients in the characteristic polynomial, all three indices are invariant to coordination changes as well. These 3 special indices are called the "principal invariants of tensors."

We should pay some extra attention to our second principal invariant:
I2(T)=12((trT)2tr(T2))=12(T2iiTijTji)
Since Tii is simply the first principal invariant, we can deduce that TijTji is also an invariant. Similarly, TijTij=T:T is also an invariant.

Cayley-Hamilton theorem

From previous definition of characteristic polynomial, we know that if we substitute λ with any eigenvalue λi, the characteristic polynomial will vanish. Since the eigenvalues are kind of "intrinsic" to our original tensor, what will happen if we substitute λ with our tensor T?
pT(T)=T3I1(T)T2+I2(T)Tdet(T)I=?
Since pT(T) is a tensor, we will see how it will map our eigenvector vi with eigenvalue λi:
pT(T)vi=T3viI1(T)T2vi+I2(T)Tvidet(T)Ivi
=λ3iviI1(T)λ2ivi+I2(T)λividet(T)vi
=(λ3iI1(T)λ2i+I2(T)λidet(T))vi=0
From theorem we are not going to prove here: if an nxn matrix has n distinct eigenvalues, the matrix has a basis for eigenvectors for n. That means for any vector u, we can express it in terms of the linear combination of the eigenvectors (i.e., u=αivi). Therefore, for any vector u,
pT(T)u=0
That means:
pT(T)=T3I1(T)T2+I2(T)Tdet(T)I=0
And this is the Caylay-Hamilton theorem. (This theorem also holds for degenerate tensors, but the proof requires a few more steps, which we will not go through here.)

From this theorem we can make the calculation of determinant and matrix inverse simpler. If we take the trace of pT(T), we will get:
tr(T3)tr(T)tr(T2)+12(tr(T)2tr(T2))tr(T)3det(T)=0
det(T)=13(tr(T3)+12tr(T)332tr(T)tr(T2))
By similar means, if we multiply pT(T) to the tensor inverse of T, we will get:
T2tr(T)T+12(tr(T)2tr(T2))Idet(T)T1=0
T1=1det(T)(T2tr(T)T+12(tr(T)2tr(T2))I)
And we now have the inverse of a tensor!

Some Wrap Up

So in this episode, we first introduce the basic concepts of eigenvalue problems. We introduced the characteristic polynomial, and we showed that this polynomial is linked to the 3 principal invariants of a tensor. We also talked about Cayley-Hamilton theorem, and how this theorem can help us determine the determinant and the inverse of a tensor. That's a lot for a single episode, and I hope you enjoy so far. In our next episode, we will finally talk about something about stress and strain, so stay tuned!

Comments