Linear Operators and Adjoints

In an inner-product space, the vectors are the things — states, functions, arrows. The operators are the verbs: the machines that act on those things and turn one vector into another. Rotate a state, differentiate a function, multiply by position, measure an energy — in the language of Hilbert space every one of these is a linear operator. If vectors are the nouns of quantum mechanics, operators are its physics: every observable quantity you can measure is an operator, and this page builds the one property that makes an operator measurable.

That property hinges on a beautiful partner construction. Every operator A has a shadow twin called its adjoint A^\dagger ("A-dagger"), defined entirely through the inner product. When an operator equals its own adjoint — A = A^\dagger, called self-adjoint or Hermitian — something magical happens: its eigenvalues are forced to be real numbers. Since a measurement in the lab always reads out a real number, this is exactly why the observables of quantum mechanics are Hermitian operators. Let us build it.

What a linear operator is

A linear operator A on a space V is a map V \to V that respects the vector-space structure — it commutes with adding and scaling:

A(\alpha u + \beta v) = \alpha\,Au + \beta\,Av.

In finite dimensions, once you fix a basis, an operator is just a matrix and Av is matrix-times-vector — nothing new. The word "operator" is what we use when we want to keep the coordinate-free viewpoint, or when the space is infinite-dimensional and no finite matrix exists. Two operators worth meeting from quantum mechanics: the position operator \hat x that multiplies a wavefunction by x, and the momentum operator \hat p = -i\hbar\,\tfrac{d}{dx} that differentiates it. Both are linear; neither is a finite matrix.

The adjoint: sliding the operator across the inner product

Here is the key move, and it is worth reading slowly. Given an operator A, its adjoint A^\dagger is the unique operator that lets you move A from one side of an inner product to the other:

\langle A u,\, v\rangle = \langle u,\, A^\dagger v\rangle \quad\text{for all } u, v.

Read it as a rule for "unloading" A: whatever A does to the left slot, its adjoint A^\dagger does the equivalent thing from the right slot. The inner product is the referee that defines what "equivalent" means. In finite dimensions with the complex dot product, this abstract definition has a wonderfully concrete answer: the adjoint is the conjugate transpose — transpose the matrix and conjugate every entry,

A^\dagger = \overline{A^{\mathsf T}} = (\bar A)^{\mathsf T}.

(For a real matrix there is no conjugation to do, so the adjoint is simply the transpose A^{\mathsf T}.) The dagger just packages "transpose and conjugate" into one symbol, and the inner-product equation above is the reason those two operations belong together.

Self-adjoint (Hermitian) operators

An operator that is its own adjoint is the star of the show:

The 2×2 grid shows a matrix and its conjugate transpose side by side. Toggle the entries and watch the reflect-and-conjugate operation. When the "before" and "after" grids match, the matrix is Hermitian — its diagonal is real, and its off-diagonal entries are conjugate mirror partners.

Left: a 2×2 complex matrix A. Right: its adjoint A^\dagger, the transpose with every entry conjugated. The matrix is Hermitian exactly when the two grids are identical — real diagonal, mirror-conjugate off-diagonal.

The punchline: Hermitian ⟹ real eigenvalues

Why do physicists insist that observables be Hermitian? Because of a short, elegant argument that forces the eigenvalues to be real. Suppose A is Hermitian and Av = \lambda v for some non-zero eigenvector v. Compute \langle A v, v\rangle two ways.

Down the left slot, using Av = \lambda v and that the first slot is conjugate-linear: \langle Av, v\rangle = \langle \lambda v, v\rangle = \overline{\lambda}\,\langle v, v\rangle.

Down the right slot, using self-adjointness \langle Av, v\rangle = \langle v, Av\rangle and then Av = \lambda v: \langle v, Av\rangle = \langle v, \lambda v\rangle = \lambda\,\langle v, v\rangle.

The two results must agree, and since \langle v, v\rangle = \lVert v\rVert^2 > 0 (the eigenvector is non-zero), we can divide it out:

\overline{\lambda} = \lambda \quad\Longrightarrow\quad \lambda \text{ is real.}

A number equal to its own conjugate has no imaginary part. So every eigenvalue of a Hermitian operator is a real number — which is precisely the guarantee a measurement needs. There is a companion fact, equally important, that comes from the same style of argument: eigenvectors of a Hermitian operator belonging to different eigenvalues are automatically orthogonal. Put the two together — real eigenvalues, orthogonal eigenvectors — and you have the spectral theorem, the crown jewel that says a Hermitian operator can be diagonalised in an orthonormal basis of its own eigenvectors.

Worked example — a Hermitian 2×2 and its real spectrum

Consider the matrix

A = \begin{bmatrix} 2 & 3 - i \\ 3 + i & 5 \end{bmatrix}.

Is it Hermitian? Take the transpose (swap off-diagonals) and conjugate every entry. The diagonal 2, 5 is real, and the off-diagonal 3 - i conjugates to 3 + i, which is exactly the mirror entry — so A^\dagger = A. Yes, Hermitian. Its eigenvalues solve \det(A - \lambda I) = 0:

(2-\lambda)(5-\lambda) - (3-i)(3+i) = 0.

Now (3-i)(3+i) = 9 - i^2 = 9 + 1 = 10 (a difference of squares — the conjugate pair multiplies to a real number, no surprise). So \lambda^2 - 7\lambda + 10 - 10 = \lambda^2 - 7\lambda = 0, giving \lambda = 0 and \lambda = 7 — both real, exactly as the theorem promised. Had the matrix not been Hermitian, the eigenvalues could easily have come out complex.

The momentum operator \hat p = -i\hbar\,\tfrac{d}{dx} looks alarming — it has an explicit i out front, so how can it possibly have real eigenvalues? The answer is a lovely use of the adjoint. Compute \langle \hat p f, g\rangle = \int \overline{(-i\hbar f')}\,g\,dx = i\hbar\int \overline{f'}\,g\,dx and integrate by parts. The boundary terms vanish for wavefunctions that decay at infinity, and the derivative moves onto g — but integration by parts flips the sign, and the conjugate on the leading i flips it back. The two sign flips conspire so that \langle \hat p f, g\rangle = \langle f, \hat p g\rangle. The -i is there precisely to make the derivative self-adjoint; without it, \tfrac{d}{dx} alone is anti-Hermitian (A^\dagger = -A) and would have imaginary eigenvalues. The factor of i is not decoration — it is what makes momentum a measurable quantity.

The adjoint of a product reverses the order. Just like the transpose and the matrix inverse, (AB)^\dagger = B^\dagger A^\dagger, not A^\dagger B^\dagger. A direct consequence bites in quantum mechanics: even when A and B are each Hermitian, their product AB is Hermitian only if they commute (because (AB)^\dagger = B^\dagger A^\dagger = BA, which equals AB only when AB = BA). Since position and momentum famously do not commute, \hat x\hat p is not itself an observable — you have to symmetrise it.

And do not confuse Hermitian (A^\dagger = A, real eigenvalues, an observable) with unitary (A^\dagger = A^{-1}, eigenvalues of modulus 1, a rotation/time-evolution). They are different conditions that happen to share the dagger. Hermitian operators are the measurements; unitary operators are the motions.