Partial Trace and Mixed States

Two qubits, A and B, are entangled and share a single joint state \rho_{AB}. Now B is carried off to another galaxy and you keep only A. Here is the question that runs this whole page: what is the state of the qubit in your hand? It must have some description — every measurement you make on A alone has to come out somehow — but a plain ket \alpha|0\rangle + \beta|1\rangle can't be it, because A is tangled up with a qubit you no longer hold.

The answer is a \rho_A called the reduced density matrix, and it is built by an operation called the partial trace that mathematically "discards" B:

\rho_A = \operatorname{Tr}_B(\rho_{AB}).

It is the honest local description of A: it reproduces the statistics of every measurement you could perform on your qubit, using only your qubit.

What the partial trace does

Recall that an ordinary density matrix lives on the full two-qubit space, spanned by |00\rangle, |01\rangle, |10\rangle, |11\rangle. The full trace \operatorname{Tr}(\rho) sums the diagonal and collapses everything to a single number (always 1). The partial trace over B is the same idea done to only B's degrees of freedom: it traces out B and leaves a matrix that still acts on A.

The whole operation is fixed by what it does to a basis operator. Writing a two-system operator as a tensor product |a\rangle\langle a'| \otimes |b\rangle\langle b'| (the A part times the B part), the rule is

\operatorname{Tr}_B\big(|a\rangle\langle a'| \otimes |b\rangle\langle b'|\big) = \langle b'|b\rangle\,|a\rangle\langle a'|.

Read it as: leave the A part alone (|a\rangle\langle a'| passes straight through), and trace the B part into the number \langle b'|b\rangle. Because the computational basis is orthonormal, \langle b'|b\rangle is 1 when b'=b and 0 otherwise — so only the terms where B's bra and ket match survive. Extend it to a general \rho_{AB} by linearity, term by term.

Worked example: one half of a Bell pair

Take the maximally entangled Bell state |\Phi^+\rangle = \tfrac{1}{\sqrt2}\big(|00\rangle + |11\rangle\big), whose density matrix is \rho_{AB} = |\Phi^+\rangle\langle\Phi^+|. Multiplying out,

\rho_{AB} = \tfrac12\big(|00\rangle\langle 00| + |00\rangle\langle 11| + |11\rangle\langle 00| + |11\rangle\langle 11|\big).

Now trace out B, one term at a time, using the rule above (each ket |xy\rangle = |x\rangle_A \otimes |y\rangle_B):

Only the two diagonal terms live. Adding them with the \tfrac12 out front,

\rho_A = \tfrac12\big(|0\rangle\langle 0| + |1\rangle\langle 1|\big) = \tfrac12 I = \begin{pmatrix} \tfrac12 & 0 \\[2pt] 0 & \tfrac12 \end{pmatrix}.

This is the maximally mixed qubit — a perfect coin flip. Locally your half of a Bell pair is indistinguishable from a qubit that is truly random: measure it in any basis and you get 0 or 1 with probability \tfrac12 each, with no bias to exploit. And yet the global state |\Phi^+\rangle is pure and perfectly definite. The purity went somewhere — into the correlations between A and B, which is exactly what the partial trace threw away.

Worked example: a product state stays pure

Contrast an unentangled pair. Prepare |\psi\rangle = |0\rangle \otimes |{+}\rangle, so \rho_{AB} = |0\rangle\langle 0| \otimes |{+}\rangle\langle{+}| factors cleanly into an A part and a B part. The partial trace acts only on the B factor:

\rho_A = \operatorname{Tr}_B\big(|0\rangle\langle 0| \otimes |{+}\rangle\langle{+}|\big) = |0\rangle\langle 0| \cdot \operatorname{Tr}\big(|{+}\rangle\langle{+}|\big) = |0\rangle\langle 0|,

since \operatorname{Tr}(|{+}\rangle\langle{+}|) = 1. The reduced state is just |0\rangle\langle 0| — the pure state |0\rangle. That makes sense: nothing about A ever depended on B, so losing B costs you nothing. Whatever B is doing, A is cleanly, definitely |0\rangle.

Mixed reduced state = the fingerprint of entanglement

These two examples are not a coincidence — they are the general law. For a pure joint state |\psi\rangle_{AB}:

|\psi\rangle_{AB} \text{ is entangled} \iff \rho_A \text{ is mixed}.

A clean, computable test for "mixed" is the purity \operatorname{Tr}(\rho_A^2): it equals 1 exactly when \rho_A is pure, and drops below 1 when it is mixed. So the operational signature of entanglement is \operatorname{Tr}(\rho_A^2) < 1. Check it on our two cases:

The bars below plot the two eigenvalues of \rho_A in each case. A pure state puts all its weight on one eigenvalue (a spike at 1); a mixed state spreads the weight out. The flatter the spread, the more entangled the original pair — and nothing is flatter than \tfrac12, \tfrac12.

Imagine you and a friend each receive one qubit of a freshly minted Bell pair. You measure yours a thousand times (on fresh copies): 50% zeros, 50% ones, every basis, no pattern, no bias — statistically it is white noise. You could be forgiven for thinking your qubit carries no information at all. And locally, it doesn't: \rho_A = \tfrac12 I is as featureless as a state can be. The information isn't in your qubit or in theirs — it lives in the correlation between them, invisible until the two of you compare notes. This is the strange heart of entanglement: a global state can be perfectly sharp while each of its parts is a shrug. It is also why entanglement makes a good cryptographic resource — an eavesdropper who steals just your qubit steals only noise.

The reduced state \rho_A is a lossy summary. It knows everything about measurements on A alone — and nothing about how those outcomes relate to B. That is precisely the information the partial trace discards. So you cannot rebuild the joint state from the two reduced states: both |\Phi^+\rangle = \tfrac{1}{\sqrt2}(|00\rangle + |11\rangle) and the boring classical mixture "coin-flip |00\rangle or |11\rangle" give the same \rho_A = \tfrac12 I (and the same \rho_B), yet they are wildly different states — one is entangled, one isn't. The moral: don't confuse "I know \rho_A and \rho_B" with "I know \rho_{AB}." The whole is more than the parts, and the missing piece is the correlation.