Zhengfeng Ji
Seminars on Quantum Computing is a quantum computing course designed for undergraduate students in the computer science.
This is an incomplete set of notes prepared by Zhengfeng Ji for the course.
What is quantum computing, or more generally, what is quantum information science?
The first example question in ChatGPT: Explain quantum computing in simple terms.
The answer I got was:
Quantum computing is a type of computing that uses the principles of quantum mechanics, the science of the very small, to process information. Instead of using classical bits, which can be either 0 or 1, quantum computers use quantum bits or qubits, which can be in multiple states at the same time.
This ability to exist in multiple states simultaneously is called superposition and allows quantum computers to perform certain calculations much faster than classical computers. In addition, quantum computers can also use a phenomenon called entanglement, where the state of one qubit is correlated with the state of another, to further enhance their computational power.
Quantum computing has the potential to solve some of the world's most challenging problems in fields such as cryptography, drug discovery, and climate modeling. However, building a fully functional quantum computer remains a major scientific and engineering challenge, and many of the applications of quantum computing are still in the experimental stage.
You can try DeepSeek V3 and you will get something similar or better.
They all describe the outline and motivation of this course well!
Quantum + Information: The second quantum revolution
The success of quantum mechanics. Laser, GPS, etc.
From passive understanding of the collective effects to active manipulation of individual particles.
History and highlights
a. Early history: Feynman 1982, Deutsch 1985, Deutsch-Jozsa 1992 b. Shor's factoring algorithm 1994 (breaks RSA) c. Quantum cryptography (QKD, Bennett Brassard 84) d. Quantum teleportation (Bennett et al., 1993) e. Simulation of quantum systems (Feynman's dream)
Current status
Primitive quantum computers are being built. Many players are racing to build quantum computers: Google, IBM, China, …
Different labs/companies are betting on different hardware routes: Superconducting qubits, ion traps, photonic systems, topological qubits, and more recently, neutral atom platforms.
Nowadays, there are programmable quantum computers with 50-1000 qubits.
Major problem: devices are noisy! The noise level is around \(10^{-3}\).
Supremacy and simulation of Google and USTC
Sycamore, 53 qubits
Zhuchongzhi, 66 qubits
The biggest competitor to quantum computers are clever classical algorithms: In Nov 2021, a group from Chinese Academy of Sciences used tensor network algorithms to reproduce the statistics of the Google supremacy experiment using 512 GPUs in 15 hours.
Both Google and USTC have upgraded the system to battle against better classical simulation methods and hardware.
On the theory side: new algorithms, new protocols, fault-tolerant quantum computing, new insights on the power of quantum computers
More computer scientists get involved
Hype?
a. TIME's new cover: Quantum computers will transform our world—and create a 21st century "space race"
b. Quantum computing has a hype problem, MIT Technology Review, By Sankar Das Sarma.
Quantum computing startups are all the rage, but it's unclear if they'll be able to produce anything of use in the near future.
What you will learn in this subject
Is quantum computing hard to learn?
The mathematics is easy, the physics is hard.
There is a simple mathematical framework describing quantum mechanics with merely four postulates, about state space, evolution, measurement, and joint system, respectively, in both the pure state and mixed state settings.
The postulates are simple, but the truly amazing thing to note is how far these four simple postulates can bring us.
"Shut up and calculate!"
Reiterate over the four postulates from different perspectives.
Zhengfeng Ji, Yuan Feng, and Jianxin Chen
Office: 1-707, Ziqiang S&T Building
Research direction of Zhengfeng
Quantum computing, theory of computing.
Research direction of Yuan
Quantum computing, theory of programming, mathematical logic.
Research direction of Jianxin
Quantum computing, computer architecture.
Online: Wechat group, learn.tsinghua.edu.cn, email
Office hour: after each class
Homework on the quantum information basics. (40%)
Work on projects in groups and take an idea to the extreme (Given projects, or projects of your choice approved by your lecturers), write an essay, present your findings.
We will grade your project by its writing and format (10%), originality (20%), completion (10%), and presentation (20%).
Extra rule: If your project yields results deemed near-publishable by the teachers' evaluation, you will automatically receive an A+.
a. Quantum Computation and Quantum Information (Mike & Ike)
d. Foundations of Quantum Programming by Mingsheng (Tsinghua Online Access)
Computer science
Mathematics
Physics
Three basic statements about the physical world
Probability
\(\sum_j P_j = 1, P_j \ge 0\)
Monotonicity
Single-slit case
Locality
Things can only propagate through the universe at a certain speed.
In physics, locality naturally emerges from Einstein's special theory of relativity, which implies that no signal can propagate faster than the speed of light.
Local Realism
Things have predetermined value, which depends on parameters in a local region.
Einstein: I like to think the moon is there even if I am not looking at it.
Church-Turing thesis
What is Church-Turing thesis?
TCS is mathematics but the Church-Turing thesis connects it to physics.
Information is physical.
Conversely: It from qubit.
Quantum mechanics and the above principles
Probability stays, yet monotonicity goes away.
Locality stays but realism is abandoned.
EPR and Bell's inequality.
Quantum computing does not falsify Church-Turing, but seriously challenges the extended Church-Turing thesis.
Watch this video.
Double-slit with an observer: which path did the electron take?
If there is a conscious observer, the distribution goes back to classical! That is the same as decoherence in a quantum system interacting with an environment!
Quantum mechanics is an (unusual) theory using probability.
A cat isn't found in a superposition of alive and dead states, because it interacts constantly with its environment. These interactions essentially leak information about the 'cat system' out.
This explains the difficulty of building a quantum computer. Quantum computers need to be isolated well from the environment yet still easy to manipulate.
Shor's factoring algorithm can be thought of as a large-scale interference experiment.
Born's rule
Probability stays but monotonicity goes.
God does play dice, and an unusual one.
Einstein liked inventing phrases such as "God does not play dice," "The Lord is subtle but not malicious." On one occasion Bohr answered, "Einstein, stop telling God what to do."
Instead of probabilities, quantum mechanics uses amplitudes.
\(p = \abs{\alpha}^2\).
Let \(\alpha_i\) be the amplitude that it lands on a position through slit \(i\).
\(\alpha=\alpha_1 + \alpha_2\).
Example \(\alpha_1 = 1/2\) and \(\alpha_2 = -1/2\). What is the probability \(p\)?
Cancellation!
Consider the problem of flipping a coin.
\(\Pr(\text{heads}) = p, \Pr(\text{tails}) = q\).
Apply a transform \(X\).
In general, the transform is a matrix of transition probabilities \(p(a|b)\).
Markov chain and stochastic matrix.
Now consider two independent coins. This leads to the tensor product very naturally.
More generally, the tensor product of two matrices \(A, B\) is defined as a block matrix
\[\begin{equation*} A \otimes B = \begin{pmatrix} A_{1,1} B & A_{1,2} B & \cdots & A_{1,n} B\\ A_{2,1} B & A_{2,2} B & \cdots & A_{2,n} B\\ \vdots & \vdots & \ddots & \vdots \\ A_{m,1} B & A_{m,2} B & \cdots & A_{m,n} B\\ \end{pmatrix}. \end{equation*} \]Consider the CNOT matrix.
It creates a correlation between the two coins.
Quantum mechanics is similar, except that it uses amplitudes.
A quantum state is a unit vector of \(\complex^N\).
Finite dimensional spaces suffice for this course.
A qubit is a superposition of 0 and 1.
Quantum states are superposition of possibilities.
Dirac notation.
ket, bra (\(\ket{0}, \ket{1}, \ket{+}, \ket{-}, \ket{\psi}\)).
Unitary. \(U^\dagger U = I\).
Examples: \(I, X, Y, Z, H, S, T\), CNOT, rotations.
\[\begin{equation*} H = \frac{1}{\sqrt{2}} \begin{bmatrix} 1 & 1\\ 1 & -1 \end{bmatrix}. \end{equation*} \] \[\begin{equation*} H \ket{0} = \ket{+}, H \ket{1} = \ket{-}. \end{equation*} \]Why unitary? Unitary operators are the only operators that preserve the length of vectors.
一场从根上开始的革命 (by Mingsheng Ying)
Simulating physics with computers (by Richard P. Feynman)
18:01 Charlie Bennett - 1981 and the Beginning of Quantum Information
49:20 Peter Shor - Development of Quantum Algorithms and Error Correction
Probability, amplitude, and Born's rule
Two-slit experiment
A (classical) random bit the mixture of \(0\) or \(1\) described by probabilities.
A quantum bit (qubit) is a superposition state described by amplitudes for \(0\) and \(1\).
1-norm vs. 2-norm theory.
Linear algebra for probability
A (random) bit is represented as a vector \(\begin{pmatrix} p_0 \\ p_1 \end{pmatrix}\).
\(X\) and CNOT as operations on the probability vector.
\(X\) is the NOT gate in matrix form!
CNOT says if the control bit is \(1\), flip the target bit.
Tensor product arises naturally.
Binary symmetric channel as a matrix.
Linear algebra for quantum mechanics
Quantum gates
Unitary gates
Pauli \(X, Y, Z\)
The Hadamard gate \(H\)
In some sense, the most important quantum gate.
The Hadamard gate \(H\) has the matrix form
\[\begin{equation} H = \frac{1}{\sqrt{2}} \begin{bmatrix} 1 & 1\\ 1 & -1 \end{bmatrix}. \end{equation} \]It maps \(\ket{0}, \ket{1}\) to \(\ket{+}, \ket{-}\), and vice versa.
\(H\) and \(H^2\).
Destructive interference (in the tree for \(H^2 \ket{0}\)).
All quantum gates are changes of basis.
We can expand \(\ket{0}\) in the basis of \(\ket{+}\) and \(\ket{-}\).
\(\ket{0}, \ket{1}\) is not special.
There are different ways to measure in quantum!
What if we measure along another choice of basis? Say, \(\ket{+}, \ket{-}\)?
Decompose the state \(\ket{\psi}\) as a combination of \(\ket{+}\) and \(\ket{-}\).
Two views:
The projection picture on the circle.
Projective measurements.
Like calculating the probability of two independent random bits, we can use the tensor product to represent the state of multiple qubits.
It is now the tensor product of quantum states (vectors of amplitudes instead of vectors of probabilities).
Two-qubit states
A general two-qubit state is
\[\begin{equation} \ket{\psi} = \alpha \ket{00} + \beta \ket{01} + \gamma \ket{10} + \delta \ket{11} \end{equation} \]It is the superposition of four possibilities.
Born's rule gives the probability of four outcomes.
In principle there's no distance limitation between the two qubits.
Let's say the first qubit is in Alice's hand. What happens if Alice measures it?
If Alice obtained \(0\) outcome, then the state is
\[\begin{equation} \frac{\alpha \ket{00} + \beta \ket{11}}{\sqrt{\abs{\alpha}^2 + \abs{\beta}^2}}. \end{equation} \]This is called the Partial Measurement Rule, which follows from the measurement postulate.
\[\begin{equation*} \begin{split} M_0 & = \ket{0}\bra{0} \otimes I\\ M_1 & = \ket{1}\bra{1} \otimes I \end{split} \end{equation*} \]Two-qubit gates
Single-qubit gates as a two-qubit gate: \(X \otimes I\) and \(I \otimes X\).
Applying the same single-qubit gate on the two qubits: \(H \otimes H\).
CNOT gate.
We need some facts from linear algebra to take our grasp of quantum mechanics to the next level.
\(A (\sum_j a_j \ket{v_j}) = \sum_j a_j A \ket{v_j}\).
Matrix entries \(A_{ij} = \bra{i} A \ket{j}\).
Hilbert space: vector space with an inner product.
What is an inner product \((\,\cdot\,, \,\cdot\,)\)?
A concrete example is
\[\begin{equation*} (\ket{u}, \ket{v}) = \sum_{j=1}^d {u_j}^* v_j. \end{equation*} \]Question: is this the only possibility?
Dirac notation \(\langle u | v \rangle\).
The norm is induced by the inner product (of the Hilbert space).
\[\begin{equation*} \norm{\ket{u}} = \sqrt{\langle u | u \rangle}. \end{equation*} \]The normalization of \(\ket{u}\) is \(\ket{u} / \norm{\ket{u}}\).
The decomposition of a state into basis
\[\begin{equation*} \ket{\psi} = \sum_j a_j \ket{\lambda_j}. \end{equation*} \]Then
\[\begin{equation*} a_j = \langle \lambda_j | \psi \rangle. \end{equation*} \]Orthonormal basis and completeness relation:
If \(\{ \ket{\lambda_i} \}\) form an orthonormal basis, then we have
\[\begin{equation*} \sum_j \ket{\lambda_j} \bra{\lambda_j} = I. \end{equation*} \]Adjoints of operators
Let \(A\) be a linear operator on a Hilbert space. There exists a unique operator \(A^\dagger\) such that for all \(\ket{v}, \ket{w}\),
\[\begin{equation*} (\ket{v}, A \ket{w}) = (A^\dagger \ket{v}, \ket{w}). \end{equation*} \]In the matrix form, \(A^\dagger\) is the conjugate transpose of \(A\).
\(A \ket{\lambda} = \lambda \ket{\lambda}\).
Matrix \(A\) is diagonalizable if \(A = \sum_j \lambda_j \ket{\lambda_j} \bra{\lambda_j}\) where \(\ket{\lambda_j}\) form an orthonormal set of eigenvectors of \(A\).
Example: What is the eigenvalues and eigenvectors of \(X\)?
Spectral decomposition theorem
Theorem. Any normal operator \(A\) on space \(V\) is diagonalizable with respect to some orthonormal basis of \(V\), and vice versa.
Families of diagonalizable matrices
Matrix | Condition | Eigenvalues |
---|---|---|
Normal | \(A\) and \(A^\dagger\) commute | Complex |
Hermitian | \(H = H^\dagger\) | Real |
Unitary | \(U^\dagger U = I\) | Phase |
Projection | \(P^2 = P, P = P^\dagger\) | \(\{0,1\}\) |
Reflection | \(R^2 = I, R = R^\dagger\) | \(\{\pm 1\}\) |
\(A \otimes B\) is a block matrix whose \(i,j\)-th block is \(a_{i,j} B\).
\[\begin{equation*} X \otimes Y = \begin{bmatrix} 0 & 0 & 0 & -i\\ 0 & 0 & i & 0\\ 0 & -i & 0 & 0\\ i & 0 & 0 & 0 \end{bmatrix} \end{equation*} \]For \(A = \sum_a a \ket{a}\bra{a}\), define \(f(A) = \sum_a f(a) \ket{a}\bra{a}\).
Example: \(\exp(A)\).
State
The state of a closed quantum system can be described by a unit vector in a Hilbert space (called the state space).
Closed: no observer, no leak of information to the environment!
If a "particle" can be in one of the \(d\) possibilities from a finite set \(\Gamma\), the state space is \(\mathcal{H} = \complex^\Gamma\). That is, to each possibility in \(\Gamma\), a complex number (the amplitude) is assigned.
\[\begin{equation*} \ket{\psi} = \begin{bmatrix} \psi_1 \\ \psi_2 \\ \vdots \\ \psi_d \end{bmatrix} \end{equation*} \]such that \(\norm{\ket{\psi}} = \abs{\psi_1}^2 + \cdots + \abs{\psi_d}^2 = 1\).
Evolution
The discrete time evolution of the closed system can be described by a unitary operator. If the state at time \(t_0\) is \(\ket{\psi_0}\), the state at time \(t_1\) can be obtained as
\[\begin{equation*} \ket{\psi_1} = U \ket{\psi_0}. \end{equation*} \]It also follows from the Schrödinger equation
\[\begin{equation*} i \hbar \frac{\partial}{\partial t} \ket{\psi(t)} = H \ket{\psi(t)}. \end{equation*} \]Take \(\hbar = 1\), the evolution from \(t_0\) to \(t_1\) is
\[\begin{equation*} U = \exp \bigl(-i H (t_1 - t_0) \bigr). \end{equation*} \]Measurement
Composite system
The state of a composite system whose components have states \(\ket{\psi_1}, \ket{\psi_2}, \ldots, \ket{\psi_m}\) is the tensor product state
\[\begin{equation*} \ket{\psi_1} \otimes \ket{\psi_2} \otimes \cdots \otimes \ket{\psi_m}. \end{equation*} \]The measurement postulate
Quantum measurements are described by a collection \(\{M_m\}\) of measurement operators. These are operators acting on the system being measured and must satisfy
\[\begin{equation*} \sum_m M_m^\dagger M_m = I. \end{equation*} \]The index \(m\) refers to the measurement outcomes that may occur.
(Compare the \(U^\dagger U = I\) condition)
If the state of the system is \(\ket{\psi}\) immediately before the measurement then the probability that result \(m\) occurs is given by
\[\begin{equation*} p(m) = \bra{\psi} M_m^\dagger M_m \ket{\psi}. \end{equation*} \]The state after the measurement is
\[\begin{equation*} M_m \ket{\psi} \big/ \sqrt{p(m)}. \end{equation*} \]Why this general form?
It is essentially Born's rule when we use ancillary systems and apply a joint unitary operation.
It is known as the Naimark theorem.
Hint: Consider a unitary
\[\begin{equation*} U = \begin{pmatrix} M_1 & \cdots & \cdots\\ \vdots & \ddots & \vdots\\ M_m & \cdots & \cdots \end{pmatrix} \end{equation*} \]Special case: projective measurement if \(M_m = \ket{\psi_m}\bra{\psi_m}\) and this will recover the Born's rule.
Observable: an operator that models the observation of a physical quantity.
For any Hermitian operator \(O\), the spectral decomposition theorem tells us that it has the form
\[\begin{equation*} O = \sum_m m P_m, \end{equation*} \]where \(m\) is from a finite subset of \(\real\).
\[\begin{equation*} \E (O) = \sum_m m\, p(m) = \bra{\psi} O \ket{\psi} = \langle O \rangle. \end{equation*} \]So the average value \(\langle O \rangle\) on state \(\ket{\psi}\) is the expectation of outcome value \(m\).
Measurement collapses the state.
The three polarizer "paradox".
Gates are:
Invertible
This means that unitary gates never erase information. Compare this with classical gates!
Deterministic
Nothing is random in the application of a quantum gate.
Continuous
There is a unitary operation that represents the evolution of a smaller period of time.
Explains the necessity of using complex numbers.
What is the square root of the \(Z\) gate?
Measurements are:
Irreversible
We cannot go back to the state before measurement in general!
The superposition state (wave function) collapses.
Probabilistic
Born's rule.
Discontinuous
The collapse of the wave function in measurement is considered instantaneous!
How does quantum mechanics reconcile these two conflicting operations? Born's rule!
Gates preserve the 2-norm, and measurements give the probabilities determined by the 2-norm.
If you want to think more about superposition, the lecture by Allan Adams on superposition is highly recommended.
Quantum measurements are described by measurement operators \(\{M_m\}\) satisfying
\[\begin{equation*} \sum_m M_m^\dagger M_m = I. \end{equation*} \]The probability that \(m\) occurs is \(\bra{\psi} M_m^\dagger M_m \ket{\psi}\).
The post-measurement state given outcome \(m\) is proportional to \(M_m \ket{\psi}\).
An observable is a Hermitian operator whose eigenprojectors form a set of measurement operators indexed by the eigenvalues.
\(Z\) is an observable, which corresponds to the computational measurement \(\{\ket{a}\bra{a}\}\) with outcomes \((-1)^a\).
How do we process information using the four postulates?
Boolean functions, Boolean circuits, and Boolean formulas
A Boolean function is function \(f : \{0,1\}^n \to \{0,1\}\).
There are \(2^{2^n}\) of them!
A Boolean circuit is a model of computation using gates and wires.
A Boolean formula is a Boolean circuit where all gates have fan-out at most \(1\).
Functional completeness
A set of Boolean functions \(f_i : \{0,1\}^{n_i} \to \{0,1\}\) is functionally complete if the functions "generate" every \(f : \{0, 1\}^n \to \{0, 1\}\) for all \(n \ge 1\).
\(\{\text{AND}, \text{OR}, \text{NOT}\}\) is functionally complete.
Shannon formula:
\[\begin{equation*} \begin{split} f(x_1,...,x_n) & = ((\neg x_1) \land f(0,x_2,...,x_n)) \\ & \qquad \lor (x_1 \land f(1,x_2,...,x_n)). \end{split} \end{equation*} \]\(\{\text{NAND}\}\) is functionally complete.
Disjunctive normal form (sum of minterms):
\[\begin{equation*} \bigvee_{f(a_1,...,a_n)=1} \bigwedge_{i=1}^{n}x_i^{a_i}, \text{ where } x_i^{a_i} = \begin{cases} \phantom{\neg} x_i & \text{if } a_i = 1,\\ \neg x_i & \text{if } a_i=0. \end{cases} \end{equation*} \]Quantum circuit is our new language for describing quantum information processing.
Reversibility implies that the quantum circuit looks more regular (the same number of input and output qubits).
From left to right: initial state \(\ket{\psi_0}\), the first gate \(U_1\), the second gate \(U_2\), …, the last gate \(U_T\).
Reverse the order when writing them in an equation:
\[\begin{equation*} U_T \cdots U_2 U_1 \ket{\psi_0}. \end{equation*} \]Multi-qubit gates CNOT, C-U, CCNOT (Toffoli) in circuit diagrams.
What does it mean to apply a single qubit \(U\) to a multi-qubit states?
Prepare a classical bit string \(x\) from the all-zero string.
\(H \ket{0}\) and measure.
Bell state preparation.
Qubit is the quantum analog of bit and is, therefore, the simplest yet most important concept in quantum information processing.
Some noteworthy examples of states are:
\[\begin{equation*} \begin{split} & \ket{0} = \begin{pmatrix} 1 \\ 0 \end{pmatrix},\quad \ket{1} = \begin{pmatrix} 0 \\ 1 \end{pmatrix},\\ & \ket{+} = \frac{\ket{0} + \ket{1}}{\sqrt{2}} = \frac{1}{\sqrt{2}} \begin{pmatrix} 1 \\ 1 \end{pmatrix},\\ & \ket{-} = \frac{\ket{0} - \ket{1}}{\sqrt{2}} = \frac{1}{\sqrt{2}} \begin{pmatrix} 1 \\ -1 \end{pmatrix}. \end{split} \end{equation*} \]Single-qubit states
Examples of single-qubit states \(\ket{0}, \ket{1}, \ket{+}, \ket{-}\).
\(\alpha\) and \(\beta\) are complex numbers.
Quantum theory is a \(2\)-norm theory: \(\abs{\alpha}^2 + \abs{\beta}^2 = 1\).
Global phases do not matter and so we can write
\[\begin{equation*} \ket{\psi} = \cos\frac{\theta}{2} \ket{0} + \sin\frac{\theta}{2}\, e^{i\varphi} \ket{1} \end{equation*} \]Pure single-qubit states are on the Bloch sphere.
Before discussing mixed states in the Bloch sphere, we review the Pauli operators.
Pauli operators are unitary, Hermitian, and traceless.
What are their eigenvalues and eigenvectors?
Pauli operators as observables.
Commutation and anti-commutation relations of Pauli operators
\[\begin{align} [\sigma_x, \sigma_y] & = 2 i \sigma_z, & \{\sigma_x, \sigma_y\} & = 0,\\ [\sigma_y, \sigma_z] & = 2 i \sigma_x, & \{\sigma_y, \sigma_z\} & = 0,\\ [\sigma_z, \sigma_x] & = 2 i \sigma_y, & \{\sigma_z, \sigma_x\} & = 0. \end{align} \]For any unit vector \(\vec{n} = (n_x, n_y, n_z) \in \real^3\), define
\[\begin{equation*} \vec{n} \cdot \vec{\sigma} = \sum_{k \in \{x,y, z\}} n_k\, \sigma_k. \end{equation*} \]It is unitary, Hermitian, and traceless
\[\begin{equation*} (\vec{n} \cdot \vec{\sigma})^2 = I. \end{equation*} \]For pure state \(\ket{\psi} = \cos\frac{\theta}{2} \ket{0} + \sin\frac{\theta}{2} e^{i\varphi} \ket{1}\) and its coordinate \(\vec{n}_\psi\), we have
\[\begin{equation*} \vec{n}_\psi \cdot \vec{\sigma} = \begin{pmatrix} \cos\theta & \sin\theta\, e^{-i\varphi}\\ \sin\theta\, e^{i\varphi} & -\cos\theta \end{pmatrix}. \end{equation*} \]What are the eigenvectors of the above operator?
The coordinate of \(\ket{\psi}\) in \(\real^3\) is
\[\begin{equation*} (n_x, n_y, n_z) = (\sin\theta \cos\varphi, \sin\theta \sin\varphi, \cos\theta). \end{equation*} \]It would be more natural first to define the \(\vec{n} \cdot \vec{\sigma}\) operator for any coordinate and define the pure state as its eigenvalue-\(1\) eigenstate.
It explains the \(\frac{\theta}{2}\) used to define the state \(\ket{\psi}\).
Pauli \(X\), \(Y\), \(Z\) rotate the Bloch sphere by angle \(\pi\) along the \(x\), \(y\), \(z\) axis, respectively.
The example of the \(Z\) action:
\[\begin{equation*} \cos\frac{\theta}{2} \ket{0} + \sin\frac{\theta}{2} e^{i \varphi} \ket{1} \\ \mapsto \cos\frac{\theta}{2} \ket{0} - \sin\frac{\theta}{2} e^{i \varphi} \ket{1}. \end{equation*} \]How about \(\vec{n} \cdot \vec{\sigma}\)?
We do quantum information processing simply by rotating the qubits in our system!
Hadamard \(H\), phase gate \(S\), \(\pi/8\) gate \(T\)
\[\begin{equation*} \begin{split} & H = \frac{1}{\sqrt{2}} \begin{pmatrix} 1 & 1 \\ 1 & -1 \end{pmatrix},\quad S = \begin{pmatrix} 1 & 0\\ 0 & i \end{pmatrix},\\ & T = \begin{pmatrix} 1 & 0 \\ 0 & e^{\pi i / 4} \end{pmatrix}. \end{split} \end{equation*} \]Rotations through \(a = x, y, z\) axis
\[\begin{equation*} R_a(\phi) = \exp\Bigl( - \frac{\phi}{2} i \sigma_a \Bigr) = \cos\frac{\phi}{2} I - i\sin\frac{\phi}{2} \sigma_a. \end{equation*} \]In particular, \(R_z(\phi) = \begin{pmatrix} e^{-\frac{\phi}{2} i} & 0 \\ 0 & e^{\frac{\phi}{2} i} \end{pmatrix}\), and \(T\) is \(R_z(\pi / 4)\) up to a global phase.
Rotations along \(\vec{n}\)
\[\begin{equation*} \begin{split} R_{\vec{n}}(\phi) & = \exp\bigl( - \frac{\phi}{2} i\, \vec{n} \cdot \vec{\sigma}\bigr)\\ & = \cos\frac{\phi}{2} I - i \sin\frac{\phi}{2} \vec{n} \cdot \vec{\sigma}. \end{split} \end{equation*} \]Any single-qubit unitary operator \(U\) can be written as
\[\begin{equation*} U = e^{i\alpha} R_{\vec{n}}(\phi), \end{equation*} \]for some \(\alpha\), \(\phi\), and \(\vec{n}\).
(Z-Y decomposition) Any single-qubit unitary \(U\) has the form
\[\begin{equation*} U = e^{i\alpha} R_z(\beta) R_y(\gamma) R_z(\delta). \end{equation*} \]The computational (standard) basis measurement (\(Z\) measurement)
Is the state \(\ket{0}\) or \(\ket{1}\)?
(Born's rule) When we measure \(\ket{\psi} = \alpha \ket{0} + \beta \ket{1}\):
With probability \(\abs{\alpha}^2\), the measurement outcome is \(0\) and the state collapse to \(\ket{0}\);
With probability \(\abs{\beta}^2\), the measurement outcome is \(1\) and the state collapse to \(\ket{1}\).
Hadamard basis measurement (\(X\) measurement):
Is the state \(\ket{+}\) or \(\ket{-}\)?
When we measure \(\ket{\psi} = \alpha \ket{0} + \beta \ket{1}\):
With probability \(\abs{\alpha + \beta}^2/2\), the outcome is \(+\) and the state collapse to \(\ket{+}\);
With probability \(\abs{\alpha - \beta}^2/2\), the outcome is \(-\) and the state collapse to \(\ket{-}\).
Measurement operators
\[\begin{equation*} M_{+} = \frac{I + X}{2} = \ket{+}\bra{+},\quad M_{-} = \frac{I - X}{2} = \ket{-}\bra{-}. \end{equation*} \]\(X\) as an observable.
Why is it called the Hadamard basis measurement?
A projective measurement in the direction of \(\vec{n}\)
Note that \(\vec{n} \cdot \vec{\sigma}\) is an observable.
Measurement operators
\[\begin{equation*} M_b = \frac{I + (-1)^b\, \vec{n} \cdot \vec{\sigma}}{2}, \text{ for } b = 0, 1. \end{equation*} \]Check that \(M_b\) is indeed projective and \(M_0 + M_1 = I\).
Someone sends you a box, which is either a dud (empty box) or a bomb attached to a quantum device that will trigger the explosion if it measures a \(\ket{1}\) on the photon sent in. Your task is to detect if there is a bomb inside.
Not very interesting
Dud: No explosion, measures \(\ket{+}\).
Bomb: 50% explosion, 25% measures \(\ket{+}\), 25% measures \(\ket{-}\).
a. Start with \(\ket{0}\), rotate \(\epsilon\) angle, and send it in.
Use \(R_y(\epsilon)\) to implement the \(\epsilon\) rotation.
b. Repeat \(N = O(1/\epsilon)\) times without measuring. c. Measure in the standard basis.
Dud: No explosion, measures \(\ket{1}\) if \(N = \pi/\epsilon\).
Bomb: \(O(\epsilon)\) probability of explosion, measures \(\ket{0}\).
\[\begin{equation*} \Pr(\text{explodes}) \le N \sin^2 \Bigl( \frac{\epsilon}{2} \Bigr) = O(\epsilon). \end{equation*} \]This is known as the quantum Zeno effect.
Entanglement is yet another peculiar quantum phenomenon along with superposition and measurement.
Superpositions of 00, 01, 10, 11
\[\begin{equation*} \ket{\psi_{AB}} = \alpha_{00} \ket{00} + \alpha_{01} \ket{01} + \alpha_{10} \ket{10} + \alpha_{11} \ket{11}. \end{equation*} \]Tensor products \(\ket{ab} = \ket{a} \otimes \ket{b}, \text{ for } a, b \in \{0, 1\}\).
Tensor product of two matrices
Vector representation \((\alpha_{ij})\)
Normalization condition \(\sum_{a,b} \abs{\alpha_{ab}}^2 = 1\).
CNOT, SWAP, Controlled-\(U\), …
\[\begin{equation*} \begin{split} \text{CNOT} :\; & \ket{a, b} \mapsto \ket{a, a\oplus b}, \\ \text{SWAP} :\; & \ket{a, b} \mapsto \ket{b, a}, \\ \text{Controlled-}U :\; & \ket{a, \psi} \mapsto \ket{a} \otimes U^a \ket{\psi}. \end{split} \end{equation*} \]Circuit identities
Control vs. target
CNOT and single-qubit gates are all you need for universal quantum computing.
…, then they can no longer be described in the same way as before, viz. by endowing each of them with a representative of its own. I would not call that one but rather the characteristic trait of quantum mechanics, the one that enforces its entire departure from classical lines of thought. By the interaction the two representatives [the quantum states] have become entangled.
Four Bell states
\[\begin{alignat*}{3} & \ket{\beta_{00}} & = \frac{\ket{00} + \ket{11}}{\sqrt{2}},\quad && \ket{\beta_{01}} & = \frac{\ket{01} + \ket{10}}{\sqrt{2}},\\ & \ket{\beta_{10}} & = \frac{\ket{00} - \ket{11}}{\sqrt{2}},\quad && \ket{\beta_{11}} & = \frac{\ket{01} - \ket{10}}{\sqrt{2}}. \end{alignat*} \]The first index determines the sign; the second determines the parity.
Prepare the Bell states
The Bell states are also known as the EPR states, and we often use \(\ket{\text{EPR}}\) to denote \(\ket{\beta_{00}}\).
Singlet state \(\ket{\beta_{11}}\) is an anti-symmetric state.
The other three states are all symmetric and span the symmetric subspace of the two-qubit state space.
It is easy to verify that this state is not a product of single-qubit states!
A picture of entanglement.
Basic rules of quantum mechanics imply the existence of EPR, arguably what troubled Einstein the most in quantum mechanics! It is what he called the 'spooky action at a distance'.
Consider two players Alice and Bob sharing an EPR state. Alice brings her particle to the moon while Bob stays on earth.
Fact: If Alice measures her particle, she instantaneously knows whether Bob's state is \(\ket{0}\) or \(\ket{1}\) (which Bob can confirm if he performs the standard basis measurement).
Is it a big deal?
It is not very different from a pair of two perfectly correlated classical bits.
A pair of gloves in two boxes.
EPR considers other possibilities: Alice can measure in a different basis.
There is another way to open the box.
We have measured \(Z\) basis, what about \(X\) measurement? That is, what happens if Alice measures \(\ket{\pm}\)?
Method I. \(M_0 = \ket{+}\bra{+} \otimes I\) and \(M_1 = \ket{-}\bra{-} \otimes I\).
Method II. Write EPR state
\[\begin{equation*} \ket{\text{EPR}} = \frac{\ket{{+}{+}} + \ket{{-}{-}}}{\sqrt{2}}. \end{equation*} \]The conclusion is that Alice will instantaneously know whether Bob's post-measurement state is \(\ket{+}\) or \(\ket{-}\).
Consider two situations. In the first, Alice measures in the standard basis, she will obtain \(a=0,1\) with equal probability and Bob's qubit collapses to \(\ket{a}\). In the second, she measures in the Hadamard basis, she will obtain \(a={+},{-}\), with equal probability and Bob's qubit collapses to \(\ket{a}\).
It looks as if Alice's measurement choice affects Bob's final state! Would it allow Alice to send information instantaneously to Bob?
What are the states of Bob in the two situations?
Bob's view and Alice's view.
The mixed state framework will help us to answer the question and see why this won't allow superluminal communication.
In the first situation, Bob's state is in an equal distribution over \(\ket{0}, \ket{1}\). In the second, it is an equal distribution over \(\ket{+}\) and \(\ket{-}\). Can we distinguish these two cases in quantum mechanics?
Let's generalize a little bit and consider a distribution of pure states described by an ensemble \(\{(p_i, \ket{\psi_i})\}\). It means that our knowledge of the quantum system says that it is in state \(\ket{\psi_i}\) with probability \(p_i\).
Consider a quantum measurement \(\{M_0, M_1\}\) measuring the state ensemble. The probability that \(a\) occurs is
\[\begin{equation*} \Pr(a) = \sum_i p_i \bra{\psi_i} M_a^\dagger M_a \ket{\psi_i}. \end{equation*} \]We can rewrite it using cyclic property of the trace as
\[\begin{equation*} \Pr(a) = \tr \Bigl(\Bigl(\sum_i p_i \ket{\psi_i}\bra{\psi_i} \Bigr) \, M_a^\dagger M_a \Bigr). \end{equation*} \]We call
\[\begin{equation*} \rho = \sum_i p_i \ket{\psi_i}\bra{\psi_i} \end{equation*} \]the density matrix of the ensemble. The probability is determined by the density matrix, so it makes sense to use the density matrix to describe the state of the system.
We now go back to the EPR paradox. When Alice measures \(\ket{0}, \ket{1}\), Bob's state is also \(\ket{0}\) or \(\ket{1}\), which Alice knows for sure as it will be the same as her measurement outcome. But Bob does not know the measurement outcome, so before Alice send the outcome to him, his knowledge of the system is that it is \(\ket{0}\) or \(\ket{1}\) with equal probability. The density matrix is
\[\begin{equation*} \rho = \frac{1}{2} (\ket{0}\bra{0} + \ket{1}\bra{1}) = \frac{I}{2}. \end{equation*} \]If Alice measured \(\ket{+}, \ket{-}\). Bob's qubit is \(\ket{+}\) or \(\ket{-}\) with equal probability. Hence, the density matrix is
\[\begin{equation*} \rho' = \frac{1}{2} (\ket{+}\bra{+} + \ket{-}\bra{-}) = \frac{I}{2}. \end{equation*} \]So even though the two ensembles look very different, the density matrices in the two situations are the same. It means that no measurement on Bob's system can tell the two cases apart. This makes perfect sense as the state of Bob should not change when Bob has done nothing to his system.
It explains why density matrix is more fundamental then ensemble of quantum states.
No superluminal communication is possible using the strategy above!
Mathematically, a density matrix is positive semidefinite and of trace \(1\) (iff there is a corresponding ensemble). It is a description of the state of a quantum system that is more general than state vectors for pure states.
Any pure state \(\ket{\psi}\) has a density matrix representation \(\rho = \ket{\psi}\bra{\psi}\).
Example: The density matrix for \(\ket{+}\) is
\[\begin{equation*} \ket{+}\bra{+} = \frac{1}{2} \begin{bmatrix} 1 & 1\\ 1 & 1 \end{bmatrix}. \end{equation*} \]The spectral decomposition of a density matrix
\[\begin{equation*} \rho = \sum_j \lambda_j \ket{\psi_j}\bra{\psi_j} \end{equation*} \]gives a possible ensemble \(\{(\lambda_j, \ket{\psi_j})\}\) for the density matrix.
Probability can be considered a special case where the density matrix is diagonal.
The state is now a density matrix.
Evolution: \(\rho \mapsto U\rho U^\dagger\).
Measurement: \(\Pr(m) = \tr \bigl( M_m^\dagger M_m \rho \bigr)\) and the post-measurement state is proportional to
\[\begin{equation*} M_m \rho M_m^\dagger. \end{equation*} \]Tensor: \(\rho = \rho_1 \otimes \cdots \otimes \rho_n\).
Define an inner product on the \(2\) by \(2\) matrix space
\[\begin{equation*} (A, B) = \frac{\tr(A^\dagger B)}{2}. \end{equation*} \]Together with \(I\), Pauli matrices form an orthonormal basis!
For all \(\rho\), we can write
\[\begin{equation*} \rho = \frac{I + \vec{n} \cdot \vec{\sigma}}{2}, \text{ where } n_a = \tr(\rho \sigma_a). \end{equation*} \]The positive semidefinite condition is equivalent to \(\norm{\vec{n}} \le 1\).
The Bloch sphere has all valid single-qubit states.
What are the states of the north pole, the origin, and \((1, 0, 0)\)?
From coordinate \(\vec{n}\), we can compute the state as
\[\begin{equation*} \rho = \frac{I + \vec{n} \cdot \vec{\sigma}}{2}. \end{equation*} \]Conversely, given the density matrix \(\rho\), the expansion in the Pauli operators basis gives a vector \(\vec{n} = (n_x, n_y, n_z)\)
\[\begin{equation*} n_a = \tr(\rho \sigma_a), \text{ for } a = x, y, z. \end{equation*} \]Classical probability over a bit can be represented as diagonal density matrices corresponding to the line segment between the north and south poles.
Summarize: Things that can be identified with point \(\vec{n}\) on the sphere.
When we measure \(\rho = (I + \vec{n} \cdot \vec{\sigma})/2\):
With probability \(\tr(\rho\, \ket{0}\bra{0}) = (1 + n_z) / 2\), the measurement outcome is \(0\) and the state collapse to \(\ket{0}\bra{0}\);
With probability \(\tr(\rho\, \ket{1}\bra{1}) = (1 - n_z) / 2\), the measurement outcome is \(1\) and the state collapse to \(\ket{1}\bra{1}\).
What is the state of \(A\) given the state of \(AB\)? We cannot answer the question in the pure state framework, and we can now with density matrices.
Let \(\rho_{AB}\) the joint state of \(A\) and \(B\) systems. What is the state of \(B\) only? Is there a density matrix associated with system \(B\) given that we know the state of both \(A\) and \(B\)?
We need an effective representation of the state on \(B\) so that measurement only depends on this reduced state (not the joint state).
Let \(M\) be a measurement operator; the probability its outcome occurs is
\[\begin{equation*} \tr \bigl( \rho_{AB} (I_A \otimes M^\dagger M) \bigr), \end{equation*} \]which is a linear functional of \(M^\dagger M\). So there is a \(\rho_B\) such that
\[\begin{equation*} \tr \bigl( \rho_{AB} (I_A \otimes M^\dagger M) \bigr) = \tr (\rho_B\, M^\dagger M). \end{equation*} \]The partial trace is defined by
\[\begin{equation*} \tr_A : \ket{i,j}\bra{k,l}_{A,B} = \tr(\ket{i}\bra{k}_A) \ket{j}\bra{l}_B. \end{equation*} \]Extend by linearity.
\[\begin{equation*} \tr \Bigl( \rho_{AB} (I_A \otimes M^\dagger M) \bigr) = \tr \bigl( \tr_A(\rho_{AB}) M^\dagger M \bigr). \end{equation*} \]The reduced density matrix \(\rho_B = \tr_A (\rho_{AB})\).
What is the partial trace of the EPR state?
The reduced density matrices of \(\rho_{AB} = \rho_A \otimes \rho_B\) is \(\rho_A\) and \(\rho_B\). The converse is usually not true!
For any mixed state \(\rho_B\), there is a system \(A\) of the same dimension as \(B\) and a pure state \(\ket{\psi}_{AB}\) such that
\[\begin{equation*} \tr_A \bigl( \ket{\psi}\bra{\psi}_{AB} \bigr) = \rho_B. \end{equation*} \]Use the spectral decomposition theorem again!
Example: The EPR state is a purification of \(I/2\).
Mixed states arise when we lose control of a system!
Homework I
Problem 3 about computing
\[\begin{equation*} \bra{x} H^{\otimes n} \ket{y}. \end{equation*} \]Problem 4
Spectral decomposition
Consider a special case where \(A\) is Hermitian.
Let \(\lambda_1\) be an eigenvalue of \(A\) and \(\ket{e_1}\) a corresponding eigenvalue.
\[\begin{equation*} \begin{split} & \lambda_1 (\ket{e_1}, \ket{e_1}) = (\ket{e_1}, A\ket{e_1}) \\ = \, & (A\ket{e_1}, \ket{e_1}) = \lambda_1^* (\ket{e_1}, \ket{e_1}). \end{split} \end{equation*} \]This means that \(\lambda_1\) is real.
Consider subspace \(K\) spanned by all vectors orthogonal to \(\ket{e_1}\).
We prove that \(K\) is an invariant subspace of \(A\). Suppose \(\ket{\psi} \in K\), we have \(\bra{e_1} A \ket{\psi} = \lambda_1 \langle e_1 | \psi \rangle = 0\). Use induction to complete the proof.
In standard textbooks, it says that \(A = UDU^\dagger\). Using Dirac notation, we write it as
\[\begin{equation*} A = U \bigl( \sum_j \lambda_j \ket{j}\bra{j} \bigr) U^\dagger. \end{equation*} \]In this form, \(\ket{e_j} = U\ket{j}\) will be the eigenvectors of \(A\), and we write
\[\begin{equation*} A = \sum_j \lambda_j \ket{e_j}\bra{e_j}, \end{equation*} \]and \(\ket{e_j}\)'s are orthonormal.
Measurement
Projective measurements
Measurement operators are projectors such as \(\ket{0}\bra{0}\) and \(\ket{1}\bra{1}\).
Note that \(\ket{\psi}\bra{\psi}\) is the projector to the span of \(\ket{\psi}\).
A set of projectors \(\{P_j\}\) form a projective measurement if \(\sum_j P_j = I\).
General measurement satisfies \(\sum_j M_j^\dagger M_j = I\)
When we don't care about the post-measurement state, we consider measurement operators \(N_j = M_j^\dagger M_j \ge 0\).
The completeness condition is \(\sum_j N_j = I\).
The probability that \(j\) occurs is \(\tr(N_j \rho)\).
They are called positive-operator valued measure (POVM).
(Schmidt Decomposition). For a bipartite pure quantum state \(\ket{\psi_{AB}}\), we can write
\[\begin{equation*} \ket{\psi_{AB}} = \sum_{j=1}^k \sqrt{\lambda_j} \, \ket{\psi_A^{(i)}} \otimes \ket{\psi_B^{(i)}}, \end{equation*} \]where \(k \le \min(\dim_A, \dim_B)\), \(\lambda_j > 0\), and both \(\{\ket{\psi_A^{(i)}}\}\) and \(\{\ket{\psi_B^{(i)}}\}\) are orthonormal.
The meaning of the theorem says that, up to a change of local basis, an entangled state can be written as
\[\begin{equation*} \sum_j \sqrt{\lambda_j} \ket{j}_A \otimes \ket{j}_B. \end{equation*} \]Hint: Consider the singular value decomposition of matrix \((\psi_{j,k})\).
EPR as a mirror!
For any matrix \(U\):
\[\begin{equation*} (U \otimes I ) \sum_j \ket{j,j} = (I \otimes U^T) \sum_j \ket{j,j} = \sum_{i,j} U_{i,j} \ket{i,j} \end{equation*} \]\(\lambda_j\)'s are called the Schmidt coefficients
What are the Schmidt coefficients of the Bell states?
How about product states?
Thanks to this theorem, the theory of pure bipartite entanglement is well established.
Local operation and classical communication (LOCC)
Nielsen's Theorem
Let \(\ket{\psi}\) and \(\ket{\phi}\) be two entangled states on systems \(A,B\). Their Schmidt coefficients are \(\lambda = (\lambda_j)\) and \(\gamma = (\gamma_j)\) respectively.
Theorem (Nielsen). There is an LOCC transforming \(\ket{\psi}\) to \(\ket{\phi}\) if and only if the majorization condition \(\lambda \prec \gamma\) holds.
The condition \(\lambda \prec \gamma\) is that for all \(k\)
\[\begin{equation*} \sum_{j=1}^k \lambda_j^{\downarrow} \le \sum_{j=1}^k \gamma_j^{\downarrow}. \end{equation*} \]A simple corollary is that there is an LOCC transforming an EPR pair to an arbitrary entangled pair of qubits.
EPR: Maximally entangled states.
Superpositions of 000, 001, 010, \(\ldots\), 111
GHZ state
\[\begin{equation*} \ket{\text{GHZ}} = \frac{\ket{000} + \ket{111}}{\sqrt{2}}. \end{equation*} \]W state
\[\begin{equation*} \ket{\text{W}} = \frac{\ket{001} + \ket{010} + \ket{100}}{\sqrt{3}}. \end{equation*} \]Entanglement is a resource for many quantum information processing tasks
Now utilized for good at a distance of at least 1200km (Mozi satellite).
Quantum entanglement—physics at its strangest—has moved out of this world and into space. In a study that shows China's growing mastery of both the quantum world and space science, a team of physicists reports that it sent eerily intertwined quantum particles from a satellite to ground stations separated by 1200 kilometers, smashing the previous world record. (Science, 15 JUN 2017)
A qubit is a unit of quantum information storing the quantum state of a two-level physical system.
Quantum information is more general than classical information. We can store a bit in a qubit. But to store a qubit, it may require an infinite number of bits.
Consider an encoding \(a \mapsto \rho_a\) for \(a=0, 1\).
Promise: The system is in one of the two states \(\rho_0\) and \(\rho_1\),
Problem: Decide which is the case.
Measurement is the only bridge between the quantum and classical worlds. So we use a measurement with measurement operators \(M_0\) and \(M_1\)
\[\begin{equation*} \begin{split} \text{Minimize: } & \Tr(M_0 \rho_1) + \Tr(M_1 \rho_0)\\ \text{Subject to: } & M_0 \ge 0, M_1 \ge 0, M_0 + M_1 = I. \end{split} \end{equation*} \]The optimal \(M_0\) minimizes \(\Tr(M_0 (\rho_1 - \rho_0))\)
In the qubit case, we write \(\rho_b = (I + \vec{v}_b \cdot \vec{\sigma}) / 2\), and \(\rho_0 - \rho_1 = \vec{v} \cdot \vec{\sigma}\) for \(\vec{v} = (\vec{v}_0 - \vec{v}_1)/2\). The optimal \(M_0\) should be
\[\begin{equation*} \frac{1}{2} \left( I + \frac{\vec{v}}{\lVert \vec{v} \rVert} \cdot \vec{\sigma} \right) \end{equation*} \]The minimal error probability is therefore \((2 - \lVert \vec{v}_0 - \vec{v}_1 \rVert )/ 4\).
The amount of information one can store and reliably read from a qubit is only a single bit.
The states \(\ket{0}\) and \(\ket{+}\) are different but not perfectly distinguishable.
Non-orthogonality is the key issue.
What is information? What is quantum information?
We can store the whole Internet in a single qubit, but we cannot reliably read information out.
For a quantum state \(\rho\), its von Neumann entropy is \(S(\rho) = -\tr (\rho \log(\rho))\).
The eigenvalues of \(\rho\) form a probability distribution whose Shannon entropy equals the von Neumann entropy \(S(\rho)\).
Mutual information \(H(X{\,:\,}Y) = H(X) + H(Y) - H(XY)\).
Theorem. Let \(\{(p_x, \rho_x)\}\) be an ensemble of states. Alice prepares \(\rho_x\) with probability \(p_x\) and sends it to Bob. Bob measures the state and gets outcome \(y\). Then the mutual information (accessible information) between \(X\) and \(Y\) satisfies
\[\begin{equation*} H(X{\,:\,}Y) \le S \Bigl( \sum_x p_x \rho_x \Bigr) - \sum_x p_x S(\rho_x). \end{equation*} \]The right-hand side is called Holevo \(\chi\).
We will prove a poor man's version of Holevo's theorem.
Let \(X\) be a uniformly distributed random variable taking values in \([N]\). Consider an encoding of classical information \(x \mapsto \rho_x\) into quantum systems of \(n\) levels. For all measurement (POVM) \(\{M_m\}\), the probability that the measurement correctly guesses \(x\) is
\[\begin{equation*} p_x = \tr(M_x \rho_x) \le \tr(M_x). \end{equation*} \]In expectation the success probability is therefore
\[\begin{equation*} \E_x p_x \le \frac{1}{N} \sum_x \tr(M_x) = \frac{n}{N}. \end{equation*} \]Let \(\rho\) be an arbitrary qubit state, say \(\ket{0}\bra{0}\).
The quantum one-time pad works as follows:
For a player who does not know \(a,b\), the state is
\[\begin{equation*} \frac{1}{4} \sum_{a,b} U_{a,b} \rho U_{a,b}^\dagger = \frac{I}{2}. \end{equation*} \]Unknown quantum states cannot be copied.
There is no unitary transformation \(U\) that can map any \(\ket{\psi} \ket{0}\) to \(\ket{\psi} \ket{\psi}\).
Compute the inner product.
Note: \(\ket{0}\) and \(\ket{1}\) can be cloned using the CNOT gate.
Given a large number of i.i.d samples of quantum state \(\rho\), find the density matrix for \(\rho\).
In the single-qubit case, \(I\), Pauli \(X\), \(Y\), \(Z\) form an orthonormal basis, so
\[\begin{equation*} \rho = \frac{\tr(\rho)I + \tr(X \rho) X + \tr(Y \rho) Y + \tr(Z \rho) Z}{2}. \end{equation*} \]For example, to estimate \(\tr(Z \rho)\), we need to measure \(\rho\) in the \(Z\) basis multiple times, obtaining \(z_1, z_2, \ldots, z_m \in \{\pm 1\}\). Then \((\sum_i z_i) / m\) is an empirical estimate of \(\tr(Z \rho)\).
Note that the standard deviation is \(1/\sqrt{m}\) and so we need to repeat about \(O(1/\eps^2)\) times to have \(\eps\) precision.
Multiple qubit case is similar:
\[\begin{equation*} \rho = \frac{\sum_{P} \tr(\rho P) P}{2^n}, \end{equation*} \]where the summation goes over \(n\)-qubit Pauli operators.
Check that \(I\) and Pauli operators form a basis also in the multiple qubit case.
But it's a summation of \(4^n\) terms!
In the lab, it would take days to tomography a system of more than 10 qubits.
Theorem. The sample complexity for tomography is \(\tilde{\Omega}(d^2/\eps^2)\), where \(d\) is the dimension and \(\eps\) is the precision in the so called trace distance.
To make things worse, each term needs high precision for the triangle inequality to give meaningful bounds.
There are \(4^n\) small real numbers \(\tr(\rho P)\), and we have to ensure that their error sum is still controlled.
How do we even know the quantum device works as expected when tomography of its state is impractical?
We need to estimate 15 real numbers \(\mu_P = \tr(\ket{\beta_{11}}\bra{\beta_{11}} P)\) for non-identity two-qubit Pauli operators \(P\).
I | X | Y | Z | |
---|---|---|---|---|
I | 0 | 0 | 0 | |
X | 0 | -1 | 0 | 0 |
Y | 0 | 0 | -1 | 0 |
Z | 0 | 0 | 0 | -1 |
For \(P = XX, YY, ZZ\), \(\mu_P = -1\),
For all other \(P\), \(\mu_P = 0\).
Note that in this simple case, it suffices to check the \(XX, YY, ZZ\) average values in the first case to certify that the state is the singlet state.
The singlet state is the unique state stabilized by \(-XX\) and \(-ZZ\).