this post was submitted on 18 Sep 2025
17 points (100.0% liked)

MoreWrite

163 readers
1 users here now

post bits of your writing and links to stuff you’ve written here for constructive criticism.

if you post anything here try to specify what kind of feedback you would like. For example, are you looking for a critique of your assertions, creative feedback, or an unbiased editorial review?

if OP specifies what kind of feedback they'd like, please respect it. If they don't specify, don't take it as an invite to debate the semantics of what they are writing about. Honest feedback isn’t required to be nice, but don’t be an asshole.

founded 2 years ago
MODERATORS
 

For reasons outside the scope of this note, I have been reading Scott Aaronson's Quantum Computing Since Democritus for the first time since it was originally published.

As can happen in books based directly on lectures, it has more "personality" overtly on display than the average technical book. That goes for good and for ill. What Alice finds engaging, Bob can find grating, and vice versa. In this case, I noticed some passages that sound, well, smarmy. I personally find that there's a kind of self-deprecating arrogance at work, as though the book is saying, "I am a nerd, I hold the one true nerd opinion, and everything I assert is evident and simple if you are a nerd, which again, I am the defining example of." It's possible that I would have skipped past all that a decade ago, but now, I can't not see it.

There are big chunks of it that I'm not the best reader to evaluate. I'm a physicist who has casually studied computer science along with many other interests; I haven't tried to teach P vs NP in a classroom setting. But where the book does overlap with more serious interests of mine, I found it wanting. There's a part (chapter 9) about exploring where the rules of quantum theory could come from, and how the mathematics of the theory could potentially be derived from more basic premises rather than taken as postulates. I found this discussion badly organized and poorly argued. In 2013, it was historically shallow, and now in 2025, it's outdated.

Everything he says about Bohr is caricatured to the point of absurdity.

His history of the halting problem is conventional but wrong.

The last chapter is called "Ask me anything" and records a Q&A he held on the last day of the course upon which the book was based. The value of these off-the-cuff remarks on areas where the author has no real experience was perhaps over-estimated. It gets onto the topic of evolution, veers into naive adaptationism and blends that with social Darwinism. Reading this chapter is like reading Puzo’s The Godfather and slamming into the subplot about vaginal surgery. One asks, "What?" And then, "Did this have an editor?"

Above all, we should ask whether the book delivers on its professed theme. Here's Aaronson in the preface, laying out what he considers the book's "central message":

But if quantum mechanics isn’t physics in the usual sense – if it’s not about matter, or energy, or waves, or particles – then what is it about? From my perspective, it’s about information and probabilities and observables, and how they relate to each other.

This is a defensible claim. All the way back in the 1930s, Birkhoff and von Neumann were saying that we should understand quantum physics by modifying the rules of logic, which is about as close to "quantum information" thinking as one could get before the subjects of computer science and information theory had really been invented. Later, E. T. Jaynes was fond of saying that quantum mechanics is an omelette that mixes up nature and our information about nature, and in order to make further progress in physics, we need to separate them. When undergrads came to John Wheeler asking for summer research projects, he liked to suggest, "Derive quantum mechanics from an information-theoretic principle!" But the question at hand is whether Aaronson's book succeeds at making a case. You can talk a lot about quantum information theory or quantum computing without convincing anyone that it illuminates the fundamental subject matter of quantum mechanics. Knuth's Art of Computer Programming is not an argument that classical electromagnetism is "about information".

While we're calling back to historical figures, we should take a moment to recall that John Bell replied to remarks like Wheeler's with, "Whose information? Information about what?" The responses to that have demonstrated that two rabbis will have three opinions. Or, as Wikipedia puts it, "Answers to these questions vary among proponents of the informationally-oriented interpretations." Aaronson does not do anything to sort this out. Instead, he lumps together multiple genera of interpretations and treats them as a single species. If Aaronson has answers to Bell's questions, it is not clear to me what they might be.

Here's Aaronson a bit later:

Here, the physicists assure us, no one knows how we should adjust our intuition so that the behavior of subatomic particles would no longer seem so crazy. Indeed, maybe there is no way; maybe subatomic behavior will always remain an arbitrary brute fact, with nothing to say about it beyond “such-and-such formulas give you the right answer.”

Then he argues,

as the result of decades of work in quantum computation and quantum foundations, we can do a lot better today than simply calling quantum mechanics a mysterious brute fact.

What is this new improved perspective? Here's how his italicized paragraph about it begins:

Quantum mechanics is a beautiful generalization of the laws of probability: a generalization based on the 2-norm rather than the 1-norm, and on complex numbers rather than nonnegative real numbers.

That isn't just a "brute fact". It's the same "brute fact" that an ordinary textbook will tell you! It's the "fourth postulate" in Cohen-Tannoudji et al., equation (1.3) in Griffiths and Schroeter, page 9 of Zwiebach. All that Aaronson has done is change the jargon a tiny bit.

Aaronson declares himself indifferent to the needs of "the people designing lasers and transistors". And fair enough; we all have our tastes for topics. But he has set himself the challenge of demonstrating that studying how to program computers that have not been built, and comparing them to computers that physics says can never be built, is the way to the heart of quantum mechanics.

Aaronson quotes a passage from Carl Sagan, thusly:

Imagine you seriously want to understand what quantum mechanics is about. There is a mathematical underpinning that you must first acquire, mastery of each mathematical subdiscipline leading you to the threshold of the next. In turn you must learn arithmetic, Euclidean geometry, high school algebra, differential and integral calculus, ordinary and partial differential equations, vector calculus, certain special functions of mathematical physics, matrix algebra, and group theory . . . The job of the popularizer of science, trying to get across some idea of quantum mechanics to a general audience that has not gone through these initiation rites, is daunting. Indeed, there are no successful popularizations of quantum mechanics in my opinion – partly for this reason. These mathematical complexities are compounded by the fact that quantum theory is so resolutely counterintuitive. Common sense is almost useless in approaching it. It’s no good, Richard Feynman once said, asking why it is that way. No one knows why it is that way. That’s just the way it is.

Aaronson follows this by saying that he doesn't need convincing: "Personally, I simply believe the experimentalists" when they say that quantum physics works. Again, fair enough on its own. But I think this is poor media literacy here. Sagan's Demon-Haunted World is all about the public understanding of science, the difference between authorities and experts, the challenge of becoming scientifically literate, and that kind of thing. What Sagan means by "what quantum mechanics is about" in this context is what physicists use the theory to do, day by day, and why we have confidence in it. Even if you come along with a better explanation of where the mathematics comes from, all that won't go away!

Aaronson tries to back up his perspective in chapter 9, where he makes the following contention:

Quantum mechanics is what you would inevitably come up with if you started from probability theory, and then said, let’s try to generalize it so that the numbers we used to call “probabilities” can be negative numbers.

This is a bait-and-switch, or more charitably, poor organization. Later he will admit that he needs to introduce not just negative numbers, but complex numbers too. What arguments does he give to justify bringing complex numbers into the picture? Why prefer ordinary quantum theory over what we might call "real-amplitude" quantum theory? He provides three suggestions. The first is based on a continuity argument ("if it makes sense to apply an operation for one second, then it ought to make sense to apply that same operation for only half a second"). He argues that this can only be made to work if the amplitudes are complex rather than only real. But this does not hold. We can bypass this issue simply by making sure that our real-amplitude quantum theory is analogous to the original. To get technical about it: We can simply say that in real-amplitude quantum theory, the time evolution operators belong to the subgroup of the orthogonal group that is continuously connected to the identity. This is actually what would be analogous to regular quantum theory, where we make unitary operators by taking the exponential of -iHt, where H is a Hamiltonian and t is an amount of time. In the real-amplitude theory, we just use an antisymmetric matrix as a generator instead of an anti-Hermitian one.

The second argument is that the number of parameters needed to specify a mixed state scales better for complex amplitudes than for real. This is a style of argument that has a considerable cachet among aspiring reconstructors of the quantum formalism, but it too has shortcomings. Aaronson invokes the principle that states for independent quantum systems combine via the tensor product. He asserts that this is true, and then argues that this makes the parameter counting work out nicely for complex but not real amplitudes. Plainly, then, this case for complex amplitudes can't be better than the case for the tensor product. It replaces one mathematical "brute fact" with another. People who go into more depth about this invoke a premise they call "tomographic locality". The conceptual challenge is then, if tomographic locality failed to hold true, would that actually be so bad? Would we find it stranger than, for example, quantum entanglement? See Hardy and Wootters (2010) and Centeno et al. (2024).

The third argument is given almost in passing. It's a "well, I guess that's nice" property which holds for the complex-amplitude theory and fails for the real-amplitude version. Bill Wootters noticed it. Of course, he also found something that works out nice only when the amplitudes are real instead. See Wootters (2013) for a more recent explanation of the latter, which he first published in 1980.

What Aaronson calls starting "directly from the conceptual core" strikes me instead as merely discarding some old prefatory material, like the Bohr model of hydrogen, and replacing it with new, like some chatter about classical computation. His "conceptual core" is the same old postulate. He just applies it in somewhat different settings, so he ends up doing matrix algebra instead of differential equations. I once thought that would be easier on students, but then I actually had to teach a QM class, and then I ended up "reviewing" a lot of matrix algebra.

A physicist who learned quantum mechanics the old-fashioned way, and who now sees "quantum" being hyped as the next Bitcoin, might well have some questions at this point. "So, you're telling me that these highly idealized models of hypothetical, engineered systems bring us closer to the secrets of the Old One than studying natural phenomena will? I'm sure you have your own good reasons for wanting to know if QURP is contained in PFUNK, but I want to understand why ice floats on water, why both iron and charcoal glow the same kind of red when they get hot, why a magnet will pick up a steel paperclip but not a shiny soda can. I want to map the galaxies with the radio waves from interstellar hydrogen, and I want to know what sustains a dead sun against the pull of gravity." And: "I get the desire for a 'conceptual core' to quantum physics. But have you actually isolated such a thing? From where I stand, it looks like you've just picked one of the important equations and called it the important equation. Shouldn't your 'conceptual core' be a statement with some punch to it, like the big drama premise of special relativity? What's your counterpart to each observer who feels herself motionless will measure the same speed of light?"

Here's how Aaronson begins chapter 9:

There are two ways to teach quantum mechanics. The first way – which for most physicists today is still the only way – follows the historical order in which the ideas were discovered. So, you start with classical mechanics and electrodynamics, solving lots of grueling differential equations at every step. Then, you learn about the “blackbody paradox” and various strange experimental results, and the great crisis these things posed for physics. Next, you learn a complicated patchwork of ideas that physicists invented between 1900 and 1926 to try to make the crisis go away. Then, if you’re lucky, after years of study, you finally get around to the central conceptual point: that nature is described not by probabilities (which are always nonnegative), but by numbers called amplitudes that can be positive, negative, or even complex.

This is wrong in a few ways. First, that "years of study"? Yeah, I saw complex probability amplitudes in my first term of college. Before they showed us all the blobby/cloudy pictures of electron orbitals, they took two minutes to explain what was being plotted. Our first full-blown quantum mechanics course was at the advanced age of ... sophomore year. And we're not talking about something squeezed in on the last day before summer vacation. See above regarding how it's the third equation in the first chapter of the ubiquitous standard undergrad QM textbook. This is not an idea sequestered in the inner sanctum of knowledge; it's babby's first wavefunction.

Second, the orthodox method is not really "historical". It can't be. The physicists who did all that work from 1900 through 1925--27 knew much more physics than college kids do today. They were professionals! Pick up the Dover reprint of the Sources of Quantum Mechanics collection, and see how many of the papers in it make sense using only first-year physics. Dirac was thinking about Poisson brackets, not a block on an inclined plane. The capsule "histories" in QM textbooks are caricatures, and sometimes quite poor ones at that.

Aaronson goes on:

Look, obviously the physicists had their reasons for teaching quantum mechanics that way, and it works great for a certain kind of student. But the “historical” approach also has disadvantages, which in the quantum information age are becoming increasingly apparent. For example, I’ve had experts in quantum field theory – people who’ve spent years calculating path integrals of mind-boggling complexity – ask me to explain the Bell inequality to them, or other simple conceptual things like Grover’s algorithm. I felt as if Andrew Wiles had asked me to explain the Pythagorean Theorem.

And then, did anyone clap?

This is a false analogy. I don't think it's a surprise, I am not convinced that it's an actual problem, and if it is, I don't think Aaronson makes any progress to a solution.

The Pythagorean theorem is part of the common heritage of all mathematics education. Moreover, it's the direct ancestor to the problem that Wiles famously solved. It's going to be within his wheelhouse. But a quantum field theorist who's been deep into that corner of physics might well not have had to think about Bell inequalities since they were in school. It's like asking an expert on the voyages of Zheng He about how Charlemagne became Holy Roman Emperor. There are multiple aspects of Bell inequalities that someone from a different specialization could want "explained", even if they remember the gist. First, there are plenty of questions about how to get a clean Bell test in the laboratory. How does one handle noise, how do we avoid subtly flawed statistics, what are these "loopholes" that experimentalists keep trying to close by doing better and better tests, etc. Aaronson has nothing to say about this, because he's not an experiment guy. And again, that's entirely fair; some of us are best as theorists. Second, there are more conceptual (dare I say "philosophical"?) questions about what exactly are the assumptions that go into deriving Bell-type inequalities, how to divide those assumptions up, and what the violation of those inequalities in nature says about the physical world. Relatedly, there are questions about who proved what and when, what specifically Bell said in each of his papers, who built on his work and why, etc. Aaronson says very little about all of this. Nothing leaps out at me as wrong, but it's rather "101". The third broad category of questions are about mathematical specifics. What particular combination of variables appears in which inequality, what are the bounds that combination is supposed to satisfy, etc. The expressions that appear in these formulae tend to look like rabbits pulled out of a hat. Sometimes there are minus signs and factors of root-2 and such floating around, and it's hard to remember where exactly they go. Even people who know the import of Bell's theorem could well ask to have it "explained", i.e., to have some account given of where all those arbitrary-looking bits came from. I don't think Aaronson does particularly well on this front. He pulls a rabbit out of his hat (a two-player game with Alice and Bob trying to take the XOR of two bits), he quotes a number with a root-2 in it, and he refers to some other lecture notes for the details, which include lots of fractional multiples of pi and which themselves leave some of the details to the interested reader.

Aaronson leads into this rather unsatisfying discussion thusly:

So what is Bell’s Inequality? Well, if you look for an answer in almost any popular book or website, you’ll find page after page about entangled photon sources, Stern–Gerlach apparatuses, etc., all of it helpfully illustrated with detailed experimental diagrams. This is necessary, of course, since if you took all the complications away, people might actually grasp the conceptual point!

However, since I’m not a member of the Physics Popularizers’ Guild, I’m now going to break that profession’s time-honored bylaws, and just tell you the conceptual point directly.

The tone strikes me, personally, as smarmy. But there's also an organizational issue. After saying he'll "just tell you the conceptual point directly", he then goes through the XOR rigmarole, which takes more than a page, before he gets to "the conceptual point" (that quantum mechanics is inconsistent with local hidden variables). It's less direct than advertised, for sure. I have not systematically surveyed pop-science explanations of Bell's theorem prior to 2013, but the "page after page of entangled photon sources..." rings false to me.

There's a chapter called "Free Will". Now, whether any scientifically-inclined reader will want to read a chapter with that name is highly taste-dependent. The imaginary physicist from a few paragraphs ago returns to say, "It is not 3 AM, I am no longer in college, and no one here is stoned." For many, the value of such a chapter will hinge upon whether the technicalities of computation and quantum physics bring anything new to a thoroughly stale discussion. This requires, at a minimum, that the technicalities included are correct. However, Aaronson's description of the Conway--Kochen "free will theorem" is simply wrong. He says it uses "the standard Bell experiment", but as Conway and Kochen explicitly point out, Bell inequalities are insufficient to power the result they derive. A reader will not understand what Conway and Kochen argue, let alone the conceptual context and historical background, from what Aaronson writes.

This is outside my own department, but I think there's a problem with Aaronson's treatment of Gödel's incompleteness theorems. He says that Gödel's first incompleteness theorem follows directly from Turing's proof that the halting problem is undecidable. This doesn't quite work, for reasons that are subtle but not too subtle for a technical text. The result conventionally known as Gödel's theorem is stronger than what you can get from the undecidability of the halting problem. In other words, the result that the Turing machines get you depends upon a more demanding precondition than "consistency", and so it is somewhat less impressive than what was desired. My best stab at a semi-intuitive explanation would be in the vein of, "When you're discussing the consistency of mathematics itself, you have to be double-special-careful that ideas like the number of steps a Turing machine takes really do make sense."

The historical problem is that Turing himself did not prove the undecidability of the halting problem. He wasn't even focused on halting. His main concern was computing real numbers, where naturally a successful description of a number could be a machine that doesn't stop. The "halting state" as we know and love it today was due to Emil Post.

Moreover, this is one of the passages where Aaronson seems to be offering the one and only true Nerd Opinion. He is dismissive of any way to understand Gödel's theorems apart from the story he offers, to the extent that a person who had only read Aaronson would be befuddled by anyone who used Gödel numbering after 1936.

In summary, then: Quantum Computing Since Democritus makes halfhearted efforts at supporting its professed central thesis. It repeats oversimplifications, rather than correcting them. It addresses interesting topics, but not in a satisfying way.

you are viewing a single comment's thread
view the rest of the comments
[–] corbin@awful.systems 4 points 1 month ago (2 children)

Okay, one more post. I re-read Scott's coverage of free will. Here's something he doesn't understand: given the Free Will theorem, it's not possible to build a Newcomb predictor which does well, and Newcomb's paradox can't get off the ground. The way I like to think of it is that we can build a Conway coin: a handheld device that uses the orientation angles of the wrist to contextualize an indefinite measurement over a 50-50 discrete distribution (with exponentially small possibility of erroring out and requiring a second measurement!) by using the wrist orientation as an orthogonal 3D basis and invoking Bell-Kochen-Specker. A predictor cannot reliably influence its victims when they are equipped with Conway coins; the paradox dissolves.

It's very funny, given this, that Scott wants credit for the Free Will theorem. He seems to think that the theorem is about a straightforward rewording of EPR in terms of KS, rather than a fairly deep insight about the indeterministic nature of reality. For example, there's no indication that he has accepted "Conway's shock": there is no experimental evidence in favor of determinism once we notice that most experiments circularly assume that their underlying theory under test is deterministic. Conway insisted that this should shock the reader, as it once shocked Conway himself. Kochen 2017 is an excellent and self-contained explanation of Kochen's view which complements Conway nicely; Conway himself ranted for about 6hrs on the topic in the 2018 Free Will Lectures, if you want the whole story.

[–] blakestacey@awful.systems 4 points 1 month ago* (last edited 1 month ago) (1 children)

It's worse than that: Aaronson doesn't even give a correct description of the Conway--Kochen free will theorem. He says it uses "the standard Bell experiment", but it actually uses a pair of twinned spin-1 particles (maximally entangled qutrits) and a set of measurements given by Asher Peres' 33-ray KS set. Aaronson's review of Wolfram's book doesn't use KS, either, only Bell's theorem phrased as an XOR game. It's like he didn't even read the Conway--Kochen paper, or else he didn't care to explain what was actually in it.

[–] blakestacey@awful.systems 4 points 1 month ago

I have added a paragraph about this to the review, since it seems important enough.

[–] V0ldek@awful.systems 2 points 4 weeks ago

I've never heard of the Newcomb thing, this might be me applyng my CS brain where it doesn't work, but can someone tell me why a perfect predictor isn't immediately incoherent for analagous reasons for which a machine solving HALT is incoherent?

  1. Assume we can build a perfect predictor.
  2. One perfect predictor can predict the prediction of another perfect predictor (in particular, two perfect predictors predicting the outcome of the same experiment must give the same result)
  3. Setup an experiment as in the Newcomb's problem.
  4. Use a second predictor to predict what the predictor inside the experiment would've predicted.
  5. Choose the other option. Contradiction, perfect predictor mispredicted.

This doesn't assume free will and it works exactly the same even if the predictor is not perfect (it's enough that it predicts better than random chance).