Part III · HARDWARE AND INFRASTRUCTURE

The sinews of war: compute, chips and data centers

Chapter 817 min readUpdated: June 2026

8.1Why AI devours compute

8.2GPUs, TPUs and specialized chips

Its dominance does not rest on silicon alone: it also relies on CUDA, the software layer that the entire AI ecosystem uses to program its chips, a competitive "moat" that is hard to cross. At CES in January 2026, NVIDIA detailed its new Vera Rubin generation (succeeding the Blackwell architecture): each Rubin GPU delivers around 50 petaFLOP in FP4 precision, carries ultra-fast HBM4 memory, and is assembled into "NVL72" racks combining 72 GPUs and 36 Vera processors. The next generation, "Feynman," has already been announced.

How do you train living neurons? The process has nothing to do with training an AI in silicon: no backpropagation and no gradient descent (Chapter 2). The neurons are cultured on a grid of electrodes that serves as both their senses and their muscle, able to stimulate them and read their activity. For Pong, the ball's position is translated into electrical stimulations; then, depending on whether the neurons return the right activity (the paddle intercepts the ball) or not, they are sent back a signal that is either regular and predictable, or chaotic noise. Following the so-called free energy principle (formulated by neuroscientist Karl Friston), a neural network seeks to minimize the unpredictability of what it perceives: the cultures therefore reorganize spontaneously to escape the chaos — that is, to play better — and manage it within a few minutes. Nothing is programmed: an environment is shaped, and the tissue adapts to it on its own.

Two major reservations then impose themselves. First, maturity: these systems are very slow, tiny in capacity, and the cultures survive only a few months; several leading neuroscientists judge the idea of competing with silicon this way to be premature, even doomed to fail.

The ethical vertigo. The second reservation is deeper (Chapter 23). The horizon embraced by some is to replace, one day, the artificial neurons used to build AI (mere numbers in a matrix) with genuine biological neurons, far more energy-efficient. But this horizon opens an abyss: as these cultures grew larger, could they develop a form of consciousness, even of suffering? No one today knows how to define or measure the consciousness of such a system, and it is precisely this fuzziness that worries (when the word "sentience" was used about Pong, dozens of researchers published a rebuttal). Three concrete questions follow. Consent first: these neurons come from the cells of human donors, who did not necessarily agree to become a thinking computer. Moral status next: if such a culture could feel anything at all, would we have the right to exploit it, and to switch it off? Cutting the power to a server is trivial; unplugging a potentially sentient brain tissue would no longer be. Regulation, finally: in the absence of scientific consensus, researchers are already calling for safeguards inspired by the ethics committees of animal research. At this stage, then, biological computing is less a credible alternative to chips than a fascinating and uncomfortable field of research, to be followed with as much caution as curiosity.

8.3The semiconductor value chain

Diagram8.1. The AI chip value chain. NVIDIA designs the GPUs but does not manufacture them (the "fabless" model): it is the Taiwanese firm TSMC that etches them, using machines that only the Dutch company ASML knows how to produce. Each link is a mandatory chokepoint, and therefore a point of vulnerability.
  • ASML (Netherlands) holds a global monopoly on extreme ultraviolet (EUV) photolithography machines, the only ones capable of etching the finest circuits. Without ASML, no cutting-edge chips.
  • TSMC (Taiwan) manufactures about 90% of the most advanced chips in the world (etched at 3 nanometers and below) and holds two-thirds of the global foundry market. Its geographic concentration in Taiwan makes it a nerve center of the world economy.
  • NVIDIA (United States) designs the GPUs but does not manufacture them itself (the so-called "fabless" model): it entrusts their etching to TSMC.

8.4Mega-data centers

Worldwide, the power dedicated to data centers is estimated to reach about 132 GW in 2026, and it is estimated that about 10 GW of new AI compute capacity (i.e., 13 to 15 million accelerators) will be added in that single year.

8.5The geopolitics of chips

The 2025-2026 sequence illustrates an eventful game of chess. After the 2025 repeal of the regulatory framework inherited from the previous administration (creating a period of looser control during which hundreds of thousands of chips reportedly transited through third countries), the US administration tightened the rules in late May 2026: any sale of advanced accelerators (NVIDIA's Blackwell and Rubin lines, AMD's MI350x) to a foreign subsidiary of a Chinese company now requires a license. In parallel, the saga of the H200 chip (authorized, then blocked at times by Washington, at times by Beijing, which is pushing toward self-sufficiency) led NVIDIA to reallocate its capacity at TSMC toward the new Vera Rubin generation. More recently, in mid-2026, US pressure shifted higher up the chain, onto ASML itself, suspected by Washington of having let a cutting-edge machine reach China.


Key takeaways (Chapter 8)

  • AI is first of all a matter of compute: models are trained on tens of thousands of GPUs, and inference (each request) dominates the bill over time.
  • NVIDIA dominates thanks to its chips (the Vera Rubin generation in 2026) and above all its CUDA software; Google (TPU), AMD and the cloud giants are developing alternatives.
  • The value chain is ultra-concentrated: ASML (EUV machines, Netherlands), TSMC (manufacturing, Taiwan, ~90% of cutting-edge chips), NVIDIA (design, "fabless" model).
  • Mega-data centers are measured in gigawatts (Stargate 10 GW, Colossus, Hyperion); global capex exceeds 400 billion dollars a year.
  • One frontier path is to place data centers in orbit (near-continuous solar energy, cooling by radiation): first demonstrators in 2025 (an H100 GPU in orbit, Google's Suncatcher project), but major obstacles (launch cost, heat, radiation).
  • An even more radical path, biological computing (computing with living neurons, or "wetware"), draws on the brain's efficiency (about 20 W) but remains slow, tiny and laden with ethical questions.
  • The US-China "chip war," founded on a "chokepoint strategy," is fragmenting the world into two technological blocs.

Faced with this dependence on a few giants and their immense data centers, an alternative is gaining ground: running AI at home, with open models. That is the subject of Chapter 9.