On Easter Sunday 1929,
a group of women walked down Fifth Avenue in New York City smoking cigarettes.
They had been hired by Edward Bernays,
a publicist working for the American Tobacco Company,
to light up in public and treat their cigarettes as what Bernays called “torches of freedom.”
Women smoking in public was a social taboo;
framing the act as feminist defiance was designed to dissolve that taboo
and open the female market to tobacco sales.
It worked.
Bernays was Sigmund Freud’s nephew,
and he had taken his uncle’s ideas about unconscious desire and applied them to commerce.
His insight was that you do not need to argue with people about whether they want something.
You can create the conditions under which they will want it
[Bernays2024].
The previous psychology post explained how
people make decisions differently than the rational-actor model predicts.
This post makes a related point:
before you can ask how people choose,
you need to ask where their preferences come from.
Standard economics treats preferences as given:
people arrive at markets with wants, and markets serve them.
Thorstein Veblen was among the first to argue that this is not how consumption actually works.
People don’t want things in the abstract;
they want things based on their social position,
relative to what the people around them have and display.
The desire for a particular pair of shoes cannot be separated from
the social meaning of those shoes in a specific time and place and to a particular peer group.
John Kenneth Galbraith pushed the argument further in The Affluent Society.
He called it the “dependence effect”:
the wants that production satisfies are themselves created by the process of production.
Advertising does not serve existing desires:
it manufactures new ones and attaches them to products.
The economy doesn’t exist to satisfy people’s needs—it exists to perpetuate itself
[Galbraith1998].
In South Korea,
cosmetic surgery has grown into a multi-billion-dollar industry
drawing patients from across East and Southeast Asia.
The procedures most in demand—eyelid surgery, jaw reduction, and particular forms of rhinoplasty—track
the standards of appearance disseminated through Korean entertainment products,
which are themselves produced by a commercial industry with strong incentives to generate aspiration.
Digital platforms have scaled this dynamic.
Algorithmic recommendation systems don’t just show you content that matches what you already want.
They build a behavioral model from pauses, clicks, shares, and watch time,
then serve content designed to maximize engagement.
Your preferences at the end of an evening’s scrolling are partly an artifact
of what the algorithm chose to show you.
Recommendation systems are also preference-construction systems.
This is distinct from the nudging described in the previous psychology post.
A nudge changes how a choice is presented without altering the options.
What we’re describing here is shaping what you want before you arrive at a choice.
In India, skin-lightening products generate annual revenues in the billions of dollars,
sustained by advertising that associates lighter skin with success, desirability, and social mobility.
The market for these products—many of which contain harmful ingredients—doesn’t reflect a natural preference.
It reflects decades of advertising, colonial inheritance, and film industry imagery.
Markets can satisfy preferences efficiently.
What they can’t do
(or rather, what the people who profit from them won’t do)
is tell you whether your preferences satisfying are worth having,
who created them,
or why [Packard2007,Veblen1899,Wu2016].
Fads and Bubbles
In February 1637, a single bulb of the Semper Augustus tulip sold in Amsterdam
for the price of a house on a canal.
If you had bought one of those bulbs in November 1636 and sold it in January 1637,
you would have tripled your money.
If you held it until March, you were ruined.
Tulip mania is the most entertaining entry in the catalog of financial bubbles
because it involves flowers.
But the mechanics are identical to what happened in 1720 with South Sea Company shares,
in the 1840s with British railway stocks,
in the 1980s with Japanese real estate,
in 1999 with dot-com IPOs,
in 2006 with Florida condos,
and in 2021 with NFTs.
Japan’s stock and real estate bubble peaked in December 1989—
when the Nikkei index hit 38,915 and Tokyo land was theoretically worth more than all US real estate—
and was followed by thirty years of stagnation
that every mainstream forecast predicted would end within a few years,
which is what “this time it’s different” sounds like from the inside.
The first thing to understand about a bubble is that it is not entirely irrational
while it is happening.
Tulips were genuinely scarce and genuinely fashionable.
Railway technology was genuinely transformative,
and the internet did change commerce.
Every bubble grows from a real phenomenon,
which is precisely what makes it so effective at separating people from their money.
The second thing is the greater fool theory.
You do not have to believe that a tulip bulb is worth a house.
You only have to believe that someone else will pay you more for it next month
than you paid for it today.
This is rational behavior when prices are rising,
but disastrous the moment they stop.
[Mackay1841] documented this pattern in 1841,
which remains readable and depressing because every pattern he identified is still with us.
Isaac Newton, one of the smartest people who ever lived,
lost the equivalent of several million dollars in the South Sea Bubble of 1720.
He reportedly said afterward that he could calculate the motions of heavenly bodies,
but not the madness of men.
He had already sold his shares at a profit,
watched the price continue rising,
bought back in near the peak,
and then watched it collapse.
Robert Shiller, who won a Nobel Prize partly for studying this problem,
argues that markets are driven less by rational calculation than by contagious stories [Shiller2015].
He calls this narrative economics:
when a compelling story spreads through a population,
it changes behavior,
and changed behavior changes prices,
which seems to confirm the story,
so it spreads further.
This feedback loop is difficult to interrupt because
the people caught up in it are not obviously doing anything wrong.
They are listening to their neighbors,
watching prices,
reading the news,
and making bets based on the best available information.
The information just happens to be mostly about what other people are betting.
The British Railway Mania of the 1840s combined technological innovation with genuine investment needs
to produce something close to collective insanity.
Between 1844 and 1846, Parliament approved over 400 new railway projects.
Engineers, lawyers, and promoters collected fees
regardless of whether the lines were ever built.
Landowners extracted fortunes from right-of-way negotiations.
Ordinary investors poured their savings into railways that had no surveyed routes,
no locomotives,
and sometimes no identifiable board of directors.
When the bubble collapsed,
it wiped out a significant fraction of the British middle class,
destroyed several major banks,
and left behind a network of railways that,
once consolidated by the surviving companies,
actually worked.
That last part matters.
Bubbles are destructive for investors,
but sometimes productive for infrastructure.
The dot-com crash of 2000 that wiped out trillions of dollars in market value
left behind fiber-optic cable in the ground,
server farms,
and the engineering knowledge to run large-scale internet services cheaply.
Most of the companies that failed deserved to fail:
their business models assumed that selling pet food online at a loss,
while paying for national television advertising,
would somehow eventually produce a profit.
The crash was the market performing its nominal function of
reallocating capital away from bad ideas.
The problem was that the reallocation happened after
several hundred thousand people lost their jobs and several million lost their retirement savings.
The housing bubble that collapsed in 2008 followed a different structure
because it was built on debt rather than equity [Lewis2010].
Banks lent money to people who could not repay it,
packaged those loans into securities that were sold to pension funds and insurance companies,
bought insurance against those securities from companies that could not pay out,
and collected fees at every stage of the transaction.
The economists who designed the risk models assumed,
based on historical data,
that housing prices never declined nationally at the same time.
They were right until they were catastrophically wrong.
Crucially,
the losses did not fall on the banks that had made the bad loans—the US government covered those.
The losses fell on homeowners who lost their houses and
workers who lost their jobs when the broader economy contracted.
John Kenneth Galbraith noted in his history of the 1929 crash
that the capacity of human beings to ignore evidence
that contradicts a profitable belief is essentially unlimited [Galbraith1954].
The 2008 crisis added the observation that when enough money is at stake,
the strongest defenders of free markets suddenly become socialists in all but name.
Another feature of speculative bubbles is that
they tend to be ahistorical.
The dot-com bubble was the first internet bubble;
there were no previous internet bubbles to learn from.
The mortgage securities market of the 2000s was large enough
that its collapse was qualitatively different from anything that had happened before.
The phrase “this time it’s different” has become shorthand
for the moment just before a collapse,
but it is genuinely true in a narrow sense:
the “why” of a particular collapse is usually novel enough that
people who want to believe can point to real differences from previous disasters [Kindleberger2005].
Which brings us to AI.
Large language models are genuinely impressive.
The companies building them have produced tools that fundamentally change how people write and program.
But the valuations of AI companies have,
since roughly 2023,
been running well ahead of their revenue,
their profit margins,
and any plausible estimate of the addressable market.
The story driving those valuations is that AI will soon be able to do everything,
which is a story that has been told before about nuclear power,
and about previous generations of expert systems and neural networks.
This doesn’t mean that AI companies will collapse to nothing,
any more than railways did.
The real question is who is going capture the value
and who is going to absorb the cost
when reality catches up with the hype.
Historically, the engineers and founders who cashed out early do well.
The pension funds, retail investors, and laid-off workers who arrived later in the story do not,
while the people who collected fees at every stage of the transaction do best of all.
The Myth of Meritocracy
The word “meritocracy” was coined by the British sociologist Michael Young
in a satirical novel in 1958.
Young’s dystopia describes a society that has perfected the measurement of individual talent
and ruthlessly sorts people by it,
producing a ruling class convinced of its own deservingness and contemptuous of those beneath it.
The word was adopted by politicians and executives as a term of praise,
and Young spent the rest of his life pointing out the confusion [Young1958].
How elite selection actually works in practice is very different
from how institutions describe it.
Studies of admissions to elite universities,
hiring at professional services firms,
and promotion within corporations consistently
find that formal credentials matter less than social legibility:
the ability to display the cultural capital
like tastes, manners, and cultural references
that signals membership in the right networks.
Lauren Rivera’s research on hiring at elite firms
found that interviewers routinely described candidates as “polished” or “rough around the edges”
in ways that correlated with class background rather than competence.
The stated meritocratic criteria were real,
but they operated within a prior filter that most candidates never saw [Rivera2015,Karabel2006].
Audit studies make the gap between stated and actual criteria measurable.
Researchers send identical resumes to employers,
varying only the name at the top,
and record callback rates.
The results are consistent across countries and decades:
resumes with names read as white receive significantly more callbacks
than identical resumes with names read as Black or Latino.
The same pattern appears when resumes signal class background or gender
or when disability is disclosed.
Believing in meritocracy does not make people fairer.
Research by Emilio Castilla and Stephen Benard found that
organizations that explicitly adopt merit-based pay principles show larger gender pay gaps
than those that do not.
The mechanism appears to be moral licensing:
people who believe the system is already fair feel less need to monitor their own judgments for bias.
The belief in meritocracy also predicts harsher moral judgments of people who fail.
After all,
if outcomes are deserved,
then failure is evidence of some personal deficiency.
This framing conveniently shifts responsibility from structures to individuals
and makes collective responses to poverty or unemployment
harder to sustain politically [Sandel2020,Wilkinson2011].
The meritocratic ideology takes specific forms in the tech industry.
The puzzle interview,
popularized by Microsoft in the 1990s and still widespread,
selects for people who have prepared for puzzle interviews,
which correlates with having time, networks, and educational backgrounds
that make such preparation possible.
“Culture fit” as a hiring criterion is even less bounded:
it is assessed subjectively and consistently reproduces the demographic composition of existing teams.
Neither criterion is without any signal about likely job performance,
but neither is calibrated against actual job performance data,
because most organizations never close that loop.
India’s IIT and IIM entrance examinations were designed as objective filters,
but a coaching industry whose best programs charge fees equivalent to several years of average household income
ensures that upper-caste and wealthy families dominate admission lists
despite formally caste-neutral scoring—the prior filter that formal meritocratic criteria operate within,
invisible to anyone who sees only the exam results.
The Representation Feedback Loop
In 1967,
at the height of the civil rights movement,
Nichelle Nichols played Lieutenant Uhura on Star Trek.
She decided to quit after the first season,
but Dr. Martin Luther King Jr. persuaded her to stay.
He told her she was not playing a role but representing a future.
Whoopi Goldberg,
having grown up without ever seeing a Black woman on television who was not playing a servant,
ran through her house shouting when she first saw Nichols on screen.
Mae Jemison,
the first Black American woman to travel to space,
has said that watching Uhura was formative for her sense of what was possible.
These stories and millions of others show that representation is not just symbolic:
it changed what futures became thinkable.
Research has shown that men outnumber women roughly three to one in speaking roles in family-rated films,
and that the gap is even larger among characters in science and technology careers.
If every doctor, lawyer, and CEO you ever see looks a particular way,
then looking like them becomes part of your picture of what belonging in those roles requires.
This is not a matter of being naïve or impressionable;
it is just how our minds work.
Breaking the cycle requires intervening somewhere in it,
which is precisely where resistance concentrates.
The NFL’s Rooney Rule,
introduced in 2003,
required teams to interview at least one minority candidate for head coaching positions.
The argument against it was that it was condescending—that qualified candidates should be hired on merit,
not to satisfy a quota.
The argument for it was that the existing process was already not meritocratic:
the informal networks through which coaching positions were filled
were built from relationships among people who looked the same,
so identical qualifications led to different outcomes depending on whose network you were in.
The Rooney Rule did not guarantee hiring,
but even that minimal intervention increased
the number of minority head coaches appointed in the years following its introduction.
Algorithmic recommendation systems reproduce the same dynamic at scale.
YouTube’s algorithm surfaces content similar to what previously earned engagement,
which means creators from underrepresented groups face higher barriers to discovery
than their more visible counterparts.
The algorithm is not making a judgment about quality:
it is applying a preference filter built from historical patterns,
and those patterns reflect who had access to platforms and production tools before the algorithm existed.
This framing is what makes the cycle self-perpetuating.
If you believe that whoever is currently represented in any field
is there because they were the most qualified,
then any change in the composition looks like a lowering of standards rather than a correction of a filter.
The response to evidence of the filter is to explain that the filter must be working correctly,
because look at the results.
The loop closes there unless something breaks it open.
Years ago,
when I first started writing for a programming magazine called Doctor Dobb’s Journal,
I decided to do a piece on the object-oriented features that were being added to MATLAB.
I didn’t know much about them,
so I called The MathWorks,
told the publicity rep who I was,
and asked if they could find me a co-author.
A couple of days later I got a call back from a woman who said she had volunteered to help with the article.
I explained again who I was and what I wanted;
she said she’d be happy to provide me with information and examples.
I then said,
“Thanks, but I’d rather have someone technical.”
After a slight pause, she said,
“Well, I have a master’s degree in Computer Science, and I implemented some of the new features.”
The rest of the conversation was short and uncomfortable.
The next day, I had an email from a different guy saying that he’d be working with me.
I was embarrassed at having put my foot in my mouth;
these days, I’m more embarrassed by how long it took me to wonder how she felt.
Why Discrimination Persists
Becker’s 1957 work on discrimination in labor markets generated an optimistic prediction [Becker1971].
Firms that discriminate pay a cost because
they forgo productive workers that non-discriminating competitors will hire instead.
Over time,
competitive pressure should therefore eliminate discrimination.
History has thoroughly falsified this prediction.
Discrimination has not been competed away in any labor market where economists have looked carefully.
The gap between Becker’s prediction and the observed reality reveals that
rational economics misidentifies what discrimination is and how it works.
Economists distinguish taste-based discrimination,
in which employers simply prefer not to hire members of certain groups regardless of productivity,
from statistical discrimination,
in which employers use group membership as a proxy for characteristics they cannot directly observe.
An employer who uses race or gender as a proxy for something like reliability
is drawing inferences from population-level patterns
that are themselves often the product of prior discrimination,
which creates a vicious circle.
The psychological research on implicit bias
complicates the picture further [Banaji2013].
Studies using the Implicit Association Test and its successors show that most people,
including those who explicitly reject prejudice,
harbor automatic associations between social groups and evaluative attributes.
These associations operate below the conscious level,
and lead to well-intentioned people evaluating resumes differently
depending on the apparent race or gender of the applicant.
Audit studies provide the cleanest evidence.
Researchers send pairs of fictitious resumes to real job postings,
holding everything constant except for names that signal racial identity.
The results are consistent across many countries and many decades:
resumes with names coded as white receive significantly more callbacks
than identical resumes with names coded as Black.
The same pattern appears when resumes signal class background, gender, or disability.
But understanding why discrimination persists also requires recognizing that some workers benefit from it.
Workers who belong to favored groups face less competition for jobs, promotions, and wages
when members of excluded groups are kept out.
This gives them an incentive to back political groups that will maintain discriminatory arrangements,
even if they object adamantly to being accused of discrimination [McGhee2021].
Such groups merely need people to believe that not actively being bad
is the same as being good.
India’s caste system and South Africa’s apartheid
both persisted long after any plausible economic case for them had dissolved.
They persisted because they served social and psychological functions
that economic analysis does not capture:
in particular,
they guaranteed those near the bottom of otherwise low-status hierarchies that
they were not at the very bottom.
If you can convince the lowest white man he’s better than the best colored man,
he won’t notice you’re picking his pocket.
Hell, give him somebody to look down on, and he’ll empty his pockets for you.
– Lyndon B. Johnson
The history of how discrimination gets built into professional structures
is worth understanding before applying it to computing.
The Flexner Report of 1910 recommended consolidating medical education
around a small number of university-based schools modeled on Johns Hopkins.
What is less often noted is what this consolidation closed:
the eclectic and homeopathic schools that had trained a significant proportion of women physicians,
and the historically Black medical colleges that were the primary pathway into medicine for Black practitioners.
The reorganization of medicine around a single credentialing model
was also a reorganization of medicine around a single demographic profile.
The American Medical Association’s subsequent decades of opposition
to women’s participation in medical societies
and to Black physicians’ hospital admitting privileges
extended and consolidated what the Flexner Report had begun.
The legal profession followed the same pattern.
Bar associations across Britain, the United States, and continental Europe
formally prohibited women from practice until courts or legislatures intervened.
In the UK, women were not admitted to the legal profession until 1919.
In the US, the Supreme Court upheld Illinois’s exclusion of women from the bar in 1872,
with a concurrence that stated women’s domestic roles were divinely ordained.
The formal barriers were eventually removed.
The informal barriers persisted:
partnerships that did not hire women,
chambers that did not take women as pupils,
and professional associations that did not see women as natural members.
These required decades more of pressure to dislodge,
and have not been fully dislodged.
The undercounting of women’s scientific contributions is documented across many countries and disciplines.
The pattern includes attribution of women’s work to male collaborators or supervisors,
systematic exclusion from authorship norms that credited the laboratory head
rather than the person who performed the experiments,
and deliberate suppression documented in specific cases
such as Rosalind Franklin’s role in the determination of DNA’s structure.
Studies of citation patterns, grant success rates, and peer review outcomes
continue to find systematic disadvantages for women researchers in many scientific fields,
controlling for measurable differences in output.
Mary Beard’s analysis of women’s exclusion from public speech
draws the pattern to its full historical length [Beard2017].
From the moment in the Odyssey where Telemachus tells Penelope to go back inside
and let men deal with public affairs,
through Roman rhetoric’s treatment of women’s public speech as inherently disgraceful,
through centuries of legal and customary prohibition on women speaking in court and in public assemblies,
the mechanisms of exclusion are remarkably consistent.
They include treating women’s speech as categorically out of place in authoritative contexts,
dismissing or ridiculing women who do speak in those contexts rather than engaging with what they say,
and redefining the terms of authority in response to women’s gains
so that what women achieve is always reclassified as less important than what they have not yet achieved.
In contemporary computing,
the primary mechanism of exclusion operates under the label of culture fit.
Hiring decisions made on the basis of cultural fit are rarely experienced as discriminatory
by those who make them.
The judgment is framed as a question of whether a candidate seems like one of us,
whether they would enjoy working with the existing team,
whether they share the team’s values and habits.
But when the existing team is demographically homogeneous,
hiring for cultural fit reproduces that homogeneity.
The criterion functions as a demographic filter while providing its users with a clean conscience.
The automation of hiring decisions does not solve this problem.
It replicates it at scale.
Algorithmic hiring tools trained on historical hiring data
learn to identify candidates who resemble those previously hired and promoted.
If the historical data reflects discriminatory patterns,
the algorithm encodes those patterns into a system that is harder to scrutinize and challenge
than a human decision-maker.
Amazon’s internal recruiting tool, abandoned in 2018,
penalized resumes that included the word “women’s”
and downgraded graduates of all-women’s colleges,
because it had learned from a decade of prior hiring decisions that were heavily male.
The standard response to demographic imbalance in tech is to point to the pipeline:
not enough women or underrepresented minorities in computer science programs,
and the problem will correct itself when universities produce more qualified candidates.
This explanation is flatly contradicted by the evidence.
Women’s participation in computing was substantially higher in the 1980s than it is today.
The decline in women’s representation occurred
while women’s representation in law, medicine, and other professions was increasing.
The pipeline analogy locates the problem in the wrong place.
Demographic problems that originate in hiring practices, retention conditions, and professional culture
cannot be solved by changing who enters the pipeline.
They require changing the conditions that determine who stays in,
who is promoted,
and who leaves.
The Moral Psychology of Building Harmful Things
In September and October 2021,
the Wall Street Journal published a series of articles
based on tens of thousands of internal Facebook documents provided by Frances Haugen.
Among other things,
the documents showed that Facebook’s own researchers had found that
Instagram worsened body image issues and increased suicidal ideation in teenage girls,
and that the company had known this for years while publicly denying that its platforms caused harm.
The documents also showed that Facebook’s algorithms systematically promoted outrage and divisive content
because it generated more engagement,
and that internal teams had identified this as a serious problem without being able to change it.
None of this is surprising in retrospect.
What is surprising is how ordinary the behavior looks when you read the documents:
working groups, slide decks, and recommendations sent to committees.
The harm was not produced by villains.
It was produced by an organization operating in ways that feel completely familiar.
The psychologist Stanley Milgram ran his obedience experiments at Yale in 1961 and 1962,
partly in response to the trial of Adolf Eichmann,
who had organized the logistics of the Holocaust
and whose defense was that he had simply followed orders [Milgram1974].
In the most famous version of Milgram’s experiments,
subjects were told they were participating in a study of learning
and were instructed by an authority figure to administer electric shocks to another person
whenever that person gave a wrong answer.
The other person was a confederate; the shocks were fake; the screams were recorded.
Sixty-five percent of subjects administered what they had been told was the maximum voltage —
labeled “Danger: Severe Shock” — because a person in a lab coat told them to.
Milgram’s explanation was that people in hierarchical situations shift into an agentic state:
they stop regulating their own behavior against their personal moral standards
and start executing instructions from whoever they perceive as legitimate authority,
locating responsibility upward rather than in themselves.
The psychologist Albert Bandura identified eight mechanisms
by which people disengage their moral standards
to engage in or tolerate harmful behavior without feeling responsible for it.
Moral disengagement does not require cruelty or indifference.
It operates through cognitive reframing available to people who consider themselves good.
Moral justification frames the harm as serving a higher purpose,
while euphemistic labeling describes content moderation failures as “trust and safety challenges.”
Displacement of responsibility points to someone else:
the advertisers demanded this, the users clicked on it.
Diffusion of responsibility distributes it across so many people
that no single one feels it:
the engineers wrote the algorithm, the product managers set the metrics, the executives approved the strategy.
Treating teenage users primarily as engagement statistics
accomplishes much of the same work as dehumanization [Bandura1999,Palazzo2025].
The sociologist Diane Vaughan’s analysis of the Challenger disaster identified a related mechanism
she called normalization of deviance:
the gradual process by which organizations come to accept risk thresholds
that would initially have been unacceptable,
through repeated exposure to near-misses that did not immediately produce catastrophe.
NASA engineers had known for years that O-rings on solid rocket boosters failed in cold temperatures.
Each launch that succeeded despite the problem
was treated as evidence that the problem was manageable.
The night before the Challenger launch,
engineers who understood the risk recommended against launching in cold weather.
Their recommendation was overruled by managers
who reframed the decision as requiring proof of danger rather than proof of safety.
No one decided to kill the seven astronauts who died the next day.
NASA had simply accumulated a set of practices that made the outcome possible [Vaughan1996].
The phrase “we just build hammers” is the most common form of moral disengagement in technology work.
It frames engineers as neutral suppliers of capability,
placing all responsibility for consequences on whoever chooses to use the tools.
This framing collapses when you look at what the tools actually do.
A recommendation algorithm that is trained to maximize engagement
and deployed on a platform used by teenagers
is not neutral the way that a hammer is.
It is a system specifically designed to capture attention,
running on knowledge about how human psychology responds to certain kinds of content,
at a scale that no individual user can resist or comprehend.
The programmers who built it made choices about what to optimize,
and their bosses approved those choices.
None of this means that people inside organizations that cause harm are helpless.
It means that the conditions that produce harmful outcomes are predictable,
that they operate through ordinary human psychology rather than through exceptional malice,
and that understanding them is prerequisite to changing them.
When the metrics that determine whether a career succeeds
are the same metrics that produce harm,
most people will find ways to make peace with the harm [Ehmke2025].
In order to understand how the world works,
we have to understand how we got here.
That’s also a bit much for one blog post,
but the sections below summarize a few things I didn’t know
when I started trying to figure it out.
See the first and second posts in this series for context.
The Creation of Money
In the 1660s,
London merchants who needed somewhere safe to store their gold and silver
began using the vaults of goldsmiths,
who already had the locks and the reputation.
The goldsmith would issue a paper receipt;
merchants quickly discovered that it was easier to pay each other with receipts
than to cart metal through the streets.
The receipts circulated as currency.
The goldsmiths making these transactions noticed that
at any given time,
only a fraction of depositors came to claim their metal.
The rest left it in the vault, trusting the paper.
A goldsmith willing to issue more receipts than he actually had gold in the vault
could lend out the extras and collect interest,
as long as depositors never all came at once.
This is the origin of fractional reserve banking,
a name that makes it sound more principled than it is.
This matters for understanding the modern financial system
because the same basic mechanism is still operating,
at much larger scale,
with slightly more oversight,
in digital form.
The story is also the origin of the periodic bank run:
when depositors suspect the receipts exceed the gold,
everyone tries to claim their metal at once,
which confirms the suspicion,
which guarantees the collapse.
In September 2007,
people queued outside branches of Northern Rock, a British mortgage lender,
for three days—the first run on a British bank in 150 years.
The queues were orderly and very British,
but the mechanism was identical to panics that had destroyed banks in Amsterdam in 1763,
in the United States repeatedly between 1873 and 1907,
and in Argentina in 2001.
In 2014,
the Bank of England published a paper with the dry title
“Money Creation in the Modern Economy.”
Its core claim was not subtle:
when a commercial bank makes a loan,
it does not lend out money that was previously deposited.
It creates the deposit simultaneously with the loan.
The loan and the deposit appear on the bank’s books at the same moment.
Money that did not previously exist comes into existence through the act of lending.
This is just as ridiculous as it sounds, but very useful.
When you repay a loan,
the deposit disappears from your account and the corresponding debt disappears from the bank’s books.
Money is destroyed.
The money supply expands when banks are lending
and contracts when loans are being repaid or written off.
The anthropologist David Graeber spent years studying debt across cultures and centuries.
In Debt: The First 5,000 Years,
he showed that the story economists tell about money—that barter came first,
that money was invented to make barter easier,
and that credit developed on top of money—has
almost no support in the historical or anthropological record.
The fish-for-axes economy that appears in introductory economics textbooks
has never actually existed anywhere.
What appears in the record instead are systems of mutual obligation and accounting:
I owe you,
you owe me,
the village keeps track.
Credit relationships are older than markets and older than money.
Money emerged not to simplify barter but to settle and record debts
[Graeber2011,McLeay2014].
Who gets to create money,
and under what constraints,
are political questions.
Andrew Jackson ran his 1832 presidential campaign substantially on the question
of whether a private institution should have the power
to create the country’s money.
The financial panics that followed
eventually produced the Federal Reserve Act of 1913,
passed in the aftermath of the 1907 panic,
to give the money-creation system a lender of last resort.
Contemporary fintech companies are participating in this long argument
without usually acknowledging it.
Klarna, Afterpay, and Affirm extend credit to consumers at the point of purchase.
In doing so they are creating money,
but they are doing it without the deposit insurance,
capital requirements,
or regulatory oversight that apply to licensed banks.
PayPal holds tens of billions of dollars in customer balances that do not carry deposit insurance,
and Stripe extends credit to merchants.
These companies are, in each case, capturing the profit of money creation
while avoiding the obligations that historically came with it—the same maneuver
that London goldsmiths discovered in the 1660s [Kindleberger2005,Chang2012].
The Invention of the Corporation
The limited liability corporation is the dominant form of business organization,
and its history is stranger than most people realize.
Joint-stock companies with limited liability appeared in England and the Netherlands in the early 1600s.
The Dutch East India Company, founded in 1602,
is often called the world’s first publicly traded corporation:
it issued shares to outside investors,
its shareholders could not lose more than they invested,
and those shares were traded on what became the Amsterdam Stock Exchange.
The whole structure was designed to let wealthy merchants finance long and risky trading voyages
without betting their entire fortunes on any single ship.
Limited liability is a legal invention,
created by governments to encourage private investment
in ventures too expensive for any one person to fund alone.
It is not a natural feature of commerce.
It is a grant from the state—a deliberate decision to limit the consequences that flow from a business’s failures.
Before it existed,
a merchant who invested in a failing venture could lose everything they owned.
After it, investors could only lose what they put in.
This changed who was willing to invest, in what, and on what scale.
Why do so many Canadian tech startups end up incorporated in Delaware?
Not because Delaware law is superior, but because American venture capital funds require Delaware C-corporations.
Their legal documents were drafted for that structure,
their lawyers know it,
and their limited partners expect it.
A Canadian founder who takes US venture money
often has to execute what is called a Delaware flip:
creating a new Delaware holding company above the existing Canadian entity,
transferring the intellectual property,
and essentially making the company American on paper.
This is an expensive process that produces no operational benefit for the company;
it is pure path dependency imposed by the capital markets.
What tech companies have innovated on is not the basic corporate form
but the voting structure within it.
Dual-class share structures,
in which founders hold shares with ten or more votes each
while public investors hold shares with one vote,
have become standard in major tech IPOs.
When Google went public in 2004,
Larry Page and Sergey Brin wrote a letter to shareholders explicitly explaining
that they intended to run the company unconventionally
and that investors who disagreed should not buy the stock.
The practical effect is that founders retain effective control regardless of how many shares they have sold:
the company is publicly traded but not publicly governed,
which is exactly what most founders want.
The usual excuse for why tech companies aren’t worker cooperatives or other equitable structures
is that cooperative governance is too slow and cooperative financing too limited.
This explanation is conveniently incomplete.
It omits the fact that the founders, lawyers, and venture capitalists making the choice
benefit most from the conventional structure.
A Delaware C-corp with dual-class shares concentrates decision-making authority and financial upside
in the hands of a small number of people at the top.
Describing this as a neutral response to market conditions,
rather than as a choice made by self-interested parties who control the available alternatives,
takes a certain kind of nerve [Whyte1991,Kelly2012].
The Monopolist’s Playbook
Venture-funded platform competition is a tontine.
In European finance from the seventeenth through the nineteenth centuries,
a tontine was a pooled investment in which,
as each subscriber died,
the remaining subscribers’ shares increased,
with the last survivor inheriting the entire fund.
Most governments eventually banned tontines because they created obvious incentives
to hasten the death of other participants.
In platform competition,
the explicit goal from the first investment round
is to be the one platform that achieves dominance while all competitors fail.
Operating below cost to build market share,
sometimes for years,
makes financial sense
if it eliminates alternatives and enables the extraction of monopoly rents afterward.
Investors who hold positions in multiple competing platforms profit regardless of which one wins;
the workers, users, and communities that depended on those platforms do not.
The life cycle of a dominant firm in a network industry follows a pattern:
enter the market with a genuinely useful product,
use that position to acquire or drive out competitors,
then extract maximum value from the resulting captive market.
Standard Oil, railway companies, the Bell telephone system, cable television,
and major record labels all followed this playbook.
The constraints that eventually constrained them
were the result of political struggles that the industries fought at length.
At its peak,
Standard Oil controlled roughly ninety percent of US oil refining capacity.
It didn’t achieve this through superior efficiency
but through secret railroad rebates,
predatory pricing against competitors,
and strategic acquisitions of rivals.
The Sherman Antitrust Act of 1890 was written in direct response,
and the landmark 1911 Supreme Court decision that broke Standard Oil into thirty-four separate companies
is often cited as the definitive antitrust remedy.
What is less often noted is that several successor companies immediately reconsolidated,
and that John D. Rockefeller’s personal fortune increased after the breakup
as the stock prices of the subsidiaries rose.
The remedy addressed the legal structure of the monopoly
without fundamentally altering the underlying concentration of wealth or market power [Stoller2019].
The Bell network evolved along slightly different lines than Standard Oil.
AT&T’s control over telephone infrastructure from the 1910s until its breakup in 1984
was maintained by a regulatory compact
in which AT&T accepted rate regulation in exchange for a protected monopoly position.
That compact produced genuine achievements:
Bell Labs generated an extraordinary concentration of fundamental research,
including the transistor, the laser, information theory, and Unix.
It also suppressed the development of competitive long-distance service
and technologies that threatened the core telephone business [Wu2010].
The British railway mania of the 1840s followed the same cycle earlier and more visibly.
The consolidation that followed produced a small number of large regional monopolies.
Parliament responded with rate controls and mandated third-party access to track,
which the railway companies contested for decades,
making the same arguments that social media and AI companies make today:
the industry was too complex to regulate effectively,
regulation would destroy investment incentives,
and the market would eventually take care of things anyway.
Network effects and switching costs
are what make monopolies in network industries self-reinforcing.
A telephone network becomes more valuable as more people join it,
giving the dominant network an advantage separate from the quality or price of the underlying service.
Switching costs compound this:
users who have built workflows and contact lists around a platform
can’t afford to move to a competitor even when the competitor offers a better product.
In 2005, a Dutch startup called Booking.com offered hotels a deal:
list your rooms on our platform for a 12% commission,
and we will send you customers you would not otherwise reach.
Hotels signed up;
travelers followed, because the inventory was there,
and by the early 2010s,
Booking.com was the dominant hotel search platform across Europe and much of Asia.
Then the commissions started climbing.
By 2019, many hotels were paying 25-30% per booking,
plus additional fees for “preferred placement” near the top of search results.
Hotels that declined to pay for placement found themselves buried behind those that did.
The traveler experience degraded too:
search results increasingly reflected who had paid for prominence,
not which hotel best matched the search.
Hotels understood what had happened,
but they were locked in.
Their repeat customers now booked through Booking.com rather than directly,
because that was where travelers looked.
A hotel that left the platform did not take its customers with it—those
relationships belonged to the platform.
Leaving meant losing access to a market the platform now controlled.
Cory Doctorow named this pattern enshittification.
A platform enters a market by offering a service below cost to build a user base.
Once users are locked in,
it begins subsidizing business customers,
using the captive user base as leverage.
Once business customers are also locked in,
it harvests both by
degrading service quality,
raising prices,
and extracting the maximum value from a market it now controls.
The platform can do this because the switching costs that lock users in
also protect it from competitive consequences [Doctorow2022].
Enshittification depends on two things:
network effects and structural lock-in.
Neither is not specific to digital platforms.
The record club Columbia House ran the same play in a different era.
Launched in the 1950s in the United States and Canada
and later extended to the United Kingdom, Australia, and Brazil,
it offered new members twelve records or cassettes for a penny.
The first transaction was a real deal.
The extraction came later:
members committed to purchasing eight more titles at “regular club prices,”
which were two to three times the retail price of the same albums in a shop.
If a member forgot to decline the monthly selection,
it arrived automatically and the cost appeared on their bill.
The penny offer built the membership;
the commitment structure extracted the value.
Columbia House recruited around sixteen million members in the United States alone
before the model collapsed when digital music eliminated the inventory advantage.
In 1986, a corporation better known for making women’s underwear
acquired JanSport and, over the following decades,
bought nearly every backpack brand with a reputation for durability.
Controlling more than half the US backpack market,
it had no competitive pressure to maintain quality.
Fabric thickness dropped, cheaper zippers replaced better ones, and stitching density fell.
The products looked identical on the shelf;
customers discovered what they had actually bought when the stitching pulled apart at the stress points.
JanSport continued to advertise a lifetime warranty,
and the suggestion “just use the warranty” sounds entirely reasonable.
In practice, using it required paying $12 to $25 in return shipping,
waiting three to six weeks,
and arguing that a failure qualified as a “defect in materials and workmanship”
rather than “normal wear and tear.”
(The warranty language was not incidental;
it was written to exclude precisely the kind of failure
that was now designed into the product.)
One customer,
when told their zipper failure was wear and tear,
got quotes of $50 to $100 from local tailors,
then bought a used bag at a thrift store for four dollars rather than a new one.
Grab, the ride-hailing and delivery platform dominant across eight countries in Southeast Asia,
followed the same trajectory at scale.
It entered markets including Malaysia, Indonesia, Vietnam, Thailand, and the Philippines
with driver incentives and passenger subsidies
that made rides cheaper than local alternatives.
It acquired Uber’s Southeast Asian operations in 2018,
eliminating its main competitor outright.
With competition gone,
driver commissions rose and passenger fees increased.
The platform began requiring restaurants and drivers to pay for placement
in its food-delivery and services listings.
The pattern was identical to Booking.com’s,
conducted in markets where regulatory capacity to respond was considerably thinner.
Investor dynamics accelerate enshittification.
Platforms in their subsidy phase operate at significant losses,
funded by venture capital in expectation of eventual monopoly returns.
Once the platform achieves dominance,
those investors demand extraction.
Losses during the subsidy phase are booked as investment;
extraction during the harvest phase is booked as profit.
The users who benefited from below-cost service in year one
fund those returns in year ten.
Antitrust enforcement has provided limited relief.
Because acquisitions of potential competitors are evaluated on
whether they raised consumer prices immediately—not
whether they reduced competition structurally—dominant platforms can acquire dozens of rivals
before those rivals threaten their market share.
The result is that enshittification faces no real competitive check:
alternatives are acquired or driven out during the subsidy phase.
Antitrust law in the United States was supposed to prevent this.
Beginning in the 1970s,
however,
it was weakened by a legal school that argued courts should evaluate mergers solely
on whether they reduced consumer welfare in the short run.
Under this standard,
acquisitions of nascent competitors were not anticompetitive
if the acquired company had not yet raised prices.
Platform companies used this framework to acquire dozens of potential competitors
before they could grow large enough to threaten market share.
The current debate over platform regulation is therefore partly a debate about
whether antitrust law should return to its original concern with concentration of power
[Doctorow2025,Shapiro1999].
Intellectual Property as Invention
Copyright, patent, and trademark are not natural rights:
they are legal instruments invented at specific times to serve specific interests.
The expansion of intellectual property rights over the past forty years was a deliberate political project,
pursued by specific industries,
over the objection of economists who predicted (correctly) that it would harm innovation.
Understanding this history is essential for evaluating current claims
about AI training data, open source licensing, and platform ownership of content created by users.
The Statute of Anne,
enacted in England in 1710,
is generally identified as the first copyright law.
It was not passed to protect authors,
but to end a monopoly held by the London Stationers’ Company,
a guild of printers that had controlled the trade in printed books since the sixteenth century,
and to create a new system of publisher monopolies that would operate for limited terms.
Authors received rights in the statute,
but only to the extent that they assigned those rights to publishers,
who were the real beneficiaries of the act.
The framing of copyright as a natural right of creators,
which dominates popular and political discussion today,
was a later construction that reversed the actual historical origin [Bracha2019].
Patent and copyright terms have been extended repeatedly,
almost always to be longer, broader, and allow fewer exceptions.
United States copyright terms have moved from 14 years under the Copyright Act of 1790
to the current life-plus-70 years.
(The 1998 Sonny Bono Copyright Term Extension Act was timed
to prevent the earliest Mickey Mouse films from entering the public domain.)
Patent terms have been less dramatically extended,
but the scope of what is patentable has expanded substantially,
particularly with the rise of software patents.
Each extension was the result of lobbying by specific industries,
over the objections of economists
who consistently argued that the existing terms were already longer than necessary to incentivize creation.
Pharmaceutical companies have been extremely effective at shaping intellectual property law globally.
Beginning with the Agreement on Trade-Related Aspects of Intellectual Property Rights (TRIPS) in 1994,
high-income countries required all members of the World Trade Organization
to adopt patent standards that matched or exceeded their own.
Countries that had deliberately excluded pharmaceutical products from patentability
in order to build domestic generic drug industries,
including India, Brazil, and several African countries,
were required to change their laws.
The result was a substantial transfer of rents to pharmaceutical companies operating in rich countries,
and a significant increase in drug prices in countries that could least afford them.
Software patents are a category of intellectual property right
whose practical operation differs substantially from the stated purpose of the patent system.
The original argument for patents is that they encourage disclosure:
an inventor gets a time-limited monopoly
in exchange for publishing a description of the invention
sufficient to allow others to replicate it after the term expires.
In contrast,
software patents are often written at high levels of abstraction,
covering broad categories of computational method rather than specific implementations.
They are concentrated in the portfolios of large technology companies, financial institutions,
and patent assertion entities:
organizations that hold patents without producing anything,
but derive revenue entirely from licensing and litigation.
Software patents are used primarily against smaller companies and open source projects,
which lack the resources to defend extended litigation.
Open source licensing emerged in part as
a response to the expansion of intellectual property rights over software.
The GNU Public License (GPL) and its relatives use copyright law against itself:
the license grants broad permissions
that apply only on the condition that derivative works carry the same license,
making it legally difficult to take open source software and close it.
This kind of copyleft has only been a partial success:
big tech companies have learned to extract enormous value from open source software
while contributing relatively little back,
because their primary product is not the software but the service running on it
[Doctorow2022,Baldwin2014,Bellos2024].
Setting the Standard
In May 1886,
American railroads completed the largest coordinated industrial operation the world had ever seen.
Over two days,
work crews across the South pulled up thousands of miles of track
and relaid it three inches closer together,
converting the region’s idiosyncratic gauge to match the rest of the country.
The process was planned, organized, and executed in less time than most tech companies take
to schedule a product launch.
The southern railroads had resisted this change for decades
because the incompatibility was not an accident.
Different gauges meant that northern rolling stock could not run on southern track,
so freight crossing a gauge boundary had to be unloaded, transferred, and reloaded.
Every cargo shipment lost time and money at the border,
and that money went to the carriers who controlled the boundary.
The technical choice was a market weapon.
A technical standard is an agreement about how things connect.
The value of having a standard usually far exceeds
the value of any particular outcome in the format war that precedes it,
which is exactly why control of the format is worth fighting for.
Thomas Edison understood this in the 1880s,
when he and George Westinghouse fought over whether the United States would run on direct or alternating current.
Edison had bet on DC infrastructure and stood to lose if AC won,
so he electrocuted animals at exhibitions
and lobbied for AC to be used in the first electric chair,
hoping to fix the association of alternating current with execution in the public mind.
None of this changed the underlying physics, though;
AC transmits over longer distances at lower cost,
and Westinghouse won.
A century later,
Microsoft repeatedly used standards capture as a business strategy.
The company would adopt an open standard like HTML, Java, or Kerberos authentication,
and then add proprietary extensions that were convenient but tied developers to Microsoft’s version.
Once enough code was written against the extensions,
interoperability with non-Microsoft systems degraded.
The strategy had a name inside the company: embrace, extend, extinguish [Russell2014].
(The “extinguish” referred to both the open standard and Microsoft’s competitors.)
Google’s Accelerated Mobile Pages, launched in 2015,
was presented as an open standard for fast-loading mobile web pages.
To receive preferential treatment in Google’s mobile search results,
however,
pages had to be hosted on Google’s own servers and delivered through Google’s infrastructure.
Publishers who adopted AMP handed Google control over their content delivery
in exchange for a favorable page rank.
Google eventually backed away from the hosting requirement,
but only under regulatory pressure.
Apple took a different approach.
Rather than capturing an open standard, it simply refused to implement one.
RCS is a modern successor to SMS that supports end-to-end encryption,
read receipts,
high-resolution media,
and group chat.
For years after Android phones had adopted it,
Apple kept cross-platform messaging on unencrypted SMS,
so messages between iPhones and Android phones appeared as green bubbles.
Internal documents revealed in US antitrust litigation showed Apple executives explicitly acknowledging
that adopting RCS would reduce the social cost of switching from iPhone to Android,
particularly for teenagers for whom the green-bubble distinction had become a social marker.
The EU’s Digital Markets Act eventually compelled Apple to support RCS in 2024.
When a technology becomes embedded in a formal standard,
the patent holder typically commits to license it on FRAND terms:
fair, reasonable, and non-discriminatory.
This sounds reassuring,
but the word “reasonable” has funded decades of litigation.
Qualcomm’s licensing practices for mobile baseband patents
(the technology that connects a phone to a cellular network)
were subject to antitrust proceedings simultaneously in the United States, Europe, South Korea, China,
and Taiwan in the late 2010s.
The core allegation was consistent across jurisdictions:
Qualcomm was using its position as the patent holder
o charge fees that bore no relationship to any recognizable interpretation of “reasonable.”
The legal outcomes varied by country;
Qualcomm remained profitable throughout.
During the browser wars of the late 1990s,
Microsoft bundled Internet Explorer with Windows,
deliberately undercutting Netscape’s business model.
IE’s market dominance then allowed Microsoft to implement proprietary HTML extensions
that worked only in its browser.
What finally broke this cycle was not antitrust action—which produced a consent decree
and little structural change—but the arrival of Firefox and then Chrome,
backed by organizations with different incentives.
If you are a developer building a product,
every choice between an open standard and a proprietary API is
a bet on the future behavior of the platform that owns the API.
History suggests the bet is a bad one.
From Instagram’s third-party API and Twitter/X’s developer API
to Google Reader and Firebase’s original pricing structure,
platforms have repeatedly changed terms after developers are so invested that they can’t walk away.
Each time,
the platform points to its terms of service,
which always reserves the right to change.
The structural remedy for this is interoperability mandates:
legal requirements that platforms accept connections from competing services
on terms that do not depend on the platform’s goodwill.
Phone number portability,
which required carriers to let customers keep their numbers when switching providers,
eliminated one of the most effective lock-in mechanisms in telecoms.
Messaging interoperability requirements in the EU’s Digital Markets Act
are attempting to do the same thing for social platforms.
Whether they succeed depends on implementation details that are actively contested,
and on enforcement that companies have historically resisted every way they can
[Shapiro1999,Wu2010].
Public Subsidy, Private Profit
When Steve Jobs unveiled the iPhone in January 2007,
the crowd responded as if Apple had conjured something from nothing.
What neither Jobs nor the press mentioned was that
every technology in that device had been developed with government money.
The internet it connected to had been built by the Defense Advanced Research Projects Agency.
The GPS it used had been developed and maintained by the US Air Force,
which had turned off the deliberate signal degradation for civilian users only seven years earlier.
Its touchscreen came from research supported by the National Science Foundation,
which Apple had acquired by buying a small company called FingerWorks in 2005.
Siri, added to the iPhone 4S in 2011, had started as a DARPA-funded project at SRI International.
The government took the risk;
the investors who held Apple stock reaped the benefits.
This pattern is not unique to Apple.
The internet began as ARPANET, a network funded by the Department of Defense from 1969
to connect university research computers.
The initial packet-switching protocols,
the domain name system,
and the basic architecture of what became the web
were all developed in publicly funded laboratories and universities.
The commercial internet of the 1990s built on this foundation without paying for it.
The browser itself emerged from the National Center for Supercomputing Applications,
funded by the National Science Foundation.
Mosaic was not a startup product:
It was a research project paid for by American taxpayers.
The drug industry runs the same arrangement at enormous scale.
The National Institutes of Health spends roughly $47 billion annually on biomedical research.
Much of this money funds the basic science
that pharmaceutical companies would not fund themselves because the returns are too distant and uncertain.
When that basic science produces a promising compound,
private companies license it, conduct clinical trials, and patent the result.
The public paid for the underlying knowledge.
The private company captures the patent and sets the price.
Unlike every other wealthy country.
the United States has no legal mechanism to negotiate that price.
As a result,
insulin costs American patients ten times what Canadians pay.
The mRNA vaccine platform that produced the Pfizer-BioNTech and Moderna COVID-19 vaccines
illustrates this dynamic precisely.
The fundamental science of mRNA delivery
was developed over decades by Katalin Karikó and Drew Weissman at the University of Pennsylvania,
supported by the National Institutes of Health.
When COVID-19 hit,
the US government funded the clinical trials and pre-purchased hundreds of millions of doses
before any vaccine had received authorization.
The government provided the science, the capital, and the guaranteed market.
Moderna became a $200 billion company,
and its executives became very rich.
The NIH’s claim to a share of the intellectual property—which
would have given the government some leverage over pricing—was
disputed by Moderna and ultimately not enforced.
The economist Mariana Mazzucato has called this arrangement
the privatization of gains and the socialization of losses.
Her argument is not that private companies add no value:
they obviously do, in manufacturing, distribution, and application.
Her argument is that
the standard story of heroic private entrepreneurs taking risks that timid governments would never accept
inverts the actual history.
Governments took the foundational risks
by funding research that might produce nothing,
maintaining infrastructure that would not attract private capital,
and training the scientists and engineers that firms would later hire.
Technology transfer moves the results into private hands,
almost always at prices that dramatically undervalue the public’s investment
[Mazzucato2013].
The tech companies that have benefited most from publicly funded research
are also among the most sophisticated users of international tax structures
designed to minimize what they pay back into the public systems that enabled them.
Apple’s arrangements in Ireland,
described in a 2016 European Commission ruling,
allowed the company to pay an effective tax rate of 0.005% on European profits.
Over and over,
public investment creates the technology,
private firms capture the profits,
and international tax structures ensure that
only a tiny fraction of those profits flow back into the public budget.
The cycle is effectively a permanent subsidy.
The arrangement looks different in Europe,
partly because European governments built in mechanisms that Americans left out.
Germany’s Fraunhofer-Gesellschaft,
a network of applied research institutes funded jointly by the federal government,
state governments,
and industry,
licenses its discoveries under terms that return revenue to the institutes themselves
rather than transferring intellectual property to private buyers at knockdown prices.
The European Medicines Agency negotiates drug prices on behalf of member states,
which is why the same cancer drugs that cost American patients six figures a year
cost Germans and French patients a fraction of that.
When Moderna tried to sell COVID-19 vaccines to the European Union,
EU negotiators paid roughly half the per-dose price that American buyers paid,
for the same product developed with the same publicly funded science.
The European model still lets private firms profit substantially from public investment.
What it does not do is treat the transfer as a gift.
China has taken a third path that makes the American arrangement look like an oversight rather than a design.
Programs like Made in China 2025, announced in 2015,
identify strategic industries like semiconductors, electric vehicles, robotics, and artificial intelligence,
and pour in state capital with the explicit goal of domestic ownership of the results,
not just the benefits.
Companies like CATL,
which now supplies roughly a third of the world’s electric vehicle batteries,
grew to global scale with protected home markets and state-backed financing before competing internationally.
The distinction between public and private in this system is deliberately blurry:
the government can require access to technology developed with state support,
block the international transfer of profits,
and redirect corporate strategy in ways that American or European regulators legally cannot.
This does solve the problem Mazzucato describes,
since the state that takes the foundational risk never fully loses its claim on the result.
It creates a different problem, though:
accountability runs upward to the Communist Party rather than outward to citizens,
and the line between a national champion and an instrument of state policy
disappears entirely [Acemoglu2023,Chang2012,Webber2011].
In order to understand how the world works,
we have to understand how people think.
That’s a tall order,
so the sections below focus on a few things that I’ve found particularly useful.
In 1971,
Daniel Kahneman and Amos Tversky ran a simple experiment.
They told participants that a disease was expected to kill 600 people
and asked them to choose between two public health programs.
Program A would save exactly 200 people.
Program B had a one-in-three chance of saving all 600 and a two-in-three chance of saving none.
Most people chose A, i.e., they preferred the certain outcome.
Then Kahneman and Tversky rephrased the choice.
Program C would result in exactly 400 deaths.
Program D had a one-in-three chance that nobody would die and a two-in-three chance that all 600 would die.
Statistically, the two programs are identical,
but this time, most people chose D.
Nothing changed except how the outcomes were described.
Classical economics assumes that people are rational agents who consistently maximize their own utility.
When faced with a choice they weigh expected outcomes,
discount the future at a consistent rate,
and select whatever serves them best.
This has been proven false over and over again,
but persists because—well, we’ll get to that later in this essay.
Behavioral economics looks at how people actually make decisions,
and has repeatedly shown that they deviate from “rational” in predictable ways.
The first problem with the rational-agent model is computational.
Optimizing requires evaluating all possible options against all possible outcomes under all possible conditions.
No one can actually do this,
so instead,
people use a strategy that Herbert Simon called satisficing:
they search through available options until they find one that is good enough and then stop.
Herbert Simon called this bounded rationality:
people are rational within the limits of the information, time, and cognitive capacity they actually have,
which makes the heuristics people use to make decisions worth studying.
Kahneman and Tversky spent decades cataloguing people’s heuristics
and the cognitive biases they embody.
Anchoring is one of the most reliably reproduced findings in all of psychology.
When people estimate an unknown quantity,
their estimates are heavily influenced by numbers they have recently encountered,
even ones they know to be irrelevant.
In one study,
participants spun a wheel rigged to land on either 10 or 65,
then estimated the percentage of African countries in the United Nations.
Those who had seen 65 guessed about 45 percent higher than those who had seen 10.
They knew the wheel was random,
but the number shaped their thinking anyway.
This isn’t stupidity or laziness;
it is the brain doing something that is sensible in most contexts.
Nearby numbers are usually informative,
so most of the time, it makes sense to rely on them.
This is why prosecutors set high anchor charges:
juries’ verdicts cluster around the opening number.
It is also why retailers display high “original” prices:
customers anchor to whatever is crossed out.
And research on salary negotiation consistently shows that
the person who names the first number has the advantage,
which is why negotiating advice boils down to the same instruction:
speak first.
The availability heuristic says that
people estimate how likely something is by how easily they can think of examples.
After a plane crash receives extensive media coverage,
people overestimate the risk of flying and underestimate the risk of driving,
even though the underlying statistics have not changed.
The availability heuristic is why catastrophic but rare events dominate public attention
while slow, diffuse harms are systematically underestimated,
which in turn is why it took decades to build public pressure
around tobacco, lead paint, and vehicle safety [Kahneman2011].
Prospect theory describes how people actually evaluate outcomes.
The key finding is loss aversion:
a loss of a given size produces roughly twice the emotional impact of an equivalent gain.
This asymmetry has practical consequences wherever people have a reference point they are trying to protect.
Studies of taxi drivers in New York, Singapore, and other cities show that
drivers work longer hours on bad days when earnings are below their daily target,
and knock off early on good days.
A rational agent who cares about total earnings would do the opposite,
working more hours when conditions are favorable and fewer when they are not.
Instead,
drivers are managing losses relative to a reference point,
not maximizing total income.
Similarly,
the standard model predicts consistent discounting:
a reward next month should be worth a fixed percentage less than the same reward today,
and the same percentage should apply to any two adjacent future periods.
What people actually show is hyperbolic discounting:
an extremely steep preference for the present relative to any future point,
combined with much flatter preferences among future periods.
This is why someone can genuinely plan to quit smoking next year
while lighting a cigarette.
It is why gym memberships are purchased with full intention and then rarely used.
Our future selves are strangers, and we are generous to ourselves and stingy with strangers.
If small changes in how choices are presented can have large effects on behavior,
then choice architecture—the deliberate design
of decision environments—is itself a policy tool.
Thaler and Sunstein called one form of it nudging.
The canonical example is pension enrollment.
When workers must actively opt in to a pension plan,
participation rates are typically around 50 to 60 percent.
When workers are enrolled unless they actively opt out,
participation rises to 80 to 90 percent,
without any change to the financial terms.
The UK government introduced automatic pension enrollment in 2012;
by 2019,
over ten million additional workers had joined workplace pensions as a direct result.
The UK’s Behavioural Insights Team, established in 2010,
found that adding “Nine out of ten people in your area pay their taxes on time”
to letters sent to late tax payers increased on-time payment rates by several percentage points.
The intervention cost essentially nothing and recovered tens of millions of pounds in additional revenue.
Nudges like thiat are not manipulation in the obvious sense—nothing is hidden and no options are removed.
But the line between a nudge and a shove depends entirely on whose interests the design serves.
Automatic enrollment in a pension plan serves the worker.
Automatic enrollment in a subscription that is difficult to cancel serves the company.
Variable reward schedules designed to maximize platform engagement
are also nudges, built on the same science, serving a different master.
Every infinite scroll, every notification badge,
every “people who liked this also liked” recommendation
is a behavioral economics intervention.
The field that began by documenting human irrationality
has become the primary toolkit for industrializing it [Thaler2009].
People Care About Fairness
Imagine you are given ten dollars to split with a stranger.
You can offer them any amount you like.
If they accept, you both keep your shares,
but if they reject the offer,
neither of you gets anything.
A purely self-interested stranger, according to classical economics,
should accept any positive offer—even one dollar—because one dollar is better than nothing.
When researchers ran this experiment across dozens of countries,
they found that offers below thirty percent of the total were rejected roughly half the time:
people would rather walk away with nothing
than accept an outcome they perceived as unfair.
In some communities, rejection rates were even higher.
This experiment, called the ultimatum game,
has been run so many times and reproduced so reliably
that its basic finding is no longer seriously contested.
People care about fairness,
punish violations of it at cost to themselves,
and do so even with strangers they will never see again.
This directly contradicts the assumptions that underlie most of modern economics,
much conservative political thought,
and a substantial proportion of technology design.
The idea that human beings are fundamentally self-interested did not emerge from evidence.
It emerged from political argument,
was later dressed in mathematical formalism,
and eventually achieved the status of dogma.
Thomas Hobbes, writing in 1651,
described the natural condition of humanity as “a war of all against all.”
Life without government, he argued, was “solitary, poor, nasty, brutish, and short.”
Hobbes wasn’t reporting on anthropology—he was making a political case for sovereign authority.
If humans are naturally predatory,
then the powerful state he wanted is the only alternative to chaos.
Two centuries later,
Herbert Spencer read Charles Darwin’s account of natural selection
and announced that it confirmed what Hobbes had suspected.
“Survival of the fittest” is Spencer’s phrase, not Darwin’s.
Darwin described differential reproductive success;
Spencer described a cosmic competition for dominance
in which helping the weak was a biological error.
Social Darwinism,
as this cluster of ideas became known,
provided intellectual cover for opposing labor rights and public health measures,
and for fighting almost any intervention that might protect people from market outcomes.
After all,
if the weak lost,
it was because Nature intended them to lose.
By the mid-twentieth century,
economists had turned these self-serving rationalizations into mathematics.
Homo economicus was a rational agent who consistently and accurately maximized his own utility.
He did not care about fairness or make mistakes;
he also didn’t care about other people’s welfare unless it affected his own.
(And yes, I’m using the male pronoun deliberately.)
The behavioral economics research described in an earlier post explains how this is fiction,
but the model is not just wrong about how people think,
but about what they want.
The ultimatum game is one of dozens of experiments
that have been used to study human social preferences across cultures.
Public goods games ask participants to contribute to a shared fund that pays out to everyone,
including those who contribute nothing.
Standard economic theory predicts that rational individuals will free-ride
(i.e., contribute nothing while collecting their share of others’ contributions)
until the fund collapses.
In practice, initial contribution rates are typically between forty and sixty percent,
and when participants can identify and punish free-riders,
contribution rates rise and stay high.
People punish free-riders even when punishment costs them something.
What’s more,
they do it in one-shot interactions where there is no future reputation at stake.
Samuel Bowles and Herbert Gintis spent years synthesizing this evidence,
arguing that humans evolved not just as individuals competing for resources
but as groups competing against other groups.
Cooperation within groups enforced by altruistic punishment of defectors was
a successful evolutionary strategy.
The capacity for that cooperation,
and the emotional responses that sustain it like fairness, shame, and indignation,
are as deeply embedded in human nature
as any appetite for self-interest [Bowles2011,Bregman2020].
None of this means humans are angels.
Self-interested behavior is real.
But so is cooperation, fairness, and punishment of norm violation.
The question is which tendencies a given institutional design tends to elicit.
The Commons is Not a Tragedy
In 1968, the ecologist Garrett Hardin published an essay
in which he described a common pasture open to all herders.
Each one, acting rationally in their own interest,
would add animals to the pasture until it was destroyed.
The gains from each additional animal went to the individual herder,
but the costs of the degraded pasture were borne by all.
Self-interest would, inevitably, exhaust the commons.
The only solutions Hardin could see were privatization or state regulation.
“The Tragedy of the Commons” became one of the most cited papers in academic history.
It appeared in economics, political science, environmental policy, and law.
Its intellectual framework shaped fisheries policy, water rights law,
and debates about global climate governance [Hardin1968].
There was just one problem:
Hardin hadn’t studied any actual commons.
He had described an unmanaged commons with no rules, no governance, and no community.
The historically managed commons of medieval England,
the Alpine meadows of Switzerland,
the forest communities of Japan,
and the irrigation systems of Valencia and Bali all had elaborate rules developed over generations,
mechanisms for monitoring compliance,
graduated sanctions for violations,
and processes for resolving disputes.
They had been managing shared resources sustainably, in some cases, for centuries.
The real tragedy in Hardin’s work was his ignorance of how the real world actually worked.
In contrast,
the political scientist Elinor Ostrom spent her career studying actual systems.
The picture that emerged was not a tragedy,
but a sophisticated diversity of institutions,
each one adapted to local conditions,
and each one solving the collective action problem
that Hardin had assumed was unsolvable without markets or states.
In Governing the Commons,
Ostrom identified eight design principles that successful self-governing commons tend to share:
Members have clearly defined rights to the resource.
Rules are adapted to local conditions rather than imposed from outside.
People affected by the rules have meaningful input into changing them.
A system exists for monitoring both the resource and the behavior of users.
Sanctions are graduated—minor violations draw minor consequences.
Conflicts can be resolved quickly and cheaply.
External authorities recognize the community’s right to self-organize.
Larger systems are built from nested smaller ones.
In 2009, Ostrom was awarded the Nobel Prize in Economics for her work.
The prize committee cited her demonstration that
“economic analysis can shed light on most forms of social organization.”
What her work actually demonstrated was narrower and more radical than that:
that communities could govern shared resources sustainably
without either privatizing them or handing them to the state,
and that the dominant theoretical model had failed to predict this
because it had assumed the wrong things about human nature.
If the evidence against homo economicus is this extensive,
why does the model retain such a hold on policy and institutional design?
Part of the answer is that the model is self-fulfilling in a useful way.
If you design a system that assumes people will free-ride,
you build in monitoring, penalties, and enforcement mechanisms.
Those mechanisms signal distrust,
which tends to erode the social norms that sustained voluntary cooperation.
People who might have contributed voluntarily
now respond to being treated as suspects.
The system that assumed selfishness produces the selfishness it expected.
In contrast,
Ostrom’s communities worked partly because the institutions expressed trust:
users had a voice in the rules,
sanctions were proportionate rather than punitive,
and the system treated people as members of a community rather than as threats to be managed.
Technology platforms have largely chosen the other path.
Terms of service are written for adversaries.
Moderation systems treat all users as potential bad actors.
Engagement optimization assumes that appetites can be exploited.
These choices reflect a theory of human nature,
and that theory has consequences—not just for the products built on it,
but for the kind of behavior those products elicit and reward.
Ostrom’s lesson is not that humans always cooperate.
It is that cooperation is a realistic outcome
if systems are designed to support it,
and that assuming the worst tends to prevent the better from occurring.
The tragedy of the commons was not inevitable.
It was what happened when community governance was absent.
Building that governance, it turns out, is something humans are rather good at,
so long as institutions give them room to try [Ostrom2015].
People Care About Appearances
In 2001, the Norwegian government made its tax records publicly searchable online,
so that every citizen could now look up what any other citizen earned.
This was not entirely new—the country’s tax data had theoretically public for years—but
the internet made it frictionless.
Journalists could now scrub entire neighborhoods,
neighbors could check each other out,
and colleagues could compare their salaries with one another’s.
Ricardo Perez-Truglia used this moment as a natural experiment [PerezTruglia2020].
He tracked self-reported well-being before and after the records went online
and found that the gap between higher- and lower-income Norwegians widened by 29%.
Absolute incomes did not change;
what did was knowing how you compared to other people.
This is the central finding of research on social standing:
what people care about is not how much they have in absolute terms,
but where they stand relative to those around them.
It explains a long list of behaviors that seem irrational under standard economic assumptions.
Thorstein Veblen noticed this in 1899,
before there were smartphones or social media
(or economists to argue with his heresy).
The Theory of the Leisure Class
introduced the term conspicuous consumption
to describe spending whose primary purpose is to signal social rank [Veblen1899].
His key insight is that the signal only works if it is costly:
something that only the wealthy can afford communicates rank precisely because of its price.
Similarly,
in a world where most people have to do physical labor,
conspicuous leisure is only possible for the rich.
As leisure became more broadly available,
the signal shifted:
today,
being seen to be overworked and constantly in demand signals high status:
the business traveler at the airport in the expensive suit checking email at midnight
is the modern equivalent of the nineteenth-century aristocrat who demonstrably never lifted anything heavy.
“Being seen” may be the most important part of the previous sentence.
Invisible labor like housework,
mentoring junior colleagues,
or smothering your feelings for the benefit of others
has lower status.
It is usually dumped on women, members of minoritized groups, and the economically disadvantaged,
which creates a vicious circle.
Veblen pointed out that status competition is structurally self-defeating.
If I buy a larger house to signal rank and my neighbors respond by buying larger houses,
we have all spent money and all returned to the same relative position.
The competition is real but the gains are illusory;
the spending continues because the first person to stop stops loses ground to those who don’t.
Robert Frank built on Veblen’s work
with a careful study of wage patterns within firms [Frank1985].
Standard economics predicts that workers will always move toward higher absolute pay:
if they can earn more elsewhere, they will go elsewhere.
Frank found that this prediction fails systematically.
Workers at the bottom of a firm’s pay distribution are paid above their marginal productive value,
while workers at the top are paid below it.
The spread is not random:
it is consistent with workers accepting lower total pay in exchange for higher rank within their peer group.
The implication is that a programmer who is the highest-paid person on a small team
may prefer that position to being a lower-ranked member of a higher-paying team,
even if the absolute salary differential favors the larger team.
This is not irrationality:
rank confers real benefits,
so trading some income for rank is a sensible exchange.
Standard economics fails to predict the trade only because it refuses to count rank as a good.
Frank’s local-rank argument helps explain the consistent finding in salary surveys
that the highest correlate of worker satisfaction is not absolute pay
but pay relative to colleagues doing similar work.
Across many countries and industries,
fairness within the reference group matters more than the number itself.
Fred Hirsch introduced the concept of positional goods,
whose value depends on how many other people have it [Hirsch2015].
A house with an ocean view is a positional good:
if everyone had a house with an ocean view, the view would cease to confer distinction.
A senior job title,
a degree from a prestigious school,
or a table at an exclusive restaurant are all examples.
Hirsch pointed out that positional goods cannot be democratized.
Refrigerators and mobile phones can eventually be afford by almost everyone,
and everyone genuinely benefits.
Positional goods cannot work this way.
For example,
if a prestigious university expands admissions to let in everyone who wants to attend,
its value signal collapses.
This is precisely what has happened with university degrees in wealthy countries since the 1960s.
When only a small fraction of the population held degrees,
a degree signaled something.
As participation rates rose from 5 percent to 50 percent,
the same degree began to signal much less,
so the game shifted to which university,
then to postgraduate qualifications,
then to increasingly specific institutional prestige.
Each generation has to spend more to achieve the same relative position as the previous one.
This is not a problem that can be solved by making university cheaper or more accessible:
that simply changes the positional good everyone is competing for.
The empirical case that rank rather than income drives well-being
has been built up over two decades.
An analysis of the British Household Panel Survey,
which tracked thousands of households over many years,
found that once income rank was included in the model,
absolute income had no statistically significant effect on life satisfaction [Boyce2010].
What predicted whether someone was satisfied with their life was
where they stood compared to their peers.
Wilkinson and Pickett extended this argument at the national level
with evidence that more unequal societies perform worse on almost every social indicator,
regardless of their average wealth [Wilkinson2011].
More equal societies have lower rates of homicide, mental illness, obesity, teenage pregnancy, and imprisonment.
They have higher rates of trust, social mobility, and life expectancy.
This pattern holds across wealthy countries:
the United States, the United Kingdom, and Portugal,
which are among the most unequal wealthy nations, perform poorly;
Japan, the Nordic countries, and the Netherlands, which are among the most equal, perform well.
The causal mechanism is status anxiety:
higher inequality creates steeper hierarchies,
which produce more corrosive competition for rank.
Which brings us to social media.
Before digital platforms, status competition based primarily on physical proximity:
you compared yourself to your neighbors, colleagues, and relatives.
Platforms have replaced that bounded reference group with a global feed
curated by algorithms optimized for engagement rather than accurate representation.
The comparison you are now offered is not with your actual neighbors.
It is with the most aspirational version of everyone you have ever met.
The result in South Korea, India, the UK, and Brazil
is an intensification of status anxiety without any corresponding change in absolute circumstances.
Someone whose life is objectively comfortable
can be made to feel inadequate by a platform that continuously serves them evidence
that other people are more attractive, a better parent, or has traveled more widely.
Social media platforms did not create the desire for status.
What they did was put that desire on a subscription model,
charge advertisers to place products in the resulting stream of anxiety,
and call the resulting business a social network.
It’s a game that only they can win.
Corporations are Psychopaths
In October 2011,
Michael Woodford received what should have been the best news of his career.
After thirty years working at Olympus,
the Japanese optics and medical equipment company,
he had been made chief executive:
the first non-Japanese person to run the company in its history.
Six months later, he was fired.
Woodford had found something wrong with a series of acquisitions the company had made.
The amounts paid were enormous,
the assets were nearly worthless,
and the accounting explanations made no sense.
He hired KPMG to look into it and took the resulting report to the chairman,
Tsuyoshi Kikukawa.
Kikukawa’s response was to call an emergency board meeting and vote Woodford out.
Woodford went public.
The Olympus board denied everything for a few weeks,
but then the numbers collapsed, Kikukawa resigned, and criminal charges followed.
The fraud—a sustained effort to conceal $1.7 billion in losses—had been running for nearly twenty years,
through the tenure of multiple CEOs.
What is notable about the Olympus scandal is not that individuals behaved dishonestly.
It is that they weren’t necessarily bad people.
It was as if the organization had developed a mind of its own,
and successive leaders served it rather than the other way around.
Three years before the Olympus scandal,
the Canadian legal scholar Joel Bakan asked Robert Hare,
the psychologist who had spent four decades developing
the clinical tools used to diagnose psychopathy,
to evaluate a publicly traded corporation against his Psychopathy Checklist
as if the corporation were a person [Bakan2005].
The checklist was designed to identify individuals who are:
callously indifferent to harm they cause to others,
skilled at charming and manipulating people around them,
incapable of genuine guilt or remorse,
unwilling to accept responsibility for their own failures, and
willing to lie and deceive when they believe they can get away with it.
Hare’s conclusion was that publicly traded corporations fit the profile.
In most jurisdictions,
corporate executives have a fiduciary duty to shareholders:
they are legally required to pursue shareholder interest,
and a board that sacrificed profit to benefit workers or communities
with no defensible business justification
could be held legally liable.
The resulting entity is therefore prohibited from having a conscience
in the way an individual person might.
It can behave ethically when ethics is good for the brand,
but not when the cost cannot be justified by future returns.
None of this requires any individual inside the organization to be a bad person.
It requires only that the rules governing the organization create incentives
that produce a certain kind of behavior.
However,
this argument becomes harder to sustain
when you look at who rises to the top of large organizations.
In 2005,
Belinda Board and Katarina Fritzon surveyed 39 senior managers and executives in the United Kingdom
and compared their psychological profiles to a matched group of patients at Broadmoor,
a high-security psychiatric hospital [Board2005].
The executives scored higher than the Broadmoor sample on three personality disorder traits:
histrionic, narcissistic, and compulsive.
The researchers called this pattern “successful psychopathy”:
the traits that lead to hospitalization or criminal conviction in their extreme form are,
in a milder and better-managed form,
associated with reaching senior management.
Paul Babiak and Robert Hare spent years studying
how psychopathic individuals navigate organizational environments [Babiak2019].
Their estimate is that
roughly one percent of the general population meets the clinical threshold for psychopathy,
while corporate managers cluster around three to four times that rate.
The mechanism is not mysterious.
Psychopaths tend to perform exceptionally well in job interviews.
They are confident, articulate, and skilled at saying what an interviewer wants to hear.
They feel no social anxiety in high-stakes situations,
and can fabricate credentials and relationships convincingly
because they feel no guilt about doing so.
Once hired,
they are evaluated primarily on how they appear to those above them in the organization,
and making a strong impression on a small number of people across a limited number of interactions
is something psychopaths do better than almost anyone else.
This is where a closely related concept becomes important:
impression management,
a term introduced by Erving Goffman in 1959 [Goffman1959].
His observation was that social life is fundamentally theatrical:
people perform different versions of themselves for different audiences,
and success in social situations depends heavily on managing those performances.
In a small organization where everyone works closely together over years,
this has limited scope because your actual behavior is too visible.
Colleagues know when you take credit for other people’s work,
when your confident predictions turn out wrong,
and when your charm disappears because you no longer need something from someone.
In a corporation with thousands of employees,
on the other hand,
promotions are typically decided by people who have less direct contact with the person in question.
They evaluate based on presentations, meetings, secondhand reports,
and the impressions formed in a relatively small number of interactions.
This is precisely the environment where impression management skills are most valuable,
and where the gap between managing impressions and actually performing well is hardest to detect.
Researchers who study the dark triad of psychopathy, narcissism, and Machiavellianism
have consistently found that individuals high in these traits do particularly well
in the early and middle stages of corporate careers.
Narcissists project confidence and vision.
Machiavellians are skilled at reading and exploiting organizational dynamics.
Psychopaths can absorb stress,
make decisions that harm others without losing sleep,
and deliver bad news without visible discomfort.
Each of these is a behavior that,
in moderation and over short time horizons,
looks like leadership.
Failure only occurs when a crisis requires something the dark triad cannot supply:
integrity,
honest self-criticism,
or concern for people the leader does not need.
Dutton’s research on which professions attract the most psychopaths put CEO at the top of the list,
followed by lawyer, media professional, salesperson, and surgeon [Dutton2013].
What these jobs share is a combination of high stakes,
limited direct accountability,
and the need to remain calm under pressure,
which are precisely the traits psychopaths happen to have.
Wirecard was a German payments company whose rise was celebrated as a European technology success story.
By 2018 it had joined the DAX, Germany’s index of its thirty largest listed companies.
Its chief executive,
Markus Braun,
appeared at industry conferences as the model of a visionary, unflappable founder.
When the Financial Times published articles suggesting that
large portions of the company’s claimed revenue did not exist,
Germany’s financial regulator filed a criminal complaint against the journalist who wrote them.
When the fraud collapsed in 2020,
€1.9 billion turned out never to have existed.
Braun had not built a company.
He had built an extremely convincing impression of one.
South Korea’s chaebol are a structural variation on the same theme.
The heads of Samsung, Lotte, SK, and others
have faced criminal convictions for bribery and embezzlement—and received presidential pardons,
typically on the grounds that their imprisonment would harm the national economy.
This pattern of prosecution followed by pardon describes an organization
that has achieved something psychopathic at the institutional level:
the normal consequences of harmful behavior have been suspended
because the organization is too important to be held accountable.
None of this means that every large company is led by psychopaths,
or that organizational scale inevitably produces moral failure.
It means that the selection pressures of large hierarchies are not neutral.
Hiring processes that rely heavily on interviews systematically favor candidates who are good at interviews.
Promotion decisions made by people with limited direct observation
systematically favor candidates who are good at being observed,
and performance reviews based on self-assessment systematically favor
candidates who think highly of themselves.
These processes aren’t designed to select for the dark triad,
but they are all structured in ways that make dark triad traits an advantage.
The Olympus fraud ran for twenty years
because each successive layer of management found it easier to maintain the deception than to stop it.
No individual needed to be a psychopath;
the organization’s incentives reproduced psychopathic behavior regardless of who ran it.
Bad people come and go;
structures that reward bad behavior reproduce themselves.
Why Don’t People Just Say No?
In 2016, Wells Fargo fired 5300 employees for opening millions of fake accounts in customers’ names
without their knowledge.
These were not executives:
they were branch staff, customer service representatives, and personal bankers.
When the scandal became public, initial coverage framed it as individual misconduct.
The problem with that framing was that number “5300”.
You cannot explain mass participation through individual bad character.
Instead, you have to ask
what conditions cause thousands of ordinary people to do something they know is wrong.
Stanley Milgram started asking this question in the early 1960s,
partly in response to the trial of the Nazi Adolf Eichmann,
who had organized the logistics of the Holocaust [Milgram1974].
Eichmann’s defense was that he had simply followed orders.
Hannah Arendt,
covering the trial for the New Yorker, coined the phrase that has not since been improved on:
the “banality of evil.”
Her point was that Eichmann was not a monster;
he was a bureaucrat doing what his organization told him he was supposed to do [Arendt2006].
Milgram wanted to test how far ordinary people would go.
In his experiments, volunteers were told they were measuring the effect of punishment on learning.
An actor in another room pretended to receive electric shocks when giving wrong answers,
and subjects were instructed to increase the voltage with each error.
Most continued well past the point where the actor was screaming and, eventually, silent.
Two thirds of subjects administered what they believed were the maximum possible shocks.
(When the authority was absent or instructions were given by phone,
compliance dropped sharply.)
Milgram’s subjects were not sadists:
they were people responding to the combination of an authority figure,
a legitimate-seeming purpose,
and gradual escalation.
Corporate hierarchies reproduce all of these conditions.
The Wells Fargo employees were never actually instructed to defraud customers.
They were given sales quotas that were mathematically impossible to meet through legitimate means,
put under daily supervision,
and subjected to a culture in which the phrase “eight is great”
(i.e., eight accounts per customer)
was a daily mantra.
Each individual decision was small enough to feel manageable;
employees who raised concerns were sometimes fired,
and the outcome was fraud on a massive scale.
Albert Bandura spent decades studying what he called moral disengagement:
the psychological mechanisms by which people participate in harmful behavior
without experiencing it as harmful.
These include displacement of responsibility (“I was just following orders”),
diffusion of responsibility (“everyone else was doing it”),
euphemistic labeling (calling fake accounts “cross-selling solutions”),
and treating the people being harmed as abstractions rather than as individuals.
Bandura’s insight is that these mechanisms are not
rationalizations invented after the fact [Bandura1999].
They are available in advance,
and organizations learn to activate them.
When tech companies describe user data as “exhaust”,
call manipulative design patterns “engagement optimization”,
or frame advertising surveillance as “connecting people with relevant products”,
they are providing workers with the vocabulary of moral disengagement they need
to get them to do morally repugnant things [Palazzo2025].
The pattern repeats across industries and cultures.
Mitsubishi Motors covered up safety defects for over two decades,
with participation from engineers, quality controllers, and managers who each knew parts of the problem.
And then there’s Volkswagen’s “Dieselgate” scandal,
which became public in 2015 after researchers at West Virginia University found that
cars on the road emitted far more nitrogen oxides than official test results suggested [Ewing2017].
Engineers had written software that detected when a vehicle was undergoing an emissions test
and activated pollution controls that were switched off during normal driving.
Around eleven million cars worldwide contained this code,
across multiple model lines and product generations.
The engineers who wrote it had to design, maintain, and extend the software for years,
which required knowing exactly what it was for.
What changes the equation?
Milgram found that compliance fell dramatically when subjects could see another person refuse to continue.
One dissenting voice—a confederate planted among the subjects—was enough to break the spell,
which shows that the social proof of refusal is as powerful as the social proof of compliance.
The implication is not that everyone needs to be a hero.
It is that if you want ethical behavior from an organization,
you only need a few visible dissenters.
In October 2021,
Frances Haugen,
a former Facebook product manager,
testified before the US Senate Commerce Committee
with copies of thousands of internal company documents
she had secretly copied before resigning [Frenkel2021].
The Facebook Papers showed that
Facebook’s own researchers had found Instagram was worsening mental health among teenage girls,
that the platform’s recommendation algorithms amplified political outrage,
and that the company had repeatedly chosen not to act on these findings
when acting would have reduced engagement.
Haugen’s testimony prompted legislative proposals in several countries,
but produced no significant change to Facebook’s practices.
Facts Alone Don’t Change Minds
Most scientists and programmers’ implicit model of belief is roughly Bayesian:
when someone who believes something about the world receives new evidence,
they update their beliefs in the way that fits that evidence best.
This model is mostly true in domains that people aren’t emotionally invested in,
but fails in predictable ways for beliefs that are tied to group membership.
Research in social psychology has established that
beliefs about contested political and social issues
function primarily as signals of group identity
rather than as conclusions from evidence [SteinLubrano2024].
Holding the wrong belief does not just mean being misinformed:
it puts you outside the group,
so updating the belief means leaving that group.
The social cost of updating is therefore often higher than
the mental cost of staying wrong.
This is why the tobacco industry’s manufactured uncertainty was so effective:
it did not need to be persuasive on the merits.
It only needed to give people with strong social reasons not to update
a plausible excuse to stay where they were.
Motivated reasoning compounds this.
People do not evaluate arguments neutrally.
They are significantly better at identifying flaws
in arguments that lead to conclusions they dislike
than in arguments that lead to conclusions they support.
A trained scientist who is also a gun owner
will scrutinize studies on gun violence
more skeptically than studies on climate change
if their social circle treats the former as identity-threatening but not the latter.
This isn’t dishonesty in the ordinary sense because the person doesn’t know they’re doing it.
The biased scrutiny is real scrutiny;
the flaws they find are often genuine.
But the asymmetric attention means
they have reached a conclusion before their evaluation begins [Kahneman2011].
A colleague once told me that people want data, but believe stories.
This makes sense in light of motivated reasoning:
data provides cover for a decision already reached on other grounds,
while stories transmit the emotional and social context that actually drives belief change.
It is also why the most effective public health campaigns don’t focus on presenting statistics;
they present specific, named people in specific situations.
This rule is also why
industries that want to prevent people from changing their beliefs
fund think tanks that produce reports full of data.
The data itself is not meant to influence people:
instead,
they count on the appearance of rigor,
which mimics legitimate evidence without its substance.
The fossil fuel industry has been running this operation for thirty years,
and tech companies are now doing it as well.
If we want AI, social media, and the software industry in general to be regulated in meaningful ways,
finding and presenting evidence of harm won’t be enough.
We need to change the social context in which beliefs are held
by finding trusted messengers within the relevant communities,
reframing the issue so that updating does not require identity betrayal,
and working through social networks rather than through arguments.
This is not manipulation:
it is taking the psychology of belief seriously [Achen2017,Hoffer2010].
Many years ago
I was briefly part of a university group trying to get
better working conditions for grad students, post-docs, adjuncts,
and other members of academia’s petite bourgeoisie.
(And yes,
we were the sort of people who used terms like “petite bourgeoisie”
to show each other how clever we were.)
One Tuesday evening
an older gentleman showed up to one of our meetings.
He listened patiently as we talked,
then cleared his throat and said,
“You know, there’s probably an easier way to do this.
If you can get a meeting with the dean,
he might—”
“We’ve already tried talking to dean,” someone said dismissively.
The guy nodded.
“I understand that,
but I think that if you say you want to talk about retention rather than—”
“The problem isn’t just retaining people,”
someone else immediately said,
“We need to broaden the intake.”
I think he tried to speak one more time,
only to be cut off in the same way.
As we all went in circles saying,
“Well actually, the real problem is…”
he quietly got up and went to the blackboard.
(We were meeting in an empty classroom,
and yes,
it was long enough ago that they still had blackboards.)
None of us noticed that he was writing,
but when the door closed behind him a few moments later,
we all saw the message he’d left behind:
You have just cut me off mid-sentence three times in less than a minute.
Based on that,
I don’t think a future built by you
will be better than what we have today.
We dismissed it, of course—I mean, hell, he’d been wearing a tie.
But I found out later that he was the first openly gay man
to hold an administrative position at that university,
and that he’d worked for over twenty years
to make admissions and promotions fairer.
Unfortunately,
I learned that from his obituary notice.
I still wonder what I could have learned from him
if I’d been less concerned about impressing people with how smart I was
and more willing to listen.
Thirty years later,
a billionaire named Marc Andreessen published a manifesto
with the intellectual depth and writing style
of something a freshman would throw together in a caffeine-fueled frenzy
after binging on right-wing podcast [Andreessen2023].
Andreessen’s manifesto attacked anyone who thought technology should be regulated,
that its risks should be weighed against its benefits,
or that workers and communities affected by technological change should have a say in it.
Among the heroes he cited was F.T. Marinetti,
the Italian futurist who wrote in 1909 that war is “the world’s only hygiene”
and that civilization should be cleansed of feminism, democracy, and weakness.
Marinetti’s work inspired Benito Mussolini,
the founder of Fascism;
Andreessen is one of the most powerful venture capitalists in the world,
and is one of a growing number of big tech billionaires
who believe they can dispense with the people and the society
that made their success possible.
Andreessen’s manifesto is part of why I’m writing these essays.
His views are repugnant,
and I’m offended by how shallow and superficial his thinking is,
but the real reason is that most people in tech don’t know enough about how the world actually works
to have an immune response to his self-serving bullshit.
I studied engineering as an undergraduate and then became a programmer;
during and after my degrees,
I made more than my share of disparaging jokes about fluffy disciplines
like politics, sociology, and philosophy.
It took me a long time to admit that these are just as rigorous as math and physics,
and that most of my strongly-held beliefs were just
the opening moves in a chess game that others had been playing for centuries.
These essays are, in a way,
an extended apology to some of the people I sneered at
(but only some, because many were just as pretentious as I was).
Another reason I’m writing this is that I am sixty-three years old.
People my age run countries and make life-and-death decisions that affect millions of people.
As unbelievable as it seems to me,
somehow we are the grownups.
In another few years,
though,
we’ll be retired and you will be in charge.
I’d like you to be readier than I was,
and I don’t think more lessons about the Unix shell or version control are going to help.
But I don’t want you to understand the world
just so that I can sleep at night.
I want you to understand is so that you can make it better,
because that’s the greatest adventure of all time.
In the year I was born most of the world’s people suffered under totalitarian rule,
judges could and did order electroshock therapy to “cure” homosexuals,
people could legally be denied jobs because of their skin color,
and women couldn’t open bank accounts without their husband’s permission.
Yes,
a lot of things are bad and/or getting worse,
but look at how far we’ve come.
Look at how many more choices you have than your grandparents did.
Look at how many more things you can know,
and be,
and enjoy.
And most importantly,
look at how many other people can too.
That didn’t happen by chance.
Every time you buy one brand of running shoe rather than another
or take a minutes to vote
you are choosing one future over another.
Every time you help someone do something they couldn’t do before,
you are giving them more more control over their own life.
The world doesn’t get better on its own.
It gets better because we make it better:
penny by penny,
vote by vote,
and one lesson at a time.
The climate crisis, mass extinction, surveillance capitalism,
inequality on a scale we haven’t seen in a century,
the re-emergence of racist nationalism:
my generation could have prevented it,
but decided that quarterly earnings were more important.
The bills for our cowardice, lethargy, and greed are now coming due;
as they do,
we have left you no easy solutions to these problems.
That doesn’t mean there are no solutions at all, though.
The essays that follow will explore a few things I wish I had known earlier:
where power comes from,
how it is used,
how its use is hidden,
and how people have held the powerful accountable and made the world a fairer place.
I’m not going to try to be comprehensive or even-handed,
but I hope you’ll find it informative, entertaining, and inspiring.
High-priority jobs (class H) arrive frequently and are served quickly.
Low-priority jobs (class L) arrive rarely and take longer to serve.
The server always picks the highest-priority job available. Total server utilization $\rho = \rho_H + \rho_L < 1$, so the server has spare capacity on average. Yet low-priority jobs can wait far longer than the utilization level suggests they should.
Static Priority: Starvation at Moderate Load
With a static priority queue, high-priority jobs never yield to low-priority ones. Even when $\rho_H < 1$, high-priority bursts can lock out low-priority jobs for extended periods. The mean wait for low-priority jobs under a static non-preemptive priority queue is:
This diverges as $\rho_H \to 1$ independently of $\rho_L$. As $\rho_H$ approaches 100%, low-priority jobs wait arbitrarily long, even if only a few low-priority jobs ever arrive.
Aging: Solving Starvation Creates Oscillation
The standard remedy for starvation is priority aging: a waiting job’s priority improves over time until it eventually beats even high-priority arrivals. This guarantees finite wait for all jobs.
However, aging introduces a new pathology. When aged low-priority jobs finally burst through, they occupy the server and leave a backlog of high-priority jobs waiting. The high-priority queue then drains, and the cycle repeats — producing oscillating bursts rather than smooth, uniform service.
What aging does
Aging assigns each waiting L job a maximum patience time $T_{\max}$. After waiting $T_{\max}$, the job is promoted to high priority. This caps the worst-case wait: no L job can wait longer than $T_{\max}$ plus one service time.
Practical Implications
Priority queues appear throughout computing:
OS scheduling: interactive processes (high priority) vs. batch jobs (low priority). Linux uses dynamic priority aging (nice values + sleep bonuses) to avoid starvation.
Network QoS: real-time traffic (VoIP, video) vs. bulk data. Traffic shaping with Deficit Round Robin (DRR) or Weighted Fair Queuing (WFQ) guarantees bandwidth shares without starvation.
Database query planning: short OLTP queries vs. long OLAP queries. Resource groups and query timeouts implement a form of aging.
Understanding the Math
Mean wait for two-priority queues
Let $\lambda_i$, $\mu_i$, and $\rho_i = \lambda_i / \mu_i$ be the arrival rate, service rate, and utilization of class $i \in {H, L}$. For a non-preemptive priority queue:
where $R_0 = \tfrac{1}{2}(\lambda_H \overline{s_H^2} + \lambda_L \overline{s_L^2})$ is the mean residual work seen by an arriving customer. The ratio $W_L / W_H = 1/(1 - \rho_H)$ grows without bound as $\rho_H \to 1$.
Why “on average” is not enough
Even when $\rho < 1$, randomness creates bursts of H arrivals. During a burst, the server is continuously occupied by H jobs, and L jobs must wait in the background. The mean wait for low-priority jobs is:
The critical factor is $(1 - \rho_H)$ in the denominator. As $\rho_H \to 1$, this factor approaches zero and $W_L \to \infty$ — even if $\rho_L$ stays small and the total load $\rho$ is comfortably below 1.
The trade-off
Without aging, $W_L$ can be infinite when $\rho_H$ is large. With aging, $W_L \leq T_{\max} + 1/\mu_L$, but during promotion events the effective $\rho_H$ spikes temporarily, increasing $W_H$. Choosing $T_{\max}$ is a design decision: a small $T_{\max}$ protects L jobs but forces more promotions and penalizes H jobs more often; a large $T_{\max}$ is kinder to H jobs but allows L jobs to wait longer. There is no setting that simultaneously minimizes both — the trade-off is fundamental.
This article was originally written for marimo.io.
A city has two routes from source $S$ to destination $T$:
Top route $S \to A \to T$: link $SA$ is congestion-dependent; link $AT$ has a fixed travel time.
Bottom route $S \to B \to T$: link $SB$ has a fixed travel time; link $BT$ is congestion-dependent.
The network is symmetric. A city planner proposes adding a new shortcut link $A \to B$ with near-zero travel time, creating a third route $S \to A \to B \to T$. To her surprise, adding the shortcut makes everyone’s travel time longer at the selfish-routing Nash equilibrium.
Without the shortcut
Both routes are symmetric. In equilibrium, traffic splits evenly. If $N/2$ drivers use each route and the congested links have delay $\alpha \cdot n$ (where $n$ is the number of cars):
$$t_{\text{top}} = \frac{N}{2}\alpha + c = t_{\text{bottom}}$$
With the shortcut $A \to B$
Each driver thinks, “Link $AB$ is free; I can use $SA$, slip across to $B$, then take $BT$ instead of the slow constant link $AT$.” All $N$ drivers make this choice. The Nash equilibrium has everyone on $S \to A \to B \to T$:
Since $2N\alpha > \frac{N}{2}\alpha + c$ for typical parameters, travel times increase after the road is added. This is the paradox: individually rational decisions produce a collectively worse outcome. The ratio of Nash equilibrium cost to the socially optimal cost is called the price of anarchy.
Braess’s paradox is not theoretical. Seoul, Stuttgart, and New York all observed traffic improvements after closing roads. Conversely, new roads in highly congested networks have sometimes worsened average travel times.
Understanding the Math
Nash equilibrium
A Nash equilibrium is a situation where every player has chosen a strategy and no single player can improve their own outcome by switching to a different strategy so long as everyone else stays put. Think of it as a stable fixed point: if you woke up one morning in a Nash equilibrium, you would have no reason to change what you are doing. Crucially, a Nash equilibrium need not be the best possible outcome for everyone collectively.
The paradox, step by step
Label the number of cars $N$ and suppose the congested links have delay $\alpha \cdot n$ where $n$ is the number of cars currently using that link. Without the shortcut, traffic splits evenly: $N/2$ cars use each route. Each driver’s travel time is $(N/2)\alpha + c$, where $c$ is the fixed delay on the non-congested link. Neither route is faster than the other, so no driver wants to switch — that is Nash equilibrium.
Now add the shortcut $A \to B$ with near-zero travel time $\varepsilon$. A single driver considering a switch reasons: “Link $AB$ is essentially free. If I take $SA$, cross to $B$, and take $BT$, I avoid the fixed cost $c$.” If that driver is the only one to switch, it looks cheaper. But every driver makes the same calculation simultaneously. At the new equilibrium, all $N$ drivers pile onto $SA$ and $BT$:
Since $2N\alpha > (N/2)\alpha + c$ for typical parameters, everyone is worse off than before the shortcut was built.
The price of anarchy
The social optimum would split traffic evenly at cost $(N/2)\alpha + c$, but selfish routing delivers $2N\alpha + \varepsilon$. The price of anarchy exceeds 1, meaning individual rationality destroys collective welfare.
The Prisoner’s Dilemma is the best-known example of this tension. Two suspects each choose independently to cooperate or defect. Defecting is a dominant strategy: it is better for you regardless of what the other person does. Yet if both defect, both get a worse outcome than if both had cooperated. Braess’s paradox is the same logic scaled to $N$ drivers.
The logit model
The simulation uses a probabilistic choice rule: the probability a driver picks route $r$ is proportional to $\exp(-\beta \cdot t_r)$, where $t_r$ is the expected travel time on route $r$ and $\beta$ is a sensitivity parameter. When $\beta$ is large, drivers strongly prefer the fastest route and the outcome approaches the pure Nash equilibrium. When $\beta$ is small, drivers choose nearly randomly and the paradox weakens. The parameter $\beta$ captures how responsive real drivers are to time differences.
This article was originally written for marimo.io.
$N$ commuters all want to leave for work at the same preferred time. The road has a fixed capacity: up to $C$ commuters per time slot travel quickly, but when more than $C$ try to leave in the same slot, everyone in that slot experiences extra delay proportional to the overload.
Each day, commuters observe yesterday’s travel times and shift their departure by one slot toward a less congested option with some probability. Much to their disappointment, the rush hour never disappears. Instead it:
flattens slightly (spreading across more slots), but
shifts its peak position over successive days, and
reaches a new quasi-equilibrium that may be no less congested than the original, just at a different time.
The intuition is that any slot that becomes less congested immediately attracts new commuters from adjacent overloaded slots, refilling it. Individual optimization is self-defeating in aggregate.
The simulation in this tutorial shows emergent dynamics:
The arrival distribution begins concentrated at the preferred slot.
Commuters shift away from congested slots, spreading the peak.
The spreading creates new local peaks at adjacent slots, which then attract their own shifters.
Over many days the distribution oscillates or drifts without converging to zero congestion.
The Vickrey Bottleneck Model
The classic model (Vickrey 1969) treats the road as a bottleneck with flow rate $s$ vehicles per unit time. At equilibrium, every commuter faces the same generalized cost:
where $d$ is queuing delay, $t^*$ is the desired arrival time, and $\beta, \gamma$ are schedule-delay costs for early and late arrival respectively. Vickrey showed that at Nash equilibrium a departure queue forms with length that rises and then falls as commuters spread across time to equalize cost, but total system delay is unchanged.
This model underlies modern road-pricing schemes: a time-varying toll that exactly offsets the schedule-delay cost eliminates queuing entirely while preserving the total commuting burden. In essence, the toll revenue replaces the wasted queuing time.
Understanding the Math
What is a congestion game?
Each commuter (the “player”) independently chooses a departure time slot. The delay experienced in any given slot depends on how many other commuters choose the same slot: if the slot is over capacity $C$, delay grows with the number of extra commuters. No central authority coordinates choices. This structure, where each player’s cost depends on the collective choices of all players, is called a congestion game.
Nash equilibrium in this context
A Nash equilibrium is a distribution of departure times such that no individual commuter can reduce their own delay by unilaterally switching to a different slot. At equilibrium, every occupied slot has the same congestion-adjusted cost. If slot 15 were cheaper than slot 14, commuters from slot 14 would shift to slot 15 until the costs equalized. The equilibrium is therefore defined by: all slots with commuters in them have equal cost, and all empty slots have cost no lower than the occupied ones.
Why Nash equilibrium is not the social optimum
The social optimum minimizes total delay summed over all commuters. The Nash equilibrium minimizes each person’s individual delay given everyone else’s choices. These are generally different objectives. At Nash equilibrium, a commuter choosing a crowded slot ignores the extra delay they impose on every other commuter already in that slot. They feel only their own delay; the cost they impose on others is a negative externality that they do not internalize.
Why the peak shifts but does not vanish
Suppose slot 15 is heavily congested. Some commuters shift to slot 14, relieving slot 15. But now slot 14 is more congested, so its commuters shift to slot 13. The congestion wave ripples outward in both directions. Meanwhile, commuters who shifted away from slot 15 now observe it as less congested and some drift back. The system never reaches zero congestion: it perpetually redistributes congestion across nearby slots in a slow drift. The Nash equilibrium exists in theory, but the day-by-day best-response dynamics cycle around it rather than converging to it, particularly when commuters respond noisily to yesterday’s conditions.
This article was originally written for marimo.io.
A single server processes jobs that arrive randomly according to a Poisson process. Most jobs are quick (exponential service with small mean), but a rare few are very slow (exponential service with large mean). This hyperexponential service distribution has high variance. This post compares the performance of two scheduling disciplines in this situation:
FIFO (First In, First Out): jobs are served in the order they arrive.
SJF (Shortest Job First): the server always picks the shortest queued job next.
The surprising result is that SJF dramatically outperforms FIFO: not just for the small jobs that directly benefit from skipping ahead, but also for mean sojourn time across all jobs. The improvement is most visible at the tail (95th and 99th percentiles) because FIFO creates a convoy effect: one long job blocks many short jobs behind it, inflating everyone’s wait.
The Convoy Metaphor
Picture a one-lane road with one slow truck and many fast cars. Every car behind the truck must drive at truck speed; no overtaking allowed. The truck is the long job; the cars are the short jobs stuck behind it in FIFO order. SJF is like a passing lane: fast cars jump ahead of the truck and reach their destination much sooner. The truck itself arrives at the same time either way, but the total delay experienced by all vehicles plummets.
Why FIFO Hurts with High Variance
In FIFO, the server’s current job is chosen at arrival time, not at decision time. When a slow job begins service, every subsequent arrival must join the queue and wait. The expected excess work in service (the remaining time of the current job, seen by an arriving customer) under FIFO is:
where $\overline{s^2}$ is the second moment of service time. High variance inflates $\overline{s^2}$ without changing $\rho$, directly worsening wait time.
SJF Minimises Mean Sojourn Time
For a single server with non-preemptive SJF and any service-time distribution, the mean sojourn time is given by the formula below (which is discussed in “Understanding the Math” at the end of this lesson):
SJF achieves this minimum because short jobs that would otherwise be blocked by a long job are promoted ahead, reducing the total waiting work in the system.
Practical Relevance
Operating system CPU schedulers use time-quanta and priority aging to approximate SJF without knowing job sizes in advance. Database query planners estimate query cost and reorder execution to minimize blocking. The phenomenon reappears as head-of-line blocking in HTTP/1.1 (one slow response stalls a connection), motivating HTTP/2 multiplexing and HTTP/3’s QUIC stream independence.
Understanding the Math
The second moment
For a random variable $S$ representing service time, the second moment is $E[S^2]$. Recall from your statistics course that variance is $\text{Var}(S) = E[S^2] - (E[S])^2$, which rearranges to:
$$E[S^2] = \text{Var}(S) + (E[S])^2$$
This means high variance inflates $E[S^2]$ even if the mean $E[S]$ stays fixed. Doubling the spread of service times can quadruple $E[S^2]$, even with the same average service time.
Why variance of service time hurts
Imagine a FIFO server handling jobs that are either 0.1 minutes or 10 minutes long, with 90% being short and 10% being long. The mean service time is $0.9 \times 0.1 + 0.1 \times 10 = 1.09$ minutes, so utilization $\rho = \lambda / \mu$ might be modest. But when a 10-minute job starts, every job arriving during those 10 minutes must join the queue and wait. The longer $E[S^2]$, the more average work sits ahead of each arriving job.
The Pollaczek–Khinchine formula
The mean time a job spends waiting (not counting its own service time) in a FIFO single-server queue is:
Here $\lambda$ is the arrival rate, $E[S^2]$ is the second moment of service time, and $\rho = \lambda \cdot E[S]$ is the server utilization. Both $\lambda$ and $E[S^2]$ appear in the numerator, so more variance means more waiting even at the same $\rho$. The $(1-\rho)$ denominator is the familiar blow-up term from M/M/1.
This article was originally written for marimo.io.