Kronosapiens Labs

Understanding Generative Art

Sat, 18 Nov 2023 00:00:00 +0000

I. Overview

This is an essay defending blockchain-based generative art as a contemporary form. It will address three major critiques of generative art:

The artistic legitimacy of generative art,
The value and meaning of digital ownership, and
The advantages of a blockchain-based art market

We will demonstrate how these critiques are not only unfounded, but represent an narrow understanding of the nature and potential of the medium.

For the purposes of this essay, “blockchain-based generative art” will refer to artworks which are produced by successive runs of a computer program, with outputs not known in advance, and exist in a series whose number is fixed in advance. All artworks are publically available for viewing, and the specification and ownership of the artworks are stored in a public digital ledger.

We are specifically not discussing images generated by AI models in response to natural-language prompts, such as those of DALL-E or Midjourney. Such works are also described as “generative art”, but represent a very different form and require a different conversation.

II. Artistic Legitimacy

No one can be blamed for some skepticism concerning the artistic merit of generative art. Compared to our popular images of the fine artist painting for hours in her studio, or the metalworker welding metal in high heat, the idea of an “artist” as someone sitting in front of the computer’s blue light, producing pictures at the touch of a keyboard, seems deeply unsatisfying.

A useful resource for understanding the potential of the medium is generative artist Tyler Hobb’s essay The Rise of Long-Form Generative Art. His argument can be summarized as follows:

In the past, generative artists could produce unlimited numbers of outputs, and cherry-pick the best ones for display
Now, since outputs are public and their number fixed, the programs must meet a higher standard of consistency and quality
This has placed significant pressure on artists to produce programs which are both consistently varied and high quality
Veiwers are evaluating both individual outputs, and also how the whole collection expresses an underlying concept

What this suggests is that the constraints that the blockchain places on the medium significantly raises the technical bar for generative art, to the point of approaching the level of challenge we associate with any other legitimate art form. When evaluating a generative art piece, we are both evaluating the artist’s technical skill in producing a program which produces varied outputs of consistent quality, as well as the aesthetics of the individual images and their relationship within collection as a whole, as well as the extent to which the total work expresses an underlying concept of interest.

The fact that only a fixed number of random outputs can exist is the key to the genre. To paraphrase Martin Scorsese’s famous 2019 takedown of the last decade’s Marvel phenomenon, “art must put something at risk.” In the case of generative art, the risk is that despite the artist’s best efforts, the result is largely given by chance. An artist spends days or months developing a program, only to be judged based on random outputs after-the-fact. The giving up of control on the part of the artist, the trusting in the process and in themselves and their abilities, is the beating heart of the genre.

Another easy critique is that the production of generative art doesn’t “look like” making art. This is a red herring, and reveals some important assumptions about art production which should be interrogated.

Recall that early Impressionists were excluded from Parisian galleries on the grounds that standing outside and painting en plein air wasn’t “real” artistic production, when contrasted with their classically-trained contemporaries laboring over individual brushstrokes. What they did was too quick, too easy. And yet, faster painting meant works could be sold at lower prices, bringing fine art to the middle class. And while initially mocked, we have come to see the Impressionists as advancing a new and influential creative form, which innovated not only visually, but economically. There is no reason to believe that we will not come to feel the same about generative art, in ten or fifty years time.

Another influential advocate of generative art is Erick Calderon, a pioneering generative artist and creator of both the ArtBlocks platform and the Bright Moments network of generative art galleries. Described as “your favorite artist’s favorite artist”, Calderon’s years as the proprietor of a tile company has given him a unique insight into the dynamics and economics of artistic production.

In his keynote talk at 2023’s “Outer Edge LA” NFT conference, Erick summarized this progression as follows:

Initially, fine art was “1 of 1”, in which collectors owned unique artworks
Later, art expanded to include “1 of X”, in which collectors owned identical members of a series
With generative art, we have arrived at “1 of 1 of X”, in which collectors own unique members of a series

In Erick’s telling, the drop in cost of the production of individual artworks has not cheapened the value of the work, but rather opened up a frontier for new kinds of work. He gives an example of personalized conference badges, but this only scratches the surface of possibilities, as Calderon rightly identifies our deep emotional needs for individuation and distinction alongside communion and belonging as key drivers of demand for these new forms.

All that said, this does not mean that all generative artwork being created now will be relevant in ten years time, much like the majority of boat paintings made in France during the 19th century are not relevant today. But some certainly will be.

Further, it is not yet clear what emotional modes and moods this art form will best express. The Impressionists found that themes of nature and everyday life were well suited to their new artistic production techniques. To what modes and moods will generative art ultimately be best-suited? The answer remains to be seen.

III. Digital Ownership

A common criticism of digital art can be summarized as “it is meaningless to own artwork which anyone can view”. This critique seems sensible, even self-evident, at first blush but a closer analysis reveals that if anything, it gets things backwards.

To understand why, let’s consider some of the motivations for collecting art.

The first and most obvious motivation would be the pure pleasure of having and observing an artwork. Someone collects something that they love, and displays it in a place of their choosing, enhancing their quality of life. This is an important motivation, but it is naive to think that it is the only one.

The second and arguably more important motivation is the psychological need for individuation, for feeling unique and relevant (and ultimately, safe) in the world. As French philosopher Rene Girard has repeatedly written, our drive for uniqueness is core to social functioning, and as Thorstein Veblen famously observed, people will pay large sums to defend their uniqueness.

Art is culturally relevant, and people collect art because through owning something relevant, they become relevant. But since not all art is relevant, collecting the specific artworks that ultimately become relevant indicates that the collector is perceptive and sophisticated. This is arguably self-evident, judging by how much of the art world is driven by social proof and tastemakers.

With that in mind, we can see why in the domain of cultural relevancy, digital and physical art have very much in common.

Owning physical art gives me the right to restrict access. I can hide it away in my house and prevent anyone from seeing it. Once upon a time, this might have been important, as great works of art were not easy to come by: supply was limited and attention was abundant. But in the 21st century, the logic is reversed. The supply is abundant, yet attention is limited. It means very little to own artwork that no-one cares about.

Ultimately, art collectors are stewards of their art. To buy a piece of art is like adopting a child: the collector takes on the responsiblity of protecting the art, of defending it, of sharing it and of making sure that it is appreciated by the world. Recall: by owning something relevant, you become relevant.

This logic applies not only to speculative collectors looking to preserve wealth (a top predictor of an artist’s value at auction is the number of working artists citing them as an influence), but also to social collectors looking to establish reputations in their communities.

Returning to the question of digital ownership, then, we see how little difference it makes whether the art is digital or physical, and how little difference it makes that “anyone can download a .jpg” of an artwork. To own art, whether physical or digital, is to tell the world that you have good taste and good judgement. To own art, whether physical or digital, is to take responsibility for that artwork’s status in the world.

If anything, and this may be a stretch, digital art has the potential of being more valuable than physical art, as it’s digitally-native nature may make it more amenable to dissemination, and for the universe of digital art to be more continuously in conversation. Digital art has certainly shown itself to be highly suitable to interactive display. But the full extent remains to be seen.

IV. Art Market

For a world as lavish and flamboyant as the art world, it’s markets are famously opaque. Both buyers and sellers express frustration when transacting art, as they are frequently denied information about historical sales and demand. Prices are presented as “take-it-or-leave-it” and pressure tactics run rampant. As colorfully detailed in the Atlantic’s recent profile of Larry Gagosian, art dealers frequently engage in practices which, were they to occur in regulated financial markets, would amount to criminal insider trading.

As a luxury “Veblen” good (one where demand increases, not decreases, with price), one could argue that this opacity is essential, even valuable, in the art market. The more you pay for a piece of art, the more valuable it becomes, and prospect of huge comissions encourages art merchants to throw extravagant parties, creating an art world teeming with life.

The digital art market, on the other hand, is perfectly transparent. With ownership “on the blockchain”, so to speak, everyone knows exactly when an artwork is sold, and for how much, and to whom (to some extent). This allows potential buyers and sellers to know more exactly what they are getting, and limits the ability of middle-folk to squeeze out value while contributing little.

Conventional art dealers might argue that this level of transparency will lead to depressed art markets, or that price manipulation is needed to protect artists from hype cycles over the long term. But with digital squiggles going for tens of thousands of dollars, that seems not to be the reality.

Will the transparency of the digital art market mean an end to the parties? Or will it mean new kinds of people throwing new kinds of parties?

An optimistic view would be that this more transparent art market would lead to new, more productive, value flows. Art dealers would retain their role as taste-makers, with a blunted edge vis-a-vis their most egregious practices. New models would emerge for facilitating the discovery of art, allowing more varieties of people to sustain themselves in the art world. And of course, more could accrue to the artists themselves, allowing them to produce bolder work. How and what, remains to be seen.

V. Conclusions

As with the Impressionists laboring in the French countryside, on-chain generative art represents a new form for visual art. Like Impressionism, this form comes with a new bundle of affordances: a lower cost-of-production, new creative possibilities and constraints, and a certain sensibility and preference of theme, all underpinned by new art-market dynamics.

It is an exciting time, and it will be interesting to see how the story unfolds in the years ahead.

Thanks to dianne weinthal and Eric Rosenfeld for feedback on earlier versions of this essay.

Los Angeles

Sat, 04 Jul 2020 00:00:00 +0000

I. Overture

Los Angeles is a city dreamt into existence. Unlike the great cities of the old world (or even the American east and midwest), which emerged organically around rivers and harbors, supporting trade which supported life, Los Angeles was simply willed into being: in a desert, the product of salesmanship, speculation, and solipsism.

Truly the “postmodern city”, nestled at the edge of the world: it means nothing, and can thus mean anything.

II. Context

In his ambitious and wide-ranging City of Quartz, historian Mike Davis tells the story of early Los Angeles as one of boosterism, with land speculators imagining and selling an image of Southern California to midwestern retirees – leading to a massive internal migration west and making Los Angeles an unexpected bastion of anglo-protestant power. This influx of wealth, earned elsewhere, shaped Los Angeles into a city of consumption, not of creation. It was a place people came to relax and enjoy, not to struggle and create. A consequence of this is of course, the sloth of being too well-fed: a resistance to change, a streak of conservatism, and an apathy towards the (myriad) other which shades and colors Southern Californian politics to this day.

During the first half of the 20th century, when the prime land of the coast was virgin and undeveloped, and “high modernism” the civic ethos of the day, there seemed like nothing more worthwhile than to envision and build entire communities whole-cloth (like the pioneering, and infamous, Lakewood). Water was stolen, and the then-abundance of land obscured the contradiction inherent in its commodification. The American dream was shrink-wrapped and sold, creating vast communities lacking an orienting mythology, propped up by the (then) large and growing defense industry. Unlike other great cities, Los Angeles never had to produce itself.

Culturally, the gravity of Hollywood distorts the arts, its inexorable logic turning creative output towards profit and mass appeal – the experience of which gave rise to noir, the genre of disillusionment. As Davis observes, this unique distortive force is compounded by the city’s relative youth; unlike Paris or New York, Los Angeles lacks the “accumluated patrimony” of successive generations of homegrown cultural movements. Lacking these roots, the cultural topsoil easily erodes, leading to a city of passing fads and weird cults.

There is no one in charge. The monocentric anglo Downtown power structure of the early 20th century gave way to polycentrism, with the newly risen Jewish Westside vying for influence; the California and Jonathan Clubs contending with Hillcrest. Both live in the shadow of international capital, which can enter freely and distort the economic orbits nigh at will.

III. Trendlines

Where to go? What is the future for the postmodern city which means nothing and therefore can mean anything?

The tragedy of Los Angeles seems to be that, despite its abundance of natural beauty, it is a city which consists mostly of non-space, a graph of vital nodes and hostile edges. For somebody plugged in, the city has treasures to offer. But for a physical body in the physical space, there is nothing but heat and asphalt.

Fortunately – even miraculously – we live in an exceptional time, where the ground (to use a perhaps too-apt metaphor for a California city) is shifting beneath our feet. Decades-long trends are reversing, and an imaginative space is opening up. From the perspective of a native Angeleno recently returned after twelve years away, it seems as though we are witnessing the following:

The failure of anti-development Measure S seems to have marked a watershed moment, in which the multi-decade slow-growth wave has begun to break.
The completion of the Exposition Expo train line, and the continuation of Wilshire’s Purple line, points to the increasing effectiveness of the transit lobby.
The ongoing coronavirus pandemic has shattered conventional practices and expectations around work and home, creating space for entirely new (and low-congenstion) patterns of travel.
The recent protests contra police violence, leading to proposals to reduce LAPD funding (defundthepolice) and Mayor Garcetti’s comments that DA Jackie Lacey “might” have to go, suggest a break in the city’s longstanding, unquestioning support for law enforcement.

In these events we see an opening – towards a higher-density, transit-friendly, peaceful city – and what could be the beginning of a transformation from a militarized urban sprawl into a lively urban field. Los Angeles (and America) of the 2020s is not the violent place of the 70’s or 90’s. We live in a more peaceful time, and it is time to begin the work of slowly and carefully taking down the walls of our guarded enclaves and militarized public spaces.

America’s lack of empathy has always been our fatal flaw, but it is never too late to change – and in that change, to revitalize and renew. In the words of the Lebanese poet Kahlil Gibran, “to withhold is to perish”. Ultimately, to borrow a phrase from Lévi-Strauss, the mythic energy of America as a strong and muscular power has been spent. It will not return. We must begin to reimagine our national myth as one of empathy in a time of abundance; only in such a renewal is there hope for a new vitality. There is not so much to fear.

IV. Housing

Housing is where we begin. The remarkable dysfunction of California’s housing market is well-known, but less obvious is exactly why it should be so. Good treatments of the subject matter can be found in City of Quartz, as well as the more recent Golden Gates; here is a summary of the main trends.

California’s housing crisis can be understood as the intersection of a number of forces.

The first and simplest is that we ran out of land. Well, not land per-se, but rather prime, easily-developed coastal farm and ranch land. With a remarkable lack of foresight, early southern California was developed with an aggressive suburban attitude, as large tracts of farmland were divided up and freestanding white-picked-fenced homes were mass-produced and priced-to-own, often purchased by the factory workers drawn by the region’s defense and aerospace manufacturing. When the land ran out – and it did, abruptly, mid-century – it was like a splash of cold water after a night of heavy drinking. Supply froze, and so prices went up… and up.

This seemingly abrupt but basically inevitable housing shortage had the awkward consequence of creating an essentially random distribution of political power. Whoever happened to have bought a home in the “before times” was now a part of an abruptly wealthy and powerful interest group; wealth and power which was predicated on the maintenance of status quo – as evidenced by the passing of the highly regressive Prop 13 in 1978, which reduced property taxes significantly. These peculiar political dynamics are explored by Conor Dougherty in Golden Gates, and he makes the point that much of the state’s housing dysfunction is due to the asymmetry in political power between owners and renters: for any given proposed development, the current residents are well-defined, while the potential future residents are not. Building housing for 100 people in a neighborhood of 50 means fighting against that 50 without the 100 to back you up, since they don’t exist yet.

For any given development, this asymmetry holds, which is why local control biases towards stagnation and “NIMBY”-ism. At the municipal and state levels, however, the political calculus changes, as the renter bloc becomes increasingly politically well-defined – as we saw with the decisive failure of Measure S in 2016, the pro-growth bloc, at least at the city level, has overtaken its slow-growth counterpart. As such, it seems that forward-thinking decisions about California housing will inevitably need to be made at the city, county, and state level – neighborhoods thusfar have been unwilling to do the job.

Central control, of course, is not without risk. As strikingly described by James Scott in his excellent Seeing like a State, too much central control and we risk ending up with high-modernist wastelands like Brasilia and Chandigarh, where shallow notions of visual order and harmony preclude the development of the cozy corners and impromptu interactions which (famously described by Jane Jacobs) form the public good of a vibrant urban life. Balancing central and local control is a problem as old as philosophy, and the only way out is through. Ultimately, we must transcend the consumptive retiree mentality and the intense class and racial divides and individually acknowledge the city as a commons, the obligation to which there is no individual exit. To fail in this recognition is to sign up for a long and losing battle, propping up failing levees in a storm.

How, concretely, can we move forward? Culturally and politically, we were unprepared for the abruptness of the crisis, leading to a series of questionable policy choices (along with Prop 13, we have in the 1950’s the emergence of similarly regressive “contract cities”), which functioned ultimately to shift tax burdens from the rich to the poor, the legacy of which we live with today in the form of underfunded city services and high consumer taxes. Where from here?

One idea, too radical for this moment but something to keep in the back of our minds, is the elimination of single-family zoning in its entirety. New York, considered by many one of the most exciting and dynamic cities in the world, famously included no single-family zoning at all in its 1916 zoning plan. California, with its imaginative legacy of the “open west” will need to make some mental shifts and appreciate that it is time to do the same. The market will put the suburbs where they belong.

A second idea, also ambitious but slightly more practical, is the joint repeal of Prop 13 and the elimination of rent control. It is crucial that they occur together, as these two programs represent the yin and yang of housing misallocation: Prop 13 suppresses the cost of ownership by suppressing property taxes, while rent control supresses the cost of renting by suppressing rent. In both cases, skewed incentives discourage movement and invariably result in individuals living alone in three-bedroom apartments or five-bedroom houses, while a few streets down, families of five share a single room. Removing one without the other would be rightly seen as classist – pro-owner, pro-renter – but removing them together makes sense.

Realistically, we should proceed with incremental zoning changes, along the lines of SB 50. Here Scott Weiner seems the one to watch.

Ultimately, it is possible to believe in the value of home ownership without needing homes to be speculative assets, and nowhere are we guaranteed a fixed, unchanging urban landscape.

V. Transportation

Housing changes, of course, are impossible without corresponding changes in transportation; they support and enable each other. How we get around determines where we live, and where we live determines how we get around. Density and transit are like two wings of a bird – without them both, it cannot fly.

Ultimately, the Los Angeles of low-slung sprawl is over. It is not wanted or needed. The long parched boulevards of single-story (often auto-focused) retail must be reimagined and rebuilt as two to three story mixed-use districts. There is no need to erect skyscrapers and block out the sun, but new usage patterns which allow for street life on all (or at least, most) of the streets is essential. Rather than separating residence from commerce via tremendous distance, instead mix them more often together, allowing a higher density of residence to support a wider variety of retail and commerce, and breaking the suffocating strangehold of the car on the city. There is nothing wrong with taking a car to see a friend or to go to a show (discretionary, irregular activities well-served by ride-sharing), but having to take a car to buy milk or a sandwich seems more and more like a tragedy – in which we build more housing for cars than we do people, and high costs of housing subsidize our so called “free parking”.

Ultimately, rather that inhuman distances necessitating personal cars for quotidian activities, we would like a gradation of distances for different activities, with the proximity of goods and services being directly linked to the frequency of their need. Daily essentials should be accessible by foot; infrequent needs met by rideshare or bicycle. Owning a car in Los Angeles must become an option, rather than a necessity, for a significant part of the population.

Curiously, the coronavirus presents us with a rare opportunity in this regard. With surge in people working-from-home, the country’s entire private sector is learning just how little a daily commute is needed to run a successful organization. Certainly, video conferencing can never fully replace in-person, face-to-face interaction, but for many people, a lot of the time, it is unneeded. Making a guess, we should expect a permanent post-virus reduction in the amount of time spent in an office of anywhere from 10-20% – leading to a corresponding drop in commute days. If 1/5 of workers spent any given workday in their residential neighborhood, rather than commuting to a commercial center, we would simultaneously see a drop in road congestion and demand for parking in these commercial centers, while seeing an increase in demand for business (food, retail, etc) in the residential neighborhoods. More demand for business in the residential neighborhoods means more business density, which will make these neighborhoods more pedestrian-friendly. Overall, a greater uniformity of population distribution over the course of the day (vs. large daily migrations from residential to commercial center and back) allows for a wider variety of businesses to thrive over more of the city’s area, reducing the need to cover large distances on a daily basis.

With fewer cars on the road at any given time, alternative modes of transit can be set up for success. Taking cues from other cities, we can permanently designate certain boulevards bicycle-and-pedestrian only, doing for cyclists and pedestrians what Robert Moses did for Long Island’s beachgoers – giving them pleasant, relaxing ways to get around (and with great opportunities for sidewalk dining). In fact, we have already begun. Fewer cars on the road means more opportunity for dedicated bus lanes, which, if run properly, represent an invaluable supplement to the city’s earnest-but-outmatched rail system, allowing us to provide effective transit beyond the city center.

That said, access to a car is a great convenience, and ride-sharing and one-off rentals can be expensive and are impractical for, for example, camping trips. An easy proptech-adjacent entreprenial solution would be to offer “shared-cars-as-a-service”, where groups of people can subscribe to a personal car. The subscription would include normal maintenance and a group driver’s insurance policy, and gas usage would be tracked and the costs distributed automatically. That or something like it would fill an important niche very nicely, allowing two or three cars to meet the needs of perhaps six or ten people living in proximity.

As an aside, Los Angeles could be a cyclist’s paradise. It is relatively flat and the heat is dry, not humid, making cycling suitable for daily use by professionals (no working up a sweat). If the city’s ameneties and points of interest were more evenly distributed (as they will likely become), the addition of one or two cyclist-friendly east/west boulevards could greatly expand the potential of cycling in the city. Imagine if Olympic Blvd was reduced to two lanes of car traffic, freeing the rest for a large, protected bike path – crossing the city would be a breeze. And if that same boulevard were lines with residences, eateries, and light commercial spaces?

One could argue that a long-term coronavirus-related reduction in driving would make transit less relevant, rather than more, as it would be consequently easier to get around by car. While there is likely some truth to this, congestion remains half the equation, with parking being the other. As long as every Angeleno needs a car, parking subsidies will continue to be built into the costs of housing, exacerbating the city’s housing affordability crisis. We must seize the opportunity to build transit back into the city.

As memorably put by technology investor Ben Horowitz, “there are no silver bullets, only lead ones”. No single intervention will solve Los Angeles’ substantial transportation problems, but a variety of interventions can interact to provide an effective multi-modal transporation grid. A multi-modal transporation grid allows for more heterogenous movement patterns, meaning the load on any system is reduced – but if everyone has to drive, we could build freeways for one hundred years and never get the capacity we’d need.

VI. Outro

As mentioned in the introduction, Los Angeles is a postmodern city – a product of idea more than of material. While at first blush this might seem like a weakness or fatal flaw (“a meaningless city”), it may yet be the city’s great strength, as a postmodern city can more easily be imagined and re-imagined again.

There is an opportunity to re-imagine Los Angeles, not as a harsh urban sprawl, but as a vital urban field. A city where a density of variously-priced residences, spread throughout the city, supports a vibrant commercial life throughout. A city where lush, pedestrian-and-bicycle boulevards complement light rail and well-run protected bus routes to make getting around the city easy and pleasant (complemented by ride-share and – gasp – even some car ownership).

This is not wild speculation – much of this, at least directionally, is happening now.

Further, there is a chance to make the postmodern city – one of the most diverse on earth – into a racially integrated city, and to continue to take steps to heal the racial divides which sit at the very heart of American identity. Here again the postmodern city shines – for where else can old and tired myths be reimagined and renewed? Can we find the courage to take funding away from security and from fear, and to put it towards culture and creation? And while we’re at it, make peace with the towns upstream whose water we stole?

None of these aspirational outcomes, of course, are guaranteed. But they are possible. The trendlines are all there. With vision, leadership, courage, and fortitude, they may even be achievable. And that is something.

Selected Sources

Sharing the Wealth

Sun, 19 Apr 2020 00:00:00 +0000

This essay was originally prepared for a London-based zine focusing on critical analyses of gentrification. The editors ultimately felt that this essay was too market- and policy-oriented for their arts-oriented audience, so I am publishing it here.

Smoother residential gentrification through shared ownership

From the moment early man first pointed to the ground and said “mine”, we have fought for control over land. For most of our history, this conflict involved bloody violence; in recent decades, however, economic conflict has replaced physical, and “pirate raid” is replaced by “pricing out”. It is right to interpret this as progress, for it is – and to recognize that competition over land has never been absent from the human experience.

The question is not one of “stopping gentrification” – it is intellectual laziness to conclude that our perennial competition to occupy land can be permanently stopped through some perfect policy – but rather one of best managing its energies. Here we can have hope: while a river cannot be stopped, a well-built dam can be a source of tremendous power.

Why do we view residential gentrification – the claiming of land from the less affluent by the more affluent – as “bad”, a social ill to be prevented? Surely it is not the basic experience of change, as change is as constant as breathing. No, we view residential gentrification as bad because, the logic of capitalism aside, home is an emotional sphere which exists outside and apart from the market, even while the physical space remains embedded within it. As Karl Polanyi says in his seminal The Great Transformation, there exist three “fictitious commodities”: land, labor, and capital – which we embed in markets, even though truly markets are embedded in them. We should place housing in this category also.

Unfortunately, recognizing home as a fictitious commodity does not solve the underlying problem, as we lack a better alternative for resolving conflict for the underlying land, as nationalized land ownership and rent control – two possible strategies – come with their own pernicious costs: a loss of freedom in the former, and chronic misallocation in the latter. This recognition is valuable, however, in that it suggests that there may be more to the social contract. If we accept that people may always have to leave, the question becomes one of “what does leaving look like”? And here we can find answers and inspiration.

What happens when a neighborhood gentrifies? The prices go up. Rents increase, goods and services become more expensive, and the value of land and property rise. To whom does this value accrue? Overwhelmingly, the landowners. This is the problem: the residents, who participated in the creation of the neighborhood (by living their lives, raising their families, and supporting local businesses), receive none of the upside. To transform the experience of residential gentrification, we must find a way to transfer some of this new value from the landowners to the residents. Being priced out of your home is an unfortunate but sometimes unavoidable reality. What is not unavoidable is walking away with nothing. The emotional experience of gentrification would be very different were a family to walk away with tens of thousands of dollars – or more – to start a new life.

Simply: tenants should receive partial ownership of their buildings, as a function of the duration of their tenancy. This partial ownership recognizes the role the tenants play – as the fabric of the community – in creating the value that is currently captured entirely by landowners. One scheme would be to set aside 20% of the building’s increase in value for tenants (much like companies often set aside a percentage of their stock aside for employees), and to distribute these to tenants as a function of tenure – say .2% per year. After ten years, a tenant can claim 2% of the increased value of the building. If the value of that building increases by $1,000,000, that tenant receives $20,000.

There are many ways such schemes could be implemented. One approach involves landlords adopting a policy where, every ten or so years, they “buy out” their tenants at the then-appraised value. Another approach involves tenants redeeming their shares against future rental income. If I spent a decade in a neighborhood paying $1000/mo, and then get priced out at $1500/mo, then I could claim a percentage of these higher rents over a period of several years – an empowering passive income stream for a class which has historically struggled to accumulate capital.

It is reasonable to wonder why landowners would ever agree to such a scheme, absent a significant social movement to force new legislation. The answer is that it aligns the incentives between landowners and tenants, by giving tenants a reason to “think like owners”, resulting in better-maintained and more beautiful living spaces. The contemporary landlord-tenant relationship is toxic, with each party encouraged to extract as much value as possible from the other. Neither party is incentivized to invest in the improvement of the property: the tenant does not because they capture none of the value, and the landlord does not because it cuts into their short-term bottom-line. Under a scheme in which the upside is shared, the tenant knows that they will be able to capture some of the upside associated with their contributions, and the landlord knows that the tenant will be motivated to take care of any improvements – saving the landlord in maintenance costs and real, but hard-to-price, emotional conflict. While it may seem outlandish to contemplate, a scheme like this would increase the quality of landlord-tenant relationships, the way in which landlords are viewed by society, and the quality of our living spaces – a win-win-win.

Absent some radical new development in law or philosophy, we should view gentrification as a fact of life, much like death and taxes. The productive train of thought is not how to “stop” gentrification, but rather how to shape the process. Here we propose a shared-ownership scheme which aligns the interests of landlords and tenants in a way which takes much of the bite out of gentrification, and allows the tenants, who may inevitably find themselves needing to look for a new home, to walk away with something substantial – wealth which can be invested in new neighborhoods and new communities. This new type of windfall will transform the emotional experience of residential gentrification (a key aspect of gentrification writ large) from one of powerlessness to one of agency in the face of change, and represents an important step forward in urban policy.

A Review of 'Gaming the Vote'

Sat, 04 Apr 2020 00:00:00 +0000

I. Introduction

Some months ago I read William Poundstone’s Gaming the Vote, a comprehensive survey of the history and attributes of various alternative voting systems – a topic of longstanding personal interest.

The villain of Poundstone’s story is the plurality vote, a fair-seeming but perniciously flawed way of choosing leaders, and Poundstone devotes fully the first third of the book to a survey of the method’s myriad historical vailures. The remaining two thirds are a survey of the alternatives, and a discussion of their own (inevitable) achille’s heels. Poundstone considers, in turn, the classic methods of Borda and Condorcet, the more recent innovations of Instant Runoff Voting and Approval Voting, and concludes with a bullish assessment of Score Voting as the “least worst” option. Along the way, we meet (among others) the idealistic Marquise de Condorcet, the obsessive-creative Charles Dodgson (Lewis Carroll!), and of course, Kenneth Arrow, our axis mundi.

Let’s now review the various electoral sytems (as we will see, each has a fatal flaw), and then hone in on a number of points where I think Poundstone’s argument can be extended or corrected.

II. The Voting Systems

Plurality Voting

What: Voters submit single votes; the candidate with the most votes is the winner.
But: Susceptible to vote-splitting, leading to the election of minority candidates (the “spoiler effect”).

Plurality voting (also known as “first-past-the-post”) is an electorical system in which voters cast a single vote for their preferred candidate, out of an arbitrary pool of candidates. The votes are summed up, and the candidate with the most votes is elected.

While simple and fair on its face, the critical flaw of the plurality vote is that when there are more than two candidates, often the larger group (the “majority bloc”) will find themselves choosing between two candidates, while the smaller group (the “minority bloc”) will have only one. In this case, the majority bloc, which should be the one to choose the candidate, will end up losing to the minority bloc. In concrete terms, if the majority bloc is 60% of the vote with two equally-popular candidates (Alice and Bob), then the minority candidate (Charlie) will win with 40% of the vote, while the two majority candidates will each take 30% of the vote and lose the election. Note that no candidate won a majority of the votes, a reasonable standard for legitimacy.

In real-world elections, a more common occurrence is that the majority and minority blocs are more closely matched (say 52% vs 48%), and a fringe candidate comes in to pull ~5% of the vote from the majority candidate, handing the victory to the minority bloc – the “spoiler effect” phenomenon which famously sent George Bush to the White House in the 2000 United States presidential election. In addition, Poundstone discusses the recent practice (as of the last few decades) in which minority blocs explictly encourage this phenomena by descreetly funding fringe candidates to challenge their majority opponent.

From a theoretical perspective, the key limitation of the plurality vote is that a voter is unable to provide sufficient information about their preferences. In many cases, voters who vote for a fringe candidate would still prefer the majority candidate to beat the minority candidate, but this type of “second choice” information is unavailable to the election algorithm. As we will see, the common theme of every other system Poundstone describes is their attempt to incorporate this additional information (with variously mixed results).

The Borda Count

What: Voters submit ranked lists, with candidates receiving more or less votes depending on their position. The candidate with the most votes is the winner.
But: Susceptible to tactical voting (“burying”), which can lead to the election of unintended candidates.

The Borda count is a method which attempts to avoid the flaw of plurality voting by allowing voters to convey their preferences not only for their first choice, but for all the candidates, by submitting a ranked list. Known appropriately as a “ranked choice” method, the Borda count assigns a numeric score to every candidate based on their position in the list (i.e. for the list “Alice > Bob > Charlie”, Alice gets two points, Bob one, and Charlie zero).

The Borda count thus avoids the spoiler effect by allowing multiple candidates to “share” the support of their bloc, in such a way that one of the majority candidates should prevail over the minority. To return to our earlier example, say that Alice and Bob are equally-popular candidates for the 60% majority bloc, while Charlie is the sole candidate for the 40% minority bloc. With the Borda count, each of Alice and Bob will receive a score of 30 * 2 + 30 * 1 = 90, while Charlie will recieve a score of 40 * 2 = 80. If we assume some small amount of randomness such that an exact tie is avoided, one of either Alice or Bob will win.

However, the Borda count’s flaw is that by assigning scores to individual candidates as a function of the number of total candidates, it makes it possible to create “artificial distance” between candidates (i.e. to create a distance of “two” between Alice and Bob, simply by the presence of Charlie). This ability to create distance between candidates leads to the phenomenon of “burying”, a type of strategic vote where strong candidates are ranked “artificially” low.

In extreme cases this can lead to unexpected candidates being elected. Consider an example of Alice, Bob, and Charlie, where Alice is the majority bloc candidate (60%), Bob is the minority bloc candidate (40%), and Charlie is a moderate. With plurality voting, Alice would easily win. But under the Borda count, the minority bloc can vote strategically to put Charlie in office: if the majority bloc ranks Alice, Charlie, and Bob, then the minority bloc can vote tactically, falsely putting Charlie as their top candidate (and Alice last). This would lead to a Charlie victory with a score of 40 * 2 + 60 * 1 = 140 vs. Alice’s score of 60 * 2 = 120.

Theoretically, the problem is rooted in the ability to create absolute (numeric) distance between a pair of candidates, compared to simply relative distance, as a function of something other than the pair of candidates themselves (in this case, the number of candidates total). The number of candidates in the election should not be able to affect the relative preferences of pairs of candidates (where the majority prefers Alice to Charlie), but in the Borda count it does. Another way to frame this limitation is that the Borda count provides no way to express non-uniform “distances” between the candidates – the psychic “space” between Alice and Bob is assumed to be just as large as that between Bob and Charlie. This can be understood as a type of information-theoretic noise caused by the measurement not quite fitting the thing being measured (this will be a recurring theme).

The Condorect Winner

What: Voters submit ranked lists, which are translated into pairwise contests. The candidate which defeats every other candidate, pairwise, is the winner.
But: No guarantee of transitivity, i.e. a winner might not exist.

The Condorcet winner is the candidate who would beat every other in a two-way contest. This method attempts to both avoid the spoiler effect and the problems of the Borda count by allowing voters to submit multiple preferences without creating artificial distances.

To return to our opening example, in an election where Alice and Bob are majority candidates and Charlie is the minority candidate, both Alice and Bob will beat Charlie (60/40), and then Alice and Bob will themselves face off in what will amount to a 50/50 split (with some small randomness breaking the tie). This method solves the problems of plurality voting by allowing for the incorporation of the information that 60% of the people would prefer Alice and Bob to beat Charlie, while avoiding the problems of the Borda count by never casting relative preference to absolute (i.e. if I prefer Alice to Bob, ranking Bob last doesn’t make Alice beat him by “more” as long as he is ranked after Alice).

The problem with this method, however, is that the Condorcet winner may not exist. Consider a scenario with three equal-sized blocs (and three candidates), we may find ourselves in a situation where Alice beats Bob by 2:1 (i.e. bloc A and C both prefer Alice to Bob), but Bob beats Charlie 2:1 (i.e. bloc A and B both prefer Bob to Charlie), but Charlie beats Alice by 2:1 (because bloc B and C both prefer Charlie to Alice). This can be written as:

Bloc A: Alice > Bob > Charlie
Bloc B: Bob > Charlie > Alice
Bloc C: Charlie > Alice > Bob

This creates a rock-paper-scissors situation known as a “cycle” (draw it as a graph – or scroll down – to see why), and there is no winner. Obviously it is problematic if there is “no winner” in an election, and so this is seen as a flaw in the method. Defenders of Condorcet methods say that cycles are rare in practice (it requires a particular “arrangement” of candidates and voters), and so the concern is over-blown (compared to the type of failures which can occur in the Borda count and plurality votes, which are much more common).

From a theoretical perspective, the problem here not the lack of information, but rather that the algorithm cannot “see” all the information that is there. As we will learn later, there are other techniques which can see this information and break cycles in a fair way.

Instant Runoff Voting

What: Voters submit ranked lists. An iterative algorithm re-allocates votes until a clear (majority) winner emerges.
But: The algorithm is “non-monotonic”: giving a candidates more votes can cause them to lose. Also, popular second-choice candidates may be eliminated too early.

Instant Runoff Voting (IRV) is an interesting beast. Unlike the other methods, which run in constant time, IRV is iterative: it runs in a while loop, rearranging votes until one candidate has a majority of first-place votes. Like the Borda count and Condorcet winner, IRV attempts to make second-choice votes meaningful by successively eliminating last-place candidates, reallocating votes to second (or third or fourth!) choices in the event that an eliminated candidate was a voter’s first choice. Returning to our opening example, consider the case where Alice and Bob split the 60% majority bloc’s vote down the middle (alternating as first and second choice on the ballots), while Charlie takes all of the 40% minority bloc’s votes. In the first round of the IRV algorithm, either Alice or Bob will be eliminated (having the fewest votes, ~30%, versus Charlie’s 40). Let’s say that Bob is eliminated. Now, since every voter who ranked Bob first ranked Alice second, all of Bob’s votes will now be transferred to Alice, giving her 60% of the vote and a majority victory.

IRV has proven to be popular in practice, gaining traction in governments and municipalities around the world, in part due to the intuitive nature of the algorithm making the process easy to understand. Among contemporary advocates for voting reform, IRV has become one of the most popular options (rivaling Score Voting). However, IRV is not without flaws. The essential problem with IRV is that it is, in mathematical speak, “non-monotonic”. A “monotonically increasing” function is one which is always either staying the same or increasing, but never decreasing: for IRV, the “non-monotonicity” means that giving more votes to a candidate can actually cause them to lose, a troubling phenomenon which does not appear in any of the other systems considered. Also, the nature of the IRV algorithm means that the most broadly popular candidate may still lose. To see why, consider the case where the population is split 50/50 for Alice and Bob, with Charlie being a universally-popular second choice. Charlie seems like the natural best choice, but because he receives no first-place votes, the IRV algorithm eliminates him in the first round, resulting in a deadlock between Alice and Bob – exactly the situation we would hope IRV would avoid.

Approval Voting

What: Voters submit a binary approve/reject per candidate. Votes are tallied according to plurality rules.
But: The ambiguous semantics of “approval” means that winner of the election can be hard to predict.

The Borda count, Condorcet winner, and Instant Runoff Voting all fall under the category of “ranked-choice voting” systems: they are all different algorithms which operate on the same measurement, that of a relative ranking of candidates. Approval voting (and its cousin, score voting) are fundamentally different animals, in that their primary representations are not relative, but absolute. As we will see, this approach will spare these methods from many of the flaws of their relative brethren, but introduces pernicious problems of its own.

Approval voting can be thought of as a generalization of plurality voting, but where instead of voting for one candidate, you can vote for as many as you like. This prevents the spoiler effect by allowing second-choice candidates to receive votes alongside the first-choice candidates. This also removes the benefits of voting strategically, since the sincere vote is the optimal vote. Recall Alice, Bob, and Charlie. With approval voting, both Alice and Bob will receive ~60% of the vote, compared to Charlie’s meager 40%. Some small randomness will ensure that one of Alice and Bob is elected, to the satisfaction of the majority bloc.

Unfortunately, the ambiguous semantics of “approval” (what is the standard by which someone is “approved”?) means that, contrary to expectations, mediocre candidates can prevail over strong candidates. Consider a situation where Alice is beloved by 60% of the population, Bob beloved by the other 40%, and Charlie seen as a bumbling but endearing candidate whom no-one takes seriously, but no one dispises. With approval voting, it is possible that more than 60% (potentially up to 100%) of the population will “approve” of Charlie, on the grounds that, from the perspective of each half of the population, Charlie isn’t a bad candidate. As a result, Charlie wins the election – an unintended outcome. More fundamentally, depending on how voters interpret “approval”, the same ranked-ordering of candidates can lead to different election outcomes – a phenomenon known as “indeterminacy”.

Observe that, under a Borda count, Alice would win the election, since a first-place vote from 60% of the population is worth more than a second-place vote from 100% of the population (2 * 60 > 1 * 100). With approval voting, the inability to represent the underlying relative distinctions leads to the measurement error manifesting as indeterminacy. Said another way, by treating the candidates as unrelated in the model, the base concept of “approval” decoheres and loses definition as relativity inevitably re-inserts itself.

Score Voting

What: Voters submit a numeric score per candidate. Votes are tallied according to plurality rules.
But: The ambiguous semantics of “scoring” means that winner of the election can be hard to predict.

On the surface, score voting arrives on the scene as the funnier and more handsome cousin of approval voting. Rather than precluding any expression of relative preference, score voting permits the assignment of real-valued scores to each candidate, allowing for implicit relative preference. Further, the ability to use a fuller range allows voters to avoid the false “uniformity of differences” which hounds the Borda count.

Unfortunately, this merely kicks the can down the road – it turns out that numerical scores vary as a function of the candidates just as much as binary “approvals”. Consider a beloved Alice, a well-meaning Bob, and a chicanerous Charlie. A voter might give Alice a 1, Bob a .6 (indicating their positive sentiment), and Charlie a 0. Say Charlie is indicted for fraud and drops out of the race – what happens to Bob’s score? The voter wants Alice to win, and giving Bob a 0 maximizes those chances. So we see how Bob’s score, real-valued as it is, decoheres just as much as “approval” does with binary votes. Fundamentally, the voter has no “score” for Bob, only a relative sentiment vis-a-vis Alice and Charlie. In line with our theme, we conclude that the use of real-valued scores is a mirage, providing an illusion of information.

Apart from this, score voting is subject to the same quirks as approval voting, and so there is no need to recount them here. And of course, as with all systems described here, the existence of these types of failure conditions in principle says little about the frequency with which they will be encountered in practice. All of these systems work well most of the time, which is good enough, most of the time.

III. Generalized Relativity

The Confounding of Condorcet

Let us now take the ceremonial potshot at our bugbear, the Independence of Irrelevant Alternatives (IIA) – the most frustrating of Arrow’s criteria. Consider the example on page 225 of Gaming the Vote:

Three candidates (here, Clinton, Bush, and Perot) run in a ranked-choice election with a Condorcet winner. The first ballots arrive:

Clinton > Bush > Perot (30 million)
Bush > Perot > Clinton (30 million)
Perot > Clinton > Bush (30 million)

This leads to the following graph:

As we can see, we have a nightmarish cycle in which each candidate wins and loses by a landslide, and there is intuitively no winner. With this voting data, no system could declare a winner, as the information simply does not exist. Now, additional ballots arrive:

Bush > Clinton > Perot (20 million)
Clinton > Bush > Perot (15 million)

This leads to the following graph:

Now we have sufficient information to declare a winner – but who?

As Poundstone points out, the new votes favor Bush, and yet – despite a folk moral arithmetic implying that a tie plus a victory equals a victory – Clinton is the Condorcet winner. Yet Clinton’s Condorcet victory hides the landslide victory that Bush holds over Perot – much more decisive than Clinton’s meager margin over either. It feels to us that this one landslide means more than Clinton’s two narrow victories. We spectators, with our “artist’s eye” (to borrow a term from Paglia) can “see” the relevance of this background victory to the larger picture, but the simple “machine mind” of the Condorcet algorithm cannot.

Of course, not all is lost – the information is there, clearly – we can see it. Condorcet cannot, but it turns out that Borda can. In this setting the Borda count would “see” Bush’s victory – with a count of 145 million compared to Clinton’s 140. Of course, that the Borda count is “right” in this case does not mean that it is better – as we have discussed earlier, each algorithm can “see” only certain facets of the world – and in this case, the Borda count is the machine that sees the right things.

There are more recent techniques, such as Power Ranking (the method famously underlying Google’s PageRank), which mix the numerical aspect of the Borda count with the graphical approach of of the Condorcet winner to produce compelling results, and remains on the cutting edge of applied social choice with numerous applications under active development. As promised earlier, Power Ranking is a technique capable of breaking Condorcet cycles, “spinning the wheel” to leverage more of the available information.

There is no algorithm which can fully attain the “artist’s eye”, for the same reason that there is no way to fully express a feeling. However we can get closer, and insisting on representations which as closely as possible reflect their underlying world is the start. If all mental concepts are fundamentally relative, we should stop pretending that they are not, and include relativity explicitly (and thoughtfully) in our models – before it sneaks in unnanounced. All alternatives are relevant. The two words are nearly anagrams, clearly there is a cosmic joke being played here.

Signal and Noise

Continuing our theme, let’s turn to another example in Gaming the Vote, the discussion of the infamous “Hot or Not” (chapter 14). Hot or Not, as Poundstone remembers, was (is?) a website in which people can upload photos of themselves, and have the good people of the internet submit judments as to the photo’s attractiveness, which ultimately Poundstone holds up as an example of the efficacy of score voting. Quoting directly (Poundstone, 247):

[The creators, Hong and Young] considered having visitors pick their favorite of two on-screen photos. A photo would win points for each time it was preferred over another, random photo. This would loosely simulate a Borda count. (In a true Borda count, a candidate wins a point every time a voter ranks her above a rival. No Hot or Not voter could rank all the millions of pictures on the site, of course. The aggregate effect of random visitors ranking random pairs would be similar.) However, when shown two photos that hapen to be of roughly equal attractiveness, “people will look at the pictures and not know,” Hong said. “They have a harder time deciding.”

Hong and Young also considered a simple “hot” or “not” vote on a single picture. This would be an approval vote. There it was “average” Joes and Janes which slowed things down. People would have to ponder whether to click “hot” or “not”. Range voting was faster. It seemed to require less thought. “Sometimes people can’t even express the number,” Hong explained, “they just have a feeling and like having that bar: ‘ah, it’s kinda like here.’” They position the cursor where it feels right and click.

This is a valuable history which deserves closer scrutiny, through the analytical eye of information theory.

First, some definitions: note that a single pairwise preference (“A vs B”) can be represented in one bit (0 for A, 1 for B). So too can a binary “hot or not” (0 for “not”, 1 for “hot”). A real-value, on the other hand, requires more bits – for argument’s sake, let’s say 3 bits for an 8-point scale (000 gives a 0, 111 a 7, and the rest in-between). The axioms of information theory tell us that a digital “bit” can contain up to one “bit” of information (the relationship between the digital bit and the information bit being governed by the mathematics of entropy – in the classic example, the outcome of a fair coin is exactly one bit of information, while the outcome of a loaded coin is always a little bit less).

Here, we see that score voting yields 3 bits of data, while a binary “hot or not” vote yields 1. Yet, bizarrely, it is easier for users to provide more data rather than less – suspicious. While we cannot prove which measure yields more information (as this requires access to the ineffable truth, which we lack), this juxtaposition should make us wonder how many bits of information we’re really getting in those 3 bits of real-valued scores – likely, it is less than the (up to) 1 bit we get from the binary “hot or not”.

Historical experience supports my argument. Poundstone mentions at the beginning of the chapter that, among others, YouTube uses 5-point scores for their videos. However, as discussed (in some of my work) here, YouTube (along with Netflix) have since dropped the 5-point system in favor of a binary like/dislike, on the grounds that the 5-point scale ultimately provided very little signal above and beyond what was gleaned from a like/dislike, and thus introduced mostly noise.

That it is “easier” for users to provide 3 bits tells us little about the quality of the measurement – it is the easiest thing of all to submit a random number, containing no information at all. Is not the amount of data, but the ratio of data to information, that we should care about. It is difficult decision processes that produce information-rich results.

IV. Conclusion

The foundation of science is the assumed validity of independent repetition – the idea that things which occur in the future are like things which occurred in the past, and that the things that we observe occur somewhat independently of one-another. This allows us to, for instance, re-run experiments, test theories, and to develop re-usable mathematical models of the world. Unfortunately, this assumption is, at base, incorrect. Historians know that, while we conceptualize history as a series of interconnected-but-separate episodes, in which the past contains clues about the future, the deeper truth is that history is one, single event, in which every moment is intimately and inextricably wound up with every other, and no differentiation can occur. This reality is more felt in some cases than others. The natural sciences, for one, can often get away with strong assumptions of independence (protons behave largely the same in 21st century California is they did in 10th century China). In art history, this is less true – it is virtually impossible to understand the behavior of English painters in the 19th century without understanding the Italian sculptors of the 15th. The social sciences (and by extension, voting systems) sit, somewhat frustratingly, in the middle.

In many cases, assumptions of independence are necessary to make problems tractable. Fortunately, this is not the case here. There is room in the field of electoral systems and social choice to incorporate notions of relativity alongside notions of psychic intensity, and to develop algorithms which leverage the information encoded in both. Doing so will allow our mechanisms to sit closer to the “reality” of our experience and thus yield more consistently legitimate outputs – a worthy aim in the quixotic quest for better tools of freedom.

A Mild Critique of Quadratic Funding

Fri, 13 Dec 2019 00:00:00 +0000

This essay is meant as a mild and constructive engagement with one part of the constellation of ideas being advanced under the aegis of RadicalxChange (pronounced “radical exchange”), specifically the concept of quadratic funding, and it’s claim to “optimality”. Let’s review the argument and then assess the strength of that claim. This will involve a few equations but I’ll narrate the whole thing so it shouldn’t be too hard to follow (or just skip to the critique).

A Review

From the “Liberal Radicalism” paper, we have the following notion of social welfare:

\[\sum_p (\sum_i V_i^p(F^p)) - F^p\]

Here, $i$ is a citizen in a society while $p$ is a public good in that society. $c_i^p$ (which shows up later) is the amount of money that citizen $i$ gives to good $p$, whle $F^p$ is the total amount of funding that good $p$ receives. $V_i^p(F^p)$ is the “currency-equivalent utility” that citizen $i$ receives if good $p$ is funded at level $F^p$. Pay extra special attention to the term currency-equivalent utility because it is the hinge of the critique. With these definitions, the equation is straightforward: social welfare is the sum of all individual utilities across all public goods, less the total cost of those goods. Pretty reasonable.

Now, the authors (Buterin, Hitzig, and Weyl) use this equation to show why two existing systems, namely capitalism and one-person-one-vote democracy, lead to suboptimal allocations, while their quadratic methods lead to optimal allocations. An important concept in their argument is the first derivative of the individual utility function, $V_i^{p\prime}$. This tells us how much value citizen $i$ gets from the next dollar which funds the good $p$, i.e. the slope of the curve.

For an optimal allocation, we would expect that the first derivative of the total utility for a given good (summed across all citizens) would be equal to 1, meaning that society as a whole has reached the point where giving more funding to the good would create less value than the funding itself, i.e. $V^{p\prime}(F^p) = 1$. At that point, funding should be placed elsewhere.

Now, under capitalism (the system where all contributions to public goods are made by citizens in isolation), otherwise known as $F^p = \sum_i c_i^p$, citizen $i$ will contribute to a good up until the point where their individual increase in utility is worth what they contribute, i.e. where $V_i^{p\prime}(F^p) = 1$. The problem here is that there is a lot of utility that ends up being “left on the table” – even if an extra $1 of funding can create $.5 of utility for three people (i.e. $1.5 of utility for society), no one will provide that funding since from the perspective of the individual, they are giving $1 and getting only $.5 back in value. Formally, this looks like $V^{p\prime}(F^p) > 1$, i.e. putting in more money will create more utility for society, but no one does it. Sad.

Under one-person-one-vote (1p1v) (the system where citizens vote on alloctions), otherwise known as $F^p = N \cdot \text{Median}_i V_i^{p\prime}(F^p)$, the problem is different. Here, the issue is that since the utility is determined by a majority vote (i.e. by the “median voter”), the allocation will be suboptimal to the degree to which the median voter differs from the average or mean voter. Note the appearance of the term mean here, because it sets the stage for (drumroll please) the quadratic methods.

Recall that the median is a measure of centrality which ignores degree of intensity, while the mean is exactly the measure of centrality which incorporates it, i.e. the mean minimizes the square error of itself to all the data points (while the median minimizes the absolute error).

Enter quadratic funding (the system in which the total contribution is the sum of the roots of the individual contributions), otherwise known as $F^p = (\sum_i \sqrt{c_i^p})^2$. Unlike capitalism, in which individuals contribute up until their utility matches their contribution, quadratic funding allows people to contribute until the total utility matches their contribution. We’ll look at the derivation because it’ll be instructive. Starting with the individual’s utility function, $V_i^p(F^p) - c_i^p$, we maximize by taking the derivative and setting to zero (involving several applications of the chain rule), which gives us:

\[V_i^{p\prime}(F^p) = \frac{\sqrt{c_i^p}}{\sum_j \sqrt{c_j^p}} \leq 1\]

This is an odd looking fraction, but note that it is less than (or equal to) 1, and equals one when you sum across all citizens. That is the voilà moment for quadratic funding:

\[V^{p\prime}(F^p) = \sum_i(\frac{\sqrt{c_i^p}}{\sum_j \sqrt{c_j^p}}) = 1\]

While capitalism provides funding up until individual utility matches the increased funding, quadratic funding provides funding up until the collective utility matches the increased funding, which is optimal. This is a great result and a source of legitimate excitement.

The Critique

But (and finally, we reach the critique), let us recall the key assumption of the model: individual (subjective) utility, $V_i^p$, is assumed to be both known and dollar-valued, being inferred per-citizen from the amounts contributed. This is a problematic assumption, as it equates something which is fundamentally subjective (a private feeling) with something which is fundamentally objective (a real number). The collective (subjective) utility is inferred from summing up these numbers, equating them with feelings. This seems… peculiar.

Economists have long wrestled with this problem. In his Social Choice and Individual Values, economist icon Ken Arrow famously argued that since choices are defined by relative preferences (“apple vs orange”), that “there is no quantitative meaning of utility for an individual”, and thus “interpersonal comparison [and thus summation] of utilities [have] no meaning”, since something which is not a number can be difficult to compare (i.e. it is easy to see that 6 is less than 7, but not easy to see that red is less than blue).

One might turn around and say that quadratic funding sidesteps the issue by asking citizens to make absolute decisions (“give $25 to the parks department”), rather than relative decisions (“plant apple trees, not orange trees, in the park”). In this case, citizens are telling us their currency-valued utility – $25, problem solved (known as “revealed preference”). But all that really tells us is that the citizen prefers giving the parks department $25 dollars to keeping it for themselves – and tells us nothing about the fuzzy questions of psychic insensities. Further, if we assume that everyone has the same capacity for inner experience (a question with deep ties to identity, our other bugbear), but not everyone has the same amount of money to give, then we paint ourselves into another corner: do those with more wealth, who give more, experience greater utility than those who give less? If I make $100 a day while you make $10, is my experience of satisfaction ten times yours? Probably not.

You might retort that this is excessive pedantism. Our lived experience is full of assessments of the subjective experiences of others, and – although they are based on evolved heuristics, not mathematical proofs – it seems to work well. In his Gaming the Vote (chapter 15), William Poundstone considers this debate and makes the point that “these intellectual positions… entailed a pose of fashionable agnosticism over matters previously held to be common sense.” Many economists agree, with Amartya Sen giving the famous example of Nero’s sacking of Rome: it is almost universally seen as self-evident that the negative utility of all the Romans who suffered in that blaze outweigh the positive utility that Nero experienced in the sacking, and so the sacking was “bad”. Clearly, utilitarian arguments have a place. To conclude that “we cannot model or compare subjective experience” seems like the easy way out, and evokes the behaviorist posture which constrained psychologists up until the “cognitive revolution” of the mid-20th century. Even if it’s not perfect, putting numbers to feelings seems “good enough” and gives us something to work with – so what’s the problem?

The problem is ultimately one of signal and noise, of signifier and signified, and of the risks of optimizing for proxies. Briefly, since we are unable to accurately represent (and thus measure) the thing we really care about (subjective utility), we instead measure a proxy (funding amounts). Unfortunately, there is a fundamentally unknowable gap between these two measurements, and so we cannot know how good our mechanisms really are with regard to our true goal of maximizing welfare – not only is there some error, but we cannot know what that error is.

In casual settings this is a non-issue, since this “proxy gap” will be too small to be consequential. However, the more pressure that is placed on the system (i.e. the more resources are at stake, the more people whose interests are affected), the greater the incentive to exploit the system (a phenomena known as Goodhart’s Law), and a key vector of exploitation is the gap between the desired measurement and the true measurement (for a well-known example, consider the test-prep industry which coaches students taking high-stakes standardized tests). The more resources which are deployed using quadratic funding, the more pressure is placed on the system, and so the more the gap between “true utility” and “funding amounts” (the proxy) will be exploited – leading to unexpected failures because the gap cannot be modeled by the system. Unlike other kinds of error, which can be modeled and thus handled by the system, this kind of error necessarily lies outside the system and is thus quite pernicious, as the consequences invariably come suddenly and by surprise.

All of this is not to say that quadratic funding is a bad idea – quite the opposite, in fact, as in general it will probably work well (see this experiment) and represents an important step forward. Further, these basic measurement problems do not affect quadratic funding alone – any mechanism which must represent and measure a subjective quality falls into this trap – which includes basically all voting, rating, and reputation systems.

The point is more that one of the banner claims – optimality – is overstated. Ultimately, quadratic funding is “optimal” in the same way that blue is the “optimal” color for the Blue Man Group – it follows from the definition, rather than from some essential truth. Quadratic funding does not really maximize utility – it maximizes some other amorphous “utility-like” thing. Which, again, is fine… until it’s not.

Thanks to Auryn Macmillan for feedback and for making sure I’m not an idiot.

Aragon, DAOstack, Colony, Moloch

Sun, 16 Jun 2019 00:00:00 +0000

In which we compare and contrast the essential approaches of four significant Ethereum DAO projects.

Update: this essay has been expanded into a talk (video, slides), and translated into Chinese.

Prelude

What is a DAO? Here, we take as an (imperfect) definition something simple: “a censorship-resistant means to coordinate the deployment of shared resources towards a shared objective”. The simplest DAO, by this definition, would be a multi-sig wallet, in which individual members can withdraw paltry sums and many members together can withdraw significant sums.

While a multi-sig may be sufficient for a group of friends on a backpacking trip, it quickly becomes apparent that for more ambitious objectives requiring the coordination of more resources, additional mechanisms are necessary. How permeable should the boundaries of the organization be? How much influence should any individual have? How can individuals be protected from the bad behavior of others? How easy or difficult is it to participate?

For a certain type of person, these questions are irresistible, and it no surprise that many significant projects have emerged in recent years seeking to answer these questions. People frequently ask about the ways in which these projects are similar and different from each other; this essay is a step towards an answer.

This commentary is based on my familiarity with these projects and their technical documentation, much of which I have read, as well as conversations with teammates from the various projects.

Aragon

“Freedom to organize”

Twitter (71.2k followers) – Solidity repo (created 3/2017)

Arguably the most high-profile DAO project, Aragon has achieved mindshare an order of magnitude larger (in terms of Twitter traction) than the other projects discussed here. Named after Aragon, one of Spain’s 17 comunidad autónoma, or autonomous communities, Aragon’s rhetoric and positioning is couched in the language of boundaryless freedom and unstoppability.

In the view of the Aragon team, the problem with organizations in their current form is their subjugation to the capriciousness of their real-world jurisdictions: kleptocratic governments, biased judicial systems, and the like. If organizations could be freed from these jurisdictions, then they would be able to reach their full potential.

Technically, Aragon’s most noteworthy achievements are arguably their permissioning and transaction forwarding systems, designed to allow a very wide range of modules to be safely connected together. These tools are impressive: the permissioning system can, for example, grant access only up to a particular block number or condition access on the response of some oracle; their forwarding system is based on a bespoke scripting language, evmScript.

Rather than build a product around a specific decision-making mechanism, as the other projects discussed do, Aragon has instead focused on developing a secure and general backbone for building organizations in general. One one hand, this is very appealing: by leveraging the foundation built by the Aragon team, end users are able to compose organizations to meet their specifications, with a fraction of the effort. On the other hand, one can ask whether “just putting it on the blockchain” is really the way to go. What is perhaps surprising about Aragon is, for such a visionary project, they remain pointedly agnostic with regards to organizational form and decision-making mechanisms. While many associate the term “DAO” with some flavor of non-hierarchichal, decentralized decision-making, some Aragon teammates have stated explicitly that it doesn’t matter what the organizations built with Aragon look like – for all they are concerned, one could build an autocratic Apple or Microsoft using Aragon and be in line with the project’s goals.

An open question is whether the de-emphasis on mechanism in favor of a generalized modularity will benefit or hurt Aragon in the long run. As they say, “if nothing changes, nothing changes”. As mentioned above, the Aragon’s team focus is on bringing organizations into uncensorable cyberspace. However, it is possible that in truth it is not capture by jurisdiction, but rather our antiquated decision-making mechanisms (such as pass/fail voting on arbitrary strings of text) which are keeping organizations from rising to meet the challenges ahead. That said, Aragon’s emphasis on modularity will likely make it easier for them to build out a developer ecosystem and add support for novel decision-making mechanisms down the road (once they are proven elsewhere), and there are already three separate teams (Autark Labs, Aragon Black, Aragon One) working on exactly this.

DAOstack

“An operating system for collective intelligence”

Twitter (5.3k followers) – Solidity repo (created 9/2016)

While slightly lower profile than Aragon, DAOstack has been steadily racking up wins, most recently as the platform of choice for the (assuredly non-derivative) dxDAO. Unlike Aragon, DAOstack’s messaging explicitly values decentralized decision-making, and so the project places greater emphasis on solving the problems inherent with decentralized decision-making at scale. As one would expect from two PhDs in theoretical physics, their theory is strong.

In the view of the DAOstack team, organizations with decentralized and broad-based decision-making processes are more resilient (an inherent good), but making decisions on an unbounded sequence of pass/fail proposals is too cognitively burdonsome to scale to large numbers of participants and proposals. With large numbers of proposals, it will be difficult for participants to know which proposals are most deserving of attention. With large numbers of voters, it is difficult to motivate participants to take the time to consider proposals which may not be personally relevant. For DAOstack, “just putting it on the blockchain” without engaging with these more fundamental human questions will get us exactly nowhere.

Technically (and unsurprisingly), DAOstack’s most noteworthy achievement and crown jewel is the “holographic consensus” cryptoeconomic mechanism for efficiently and reliably approximating group decisions using small numbers of participants (a hologram, after all, is an image in which every part contains all of the information of the whole). The mechanism functions by incentivizing a network of “predictors” to place bets on whether a proposal will pass or fail; the predictions are then used (along with a number of other rules) to emphasize or de-emphasize proposals and modify the quorum requirements necessary for the proposal to pass (for example, it may take fewer total voters to approve a highly boosted proposal). Functioning well, an organization using holographic consensus can scale to arbitrary numbers of proposals and arbitrary numbers of participants without sacrificing either decision-making speed or quality.

The DAOstack team has rightly identified the problem of attention as one of the most fundamental challenges confronting humanity in the 21st century, and while others write thinkpieces and wring their hands, this team has put forward at least part of a solution. For the DAOstack team, the holographic consensus mechanism is the missing piece for unlocking performant, decentralized organizations. However, we will need to see how well the ambitious mechanism works in practice. In particular, the mechanism assumes the independence of the predictors and the voters (“you cannot buy a decision, but you can buy it into consideration”), yet we can imagine situations in which voters may be “swayed” by the predictions, especially if the predictors present themselves as authoritative experts. If this line proves too fine, the approximations may cease to be reliable, threatening the central promise of the project.

Colony

“A platform for open organizations”

Twitter (8.2k followers) – Solidity repo (created 4/2017)

Unlike Aragon and DAOstack, which make their focus the enabling of vote-driven management of an organization, Colony (my employer, forgive my bias) has chosen to focus on the quotidian. With “permissionless by default” as their rallying cry, Colony focuses heavily on mechanisms which, to the extent possible, eliminate the need for voting in daily operations, and let people just “get shit done”, with an eye towards enabling the global digital workforces of the future.

While both Colony and DAOstack take as axiomatic the value of enabling decentralized organizations, their approach to creating them is somewhat orthogonal. In the DAOstack case, scale is achieved by using holographic consensus to accelerate a synchronous process of discrete pass/fail decision making. In the Colony case, on the other hand, scale is achieved via an asynchronous process of continuous financial decision-making (using an org-chart-like domain tree), allowing different domains of the organization to conduct their business relatively autonomously.

Technically, Colony’s biggest achievement is the constellation of mechanisms which leverage time to enable the permissionless allocation of resources (note: for the full experience, you are encouraged to narrate every sentence mentioning “time” in an Alan Watts voice). Time plays a role in Colony in two critical ways: reputation earned decays over time (implemented via an off-chain reputation mining process), and funds are allocated continuously as a function of time (as opposed to discrete pass/fail methods). The more reputation which backs a proposal, the faster it is funded, but even someone with little reputation can slowly claim resources. The acquisition of reputation is driven by work (as opposed to pass/fail proposals to allocate reputation), which makes reputation a carrier of market information (much in the same way prices are).

More subtly, the incorporation of time mechanics in the operation of a Colony animates the organization and makes it an entity with which individuals interact. Rather than being a static object which requires synchronous effort to overcome inertia (via a voting process), in a Colony resources are always moving, and the key interaction then becomes one of asynchronously influencing the flow of resources over time.

Scholars of organizational design in particular seem excited about Colony: the use of work-driven, time-decaying reputation in funding decisions promises to mix the best of top-down hierarchy (experienced leaders have outsized influence), with the best of independent decision-making at the periphery (leveraging local knowledge). However, it remains to be seen whether the lack of a central coordinating point will lead to organizations which are nonetheless able to work effectively towards collective goals, and further, for what types of organizations this model represents a competitive advantage. Colony takes as its patron saint the image of the ant colony, in which independent ants, unbeknowenst to any of them individually, are engaged in staggering collective enterprise. Colony hopes that its mechanisms can emulate for humans the same stigmergetic processes which evolution has bequeathed to ants; yet, as certain dilletantish cultural commentators have observed, the capacity to appreciate novelty is incremental at best. Of the projects discussed here, Colony is the only one yet to have a mainnet launch (although one is imminent), and so much of Colony’s promise has yet to be proven.

Moloch

“Moloch whose mind is pure machinery”

Twitter (2.6k followers) – Solidity repo (created 7/2018)

By far the youngest project described here, Moloch, brainchild of Ethereum’s Romantic hero Ameen Soleimani, has burst onto the scene and acquired significant traction in recent months. Unlike Aragon, which has moral roots in the struggle against bad government, and DAOstack and Colony, which set themselves against the dysfunction of human organization, Moloch’s foundations are solidily rationalist and cryptoeconomic, rooted in, among other things, the trauma of TheDAO.

Moloch can be described succinctly as an experiment in coordination mechanism, seeking to create the “minimum viable process” which allows people to allocate shared resources towards a shared goal, while aggressively minimizing the vectors for attack and abuse, both technical and social.

Technically, Moloch achieves this in two ways, both of which cleverly mix the computer and social layers. The first is by leveraging time (remember the Alan Watts voice) to create “rails” which focus the attention of participants on exactly one collective decision. Proposals are considered in sequence, on a timer, and malicious participants are unable to overwhelm the participants with many decisions. Further, the decisions themselves are similarly constrained: each proposal includes a credit of some amount of tokens (the “tribute”) and a debit of some amount of influence (“voting rights”). An unknown candidate may offer a large tribute for small influence, while a known and respected candidate may offer small tribute for large influence. By creating a mechanism with a single-minded focus, Moloch has make it likely that at least something will get done well.

The second technical achievement was the innovation around “rage quitting”, in which participants are able to exit (with their portion of the resources) if they are unhappy with the decisions of their colleagues. This innovation creates a disincentive for malice and implicitly places social pressure on participants to remain aligned on the organization’s goals.

The ease of participation and the guarantees of security and safety, along with the clever memeing on the part of the Moloch team has made Moloch the dao du jour, with Ethereum heavies Vitalik and Joe Lubin among the members. An open question for Moloch is whether the highly specific and limited interface will prove sufficient for coordinating around the ambitious goals they have set for themselves. At present, Moloch seems to see itself more as an improvement over existing grants committees, and less as a new foundation for an operating organization – it remains to be seen how far the Moloch mechanism can go.

In an important sense, Moloch can be seen as fundamentally a self-determining, plutocracy-resistent reputation system, in which members with influence choose to allocate influence to new members, and reputation just happens to be convertible into money. Inasmuch as reputation systems are important building blocks for other, more complex systems, it may well be that Moloch finds application as an important piece of other puzzles.

Conclusion

Here we present a succient and high-level overview of four Ethereum DAO projects, and attempt to get at their essential points-of-view and notable technical innovations which form the basis for their value. Each project has substantial achievements under its belt, and it will be exciting to see which hypotheses are proven out in the years head.

Against Voting

Wed, 08 May 2019 00:00:00 +0000

An essay about how the way we make choices shapes the choices we make.

We open by introducing a visual metaphor for describing the behavior of complex systems, and then modify the metaphor to instead describe “coordinating processes” – the mechanisms by which we manage our collective social life. We focus on the legislative process as a central coordinating process, and attempt to understand this process using concepts from mathematics and computer science. Through this lens, we can understand some of the limitations of current processes, and also orient ourselves towards new processes which promise to overcome these limitations. We conclude with a ten-thousand-foot view of coordinating processes in general and a framework for thinking about them going forward.

Introduction: Pace Layering

In counter-cultural icon Stewart Brand’s 1999 book The Clock Of The Long Now, we find a useful image innocuously titled “Pace Layering”:

This image’s thesis is that complex systems can be seen as a “layering” of multiple sub-systems, with “lower” systems (the more foundational) simultaneously enabling an environment and setting the boundaries within which the “higher” systems (the more discretionary) can operate. Lower systems are more critical, in that disruptions in lower systems require recalibrations of all higher systems, and are thus ideally slower to change than the more discretionary systems built on top of them.

Brand’s conceputualization of these systemic layers developed out of his theorizing on the lives of buildings: he noticed that different “layers” of a building had different rates of change: the building’s floor plan; the arrangement of furniture; the location of people and things. This recursive or fractal pattern is found also in nature:

Consider, for example, a coniferous forest. The hierarchy in scale of pine needle, tree crown, patch, stand, whole forest, and biome is also a time hierarchy. The needle changes within a year, the tree crown over several years, the patch over many decades, the stand over a couple of centuries, the forest over a thousand years, and the biome over ten thousand years. The range of what the needle may do is constrained by the tree crown, which is constrained by the patch and stand, which are controlled by the forest, which is controlled by the biome. Nevertheless, innovation percolates throughout the system via evolutionary competition among lineages of individual trees dealing with the stresses of crowding, parasites, predation, and weather.

Stewart Brand, The Clock Of The Long Now

As physical beings, visual and spatial metaphors are invaluable aids when reasoning about abstract concepts, and the notion of “pace layering” is a useful and flexible metaphor for organizing our thinking about complex systems.

I. Coordinating Processes I

Let us take Brand’s metaphor and adapt it to questions of social coordination:

This (highly schematic) setup is found, in varying degrees, in states and organizations across the world. The layers can be summarized as follows: some foundational documents bootstrap the entity into being; a specialized body is entrusted with the interpretation of those documents. On top of that, we have more flexible and popular mechanisms via which some some subset of the community is periodically entrusted with special decision-making powers; these individuals then engage in a legislative/allocative process which invariably culminates in a series of pass/fail votes on some series of things, strings of text which take many forms but can be frequently bucketed into policies and expenditures. Finally, some other subset of the community is given operational discretion to use the expenditures to enact the policies.

From a theory perspective, innovations which change these systems without changing the arrangement of layers can be thought of as marginal or incremental changes, while innovations which change the number or relationship between the layers can be thought of as paradigmatic changes. This distinction is not meant to diminish the importance of incremental improvements: measures to reduce the impact of gerrymandering, voter suppression, and the influence of moneyed special interests will make the outcomes more representative and thus more legitimate. Such efforts are valuable and needed, as are efforts to improve the efficiency of the pass/fail decision-making process (including but not limited to “just putting it on the blockchain”).

Parliamentary Procedure

The trouble with socialism is that it takes up too many evenings.

Oscar Wilde

The set of rules by which these representatives reach their pass/fail decisions are collectively known as “parliamentary procedure”, the most protyptical of which is “Robert’s Rules of Order”. Parliamentary procedure can be thought of as a type of string-processing algorithm which composes long strings of text via a sequence of branching decision points (although more commonly we speak in terms of motions, seconds, amendments, and so on).

Robert’s Rules (and systems like it) are general and secure, in that they allow groups to make nearly arbitrary decisions (encoded as plain text) while remaining robust to manipulation via strict branching rules, but are consequently (and as we will argue, necessarily) remarkably tedious and inefficient. The tedium of parliamentary procedure is a large part of the reason why our coordinating processes assume the necessity of special representatives, motivated by some mixture of duty, money, and power, to provide the cognitive power needed to run the algorithm.

This setup emerged slowly out of an even older one-layer process in which single individuals held virtually unchecked power for unlimited periods of time, and represented a paradigmatic change in coordinating processes. However, this setup seems to have remained largely uninterrogated in the last two hundred years or so; it seems to be received wisdom that we cannot do better than tedious voting on proposals, and that all we can do is make marginal improvements in the composition of the representatives and the particulars of the parliamentary procedure. But we can do better.

Unfortunately, naive efforts to “broaden the franchise” and invite large groups to participate in parliamentary process often fail due to low participation. This effect can be partially explained via the notion of “rational ignorance” – the idea that if the work required to acquire information is greater than the benefit of that information, it makes no sense to gather that information. In situations where large groups are invited to participate in pass/fail decisions on complex proposals, rational ignorance clearly applies, and further we can speculate that, in cases where many votes are cast, a large fraction may be cast somewhat randomly, intuitively, or emotionally, introducing noise into a critical signal.

So, we select and incentivize representatives because parliamentary procedure is unavoidably tedious. Parliamentary procedure is unavoidably tedious because it is completely general. Were the procedure to be specialized, it could be made more efficient and less burdensome, making broad-based, meaningful participation possible. To develop this argument further, let us consider the relationship between specificity and efficiency in more detail.

II. Representation & Computation

Behold a taxonomy of types of numbers:

The most general type of number (as far as this discussion is concerned) is the real multivariate, consisting of, unsurprisingly, one or more real numbers. A list of baseball batting averages would fall into this category. The most specific type of number is a binary univariate (more commonly known as a bit), taking on only the values of 0 or 1. In between we have a variety of other types of numbers: discrete (the integers), univariate (just one number), and so on. Note that a binary bit is still technically a multivariate real, but where the “multi” happens to be one and the “real” happens to be limited to the values 0 and 1.

The key point is that pass/fail voting makes use only of the most specific number – the binary bit, yes or no. The bit is the most specific type of number, but therefore the most general in terms of possible applications: any decision can be made by a pass/fail vote on a description of the decision (how that description is created, however, is another matter).

We can demonstrate this generality via an example from computer science. Imagine we would like to know an integer $k$. We have two oracles: one which returns $k$ itself (a discrete output), and another, which, when given an integer $q$, tells you if that integer is either greater or less than or equal to $k$ (a binary output). While the first oracle would clearly get us to $k$ faster, note that you could use the second as well: you would simply have to query this second oracle many times (up to $log(n)$ times for a $k$ between $0$ and $n$), as you iteratively hone in on $k$ via a binary search. The binary output lets us approximate the discrete output, but slowly. As an aside, this technique (the reformulation of arbitrary algorithms as binary “decision problems”) is used in computer science as part of the analysis of algorithms, useful in settings where you want to “compare” algorithms with different types of outputs.

More specific numbers (bits) are less efficient carriers of information, but are consequently more general with regard to the purposes to which they can be put. A string of bits can be a number, an image on a screen, or the outcome of a vote. Likewise, a mechanism of pass/fail voting can be used to decide on virtually anything (assuming the existence of a secondary process which can compose the string of text representing the decision).

However, this street runs two ways: a more specific number can be used to implement a more general, while a more general number can always be treated as if it were are more specific. Consider: a single bit (0) used to implement a uint8 (10101010) used to implement a single bit: (00000001). In fact, we can think of the second oracle in our earlier example as “constructing” the number $k$, bit-by-bit. Clearly, a process which returns one byte of information at a time (assuming a sufficiently low signal-to-noise ratio) is more efficient than one which returns only one bit.

Further, a byte contains more information than eight independent bits (the bits in the oracle example are not independent), for the reason that the number of meaningful states is much larger: 8 possible states for eight independent bits in which the order does not matter (since only the counts are unique), compared to 2 ** 8 == 256 for eight interdependent bits in which order does matter and therefore every permutation is unique. Of course, in the real world nothing is every truly independent, inasmuch as history is in fact a single non-repeating event, but the point is made.

It is a perhaps counterintuitive but nonetheless essential result that if a problem can be made more specific (for example, by connecting the items so that the order becomes a vehicle for encoding information), it may be possible to develop more efficient solutions which are able to process more information per unit-of-computation, encoding this information using more general or expressive types of numbers.

A Case Study

Let us make this disucssion concrete via a brief case study of Quadratic Voting, an alternative voting mechanism. With QV, participants are allocated some number of “voice credits”, which they can exchange for actual votes on pass/fail proposals at a quadratic exchange rate (n votes costs n * n voice credits). This mechanic has been shown to significantly increase the clarity of signal coming from voters by allowing them to allocate more voice credits to issues that they feel strongly about, in exchange for less influence on issues they care less about.

In our telling, we can understand Quadratic Voting as a mechanism which extends regular voting in a way which makes the outcomes of various proposals interdependent on each other – since placing multiple votes on a single issue entails withholding many more votes on other issues, the output of this procedure changes from univariate binary (each proposal is independent, with a one-bit outcome) to multivariate binary (we can only meaningfully talk about proposals as a set, since voters need knowledge of the multiple proposals to be able to place their votes). Put another way, the output of a QV process is more relative than absolute in character.

So we see that by making a mechanism more specific (for proposals to be meaningfully compared, they must be of similar “kind” and grouped together), we are able to deploy a more general type of output (multivariate binary), and exploit this structure (the ordering of bits) to convey additional information, achieving significant efficiencies in terms of translating subjective preference into objective decisions.

III. Against Voting

This essay is provocatively titled “Against Voting”, but here we show our hand: we are “against voting” only when “voting” refers to pass/fail decisions on long strings of natural language text. In common use, “voting” is also used to describe the act of selecting representatives, and “voting” can even be used to describe the process by which we determine expenditures. For a modicum of clarity, let’s talk about “voting”, “electing”, and “budgeting” to refer to these separate activites. We “vote” on policies, “elect” our representatives, and “budget” for expenditures.

Here is a table which summarizes a number of popular “input mechanisms”, as well as their applications, their most general number types, and where the “processing” occurs (i.e. cognitive or computational, or “does the computer add any non-trivial value to the inputs”). Recall that anything used for voting can be inefficiently used for electing, and anything used for electing can be inefficiently used for budgeting. Likewise, anything used for budgeting can be re-purposed for electing (rank the items by the value they receive), and anything used for electing can be re-purposed for voting (an election with two candidates: “pass” and “fail).

Mechanism	Applications	Input	Output	Processing
Majority	Voting	Univariate Binary	Univariate Discrete	Cognitive
QV	Voting	Multivariate Discrete	Multivariate Discrete	Cognitive
Plurality	Electing	Multivariate Binary	Multivariate Discrete	Cognitive
Score	Electing	Multivariate Discrete	Multivariate Discrete	Cognitive
IRV	Electing	Multivariate Discrete	Multivariate Discrete	Computational
Dot	Budgeting	Multivariate Real	Multivariate Real	Cognitive
Power	Budgeting	Multivariate Binary	Multivariate Real	Computational

The first thing to notice is how the problem of voting differs from that of electing and budgeting. With pass/fail voting on strings of text, the only question we can meaningfully ask is “is this thing better or worse than the status quo?”, and the only way to answer that question is by performing extensive cognitive labor, in large part because strings of natural language text have little formal structure and are thus difficult to analyze computationally (not for lack of trying). Calculemus!

The difference with electing and budgeting is that their questions are phrased not in terms of absolutes, but in terms of relationships (“is this thing better or worse than that thing”, and “how much should this thing get compared to that thing”). This distinction is important because, as I am going to suggest, mental concepts (and so our understanding of the world) are relational in character. We understand things not in terms of their absolute nature, but rather by their relationships to the things around them.

This is true for elections, and doubly true for budgeting, a delightfully constrained setting in which the only question we ask is “is this thing more important than that thing”. Very few people will dispute the absolute value of public education, public safety, public health, public infrastructure, and low deficits; the interesting things happen only when you ask people to choose between them. Budgeting is a crucial exercise in relativity.

We return to our theme. Consider for a moment that part of the reason which budgeting is seen as difficult is because we are making absolute decisions about relative questions. “Should we fund the police” is a meaningless question; “should we fund police before we fund education” is a meaningful one. By posing the question in terms of absolutes rather than relative degrees, we create an inefficient process (recall the binary oracle), and by framing budgeting questions as binary decisions about strings of text, we push the majority of the information processing “outside of the decision process”, which in addition to increasing the cognitive burden on participants, increases the risk of error, capture, and manipulation.

A Category Error

We have inherited a categorical error in our collective handling of finances. Our familiar coordinating processes were developed in an age where information traveled at the speed of horse and computations ran on ink and parchment. Cognition and computation were one and the same. The methodological questions discussed here were on exactly nobody’s radar (had there even been a radar to be on). But new tools bring into view new solutions, and new solutions shine light on present shortcomings, and it is time to move on. By teasing apart decisions about expenditures from decisions about policies, we can approach the question of allocation with a more precise and powerful set of tools than that of legislation writ large. We can deploy specialized mechanisms with information-rich outputs, and leverage computational techniques to perform the heavy lifting, generating these outputs from cognitively simpler but useful inputs.

Phrased differently, we can move from “simple inputs on complex objects” (cognitive complexity) to “complex inputs on simple objects” (computational complexity). A national budget is a complex object, but a relative choice between “infrastructure” and “education” is a simple one. By providing more complex inputs, such as a multivariate real describing percentage allocations, or a series of pairwise decisions bits (the simplest unit of relative information), we can shift the processing burden from cognition to computation, and by extension, change the experience of budgeting entirely.

Specifically, both Dot Voting and Power Budgeting represent alternatives to conventional proposal-based budgeting processes. The former asks participants to allocate percentages among various items, with the constraints that the sum of percentages must not exceed one. The latter asks participants to submit a sequence of explicitly pairwise preferences between items (“is A more important than B”), and then converts them into budgets using well-established mathematical techniques. In both cases, inputs are submitted under relative constraint (as with the Quadratic Voting example), which renders them more information-rich than when inputs are made without contraints, as in the case of score voting or Likert scales.

As an aside, it’s not clear how it ever became acceptable to have people provide input using unconstrained numeric scales. It would be as informative and more honest to ask people to compose haikus describing their affect towards the item in question.

IV. Coordinating Processes II

Our inherited miscategorization of questions of expenditure as questions of policy has limited the possible solutions available for solving those problems; recategorizing questions of expenditure will firstly allow us to incrementally apply a more powerful set of tools to answering these questions, and secondly and more importantly allow us to contemplate paradigmatic changes to where questions of expenditure are answered:

The relocation of complexity in the computer, rather than the person, means that the computer can act as the chair and facilitator of the process, enabling a large group of participants to asynchronously collaborate in the allocative process. Thus, in addition to the incremental improvements we would see from improving existing budgeting processes while holding the set of participants fixed (i.e. maintaining the practice of electing representatives), the adoption of new budgeting processes would allow us to “extend the budgeting franchise” to the entire population: a paradigmatic change in which allocation becomes a coordinating process more fundamental than the election of representatives: whereas currently we provide oversight of finances via our choices of representatives, in the future we can provide oversight over representatives via our choices about allocation.

Further, while univariate binary outcomes are by definition all-or-nothing, and so many voices ultimately do not matter, multivariate real outcomes are capable of “shades of gray”, of incorporating minority voices in incremental degrees: as a single participant, my input really does matter. This eliminates much of the challenges of collective action, and changes the emotional relationship the participants have to the system: active participants instead of passive validators.

V. Surely You’re Joking

Let us consider and rebut some possible objections to this thesis:

Partial funding will lead to failed projects. Things must be funded fully or not at all.

We believe that this type of thinking is fundamentally flawed.

The future is a big place. To say that something is “fully funded” implicitly implies knowledge of the future (fullyFunded => knowledgeOfFuture). The converse of this is that if there is no knowledge of the future, nothing can ever be said to be fully funded (!knowledgeOfFuture => !fullyFunded). Since no one has knowledge of the future (!knowledgeOfFuture), it is not meaningful to think of something as “fully funded” (!fullyFunded). Things change all the time. Organizations grow and shrink into the money they have.

There may certaintly be edge cases where some item is uncomfortably underesourced for a period of time, but in general this should be considered a straw man argument until experience proves otherwise. More important than pursuing fictitious goals like “fully funded” is to pursue flexible allocation mechanisms which can respond effectively to new information as it becomes available.

The vision of broad-based input is unrealistic. We know from experience that voter turnout is low.

It is true that voter turnout for referrenda is often low, but recall that referrenda are synchronous decisions overwhelming framed in terms of pass/fail. Historical experience of low turnout for largely synchronous, tedious decisions in which most participant input does not matter does not imply low turnout for asynchronous, intuitive decisions in which each participant has an incremental impact on the outcome. It is easier to decentralize allocative decisions than legislative ones; consider our historical experience with markets.

The general population lacks the expertise to make allocative decisions.

Do they? Let us remove our rose-colored glasses and acknowledge that elected representatives rarely budget in our best interest – control over allocations is an invaluable bargaining chip and a vector for opaque political processes leading to suboptimal outcomes. Any risk of suboptimal allocations by the population should be weighed against current allocation failures caused by giving control to a small group of representatives.

Further, no one knows the future and so the ability to flexibly adjust budgets is more important than “correctness” at any fixed moment in time. The mechanisms suggested here allow for constant, asynchronous updating of budgets, making it possible to course-correct should it become clear that errors have been made. We can also allow for liquid democracy-style delegation of influence, so that individuals with specific expertise can accumulate more influence over decisions in their purview.

Finally, we must remember that our memory and experience of budgeting is inextricable from the manner in which budgeting has historically occurred: as tedious, cognitively burdensome processes culminating in single contentious pass/fail vote. Changing the mechanism changes and relocates the cognitive burden, meaning that something that once required “expertise” (read: tedium) can become widely acessible.

Direct democracy is dangerous! There is too much of a risk of rash mob behavior, and defunding of critical works.

This is a fair and important critique. Republican government developed out of democratic in large part due to a recognition of the temperamentality of direct democracy. Majority rule can easily tend to mob rule, with threat of violence. But there are ways to construct the system to blunt the effects of emotion and temperamentality. One possibility is to limit the extent to which allocations may change over any period of time, which would slow the effect of mood swings until the population has regained its composure. The ability to incorporate this type of “regularization” directly into the process (versus being limited to binary approval of the output of a separate, unspecified process) is a significant advantage of these more specialized techniques.

Your computational methods seem overly complicated and hard to use.

Not necessarily; it is not true that a complicated process must have a complicated interface. Quite the opposite: by performing more information processing with a computer, you are able to create simpler experiences for the participants. The nature of the processing can be taught in school, much like we teach civics and government to teenagers today. These are all just differences of quality, not of kind. Further, the techniques proposed here are proven and reliable, intuitive, and interpretable by laypersons.

Epilogue: Zeno and the Archery of Artifice

Zeno of Elea was a pre-Socratic Greek philosopher who lived in the fifth century BCE, known for his paradoxes involving objects in motion and his early explorations of infinity. His most famous paradox is that of an arrow in flight: as the arrow moves towards its target, it takes some non-zero amount of time to traverse halfway to the destination. As one can make infinite half-motions towards a destination (each half-motion is half the distance of the one before), and each motion takes non-zero time, paradoxically it seems that it will take the arrow an infinite amount of time to hit the target. Of course, Zeno didn’t know about calculus.

They key image is that of the sequence of half-motions, each traversing half of the remaining distance: the first half-motion is the largest; the second half-motion a quarter of the total distance, and so on. As we add more half-motions, each becomes absolutely smaller until the motions diminish to zero in the infinite limit and the distance is traversed.

Return to Brand’s Pace Layering. For sake of argument, let’s say that each layer explains half of the problem of understanding the world, a type of “half motion” of responsibility. Each additional layer explains half of the remaining half. If we assume that each layer encompasses a constant amount of work (size * complexity), then we can think of an infinite sequence of increasingly complex layers with progressively smaller amounts of “discretion”.

This provides a framework for thinking coordinating processes more broadly: we take the easier half of the problem, and develop the simplest and most efficient mechanism which addresses that half. We then take the easier half of the remaining half, which by definition is more complex than the first half which we’ve already solved, and do the same thing again. The more complex mechanisms have higher risk of failure, but simultaneously are responsible for smaller fractions of the “total discretion”, and so the overall risk remains tolerably low at every layer. The entirety of the system, then, is the integration of this long tail of subsystems, with a total risk no higher than the risk represented at every layer. More specialized processes creates a less risky system, then, than fewer general processes, for the same amount of functinality.

Consider how our current paradigm is over-reliant on a single system (parliamentary procedure) and is therefore at a higher risk for failure. Contrast this with the previous paradigm, which was over-reliant on a single ur-system (hereditary autocracy) which was at an even higher risk for failure. As we have seen once, and as we will see again, by separating a coordinating process into parts, specializing the parts, and re-integretating them together will allow us to reduce the risk of failure while maintaining or even improving functionality.

To Bounty or Not To Bounty

Mon, 25 Mar 2019 00:00:00 +0000

That, truly, is the question.

On the surface, bounties seem like they should be a good way to leverage the world’s distributed software talent. Projects carve out bits of work which need to be done, put up cash for their completion, and wait for a member of the global engineering cloud to submit some code.

Especially in the world of Ethereum, the idea of bounties has captured the imagination. Two bounties platforms, Gitcoin and Bounties Network, are among the most mature in the ecosystem. And yet, despite all the infrastructure, bountied freelancers have largely not replaced the full-time engineers as the primary source of software labor. Curiously, however, bounties have found application as a way to incentivize bug reporting. Why might this be?

My hypothesis is that the reason why bounties work for security but not for general development is that the security skillset is much more fungible from codebase to codebase. Someone who is experienced in security can approach a new codebase and look for bugs without needing to understand the codebase as a whole. The time invested in building a security skillset is amortized over many bug bounties, and so coasting from bug bounty to bug bounty can be a reasoanble way to make money.

For general software development, however, the economics are different. When an engineer first joins a project, there is a fair amount of onboarding needed before that engineer can become an effective contributor. They must read through and understand the codebase, the conventions, the roadmap. This is difficult and non-fungible work but generates little to no value for the project, while also generating mostly specific knowledge which cannot be transferred to other projects. That engineer’s first contribution, which might be small from the project’s perspective, may actually represent a significant amount of work by the engineer. The longer an engineer works on a particular codebase, the more they are able to amortize the cost of their onboarding: their tenth contribution might be significantly more impactful than their first, while actually taking only a fraction of the time. For this reason, bounties are a poor means of incentivizing this type of work as bounties do not allow the engineer (or the company) to amortize the cost of the onboarding (this is part of the reason why there is a norm to stay at a job for at least one year). For an engineer to sustain themselves via bounties, it will be more efficient for them to contribute exclusively to a single (or perhaps few) projects to allow them to better exploit their project-specific context. If this is the case, then this person is functionally a full-time employee.

If it is true that the incentives for bounties for general software development converge naturally to project-specificity, it may be better for projects to recruit a core of salaried engineers than to attempt to solicit labor exclusively via bounties.

Blockchain Governance

Fri, 28 Sep 2018 00:00:00 +0000

In which Hegel and Hofstadter help us set expectations concerning the problem of governance.

I. Naming things is hard

In his Pulitzer-prize winning magnum opus, Gödel, Escher, Bach, cognitive scientist Douglas Hofstadter discusses the challenge of naming things.

Naming things is hard, Hofstadter explains, because every name given to a thing is inevitably simpler than the thing itself. A name is a summary, and names are chosen to highlight the most important aspects (according to… someone) of the thing being named. Of course, highlighting certain aspects of something necessarily means downplaying other aspects of that thing.

Naming things is hard, then, because there is always something a name leaves out, and that omission usually ends up being important at some point down the road. A “good” name is one which gets us most of the way there, most of the time – but no name is perfect. The same idea underlies the distinction between “the map and the territory”: a map is always a summary of the territory, a reduction of the dimensionality; a perfect map would be the size of the territory, and thus not useful. A “perfect” name would be as complicated as the thing itself, and thus impossible to speak!

It’s worth noting that that naming gets harder as the thing becomes more complex. For example, a variable named time-delay is fairly complete, just like the title of “goalie” is a fairly good description of that role on a football team. But what about a job like “president”? Or the name of a person? When dealing with more complicated things, naming gets harder.

Hofstadter plays with this idea in one of his many humorous dialogues between Achilles and the Tortoise, in which the former attempts to solve a puzzle posed by the latter, with little success:

Achilles: Confound it all! Every time you give one of my answers a NAME, it seems to signal the imminent shattering of my hopes that that answer will satisfy you. Why don’t we just leave this Answer Schema nameless?

Tortoise: We can hardly do that, Achilles. We wouldn’t have any way to refer to it without a name. And besides, there is something inevitable and rather beautiful about this particular Answer Schema. It would be quite ungraceful to leave it nameless!

Achilles has an intuition for the answer – he somehow grasps the essence of the puzzle – but every time he actually has to transcend amorphous intuition and commit formally to an answer in words, that formalization contains some inherent limitation. This is the essential insight of Hofstadter’s work: anytime we attempt to formalize something complex (“give it a name”), that formalization necessarily leaves something out. This is because formalizations are fixed summaries of the things themselves, and can never capture things in their entirety.

In a certain quite profound sense, the daily headache of programmers naming variables is essentially connected to Russell and Whitehead’s valiant but vain effort to place mathematics on solid, formal foundations (it is highly suggestive that Whitehead spent the rest of his career developing a metaphysics of dynamism known as “Process Philosophy”).

The truth is that we are limited beings attempting to understand a universe more complex than our powers of comprehension. We can incrementally expand our horizons, and iteratively improve our understanding, but we will never quite reach the boundaries.

II. A guiding metaphor

Before moving on, let’s introduce three thinkers and use their work to develop our argument.

The first is a cognitive linguist out in California who thinks that our minds run on metaphor, and that these metaphors are the essential building blocks of our understanding.
The second is a mystical philosopher living out in the mountains somewhere, who thinks that all life forms can be understood as being recursively composed of more fundamental life forms, according to some general principles.
The third is a long-dead german philosopher, who thought that many of the world’s social phenomena could be understood as a dynamic tension of opposites.

A lot of people think this trio is totally bananas. But let’s play a game and for the next 90 seconds pretend that they were all basically right about everything.

I made you an image containing our important metaphor. Look at it for a minute and we’ll discuss.

What do we see here?

In the center, Da Vinci’s famous Vitruvian Man, here taken as an archetypal human. Off to the sides, we see an Ameoba (minimally structured biomass) and a skeleton (highly structured but lifeless). The metaphor here is that humans come into being with the hardness (skeleton) is brought into a balanced tension (magenta arrow) with the softness (the organs and tissue). Too far in either extreme and the delicate complexity of the human cannot survive: a lifeless skeleton, or a puddle of goop.

This image provides us our first example of a dialectical tension, an essential component of Hegel’s philosophy. We see similar tensions elsewhere: in between liberals and conservatives, between process and outcome, between freedom and security, and between individual and community, among others. Each one of those names captures the extreme and ideal end of a spectrum, but life cannot be sustained at ideal extremes. It can be sustained only in the tension which comes from bringing the extremes into a dynamic but stable balance. A small nonprofit with pages of bureaucratic rules will get nothing done; likewise, a multinational firm without adequate procedures and policies will be mired in dysfunction.

Note also that the Ameoba can live just fine without a skeleton; but that Ameoba is a simple life form. The skeleton literally provides the backbone for supporting more complex types of life (there’s a reason why vertebrates are the headliners in the food chain). “Solving” a dialectical tension is not the end of the story: it simply sets the stage for the beginning of a new story – this is the essence of Wilber’s argument of increasing levels of complexity, each succeeding level of complexity made possible by the foundation provided by the stable resolution of tensions of the level below.

As an interesting aside (indulge me my amateur evolutionary biology), it is interesting to consider the arthropods (beetles, etc) as representing an alternative solution to the problem of “structural support”. Vertebrates put the structure on the inside, arthropods on the outside. Both solutions provided a stable structure, but had different long-term consequences: the former seems to have been better at scaling up (vertebrates are bigger), while the latter better at scaling across (there are many more species of arthropods).

III. No perfect rules

As mentioned, words can describe the static ends of the spectrum (“freedom”, “security”, “individual”, “community”), but it is much harder to describe the balance in the center, which can only ever be better and better approximated. And so, the struggle of naming variables is the same struggle of developing formal systems is the same struggle of reconciling simple extremes into a dynamic tension: the objects we pursue are always just beyond the limits of our comprehension.

Even our greatest and most universal “systems” assumed some context: Adam Smith’s initial description of free market capitalism assumed some degree of social cohesion (“fellow feeling” in his words). The various flavors of communist and socialist dream assume some baseline of material abundance to go around. The American republic was designed to withstand factionalism, up to a point. Even Bitcoin, the buzzing, burning Ozymandias of trustlessness, assumes some degree of distribution of computing power.

There is a lesson here for the dreamers of utopian dreams, a lesson taught by experience again and again. There is no system (skeleton) which is guaranteed to work from one end of the universe to the other, from now until the end of time. Every system comes alive only in a living context (the amoeba, the living matter) into which it is embedded; it is the interaction of the system and the context which yield a stable dynamism.

In a recent interview, Jaron Lanier and Glen Weyl discuss “living” technologies:

The most successful technologies in history are what you might call living technologies. They engage the people who use them in an ongoing conversation in which both the people and the technology change. Or you could use the word evolve if you like.

What we see here is the difference between plain rules and a set of rules in tension with a living force. The former is necessarily partial and incomplete, while the latter can be capable of amazing things.

Two additional observations:

Note that most of these successful fusions of rules and context developed incrementally (recall our biological metaphor). Even the American constitution, considered at the time a bold experiment, was an incremental response to the failures of the previous Articles of Confederation, and grounded in lessons learned from a study of classical democracies and confederacies. To develop an elaborate new mechanism and deploy it whole cloth is to court failure.

Note also that incorporating adversarial structures into governance can to some extent reduce the need for structure (or at least change the the types of structure necessary). Since measuring and adapting to an outcome after the fact is much easier than anticipating the entire process which led to that outcome, competitive or relational mechanisms (think courts of law, games of chess, or even employee reviews) can play an important role in giving governance frameworks the ability to evolve over time. In the language of computer science, they measure at a higher level of abstraction.

IV. What to strive for

In 1984, statisticians Persi Diaconis and David Freedman released a landmark paper titled “Asymptotics of Graphical Projection Pursuit”, in which they prove that under suitable conditions, most low-dimensional projections (names) of high-dimensional data are approximately Gaussian, i.e. very lossy. The trick is to find the projections which result in non-Gaussian projection, i.e. those which capture more of the original structure of the high-dimensional problem.

Put another way: some names are better than others, and naming something poorly can be worse than not naming it at all. Consider how the monotheistic deity is often referred to indirectly, via signifiers like “The Power(s)”, “The Name”, “My Lord”, etc, while the mystery surrounding the “true name” has become an alluring subject of esoteric study.

Bringing things back to earth, it will be instructive to look at how different crypto communities have been approaching governance on the ground. Current “on-chain” (i.e. explicit) approaches, such as those used by EOS and the original, infamous, Ethereum DAO suffer from meager participation among voters. On the other hand, the “off-chain” approaches taken by Bitcoin, Ethereum, et al (in which the “mechanisms” are implicit, or “unspoken”) have achieved varying degrees of success, leading Vitalik to reverse his position in favor of off-chain governance.

The issue at hand is that the “rules” for on-chain governance are too crude to avoid being captured by the social context (see: collusion among EOS block producers). Given these crude tools (names), it is better to leave governance “unspoken”: it will necessarily be smaller and more exclusive (the Ameoba), but this opacity is a guard against explicit capture by keeping the rules concealed inside of “culture”. The conclusion is not that off-chain governance is better (to conclude that would be to conclude that cliquish one-party rule is better), but rather that we still need to develop the language for describing governance mechanisms that can hold tension with the living force of the community.

What, then, to build? In short, better backbones, embedded within a committed community. Build new social infrastructure which solves a few current problems without reintroducing ones we’ve already solved – while current governance systems might seem far from perfect, don’t underestimate the amount of weight they successfully support. Try to make it robust against most types of failure. Don’t expect your thing to work forever; eventually the social context will change and the tension will cease to hold. But a few years is plenty, a few decades ideal. Buy us time so that the next time around we can see a little bit farther and do a little bit better.

Trie, Merkle, Patricia: A Blockchain Story

Wed, 04 Jul 2018 00:00:00 +0000

In which we tell the story of the Patricia Tree.

I. Introduction

Spend a few days around blockchain engineers and certain words will start to sound familiar. “Merkle Tree” and “Patricia Tree” in particular will start to seem… important somehow. You’ll eventually gather that these are quite essential parts of this whole blockchain thing… but why? What problems, exactly, do they solve?

You might do a quick search and stumble upon more than a few peices of #content which explain these things, but retreat upon seeing the complicated-looking diagrams. Fear not, dear reader. Here we will explain these things, not with graphs, but with story.

Where to begin? The beginning, I suppose.

II. The Hash Table

In the beginning there was the computer, stretching infinitely in all directions. In fact, it’s hard to say that there even was the computer, since existence implies absence, and there was nothing that wasn’t the computer. So there was the computer, but the computer was inert. Nothing was happening. Boring. So the computer decided to create a programmer. Pop.

At first the programmer wasn’t very good, but over time she got better. There wasn’t much else going on at this time, so the programmer kept going, programming more and more things into the world. Animals and the like. After a while there were a lot of animals, which meant a lot of names to keep track of. This was a problem.

The programmer thought – “how can I keep track of all of the names of these animals? I want to be able to easily look up the name I gave to each species of animal. I could write all the names down in a big list, but eventually looking up the names will get really slow. If only I had the right data structure”.

And so the programmer created the hash table.

What is a hash table? For starters, its the basis of everything else that’s going to happen, so we’re going to talk about it for a minute. Essentially, a hash table is a type of “key-value store”. This means that for a given “key” (i.e. an animal specie) you can save the “value” (i.e. the name of the animal). The main property of the hash table is that when you have a key, you can find the value fast, regardless of how many other items are in the hash table. In computer science terms, this is known as “constant-time lookup” and is very useful, which is why hashtables are “arguably the single most important data structure known to mankind”). Here’s an example:

>>> hashtable.set("dog", "fido")
>>> hashtable.get("dog")
"fido"

How do they work? To understand the hash table, we have to digress for a moment and talk the hash function. Hash functions are a magical secret sauce which make some amazing things possible. Hash functions are the “cryptography” people talk about when they talk about blockchains. Hash functions are legit.

What is a hash function? Fortunately, hash functions are simple to understand. They are essentially tiny machines which take in some value, shake it around for a while (imagine a bartender shaking a cocktail), and output some other crazy-looking value (a big number). Their essential properties are:

For a given input (like “cat”), you will always get the same output (like “0x52763589”)
Two similar inputs (like “cat” and “car”) should not have similar outputs. Put another way, given an output, you should not be able to guess the input.

This makes hash functions extremely useful because they let us handle sensitive information safely. Have you ever wondered how responsible websites keep your passwords safe? They don’t store your password, they store a hash of your password. When you type in your password to log in, they take the hash of your password and compare that against what they have in their database. But if a hacker ever gets in, all they’ll know is the gibberish hash of your password – useless since they have no way of figuring out what your actual password was.

The other thing they’re useful for is making hash tables. Why? Remember that the output of the hash function is a number. So when you hash the key, you essentially get a number telling you where to find the value. Imagine the hash table as a cabinet with 100 drawers. You hash("dog") and get 34 – you go to cabinet 34 and get the name out. You hash("cat") and get 89 – you go to cabinet 89. No need to look through a whole list – you skip directly to the finish line.

Pretty cool right? Yes it is.

And so the programmer had the hash table, and for a while things were good. Great, even! But it couldn’t last. Eventually, the brogrammer appeared.

At first things were good between them: they shared ideas, they shared code, they shared space. But eventually dark clouds appeared on the horizon. They wanted different things. The programmer was fine with a little randomness thrown into things, but the brogrammer wanted certainty, and he wasn’t happy with hash tables anymore. They’re “not deterministic”, he said.

What did he mean? To understand this point, we’ll have to talk a little more about hash functions and hash tables. The first thing to note is that the “range” of the hash function (the possible values the output can take) is very large – depending on the computer, it can take as much as 2^256, but more typically 2^32 or 2^64 possible values. 2^32 is 4,294,967,296 – and the others are much, much larger. Hash tables have to support this whole range, but we can’t make cabinets with that many drawers – there wouldn’t be room for anything else! So behind the scenes, we do a little trick: we take the hash value modulo the size of the cabinet. The modulo operation (%) is essentially division’s sidekick: it gives you the remainder. The nice thing about modulo is that the output (the remainder) is always between 0 and the base – so no matter how big the input, the output can only be so big.

So behind the scenes, we make a cabinet with 100 drawers, and when deciding where to put the name of "dog", we look in drawer hash("dog") % 100. Because the hash value is random, the remainder will still be random, just smaller. This works great, but there’s a big downside: two animals might end up in the same drawer! Let’s say that hash("dog") is 1,000,034 and hash("shark") is 200,034. Different values, but both will be 34 after the modulo. So we put them in the same drawer, and we have to look through the drawer to find the dog’s name. It’s still fast, since there’s usually only one or two names in the drawer.

So it’s fine in practice, but the brogrammer’s point is that the spot you put the name in is not 100% determined by the hash function you’re using. Two more factors come in: the size of the cabinet, and the other animals! The size of the cabinet matters because a cabinet with 10 drawers will put both 72 and 182 in the same place (2), but a cabinet with 100 drawers will put them in different places (72 and 82). Also, you can’t tell in advance if a name will be alone in a drawer, or if it will have to share with other names.

The brogrammer wasn’t happy about this, but dealt with his feelings in a healthy way and went off into the mountains for a few weeks to think about alternatives. “A place for everything, and everything in its place,” he kept repeating in his head. When he eventually came down, he had a new idea.

III. The Trie

The problem, the brogrammer had realized, was that we were trying to put everything into a single huge cabinet, which could never be big enough. The solution, the brogrammer said, was to use a sequence of smaller cabinets. The first cabinet would give you the address of the second cabinet, the second cabinet the third, and eventually you would get to the cabinet which had the name you were looking for. You would need more cabinets (but not that many, as it turns out), and each cabinet could be quite small (maybe 16 drawers, or even 2!). Here’s an example, using an 8-drawer, 3-cabinet system (which gives us 8^3 = 512 drawers total):

>>> hash3("dog")
0x237

>>> firstCabinet = trie.find(firstCabinetLocation)

>>> secondCabinetLocation = firstCabinet.drawer(7).contents
>>> secondCabinet = trie.find(secondCabinetLocation)

>>> thirdCabinetLocation = secondCabinet.drawer(3).contents
>>> thirdCabinet = trie.find(thirdCabinetLocation)

>>> thirdCabinet.drawer(2).contents
"fido"

Note that each number tells us which drawer to open, and each number means one more cabinet. The brogrammer called this system a “Trie” (as in retrieve), and said that the beauty of it was that you didn’t need to build all the cabinets at once – you could start out with just one cabinet, and only build new cabinets the first time you needed them, wherever there was room. And while it means a little more work (opening more drawers), every name will have a dedicated drawer, always in the same place. And the brogrammer knew that no one would ever need all the drawers, and so most of the cabinets would never need to be built (although you can’t rule it out).

The programmer looked at the Trie and agreed it was a clever idea (although it involved quite a bit more walking), and there was harmony between them.

Years passed, and a new people started to appear in the nearby valley. Curious, the programmer and the brogrammer journeyed over to see these people and learn about their culture. They found the people intriguing, with a curious religion revolving around the worship of a particular arrangement of carved granite blocks.

The people were quite friendly, and after meeting with some of their priests, the programmer and brogrammer learned that these people had once been warlike, but after years of conflict developed a new system of “trust” which allowed them to co-exist in remarkable peace and prosperity. The computer, they said, was only as good as the programmer, and that humans could not be trusted to program alone. These people knew of the hash table and the trie, but they had found that people would cheat: sometimes people would come in the night and change the names in the drawers; there was no way to prove that the names in the drawers were the right names. For a while these people had a warrior class who guarded the cabinets, but found that this only led to more conflict.

Eventually a number of their most skilled artisans developed the technique of carving blocks of granite; these blocks, they realized, were very difficult to carve, and so things carved into these blocks could be trusted in away that the names in the cabinets could not. It was unfeasible to carve every name into the block, however, and to carve new blocks when the names changed. What they needed, they said, was some way to carve a signature of the names onto the block, such that if any one name changes, the signature would change; but if the names were the same, the signature would always be the same. Eventually, one of their scientists, Ralph, developed a solution: the Merkle tree.

IV. The Merkle Tree

The Merkle tree behaves much like a Trie, but with a new rule: the drawers of each cabinet will not contain the location of the next cabinet, but rather the hash of all of the contents of the next cabinet. Separately, we keep track of the location of each cabinet (using, of all things, a simple hash table):

>>> hash3("dog")
0x237

>>> firstCabinetLocation = hashtable.get(firstCabinetHash)
>>> firstCabinet = trie.find(firstCabinetLocation)

>>> secondCabinetHash = firstCabinet.drawer(7).contents
>>> secondCabinetLocation = hashtable.get(secondCabinetHash)
>>> secondCabinet = trie.find(secondCabinetLocation)

>>> thirdCabinetHash = secondCabinet.drawer(3).contents
>>> thirdCabinetLocation = hashtable.get(thirdCabinetHash)
>>> thirdCabinet = trie.find(thirdCabinetLocation)

>>> thirdCabinet.drawer(2).contents
"fido"

Remember our hash function? Earlier we talked about hashing simple values like “dog” and “cat”, but in truth you can hash anything, including other hashes or sets of hashes. What Ralph realized was that by keeping the hashes in the cabinets, you can create a “hash trail” which will change whenever any value changes (remember how websites store your passwords? Same idea). Here is how you update a value:

>>> hash3("dog")
0x237

### Find cabinet same as before

>>> thirdCabinet.drawer(2).contents = "rover"

### But then you start working backwards...

>>> thirdCabinetHash = hash3(thirdCabinet.drawers)
>>> hashtable.set(thirdCabinetHash, thirdCabinetLocation)

>>> secondCabinet.drawer(3).contents = thirdCabinetHash
>>> secondCabinetHash = hash3(secondCabinet.drawers)
>>> hashtable.set(secondCabinetHash, secondCabinetLocation)

>>> firstCabinet.drawer(7).contents = secondCabinetHash
>>> firstCabinetHash = hash3(firstCabinet.drawers)
>>> hashtable.set(firstCabinetHash, firstCabinetLocation)

>>> firstCabinetHash
0x375

Now the final value, 0x375, is a “fingerprint” of the entire Merkle tree. You can save this fingerprint (or engrave it into a granite block), and know that if anyone changes any of the names in the drawers, the process of making the hashes will give a different result – you’ll know something has changed. Notice that this adds more steps compared to a simple Trie: you need to have a separate hashtable to keep track of locations. But what you get is security.

The programmer and the brogrammer walked up to get a closer look at the granite blocks, and to their surprise, on them they saw engraved a series of hashes! 0x736, 0x264, 0x123, and so on, with 0x542 being the most recent. They were amazed! Nearby, they noticed some activity: one of this peculiar tribe wanted to prove that he had purchased a horse from another. He brought forward the name of the horse and his own name, set trie.set(horse, name) and through an elaborate ritual showed that his name, hashed with certain other names, with certain other names… voila! He arrived at 0x542, and thus all agreed that the horse was his.

What a remarkable society, the programmer and brogrammer agreed. There was something nagging at the programmer, though. This was a small tribe – only 512 members. As they grow, they will need a new hash function with a larger range – thousands, millions, billions. And so updating and verifying the values in the Merkle tree will become more and more costly – from three cabinets to five, to ten, to sixty and beyond! And for what? Most of these drawers will be empty. It seems like an expensive system, slow and costly. Surely there must be a better way? If only there was a Practical Algorithm To Retrieve Information Coded In Alphanumeric…

V. The Patricia Tree

To gather their thoughts, the programmer and the brogrammer took a walk into the hills above the valley. “There must be some way to optimize this tree!” they thought to themselves. The brogrammer suggested they look at a few random hashes, to build some intuition:

>>> hash8("cat")
0x14350235

>>> hash8("dog")
0x14350762

Then the brogrammer got excited – he noticed that both of these hashes happened to start with the same numbers: 14350. With just these two entries, getting to the final drawer should only need two cabinets: one for 14350, and one for whatever was left: 235 or 762. This would be much faster than using eight cabinets. You could always add more cabinets later, but why make more than you need? On each drawer we tape a little slip of paper, where we write down the common prefix for that drawer. Finally, the first cabinet is actually just a single drawer.

Looking up values would go like this:

>>> hash8("dog")
0x14350762

>>> firstDrawerLocation = hashtable.get(firstDrawerHash)
>>> firstDrawer = trie.find(firstDrawerLocation)
>>> split(14350762, firstDrawer.commonPrefix)
(14350, 762)

>>> secondCabinetHash = firstDrawer.contents
>>> secondCabinetLocation = hashtable.get(secondCabinetHash)
>>> secondCabinet = trie.find(secondCabinetLocation)
>>> secondDrawer = secondCabinet.drawer(7)
>>> split(62, secondDrawer.commonPrefix)
(62,)

>>> secondDrawer.contents
"fido"

The programmer got excited – she felt pretty good about this. It would make the algorithm a little trickier, to make sure that cabinets were created appropriately and that common prefixes are kept up-to-date, but nothing they couldn’t figure out. A little more work at the beginning to set this all up would save the valley people a lot of time over the long run.

The pair sat down and worked out the details of this new system, which they called the “Patricia Tree”. Satisfied, they descended to the valley and presented their work to the people there. They people were joyous; the slow Merkle tree had been a drag on their society. With the Patricia tree, they hoped, they would be able to advance their arts, sciences, and industry faster.

Satisfied, the programmer and the brogrammer left the valley. As they crested the ridge and began to make their way through the surrounding grassland, they heard a soft humming sound. Looking up, they saw a flying car sailing off into the horizon.

VI. Summary

What did we learn from this completely stylistically original story?

First, that hash tables, tries, merkle trees, and patricia trees are all do essentially the same thing: they let you map keys to values. While there are differences between them, this is essentially what they do.

Second, in computer science, nothing is free (but some things are cheap). Everything has a trade-off. Hash tables are fast, but have some randomness. Tries are fully determinstic, but slower. Merkle trees have nice security properties, but use a more complicated algorithm and are slower to update. Finally, Patricia trees are faster than Tries and Merkle trees, but require an even more complicated algorithm.

Third, Patricia trees are useful for blockchains because they let you “prove” a potentially large amount of data is correct, without having to store all of that data. This is very convenient: you can have a big tree with a lot of data (such as all of the transactions in the last 24 hours), but you only have to store a few numbers (like 0x323757382) on the actual blockchain. You can keep the rest of the data on a regular database somewhere and know that no one will be able to tamper it and get away with it. Note that here the blockchain is only part of the system: it is co-dependent on other data stores to function.

Fourth, the hash function is the magical machine that makes all of this possible. The design and implementation of hash functions has been the ongoing work of computer scientists for decades, and they are very hard to get right. You should take a moment and appreciate the years of work that made this magical technology possible.