

We present a heuristic for correcting for one kind of bias (status quo bias), which we suggest affects many of our judgments about the consequences of modifying human nature. The Reversal Test: Eliminating Status Quo Bias in Applied Ethics Strategic Implications of Openness in AI DevelopmentĪn analysis of the global desirability of different forms of openness (including source code, science, data, safety techniques, capabilities, and goals). Humans are relatively expensive but absolutely cheap. This paper makes a load of specific claims that begin to stake out a position. Hail Mary, Value Porosity, and Utility Diversification, working paperĮthics & Policy Propositions Concerning Digital Minds and SocietyĪIs with moral status and political rights? We'll need a modus vivendi, and it’s becoming urgent to figure out the parameters for that.

Strategic Implications of Openness in AI Development, in Global Policy.Also German book (Suhrkamp, 2020) adaptation in Aeon The Vulnerable World Hypothesis, in Global Policy.Sharing the World with Digital Minds, w/ Carl Shulman, in edited volume (Oxford University Press, 2021).Propositions Concerning Digital Minds and Society, w/ Carl Shulman, working paper.New Yorker profile (now a bit obsolete), Bio, CV, Contact. Sign up for newsletter to receive (rare) updates.įor more on me, see e.g. Working on another paper with some colleagues that will focus on some technical challenges in detecting internal states of potential moral significance in large transformer models and other ML systems. Have also recently been doing some thinking on metaethics, and have released two papers on the ethics of (future) digital minds. Though sometimes I have the impression that the world is a conspiracy to distract us from what's important.

Hunkering down to focus on completing a book project (not quite announcement-ready yet).
