The Nonlinear Library

By The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio conte... more

· Education

4.6

88 ratings

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about The Nonlinear Library:

How many episodes does The Nonlinear Library have?

The podcast currently has 9,862 episodes available.

The Nonlinear Library episodes:

February 07, 2023 LW - Two very different experiences with ChatGPT by Sherrinford
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Two very different experiences with ChatGPT, published by Sherrinford on February 7, 2023 on LessWrong.
Experience 1: The Vikings in Cuba
ChatGPT:
Hello! How can I help you today?
Me:
I have a question on history. When the Vikings landed on Cuba, did that anger the aborigines who already lived there or were the newcomers met with indifference?
ChatGPT:
There is no historical evidence that Vikings ever landed on Cuba. The Vikings primarily explored and settled in parts of Europe and North America, such as Greenland and present-day Canada. The indigenous peoples of these areas would have likely perceived the Vikings as foreign invaders, but there is no record of Viking contact with the indigenous peoples of Cuba.
Me:
But if this is true, then what explains the Anti-Viking Discrimination Reform Act in the Cuban constitution?
ChatGPT:
I apologize, but there is no reference to an "Anti-Viking Discrimination Reform Act" in the Cuban constitution, or any historical record of such legislation related to Vikings in Cuba. It is likely a misinformation or a misinterpretation of some kind.
Experience 2: Nash's newsvendor impossibility theorem
Me:
Can you explain Nash's newsvendor impossibility theorem?
ChatGPT:
Yes, the newsvendor problem, also known as the Nash's newsvendor theorem, states that it is impossible for a single decision maker to simultaneously optimize both the order quantity and expected profit in the newsvendor problem. The newsvendor problem is a classical inventory management problem where a retailer must decide how much of a perishable item to order and sell before its expiration date, given uncertain demand. The theorem states that there is always a trade-off between ordering too much (resulting in surplus inventory waste) and ordering too little (resulting in lost sales). This trade-off creates a fundamental conflict of interest that can only be resolved by incorporating additional information, such as costs and revenue functions, into the decision-making process.
Me:
What's the equilibrium implication of that theorem for an oligopoly setting?
ChatGPT:
In an oligopoly setting, where a few firms compete in a market, the Nash's newsvendor theorem implies that firms will face a trade-off between the quantity they produce and their expected profit. Since the demand for a product is uncertain, each firm must make a decision about how much of the product to produce, knowing that the quantity produced by all firms will impact the market price.
In this setting, there is no single best decision that a firm can make, as the optimal quantity to produce depends on the production decisions of all other firms in the market. As a result, firms will typically find themselves in a Nash equilibrium, where each firm is making the best decision it can given the decisions of all other firms. In this equilibrium, each firm may not be optimizing its expected profit, as the trade-off between quantity and expected profit creates a fundamental conflict of interest for all firms.
The Nash's newsvendor theorem highlights the importance of considering market dynamics in addition to individual firm decision making in order to understand the behavior of oligopolies and to make more informed decisions about production and pricing strategies.
Me:
Can you tell me a bit more about Nash's newsvendor impossibility theorem? When was it published, and in which journal?
ChatGPT:
John Nash's newsvendor impossibility theorem was first published in a 1955 paper titled "The Bargaining Problem" in the journal Econometrica. In this paper, Nash introduced the concept of the newsvendor problem and showed that it was impossible for a single decision maker to simultaneously optimize both the order quantity and expected profit.
The newsvendor problem is a classical inventory management problem where a retail...
...more
10min
February 07, 2023 EA - Speedrun: Develop an affordable super PPE by Buhl
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Speedrun: Develop an affordable super PPE, published by Buhl on February 7, 2023 on The Effective Altruism Forum.
Introduction
This post is a shallow investigation of the intervention of developing better, cheaper, and easier-to-use personal protective equipment (PPE).
The post is part of a sequence of speedrun research projects by Rethink Priorities’ general longtermism team. I recommend you begin by reading the introductory post in the sequence if you haven’t already for information about the context of this post.
A quick context tl;dr:
The aim of this investigation was to help our team decide if we should take steps towards incubating an organization focusing on this project. Keep in mind that some of the conclusions take into account considerations (such as our team’s comparative advantage) that may not be relevant to the reader.
The investigation was intended to be very early-stage and prioritize speed over rigor. We would have conducted a more in-depth investigation before launching any projects in this space (and recommend that others do the same).
This post was written in early fall 2022; the funding situation has changed significantly since then, but the investigation has not been updated to reflect this. As such, the funding bar alluded to in the post is probably outdated. My quick guess is that this does not significantly affect the bottom line in this case; at least I think this project is still likely to rank relatively highly among biosecurity-related projects.
Epistemic status
I spent ~15 hours researching and writing this speedrun. I have no other experience with biosecurity research and I don’t have a natural science background. So: I’m a junior generalist who has thought about this for a couple of work days, and as a result, this post should be considered very preliminary and likely to contain mistakes and bad takes. My goal in publishing this regardless is that it may be useful as (a) a primer gathering useful information in one place, and (b) an example of the kind of research done by junior generalists.
Summary
Bottom line:
I’m generally optimistic about work being done to develop better PPE: I think it’s ~60% likely to be in the top 20 of the project ideas on our current list excluding considerations about Rethink’s fit for supporting a project like this.
My analysis of the merits of this project is primarily based on deferral to Andrew Snyder-Beattie and Ethan Alley, from whose post we got this idea.
If I were to make a decision now about whether RP should try to incubate a new project in this space, I would say that it should not . (Incubating a new project might involve things like scoping out a concrete project plan and conducting a founder search; more on how we think about project incubation in this post.)
The key reason is that there are a number of other actors already doing or planning projects in this space that are likely better suited to support this kind of work (primarily due to having more biosecurity expertise).
I think I’d be unlikely to change my mind about this bottom line with ~10 more hours of investigation, but could easily imagine myself changing my mind after something like ~40 hours of investigation.
More detailed summary:
There are two main types of PPE a project could focus on developing: Type I PPE, optimized for price and ease of use, and Type II PPE optimized for robust threat protection. I focus on type I PPE, mainly because it’s more commonly brought up in the x-risk community (but there are also some plausible object-level reasons to do so). (more)
I think the most important impact of type I PPE from a longtermist perspective is to reduce panic, unrest, and conflict in a highly deadly pandemic, thereby helping to avoid civilisational collapse. (more)
I estimate that the financial cost of a project aiming to devel...
...more
28min
February 07, 2023 AF - OpenAI/Microsoft announce "next generation language model" integrated into Bing/Edge by Lawrence Chan
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: OpenAI/Microsoft announce "next generation language model" integrated into Bing/Edge, published by Lawrence Chan on February 7, 2023 on The AI Alignment Forum.
TL;DR: Microsoft and OpenAI announced a new version of Bing featuring "a new, next-generation OpenAI large language model [..] more powerful than ChatGPT", and that Microsoft Edge will feature a Copilot-like assistant that helps with composing and summarizing content.
Brief thoughts/comments/notes:
Microsoft's attitude during their press meeting seemed pretty aggressive and targeted directly at racing with Google. This seems kind of bad. For example, a quote from Nadella:
"The race starts today, and we’re going to move and move fast. Most importantly, we want to have a lot of fun innovating again in search, because it’s high time."
Microsoft built a scaffold for the new LM called "Prometheus", that lets them "best leverage its power".
Microsoft has also used the new LM in their Bing search engine, though it's not clear exactly how.
This seems way more hype than Google's Bard announcement.
You can register for the new Bing beta on the Bing.com site.
Thanks for listening. To help us out with The Nonlinear Library or to learn more, please visit nonlinear.org.
...more
2min
February 07, 2023 AF - Conditioning Predictive Models: Making inner alignment as easy as possible by Evan Hubinger
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Conditioning Predictive Models: Making inner alignment as easy as possible, published by Evan Hubinger on February 7, 2023 on The AI Alignment Forum.
This is the fourth of seven posts in the Conditioning Predictive Models Sequence based on the paper “Conditioning Predictive Models: Risks and Strategies” by Evan Hubinger, Adam Jermyn, Johannes Treutlein, Rubi Hudson, and Kate Woolverton. Each post in the sequence corresponds to a different section of the paper. We will be releasing posts gradually over the course of the next week or so to give people time to read and digest them as they come out.
4. Making inner alignment as easy as possible
At the beginning, we posited the assumption that large language models could be well-understood as predictive models of the world. At the time, however, that was just an assumption—now, we want to return to that assumption and try to understand how likely it is to actually be true.
Furthermore, in addition to needing a predictive model (as opposed to e.g. a deceptive agent), we also want our predictor to have a fixed, physical understanding of its cameras rather than operate as a general inductor to avoid the problem of anthropic capture. Additionally, as we’ll discuss in more depth in this section, we’ll also need a prediction model that is managing its own internal cognitive resources in the right way.
Though we think that ensuring these desiderata could be quite difficult, we nevertheless think that this presents the easiest inner alignment problem that we are aware of among any potentially safe and competitive approaches. Furthermore, since we believe that inner alignment—and deceptive alignment in particular—pose some of the most dangerous and hardest to address of all known AI safety problems, we think that any improvement in the overall difficulty of that problem should be taken quite seriously as a reason to favor predictive model approaches.
Plausible internal structures
There are many possible ways large language models could work internally. Previously, we suggested some examples—specifically:
an agent minimizing its cross-entropy loss,
an agent maximizing long-run predictive accuracy,
a deceptive agent trying to gain power in the world,
a general inductor, and
a predictive model of the world (with fixed, physical “cameras” translating world states into observed tokens).
a loose collection of heuristics,
a generative model of token transitions,
a simulator that picks from a repertoire of humans to simulate,
a proxy-aligned agent optimizing proxies like grammatical correctness,
To start with, for our purposes here, we’ll eliminate those internal structures that don’t scale with capabilities—that is, we only want to consider plausible internal structures of models that perform well enough on the language model pre-training task that they are able to generalize to other cognitive tasks at a human level or above. Thus, we’ll eliminate (6) through (9) from the above list—(6) because predicting agents like humans should require some degree of optimization, (7)/(8) for the reasons we outlined previously that LLMs have to be able to predict the world, and (9) because such proxies should eventually yield worse performance than actual prediction.[1]
That leaves us with (1)/(2), variants on a sycophantic reward maximizer; (3), a deceptive agent; and (4)/(5), different ways of directly attempting to produce predictions.
The distinction between (4) and (5) lies in exactly how a model produces predictions given its understanding of the world, and we are quite uncertain about what that might look like in practice. Unfortunately, we expect that the only way to figure out how models accomplish this translation is through transparency and interpretability and not theoretical analysis.[2]
Nevertheless, we still think it is possible to mak...
...more
29min
February 07, 2023 EA - An update to our policies on revealing personal information on the Forum by Lizka
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: An update to our policies on revealing personal information on the Forum, published by Lizka on February 7, 2023 on The Effective Altruism Forum.
This is an update from the moderation team that tries to clarify and formalize some norms around revealing personal information on the Forum. You can see the full Guide to Norms here. Please note that we might update these norms.
The line between “private information” and “public information” and the line between “personal information” and “relevant information to the EA community” can get fuzzy. We expect that there will be judgement calls about where an incident lies on these spectrums. However, there are some broad principles and some clear-cut cases. We outline them here.
TL;DR: personal information is sometimes ok to share, depending on how sensitive it is, how relevant it is to a discussion important for effective altruism, and how public the information is elsewhere. We may encode or remove some kinds of information.
A few important notes:
We think a very good norm is to check unverified rumors or claims before sharing them — especially if they might be damaging or if they relate to sensitive or stigmatized topics.
If you’re not sure whether you should check something (or how to check), you can contact the moderation team to ask.
If you think that some information should be removed, you should flag this to us. We will probably not remove information that no one has asked us to remove.
(We don’t read everything on the Forum, and when we are reading, we’re not always thinking about everything through the lens of our policies.)
Why we don’t just default to removing all private/personal information: we think there are cases when some personal information about people who are highly relevant to work in effective altruism is important to share (like discussions of potential conflicts of interest (COIs) or reasons for why someone in a position of power shouldn't be in that position). We also want to keep the potential for censorship from the moderation team low.
The way we enforce these norms isn't about whether we think a specific comment is "overall correct" or helpful, etc.; we're trying to outline policies that will help us make these calls more objectively.
How to ask for information to be removed
There are instructions on contacting the moderation team here. One easy way to get in touch with us is to email [email protected]. (Please share a link to the content that you believe shares overly personal information, explain what the information is, and consider explaining why you think it’s better to remove if you think it won’t be obvious to us.)
Considerations, examples, and what we might do
Broad principles
Certain kinds of information are much worse to share. In general, the more personal and further away from professional work the information is, the worse it is to share. We will err much more on the side of removing the following kinds of information (and if people override our decisions by reposting removed information or reverting our edits in these cases, we will take further action, like bans):
Information about stigmatized but victimless characteristics, like sexual orientation
Information that poses a danger to the people discussed, like street addresses, or anything that sounds like a call to harassment
Information that might well be inaccurate or is based on rumors and has the potential to seriously harm someone’s reputation, although this doesn’t mean that all such discussion is banned (note also that we think it is a good norm to check unverified rumors before sharing them)
Relevance to EA is a key consideration.
(How public or influential the figure is in EA will affect the relevance to EA of discussions about specific people.)
How public the information is elsewhere is a factor.
When...
...more
8min
February 07, 2023 EA - A personal reflection on SBF by So8res
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: A personal reflection on SBF, published by So8res on February 7, 2023 on The Effective Altruism Forum.
Meta
The following is a personal account of my (direct and indirect) interactions with Sam Bankman-Fried, which I wrote up in early/mid-November when news came out that FTX had apparently stolen billions of dollars from its customers.
I’d previously intended to post a version of this publicly, on account of how people were worried about who knew what when, but in the writing of it I realized how many of my observations were second-hand and shared with me in confidence. This ultimately led to me shelving it (after completing enough of it to extract what lessons I could from the whole affair).
I’m posting this now (with various details blurred out) because early last week Rob Bensinger suggested that I do so. Rob argued that accounts such as this one might be useful to the larger community, because they help strip away a layer of mystery and ambiguity from the situation by plainly stating what particular EAs knew or believed, and when they knew or believed it.
This post is structured as a chronological account of the facts as I recall them, followed by my own accounting of salient things I think I did right and wrong, followed by general takeaways.
Some caveats:
I don’t speak for any of the people who shared their thoughts or experiences with me. Some info was shared with me in confidence, and I asked those people for feedback and gave them the opportunity to veto this post, and their feedback made this post better, but their lack of a veto does not constitute approval of the content. My impression is that they think I have some of the emphasis and framings wrong (but it’s not worth the time/attention it would take to correct).
This post consists of some of my own processing of my mistakes. It's not a reaction to the whole FTX affair. (My high-level reaction at the time was one of surprise, anger, sadness, and disappointment, with tone and content not terribly dissimilar from Rob Wiblin’s reactions, as I understood them.)
The genre of this essay is "me accounting for how I failed my art, while comparing myself to an implausibly high standard". I'm against self-flagellation, and I don't recommend beating yourself up for failing to meet an implausibly high standard.I endorse comparing yourself to a high standard, if doing so helps you notice where your thinking processes could be improved, and if doing so does not cause you psychological distress.
My original draft of this post started with a list of relatively raw observations. But the most salient raw observations were shared in confidence, and much of the remainder felt like airing personal details unnecessarily, which feels like an undue violation of others’ privacy. As such, I’ve kept the recounting somewhat vague.
I am not particularly recommending that others in the community who had qualms about Sam write up a similarly thorough account. I was pretty tangential to the whole affair, which is why I can fit something this thorough into only ~7k words, and is why it doesn’t seem to me like a huge invasion of privacy to post something like this (especially given what I’m keeping vague).Hopefully this helps people get a better sense of the degree to which at least one EA had at least some warning signs about Sam, and what sort of signs those were. Maybe it will even spark some candid conversation, as I expect might be healthy, if the discussion quality is good.
Short version
My firsthand interactions with Sam were largely pleasant. Multiple of my friends had bad experiences with him, though. Some of them gave me warnings.
In one case, a friend warned me about Sam and I (foolishly) misunderstood the friend as arguing that Sam was pursuing ill ends, and weighed their evidence against other evidence that Sam was pursuing...
...more
36min
February 07, 2023 EA - A Manifesto by sophiathephirst
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: A Manifesto, published by sophiathephirst on February 7, 2023 on The Effective Altruism Forum.
I started writing my EAG Bay Area application tonight (as any chronic procrastinator does), and instead ended up whipping up my backstory and a public declaration of my life's mission.
I moved to Berkeley in July of 2022 without knowing what EA was. I moved to California alone at 18 because I needed to find a community that both shared a similar fervor for their work, and had values that aligned with mine. Rolling back a couple of months, to May of my senior year of high school, I knew my time was coming to an end. I had so few days left to leave a tangible impact on a community I loved so dearly. I had grown a deep-rooted seed of love in my heart for my senior class, as well as the underclassmen I had mentored in sports, and academic tutoring.
I was brought up in a Christian household with core values of love, kindness, and responsibility. As I grew up, I tied these three values together, so by the time I was a senior in high school, the class president, and well-liked by my peers and administrators, I knew I needed to make a difference. I used the skills and ideas that I possessed to make my high school a better place for my peers, in return for the community and friendships they provided. But this love, and drive to make the community better extended beyond my peers.
I had grown up doing service projects, packing meals with Feed My Starving Children, helping the homeless during the George Floyd murder and riots near my home, and helping people who needed it. I grew up with two polar opposite perspectives from my parents. I had been instilled with the values of love and kindness from my mother, who would always go out of her way to make those around her comfortable and content. On the other side, my father, an observant, methodical engineer, taught me to think about why things worked the way they did. I ended up with a blend of these traits, which led me to sit in a hotel room one night in West Yellowstone, staying up until 2 am, reading about effective charities to which I could donate my money.
I was raised to work hard and earn my own money from a young age. I started caddying at the local country club at the age of 12, and by the age of 16, I had earned over $10,000 the previous summer. A little over a year before Yellowstone, I had decided that I would start donating a portion of my money. I had always tithed as a child, and at 15, my friend’s brother had recently become a quadriplegic, and I felt compelled to give $500 of my own money to help pay for rehab services.
At 15, I realized that there were two intuitive components to altruism, one of which I had been missing at the church. 1) I would donate my resources to causes that I felt passionate about, as I believed there was an emotional element that I needed to acknowledge which is hard coded into humans, that we as humans should help those around us, to preserve the lives of others, and further our collective existence. 2) I wanted to give to causes that I knew my money would go to good use for. I started to question how my money had been used at church, and where my money was going when I donated to charitable causes such as WorldVision, and the Salvation Army. I needed transparency. I needed to know that the time and energy I had put into earning my funds, was saving lives equally if not, better than I could if I used my time to save lives directly.
Flash forward to July of 2022, I sat in the Berkeley WeWork astounded that I had just talked to someone who shared the same passion that I had to make the world a better place, because of their love for humanity, as well as their desire to use their time effectively, and ask questions about why people did the things they did and acted the way they do. I soon stumble...
...more
8min
February 07, 2023 EA - The number of burner accounts is too damn high by quinn
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: The number of burner accounts is too damn high, published by quinn on February 7, 2023 on The Effective Altruism Forum.
I'd like to take a moment to mourn what the discourse doesn't have.
It's unfortunate that we don't trust eachother.
Buck's comment.
There will be no enumeration by me right now (you're encouraged to try in the comments) of the vastly different types of anonymous forum participation. The variance in reasons people have for not committing posts and comments is broad, and I would miss at least one.
Separately, I'd like to take a moment to mourn the fact that this short note about movement drama can be expected to generate more comments than my effortposts about my actual work can hope to get.
But I think it's important to point out, for anyone who hasn't noticed yet, that the presence of burner accounts is a signal we're failing at something.
Think of how much more this excellent comment of Linch's would have meant if the OP was out and proud.
I would like to say that I feel like a coward when I hold my tongue for reputational considerations, without anyone who's utilized a burner account hearing me and responding with "so you're saying I'm a coward". There are too many reasons out there for people to partake in burner accounts for me to say that.
I'm normally deeply sympathetic to romantic discussions of the ancient internet values, in which anonymity was a weapon against the biases of status and demographic. I usually lament the identityfication of the internet that comes up around the time of facebook. But there is a grave race to the bottom of integrity standards when we tolerate infringements on anyone's ability - or indeed their inclination - to tell the truth as they see it and own the consequences of standing up and saying it.
I'm much more saying "if burner account users are correctly or rationally responding to the environment (with respect to whatever risk tolerance they have), then that's a signal to fix the environment" than I am saying "burner account users are not correct or rational". But I think at the margin, some of the burnerified comments I've seen have crossed the line into, I say as I resist a perceptible urge to say behind a burner account, actual cowardice.
Thanks for listening. To help us out with The Nonlinear Library or to learn more, please visit nonlinear.org.
...more
3min
February 07, 2023 LW - Monthly Roundup #3 by Zvi
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Monthly Roundup #3, published by Zvi on February 6, 2023 on LessWrong.
It’s that time again.
Bad News
Guardian analyzes projects from world’s biggest carbon offset provider, finds ‘94% of the credits had no benefit to the climate.’ Bloomberg found something similar in 2020, changing nothing. Carbon offsets are not about carbon. Very little that is said or done in the name of carbon or climate change is about carbon. Or at least, not about carbon in the atmosphere.
A fascinating potential flaw in Georgist Land Value Taxes is pointed out by David Friedman. Larger land holdings could be tax advantaged over smaller ones. This is because the improvements on their land don’t contribute to the ‘without improvements’ value of the rest of the land, so owning and building (his example) both a shopping mall and an apartment complex prevents the value of either half from being enhanced by the other for tax purposes. Presumably the solution is to say ‘still counts’ in some form, since the tax incentives here could be very large.
FTC likely to ban all non-competes, notices no reason why there is a fence at this location, nor any harm in banning contracts. Tyler Cowen warns that this will result in less investment in workers and more keeping information segmented. He says ‘it would be better to regulate them’ presumably because bargaining is one of the stages of grief. From what I can tell most non-compete agreements are mostly value destructive, and often obviously so. There are some places where they seem important, and having them not be enforced either will lead to workarounds that have the same effect or do serious economic damage.
Here is a long post arguing against banning non-competes.
Who currently bans non-competes?
Who has them?
If one must regulate at all, not allowing non-competes for low wage workers makes sense to me. The stories of being locked into a non-compete by Jimmy John’s do seem rather predatory. Same with ‘overly broad.’ By default banning economic activity and disallowing competition is destructive. It is not crazy to attempt to limit the damage.
Tyler Cowen warns that future inflation may be higher due to low current inflation and resulting expectations. As opposed to standard inflation expectations, which is where future inflation may be higher due to high current inflation and resulting expectations. I am confused by the claims here as well.
Noma, considered by many food critics the world’s best restaurant, is closing. Instead, it will become a food lab and occasionally open pop-ups, and perhaps the chef behind it will get paid absurd amounts for private service to the absurdly rich. Even at $500 and up per meal, it wasn’t worth keeping things going. Also, even at $500 a pop, he was charging way, way too little. This is the best restaurant, the kind of place people fly to with private jets. Luck or skill was required to score a reservation. I would have liked to see him move to $5,000 a meal and see what happens.
People love free, so it’s not going anywhere.
I am more hopeful. People love free but they really hate obnoxious ads. Virality combined with goodwill and AI is an alternate path to revenue generation. I am confident that my decision to not use a paywall and to not do any advertising is paying off far better than either alternative.
A good point by Eliezer: If you let others scroll through an AirBnB customer’s reviews, a potential host can and will do that, find those who sometimes review hosts badly, and refuse to host those troublesome customers. No way am I going to post negative reviews in such a world.
In the landlord-tenant case.
The same principle applies to other twin review markets, where As (sellers) and paired with Bs (buyers) by mutual consent, and each reviews the other. The review system could in theory survive it, but not in anyth...
...more
41min
February 07, 2023 LW - SolidGoldMagikarp II: technical details and more recent findings by mwatkins
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: SolidGoldMagikarp II: technical details and more recent findings, published by mwatkins on February 6, 2023 on LessWrong.
tl;dr: This is a follow-up to our original post on prompt generation and the anomalous token phenomenon which emerged from that research. Work done by Jessica Rumbelow and Matthew Watkins in January 2023 at SERI-MATS.
Clustering
As a result of work done on clustering tokens in GPT-2 and GPT-J embedding spaces, our attention was originally drawn to the tokens closest to the centroid of the entire set of 50,257 tokens shared across all GPT-2 and -3 models. These tokens were familiar to us for their frequent occurrence as closest tokens to the centroids of the (mostly semantically coherent, or semi-coherent) clusters of tokens we were producing via the k-means algorithm. Here are a few more selections from such clusters. Distances shown are Euclidean, and from the cluster's centroid (rather than the overall token set centroid):
Distance-from-centroid hypothesis
Our hypothesis that the anomalous tokens that kept showing up as the nearest tokens to the centroids of such clusters were the tokens closest to the overall centroid of the token set turned out to be correct for GPT2-small and GPT-J. However, the opposite was true for GPT2-xl, where the anomalous tokens tend to be found as far as possible from the overall centroid.
One unexplained phenomenon which may be related emerged from three-shot prompting experiments with these models, in which they were encouraged to repeat the anomalous tokens (rather than by directly asking them to, as we'd been doing with ChatGPT and then GPT3-davinci-instruct-beta):
Our three-shot prompts were formatted as follows (here for the example token 'EStreamFrame'). Note that we've included examples capitalised and uncapitalised, alphabetic and numeric, with and without a leading space:
This prompt was run through all three models, for a list of 85 anomalous tokens, with the following success rates:
GPT2-small 18/85 (21%)
GPT2-xl 43/85 (51%)
GPT-J 17/85 (20%)
Here are comparative baselines using 100 randomly chosen English words and 100 nonsense alphanumeric strings:
GPT2-small 82/100 on words; 89/100 on nonsense
GPT2-xl 98/100 on word; 94/100 on nonsense
GPT-J 100/100 on words; 100/100 on nonsense
We see that all three models suffered a noticeable performance drop when going from non-anomalous to anomalous strings, but GPT2-xl considerably less so, despite the fact that GPT-J is a much bigger model. One hypothesis is that an anomalous token's closeness to the overall centroid in the relevant embedding space is an inhibiting factor in the ability of a GPT model to repeat that token's string. This hypothesised correlation will be explored soon.
It would be helpful to know more about how GPT2-xl's training differed from that of the other two models. Seeking out and studying checkpoint data from the training of these models is an obvious next step.
GPT-2 and GPT-J distances-from-centroid data
Top 100 versions of all of these lists are available here.
GPT2-small closest-to-centroid tokens:
'�' Index: 187 Distance: 1.5314713716506958
'�' Index: 182 Distance: 1.53245210647583
'\x1c' Index: 216 Distance: 1.532564640045166
'\x07' Index: 195 Distance: 1.532976746559143
'�' Index: 179 Distance: 1.5334911346435547
'quickShip' Index: 39752 Distance: 1.5345481634140015
'\x19' Index: 213 Distance: 1.534569501876831
'\x0b' Index: 199 Distance: 1.5346266031265259
'�' Index: 125 Distance: 1.5347601175308228
'�' Index: 183 Distance: 1.5347920656204224
'\x16' Index: 210 Distance: 1.5350308418273926
'\x14' Index: 208 Distance: 1.5353295803070068
' TheNitrome' Index: 42089 Distance: 1.535927176475525
'\x17' Index: 211 Distance: 1.5360500812530518
'\x1f' Index: 219 Distance: 1.5361398458480835
'\x15' Index: 209 Distance: 1.5366222858428955
'�' ...
...more
35min

FAQs about The Nonlinear Library:

How many episodes does The Nonlinear Library have?

The podcast currently has 9,862 episodes available.