OpenAI’s Recent Announcement: What Went Wrong, and How It Could Be Better

Share It

Earlier this month, OpenAI revealed an impressive language model that can generate paragraphs of believable text. It declined to fully release their research “due to concerns about malicious applications of the technology.” OpenAI released a much smaller model and technical paper, but not the fully-trained model, training code, or full dataset, citing concerns that bad actors could use the model to fuel turbocharged disinformation campaigns.

Whether or not OpenAI’s decision to withhold most of their model was correct, their “release strategy” could have been much better.

The risks and dangers of models that can automate the production of convincing, low-cost, realistic text is an important debate to bring forward. But the risks attached to hinting about dangers without backing them up with detailed analysis and while refusing public or academic access, need to be considered also. OpenAI has appeared to consider one set of risks, without fully considering or justifying the risks they have taken in the opposite direction. Here are the concerns we have, and how OpenAI and other institutions should handle similar situations in the future.

Some Background: What Is Language Modeling and Why Is It Important?

OpenAI’s new language model does surprisingly well at a number of tasks, most notably generating text from a couple of seed sentences. The information they have released shows that the research could be a leap forward in language modeling.

Language modeling is an area of contemporary machine learning research where a statistical model is trained to assign probabilities to sentences or paragraphs of text. This model can serve as a building block for a variety of language-related machine learning research tasks. Beyond generating coherent paragraphs of text, language models can also help answer questions, summarize passages, and translate text. Recent work from the past year has massively improved the state-of-the-art for language modeling, most of which has been fully open-sourced.

Researchers Should Encourage a More Nuanced Dialogue Around AI

OpenAI gave access to the the full model to a small number of journalists before releasing their research. On release, the media published articles with titles like, “Brace for the robot apocalypse.”

The amount of misinformation now spreading about the current capabilities of state-of-the-art language modeling are reminiscent of past media hype around Facebook’s AI research. This points to the necessity for deliberate education around AI capabilities, not existential fear-mongering about vague threats.

Unfortunately, thanks to OpenAI’s decision to give model access to journalists instead of sharing the model with the public—or even merely hosting a discussion between journalists and other experts both inside and outside the AI research community—it’s hard to say just how advanced or scary OpenAI’s model really is. Despite the conversations they may have had internally, all OpenAI published was a bullet-point list of general “risks” associated with its model, failing to provide any semblance of a rigorous risk assessment. This blocks independent researchers from studying the risks identified and trying to identify ways to mitigate those risks. This is particularly problematic because the risks OpenAI pointed to are all possible for powerful actors to recreate.

Release the Full Model to Academics

Since releasing trained language models bolsters academic work on other downstream machine learning tasks, like the ones mentioned above, most work in this field is heavily open-sourced. BERT, Google’s language model and a predecessor to OpenAI’s GPT-2, was published and fully open-sourced less than half a year ago. Since then, it has already generated (and continues to generate) massive waves of downstream language understanding research. OpenAI broke this trend of defaulting to openness by questioning the societal repercussions of releasing fully-trained language models.

And when an otherwise respected research entity like OpenAI make a unilateral decision to go against the trend of full release, it endangers the open publication norms that currently prevail in language understanding research.

In the AI space, open publication norms are all the more important, given that research capabilities are already so highly centralized. Many frontiers of AI research require massive amounts of computing power and data to train and test, and OpenAI’s GPT-2 language model is no different. In this sort of ecosystem, a lot of groundbreaking AI research comes from private research labs as well as publicly funded sources. Private institutions are already disincentivized from releasing large chunks of their research, like datasets and code that may contain proprietary information. An open research culture, however, provides a social incentive for private entities to publish as much as possible.

OpenAI’s decision threatens that open publication culture. That means large privatized AI research centers may start thinking this is an acceptable thing to do. We could start to see fewer publications—and a world that resembles an arms race between corporate giants rather than a collaborative research forum.

To minimize this sort of impact, OpenAI should at least offer full model access to academic researchers and continue to encourage a culture of peer-to-peer knowledge sharing in the AI research community.

Stop Using “Responsible Disclosure” Analogies to Justify Withholding Research

In defending its decision to withhold its fully-trained model, training code, and full dataset, OpenAI characterized its decision as an “experiment in responsible disclosure.” “Responsible disclosure” refers to a process in the information security industry where security researchers withhold vulnerabilities from the public until they are patched. But responsible (or coordinated) disclosure means eventual public disclosure: the goal is always for the knowledge to become public in a reasonable amount of time.

OpenAI’s decision here has nothing to do with “responsible disclosure”—this is a misplaced analogy and misunderstands the purpose of the term of art. Rather, they use the term here in order to justify withholding research from the public, with no date or plan for final release.

The analogy is broken in so many ways as to make it fundamentally useless. In the case of generating “believable fake news,” there is no “vendor” that OpenAI can approach to mitigate the problem. The “vulnerability” and risk is societal. It is us, and society as a whole, who must be informed and take steps to develop ways to detect or otherwise manage the consequences of convincing computer-generated text. Even if this research were as dangerous as OpenAI suggests, there is no finite period of time for which failing to disclosure would lessen the risks; in fact, the risks would only increase as powerful institutions begin to reproduce their research, and widespread understanding of the risks would be stymied.

Create a Discussion Outside OpenAI Around Release Decisions

This incident points to the need for consensus-building among independent researchers—from both public institutions and private corporations and ranging in expertise—before such decisions are made.

From this demonstration, we’re not convinced that a single research organization is capable of performing objective and comprehensive evaluations of the ethical and policy impacts of its own work. OpenAI’s post indicates they were hesitant to take on that responsibility, and they defaulted to locking their model down, rather than defaulting to openness. Until the AI research community can come to a consensus on process for such decisions in the future, we hope that OpenAI and other research organizations will step back—and consult a broader quorum of policy experts and researchers—before making such dramatic “releases” in the future.

Related Updates

Deeplinks Blog by Mario Trujillo, Jacob Hoffman-Andrews, Tori Noble | December 2, 2025

AI Chatbot Companies Should Protect Your Conversations From Bulk Surveillance

AI companies have a responsibility to their users to make sure the warrant requirement is strictly followed, to resist unlawful bulk surveillance requests, and to be transparent with their users about the number of government requests they receive.

Deeplinks Blog by Hayley Tsukayama | November 20, 2025

The Trump Administration’s Order on AI Is Deeply Misguided

Widespread news reports indicate that President Donald Trump’s administration has prepared an executive order to punish states that have passed laws attempting to address harms from artificial intelligence (AI) systems. This approach is deeply misguided.

Deeplinks Blog by Molly Buckley | November 14, 2025

A Surveillance Mandate Disguised As Child Safety: Why the GUARD Act Won't Keep Us Safe

A new bill sponsored by Sen. Hawley (R-MO), Sen. Blumenthal (D-CT), Sen. Britt (R-AL), Sen. Warner (D-VA), and Sen. Murphy (D-CT) would require AI chatbots to verify all users’ ages, prohibit minors from using AI tools, and implement steep criminal penalties for chatbots that promote or solicit certain harms. That...

Deeplinks Blog by Josh Richman | September 30, 2025

Wave of Phony News Quotes Affects Everyone—Including EFF

Whether due to generative AI hallucinations or human sloppiness, the internet is increasingly rife with bogus news content—and you can count EFF among the victims. WinBuzzer published a story June 26 with the headline, “Microsoft Is Getting Sued over Using Nearly 200,000 Pirated Books for AI...

Deeplinks Blog by Matthew Guariglia | September 16, 2025

California, Tell Governor Newsom: Regulate AI Police Reports and Sign S.B. 524

Californians should urge Gov. Gavin Newsom to sign S.B. 524: a common-sense bill that takes important first-step reforms to regulate police reports written by generative AI. This is crucial, as watchdogs struggle to figure out where and how AI is being used in a police context. S.B. 524 does several...

Deeplinks Blog by Matthew Guariglia | September 4, 2025

California Lawmakers: Support S.B. 524 to Rein in AI Written Police Reports

EFF urges California state lawmakers to pass S.B. 524, authored by Sen. Jesse Arreguín. This bill is an important first step in regaining control over police using generative AI to write their narrative police reports. This bill does several important things: It mandates that police reports written by AI...

Deeplinks Blog by Tori Noble, Kit Walsh | August 14, 2025

President Trump’s War on “Woke AI” Is a Civil Liberties Nightmare

A new executive order called “Preventing Woke AI in the Federal Government,” released alongside the AI Action Plan, seeks to strong-arm AI companies into modifying their models to conform with the Trump Administration’s ideological agenda.

Deeplinks Blog by Josh Richman | August 13, 2025

Podcast Episode: Separating AI Hope from AI Hype

If you believe the hype, artificial intelligence will soon take all our jobs, or solve all our problems, or destroy all boundaries between reality and lies, or help us live forever, or take over the world and exterminate humanity. That’s a pretty wide spectrum, and leaves a lot of people...

Press Release | July 10, 2025

EFF Investigation: AI Product for Police Reports is Designed to Hinder Audits

SAN FRANCISCO – Axon Enterprise's Draft One product, which uses generative artificial intelligence to write police report narratives based on body-worn camera audio, seems designed to stymie any attempts at auditing, transparency, and accountability, an Electronic Frontier Foundation (EFF) investigation has found. The investigation – based...

Deeplinks Blog by Tori Noble | June 23, 2025

Copyright Cases Should Not Threaten Chatbot Users’ Privacy

Like users of all technologies, ChatGPT users deserve the right to delete their personal data. Nineteen U.S. States, the European Union, and a host of other countries already protect users’ right to delete. For years, OpenAI gave users the option to delete their conversations with ChatGPT, rather than let their...

Some Background: What Is Language Modeling and Why Is It Important?

Researchers Should Encourage a More Nuanced Dialogue Around AI

Release the Full Model to Academics

Stop Using “Responsible Disclosure” Analogies to Justify Withholding Research

Create a Discussion Outside OpenAI Around Release Decisions

Related Issues

Related Issues

OpenAI’s Recent Announcement: What Went Wrong, and How It Could Be Better

OpenAI’s Recent Announcement: What Went Wrong, and How It Could Be Better

Some Background: What Is Language Modeling and Why Is It Important?

Researchers Should Encourage a More Nuanced Dialogue Around AI

Release the Full Model to Academics

Stop Using “Responsible Disclosure” Analogies to Justify Withholding Research

Create a Discussion Outside OpenAI Around Release Decisions

Related Issues

Related Updates

Related Issues

Follow EFF:

Contact

About

Issues

Updates

Press

Donate