Help EFF Track the Progress of AI and Machine Learning

Share It

The field of machine learning and artificial intelligence is making rapid progress. Many people are starting to ask what a world with intelligent computers will look like. But what is the ratio of hype to real progress? What kinds of problems have been well solved by current machine learning techniques, which ones are close to being solved, and which ones remain exceptionally hard?

There isn’t currently a good single place to find the state of the art on well-specified machine learning metrics, let alone the many problems in artificial intelligence that are still so hard that there are no good datasets and benchmarks to keep track of them yet. So we are trying to make one. Today, we’re launching the EFF AI Progress Measurement experiment, and encouraging machine learning researchers to give us feedback and contribute to the effort.

We want to know what types of AI we need to start engaging with on legal, political, and technical safety fronts.

We have drawn data from a number of sources: blog posts that report on snapshots of progress; websites that try to collate data on specific subfields of machine learning; and review articles. Where those sources didn’t have coverage, we’ve gone to the research literature itself and gathered data.

We’ve placed this information in an Jupyter / IPython Notebook, which you can read at https://eff.org/ai/metrics. The Notebook is hosted on Github, where the community can directly contribute.

What we have thus far is an experiment, and we’d like to know: Is this information useful to the machine learning community? What important problems, datasets, and results are we missing?

EFF’s interest in AI progress is primarily from a policy perspective. We want to know what types of AI we need to start engaging with on legal, political, and technical safety fronts. Beyond that, we’re also just excited to see how many things computers are learning to do over time.

Given that machine learning tools and AI techniques are increasingly part of our everyday lives, it is critical that journalists, policy makers, and technology users understand the state of the field. When improperly designed or deployed, machine learning methods can violate privacy, threaten safety, and perpetuate inequality and injustice. Stakeholders must be able to anticipate such risks and policy questions before they arise, rather than playing catch-up with the technology. To this end, it’s part of the responsibility of researchers, engineers, and developers in the field to help make information about their life-changing research widely available and understandable. We hope you’ll join us.

Related Updates

Deeplinks Blog by Mario Trujillo, Jacob Hoffman-Andrews, Tori Noble | December 2, 2025

AI Chatbot Companies Should Protect Your Conversations From Bulk Surveillance

AI companies have a responsibility to their users to make sure the warrant requirement is strictly followed, to resist unlawful bulk surveillance requests, and to be transparent with their users about the number of government requests they receive.

Deeplinks Blog by Hayley Tsukayama | November 20, 2025

The Trump Administration’s Order on AI Is Deeply Misguided

Widespread news reports indicate that President Donald Trump’s administration has prepared an executive order to punish states that have passed laws attempting to address harms from artificial intelligence (AI) systems. This approach is deeply misguided.

Deeplinks Blog by Molly Buckley | November 14, 2025

A Surveillance Mandate Disguised As Child Safety: Why the GUARD Act Won't Keep Us Safe

A new bill sponsored by Sen. Hawley (R-MO), Sen. Blumenthal (D-CT), Sen. Britt (R-AL), Sen. Warner (D-VA), and Sen. Murphy (D-CT) would require AI chatbots to verify all users’ ages, prohibit minors from using AI tools, and implement steep criminal penalties for chatbots that promote or solicit certain harms. That...

Deeplinks Blog by Josh Richman | September 30, 2025

Wave of Phony News Quotes Affects Everyone—Including EFF

Whether due to generative AI hallucinations or human sloppiness, the internet is increasingly rife with bogus news content—and you can count EFF among the victims. WinBuzzer published a story June 26 with the headline, “Microsoft Is Getting Sued over Using Nearly 200,000 Pirated Books for AI...

Deeplinks Blog by Matthew Guariglia | September 16, 2025

California, Tell Governor Newsom: Regulate AI Police Reports and Sign S.B. 524

Californians should urge Gov. Gavin Newsom to sign S.B. 524: a common-sense bill that takes important first-step reforms to regulate police reports written by generative AI. This is crucial, as watchdogs struggle to figure out where and how AI is being used in a police context. S.B. 524 does several...

Deeplinks Blog by Matthew Guariglia | September 4, 2025

California Lawmakers: Support S.B. 524 to Rein in AI Written Police Reports

EFF urges California state lawmakers to pass S.B. 524, authored by Sen. Jesse Arreguín. This bill is an important first step in regaining control over police using generative AI to write their narrative police reports. This bill does several important things: It mandates that police reports written by AI...

Deeplinks Blog by Tori Noble, Kit Walsh | August 14, 2025

President Trump’s War on “Woke AI” Is a Civil Liberties Nightmare

A new executive order called “Preventing Woke AI in the Federal Government,” released alongside the AI Action Plan, seeks to strong-arm AI companies into modifying their models to conform with the Trump Administration’s ideological agenda.

Deeplinks Blog by Josh Richman | August 13, 2025

Podcast Episode: Separating AI Hope from AI Hype

If you believe the hype, artificial intelligence will soon take all our jobs, or solve all our problems, or destroy all boundaries between reality and lies, or help us live forever, or take over the world and exterminate humanity. That’s a pretty wide spectrum, and leaves a lot of people...

Press Release | July 10, 2025

EFF Investigation: AI Product for Police Reports is Designed to Hinder Audits

SAN FRANCISCO – Axon Enterprise's Draft One product, which uses generative artificial intelligence to write police report narratives based on body-worn camera audio, seems designed to stymie any attempts at auditing, transparency, and accountability, an Electronic Frontier Foundation (EFF) investigation has found. The investigation – based...

Deeplinks Blog by Tori Noble | June 23, 2025

Copyright Cases Should Not Threaten Chatbot Users’ Privacy

Like users of all technologies, ChatGPT users deserve the right to delete their personal data. Nineteen U.S. States, the European Union, and a host of other countries already protect users’ right to delete. For years, OpenAI gave users the option to delete their conversations with ChatGPT, rather than let their...

Related Issues

Related Issues

Help EFF Track the Progress of AI and Machine Learning

Help EFF Track the Progress of AI and Machine Learning

Related Issues

Related Updates

Related Issues

Follow EFF:

Contact

About

Issues

Updates

Press

Donate