Examples of AI Improving AI

Author: Thomas Woodside, Center for AI Safety
Contributors: Herbie Bradley, James Campbell, Jun Shern Chan, Aidan O'Gara, Dan Hendrycks, Esben Kran, Nathaniel Li, Mantas Mazeika, Aaron Scher, Zach Stein-Perlman, Fred Zhang, Oliver Zhang, Andy Zou.
Last Updated: October 2, 2023

As machine learning algorithms become more capable of outperforming humans on some narrow tasks, they are increasingly being used to make improvements to themselves or other machine learning systems, or inputs to those systems such as hardware. In some cases, human feedback used to improve models has been replaced with AI feedback; in other cases, GPU circuits that were once designed by humans are being designed by AI systems. Some have warned that this "recursive self-improvement," if scaled up, could lead to AI spiraling beyond human control [1][2][3].

The table below collects some current examples of AI systems being used to improve AI systems. It should not be taken as an exhaustive list, since these applications can occur in many subsets of AI and we have not been able to review all recent AI papers. The "author" and "author affiliation" columns refer to the authors of the paper; the "submitter" column refers to the person who originally brought the paper to my attention. If you know of an example not mentioned here, you may submit more here.

[1] Nick Bostrom, Superintelligence
[2] Joseph Carlsmith, Is Power-seeking AI An Existential Risk?
[3] Dan Hendrycks, Natural Selection Favors AIs Over Humans

Description

Source

Date Published

Authors

Author Affiliations

Submitter

LLMs used for genetic algorithm to generate LLM prompts.

link

9/28/2023

Fernando et al.

DeepMind

Zach Stein-Perlman

LLM used to help refine prompts for vision-language models.

link

9/12/2023

Liu et al.

CMU

Aidan O'Gara

Language model used to generate prompts that could correspond to training data, filter them, with the data used to train a stronger language model.

link

8/3/2023

Li et al.