ACM Transactions on Evolutionary Learning and Optimization
Language Model Crossover
Variation through Few-Shot Prompting
Abstract
Language Model Crossover: Variation through Few-Shot Prompting
Elliot Meyerson, Mark J. Nelson, Herbie Bradley, Adam Gaier, Arash Moradi, Amy K Hoover, Joel Lehman
This paper pursues the insight that language models naturally enable an intelligent variation operator similar in spirit to evolutionary crossover. In particular, language models of sufficient scale demonstrate in-context learning, i.e., they can learn from associations between a small number of input patterns to generate outputs incorporating such associations (also called few-shot prompting). This ability can be leveraged to form a simple but powerful variation operator, i.e., to prompt a language model with a few text-based genotypes (such as code, plain-text sentences, or equations), and to parse its corresponding output as those genotypes’ offspring. The promise of such language model crossover (which is simple to implement and can leverage many different open-source language models) is that it enables a simple mechanism to evolve semantically-rich text representations (with few domain-specific tweaks), and naturally benefits from current progress in language models. Experiments in this paper highlight the versatility of language-model crossover, through evolving binary bit-strings, sentences, equations, text-to-image prompts, and Python code. The conclusion is that language model crossover is a flexible and effective method for evolving genomes representable as text.
Download publicationAssociated Researchers
Elliot Meyerson
Cognizant AI Labs
Mark J. Nelson
American University
Herbie Bradley
University of Cambridge & CarperAI
Arash Moradi
New Jersey Institute of Technology
Amy K. Hoover
New Jersey Institute of Technology
Joel Lehman
Carper AI
Related Resources
2024
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering DocumentationNovel benchmark aimed at evaluating the proficiency of multimodal…
2022
Leveraging Robotics for Cleaner Construction JobsitesThe value in Spot’s ability to execute repeatable, autonomous missions…
2022
Communicating Natural Programs to Humans and MachinesThe Abstraction and Reasoning Corpus (ARC) is a set of procedural…
2002
Computational Anatomy and BiomechanicsComputational anatomy incorporates the use of geometric- and…
Get in touch
Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.
Contact us