Language models can explain neurons in language models

May 9, 2023 Steve

We use GPT-4 to routinely write explanations for the habits of neurons in massive language models and to attain these explanations. We launch a dataset of those (imperfect) explanations and scores for each neuron in GPT-2.

You May Also Like

How to accelerate prototyping in manufacturing product design

Soft Skills Every Data Scientist Needs

How I Would Learn Data Science in 2025 (If I Could Start Over)