A downloadable project

We expand Neel Nanda's Interactive Neuroscope to view an entire layer.

Looking at Neel Nanda's Interactive Neuroscope, we were stymied by the question of
which neuron we ought to try to look at. It seemed potentially useful to be able to quickly map the activations of every neuron in the layer, particularly for smaller models with manageable numbers of neurons. To that end, we build a new version of the Neuroscope which generates a graphical representation of the entire layer. In the figure below, we show layer 7, generated using the default text in Nanda's Neuroscope: "The following is a list of powers of 10: 1, 10, 100, 1000, 10000, 100000, 1000000, 10000000".

We analyze more prompts with this tool and identify some interesting patterns and
possible avenues for further research.


Template hackaton-2.pdf 266 kB

Leave a comment

Log in with itch.io to leave a comment.