INDEX
Explanations
references to neurons in brain-related contexts
references to neurons and their characteristics
New Auto-Interp
Negative Logits
Vend
-0.85
aid
-0.77
weather
-0.72
edd
-0.72
Carnival
-0.71
styles
-0.70
irl
-0.70
tar
-0.69
Destruction
-0.65
bris
-0.65
POSITIVE LOGITS
neurons
3.47
neuron
3.05
neuronal
2.27
hippocamp
1.71
cortical
1.69
cortex
1.67
synaptic
1.64
neural
1.58
hippocampus
1.52
dopamine
1.50
Activations Density 0.033%