INDEX
Explanations
Code logos
The neuron activates on runs of underscores (and similar repetitive punctuation), i.e. the horizontal line elements used in ASCII art.
New Auto-Interp
Negative Logits
�
-0.07
ARB
-0.07
علی
-0.06
پخش
-0.06
فراهم
-0.06
숨
-0.06
歴
-0.06
ується
-0.06
varsa
-0.06
değildir
-0.06
POSITIVE LOGITS
______
0.07
____
0.07
___
0.07
Appro
0.07
approve
0.07
___
0.06
Transitional
0.06
_____
0.06
neoliberal
0.06
Buckley
0.06
Activations Density 0.003%