INDEX
Explanations
numbers and code
This neuron activates on the token “US,” i.e. it flags mentions of “US” in the text.
New Auto-Interp
Negative Logits
ACT
-0.06
HEX
-0.06
HTTPHeader
-0.06
shar
-0.06
arrests
-0.06
_Show
-0.06
relay
-0.06
Kickstarter
-0.06
furry
-0.06
_USED
-0.06
POSITIVE LOGITS
odule
0.07
Dresden
0.07
Memory
0.06
nije
0.06
sitesinde
0.06
complaints
0.06
são
0.06
esp
0.06
100
0.06
volcan
0.06
Activations Density 0.016%