INDEX
Explanations
review scores
This neuron never fires—it doesn’t detect any pattern in the text (i.e. it’s effectively “dead”).
New Auto-Interp
Negative Logits
acid
-0.06
Strip
-0.06
عبدال
-0.06
imperial
-0.06
dictated
-0.06
.counter
-0.06
_non
-0.06
"',
-0.06
_AUD
-0.06
Lua
-0.06
POSITIVE LOGITS
qualitative
0.06
forecast
0.06
TODO
0.06
107
0.06
europé
0.06
změn
0.06
специалист
0.06
940
0.06
LIST
0.06
イ
0.06
Activations Density 0.004%