INDEX
Explanations
prepositions
The neuron never activates—it does not respond to any token pattern.
New Auto-Interp
Negative Logits
planner
-0.07
Worcester
-0.07
επισ
-0.07
--------------------------------
-0.07
weakened
-0.07
университ
-0.07
Pist
-0.06
(text
-0.06
livest
-0.06
Sul
-0.06
POSITIVE LOGITS
fís
0.07
िब
0.07
Bundle
0.06
Nib
0.06
IMG
0.06
.hadoop
0.06
remind
0.06
ربه
0.06
ropical
0.06
ーブ
0.06
Activations Density 0.024%