INDEX
Explanations
This neuron primarily responds to the English pronoun “it.”
New Auto-Interp
Negative Logits
Wallpaper
-0.06
单
-0.06
_sym
-0.06
(cc
-0.06
elsif
-0.06
観
-0.06
runners
-0.06
irq
-0.06
Assembler
-0.06
Ün
-0.06
POSITIVE LOGITS
ži
0.07
ών
0.07
ATEG
0.07
структу
0.06
password
0.06
ців
0.06
Salary
0.06
ikleri
0.06
pocit
0.06
Armenia
0.06
Activations Density 0.133%