INDEX
Explanations
FIrst impressions: The neuron appears to be interested in the letter "ģ" and phrases associated with scientific and intellectual discourse
the repeated character "ģ"
New Auto-Interp
Negative Logits
raints
-0.77
Schr
-0.66
VW
-0.64
dialogue
-0.63
Sno
-0.63
Hunts
-0.63
icken
-0.63
Nike
-0.62
Euph
-0.62
Rapp
-0.62
POSITIVE LOGITS
ģ
1.49
¼
1.20
Ĩ
1.14
¡
1.13
ĭ
1.12
Ĭ
1.10
µ
1.10
Ģ
1.09
´
1.09
¸
1.08
Activations Density 0.004%