INDEX
Explanations
discussions around self-determination and separatist sentiments.
This neuron selectively activates on the token “United” (as in “United States,” “United Kingdom,” “United Nations,” etc.).
New Auto-Interp
Negative Logits
imony
-0.07
_remove
-0.07
Negative
-0.07
amine
-0.06
uno
-0.06
ichen
-0.06
结果
-0.06
anal
-0.06
rolled
-0.06
-fe
-0.06
POSITIVE LOGITS
وب
0.07
(distance
0.06
=W
0.06
obstruct
0.06
�
0.06
ुछ
0.06
Aws
0.06
�
0.06
velope
0.06
есте
0.06
Activations Density 0.028%