INDEX
Explanations
references to political leaders, specifically prime ministers
New Auto-Interp
Negative Logits
eker
-0.15
.googleapis
-0.14
opper
-0.14
аÑĤелÑĮно
-0.14
ohl
-0.14
ئ
-0.14
alim
-0.14
BAÅŀ
-0.14
uer
-0.14
abo
-0.13
POSITIVE LOGITS
.scalablytyped
0.14
yen
0.14
ash
0.14
angen
0.13
marvin
0.13
ham
0.13
pal
0.13
ãĤ¸
0.13
perature
0.13
iy
0.13
Activations Density 0.013%