INDEX
Explanations
names of political leaders and related entities
New Auto-Interp
Negative Logits
OTA
-0.18
idis
-0.16
ži
-0.15
AMA
-0.14
خاÙĨÙĩ
-0.14
.documentation
-0.14
seedu
-0.14
主任
-0.14
NgÃłnh
-0.14
ACTION
-0.14
POSITIVE LOGITS
hé
0.16
коз
0.15
cher
0.15
emodel
0.15
orch
0.14
ané
0.14
ubbles
0.13
qua
0.13
bia
0.13
Cher
0.13
Activations Density 0.074%