INDEX
Explanations
references to Iranian political figures and leadership
New Auto-Interp
Negative Logits
pson
-0.15
enheim
-0.15
ogy
-0.14
ná
-0.14
OGLE
-0.14
elihood
-0.14
Copyright
-0.14
ui
-0.14
guy
-0.13
à¹ĥà¸Ī
-0.13
POSITIVE LOGITS
hil
0.15
ARIO
0.15
ilha
0.15
ÑĸÑĢ
0.14
recreation
0.14
ì¼ĢìĿ´
0.14
215
0.14
allet
0.13
ario
0.13
pts
0.13
Activations Density 0.003%