INDEX
Explanations
references to geographical locations or entities related to Gaza
New Auto-Interp
Negative Logits
otland
-0.16
ниÑĩеÑģ
-0.15
redo
-0.15
es
-0.15
edes
-0.15
eon
-0.15
OLON
-0.14
avanaugh
-0.14
à¹Ģà¸
-0.14
andez
-0.14
POSITIVE LOGITS
za
0.24
zy
0.19
anja
0.18
quez
0.17
eb
0.17
akhstan
0.16
epam
0.16
t
0.16
zer
0.16
Ĥæķ°
0.15
Activations Density 0.019%