INDEX
Explanations
references to a specific person named "Arafat"
mentions of the name "Arafat" and related figures
New Auto-Interp
Negative Logits
eners
-0.83
ership
-0.82
bilt
-0.76
Legendary
-0.68
Vector
-0.67
tainment
-0.67
liness
-0.67
gamer
-0.66
sylvania
-0.66
itution
-0.64
POSITIVE LOGITS
Ara
1.03
ikawa
0.90
ishi
0.87
uses
0.85
yip
0.79
rison
0.78
azel
0.77
byss
0.77
iries
0.76
Palestin
0.75
Activations Density 0.017%