INDEX
Explanations
names of specific political figures
mentions of specific individuals, particularly politicians
New Auto-Interp
Negative Logits
ripp
-0.92
fman
-0.92
BOOK
-0.83
chrom
-0.82
hered
-0.79
Seattle
-0.78
naire
-0.77
plex
-0.77
earing
-0.77
bodied
-0.76
POSITIVE LOGITS
Mahmoud
1.22
Abbas
1.08
Ahmad
0.90
Mahm
0.87
ollah
0.86
Gh
0.86
Mubarak
0.84
Ahmed
0.80
Meh
0.79
Abdel
0.79
Activations Density 0.011%