INDEX
Explanations
mentions of specific organizations or individuals
mentions of organizations or titles that relate to service or religious affiliations
New Auto-Interp
Negative Logits
ãĥ£
-0.86
اÙĦ
-0.71
ascus
-0.69
hedral
-0.69
pora
-0.66
axies
-0.64
acca
-0.64
itals
-0.61
ocular
-0.61
abba
-0.60
POSITIVE LOGITS
shire
1.00
mare
0.96
lich
0.88
mann
0.86
ancies
0.83
sworth
0.81
ezvous
0.80
geist
0.80
heit
0.79
leness
0.77
Activations Density 0.065%