INDEX
Explanations
mentions of political figures
references to political representatives
New Auto-Interp
Negative Logits
DragonMagazine
-0.70
glers
-0.70
ahime
-0.70
underpin
-0.67
phal
-0.61
Belfast
-0.59
Gleaming
-0.58
bul
-0.56
)=(
-0.55
Siberian
-0.55
POSITIVE LOGITS
orters
1.50
orter
1.44
rint
1.36
utation
1.25
orted
1.22
rieve
1.22
resents
1.15
ository
1.14
utable
1.09
atri
1.07
Activations Density 0.024%