INDEX
Explanations
mentions of political leaders and government-related terms
New Auto-Interp
Negative Logits
INGTON
-0.87
Seah
-0.78
Operation
-0.74
TPS
-0.71
Population
-0.70
Grab
-0.69
LAN
-0.68
aband
-0.68
ITNESS
-0.67
Boy
-0.66
POSITIVE LOGITS
hips
1.64
hip
1.55
paces
1.22
ervatives
1.16
mith
1.11
ettings
1.02
pring
0.96
'
0.95
pace
0.95
ervative
0.91
Activations Density 10.303%