INDEX
Explanations
mentions of specific political figures
New Auto-Interp
Negative Logits
channelAvailability
-0.64
separat
-0.62
CPC
-0.60
confines
-0.59
causation
-0.56
STATS
-0.56
BIP
-0.55
plings
-0.55
auga
-0.55
bourg
-0.54
POSITIVE LOGITS
Wan
0.86
leck
0.83
orio
0.81
wald
0.78
uary
0.78
worldly
0.76
ratulations
0.72
actory
0.70
ugh
0.70
isoft
0.68
Activations Density 0.069%