INDEX
Explanations
personal pronouns referring to a political figure
New Auto-Interp
Negative Logits
earch
-0.69
Columb
-0.66
Veter
-0.65
Gems
-0.64
Leban
-0.64
ective
-0.63
GV
-0.62
pregn
-0.62
Millennium
-0.61
ASA
-0.61
POSITIVE LOGITS
zbollah
1.36
'll
1.20
'd
1.12
campaigned
1.10
eded
1.09
vowed
0.97
swore
0.97
tweeted
0.96
've
0.96
appoint
0.95
Activations Density 0.221%