INDEX
Explanations
mentions of political figures and their actions
New Auto-Interp
Negative Logits
ylon
-0.64
otin
-0.64
onite
-0.55
avis
-0.55
ata
-0.55
breaker
-0.55
poly
-0.53
landers
-0.52
Canadians
-0.52
Nationals
-0.51
POSITIVE LOGITS
own
0.83
tremend
0.67
introductory
0.66
travels
0.66
memoir
0.66
Own
0.64
briefs
0.61
remarks
0.61
autobiography
0.59
endeavors
0.58
Activations Density 14.102%