INDEX
Explanations
proper nouns representing locations, likely political in nature
repeated mentions of "Washington."
New Auto-Interp
Negative Logits
gger
-0.78
Torrent
-0.73
ongo
-0.71
eworld
-0.69
{*-0.66
Redditor
-0.66
unct
-0.65
complete
-0.65
order
-0.64
Scroll
-0.63
POSITIVE LOGITS
ASHINGTON
1.12
WASHINGTON
1.02
STATE
0.98
aukee
0.93
STATES
0.89
NESS
0.89
GOODMAN
0.87
MENT
0.85
YORK
0.83
REPORT
0.83
Activations Density 0.011%