INDEX
Explanations
references to the USA in the text
occurrences of the abbreviation "USA."
New Auto-Interp
Negative Logits
ragon
-0.93
Redditor
-0.81
flush
-0.81
prototype
-0.80
unci
-0.75
acles
-0.75
rums
-0.74
interstitial
-0.73
amel
-0.72
ezvous
-0.71
POSITIVE LOGITS
TODAY
1.24
Today
0.89
Patriot
0.81
ICAN
0.80
Airlines
0.79
ESSION
0.78
Freedom
0.76
FRE
0.76
Hockey
0.75
ITE
0.75
Activations Density 0.017%