INDEX
Explanations
mentions of the United States
occurrences of the abbreviation "U.S." or references to the United States
New Auto-Interp
Negative Logits
theless
-0.87
STATS
-0.71
simmer
-0.61
unpre
-0.60
caution
-0.59
proof
-0.56
Cancel
-0.55
KP
-0.55
proportions
-0.55
organising
-0.54
POSITIVE LOGITS
.,
1.63
.?
1.36
.;
1.27
.,"
1.25
.:
1.24
.-
1.21
./
1.19
.—
1.14
.$
1.05
.),
1.01
Activations Density 0.053%