INDEX
Explanations
references to news articles or reporters
possessive phrases indicating attribution to various sources or authors
New Auto-Interp
Negative Logits
PLA
-0.82
Sov
-0.81
#$#$
-0.78
unin
-0.78
ét
-0.77
$$$$
-0.76
oves
-0.74
utory
-0.71
RFC
-0.71
sov
-0.71
POSITIVE LOGITS
Geoff
1.25
Jonathan
1.22
Jeffrey
1.22
Jason
1.21
Jennifer
1.19
Andrew
1.19
Matthew
1.19
Ian
1.19
Erik
1.18
Jesse
1.18
Activations Density 0.098%