INDEX
Explanations
political language and references
repetitive mentions of the word "the."
New Auto-Interp
Negative Logits
eret
-0.72
venture
-0.71
NB
-0.68
SPONSORED
-0.67
bet
-0.67
ESPN
-0.67
rand
-0.65
ée
-0.64
akeru
-0.64
ea
-0.64
POSITIVE LOGITS
entirety
1.29
entire
1.28
remainder
1.19
latter
1.09
possibility
1.06
same
1.05
slightest
1.01
whole
0.99
aforementioned
0.98
majority
0.98
Activations Density 0.267%