INDEX
Explanations
questions or statements expressing confusion or curiosity
negative contractions and questioning phrases
New Auto-Interp
Negative Logits
urst
-0.80
Fra
-0.71
nants
-0.67
Fant
-0.63
furt
-0.63
à¼
-0.62
Frag
-0.62
Puzzles
-0.61
Flav
-0.61
Vand
-0.60
POSITIVE LOGITS
sooner
1.01
itia
0.83
reinvest
0.81
adequately
0.76
properly
0.72
prosecute
0.70
prosec
0.70
reciproc
0.68
ounty
0.68
vacc
0.67
Activations Density 0.145%