INDEX
Explanations
phrases containing contractions such as "Don't" or "It's"
negative phrases related to self-doubt or inability
New Auto-Interp
Negative Logits
acron
-0.69
Guatem
-0.67
simultane
-0.66
decomp
-0.66
Slav
-0.64
blanket
-0.64
Osw
-0.63
SHARES
-0.63
satell
-0.61
Alph
-0.60
POSITIVE LOGITS
t
1.65
tion
1.22
ti
1.21
tan
1.16
tin
1.16
tu
1.14
tions
1.13
tis
1.12
td
1.12
nt
1.06
Activations Density 0.133%