INDEX
Explanations
instances of the pronoun "it" and phrases emphasizing importance or significance
New Auto-Interp
Negative Logits
ichert
-0.17
aghetti
-0.16
endet
-0.14
oprav
-0.14
udeau
-0.14
мага
-0.13
erotik
-0.13
nio
-0.13
seksi
-0.13
predecess
-0.13
POSITIVE LOGITS
beh
0.25
scarcely
0.23
need
0.21
matters
0.19
ill
0.19
g
0.17
trans
0.17
strains
0.17
distress
0.17
grat
0.17
Activations Density 0.134%