INDEX
Explanations
second-person pronouns and phrases indicating personal engagement
New Auto-Interp
Negative Logits
íĹĮ
-0.15
讯
-0.15
士
-0.15
uslim
-0.14
lavÃŃ
-0.14
öl
-0.14
онÑĮ
-0.14
zv
-0.14
ypad
-0.14
osi
-0.14
POSITIVE LOGITS
encounters
0.18
experiences
0.18
encounter
0.17
spoiled
0.17
ipt
0.17
spol
0.17
Encounter
0.17
truly
0.16
enter
0.16
sense
0.15
Activations Density 0.059%