INDEX
Explanations
second-person pronouns and phrases that suggest action or opportunities for the reader
New Auto-Interp
Negative Logits
inspace
-0.16
bsp
-0.16
Mayer
-0.15
jax
-0.15
bserv
-0.15
osi
-0.14
ì§Ŀ
-0.14
istar
-0.14
бÑĥдÑĮ
-0.14
estroy
-0.14
POSITIVE LOGITS
cko
0.19
ipt
0.17
sure
0.17
truly
0.16
Spo
0.16
surely
0.16
_defs
0.15
can
0.15
certainly
0.15
spoiled
0.15
Activations Density 0.066%