INDEX
Explanations
emotions and personal desires expressed in the text
New Auto-Interp
Negative Logits
igma
-0.15
kola
-0.14
[q
-0.14
rubu
-0.13
ARB
-0.13
éIJĺ
-0.13
rts
-0.13
_refer
-0.13
stile
-0.13
qualification
-0.13
POSITIVE LOGITS
Kear
0.16
ixe
0.15
ordo
0.14
Cord
0.14
afil
0.14
cord
0.14
lesh
0.14
cord
0.14
urch
0.14
Div
0.14
Activations Density 0.000%