INDEX
Explanations
phrases related to personal feelings and emotional states
New Auto-Interp
Negative Logits
owie
-0.17
umen
-0.16
erli
-0.15
èĬ¸
-0.14
ivan
-0.14
637
-0.14
âl
-0.14
aille
-0.14
-Origin
-0.14
ounge
-0.13
POSITIVE LOGITS
linger
0.14
things
0.14
eno
0.14
Treatment
0.14
perse
0.14
persever
0.14
Tag
0.14
structor
0.14
Tip
0.14
lab
0.14
Activations Density 0.122%