INDEX
Explanations
expressions of desire or longing
New Auto-Interp
Negative Logits
shan
-0.17
eam
-0.17
ader
-0.16
stown
-0.16
eel
-0.16
ussen
-0.15
mpr
-0.15
ÑĢик
-0.14
bá»ı
-0.14
ught
-0.14
POSITIVE LOGITS
ful
0.25
bone
0.20
FUL
0.20
fulness
0.20
fully
0.18
inkle
0.17
ibe
0.17
omers
0.17
(es
0.16
aida
0.15
Activations Density 0.012%