INDEX
Explanations
expressions of desire or intent
New Auto-Interp
Negative Logits
witter
-0.17
wert
-0.14
Burl
-0.14
ppv
-0.14
rather
-0.13
if
-0.13
lien
-0.13
Daly
-0.13
Lans
-0.13
tak
-0.13
POSITIVE LOGITS
ersonic
0.16
کاراÙĨ
0.15
quam
0.15
amaha
0.14
mith
0.14
пеÑĢÑĸод
0.14
uge
0.14
/bind
0.14
Merc
0.14
_successful
0.14
Activations Density 0.014%