INDEX
Explanations
words expressing choice or flexibility
New Auto-Interp
Negative Logits
andan
-0.16
kins
-0.16
ÑĢаÑĩ
-0.16
Kiss
-0.16
nip
-0.15
kiss
-0.15
ilo
-0.15
syn
-0.15
ultimate
-0.15
ato
-0.14
POSITIVE LOGITS
å³°
0.18
ücken
0.16
rary
0.15
座
0.15
asio
0.14
Pazar
0.14
owa
0.13
EDIA
0.13
oliberal
0.13
acific
0.13
Activations Density 0.010%