INDEX
Explanations
phrases expressing emotions or feelings
New Auto-Interp
Negative Logits
ekl
-0.15
ahl
-0.15
iaz
-0.15
CARD
-0.15
دث
-0.15
agger
-0.15
ëĵĿ
-0.15
avit
-0.14
odo
-0.14
à¸Ńà¸Ń
-0.14
POSITIVE LOGITS
365
0.16
lessly
0.16
burg
0.15
omite
0.15
uctor
0.14
ãĥ¼ãĥŃ
0.14
arton
0.14
flo
0.14
reserve
0.14
opcion
0.14
Activations Density 0.040%