INDEX
Explanations
expressions of personal feelings and opinions
New Auto-Interp
Negative Logits
kinh
-0.16
μαÏĦο
-0.15
abic
-0.15
éĺµ
-0.14
å·»
-0.14
considering
-0.14
Îļά
-0.14
Decre
-0.13
.criteria
-0.13
Ķ
-0.13
POSITIVE LOGITS
crib
0.24
sieve
0.16
hood
0.16
anyways
0.16
visual
0.16
vala
0.15
IFA
0.15
pole
0.15
intros
0.15
anyway
0.15
Activations Density 0.261%