INDEX
Explanations
phrases that indicate frequency or prevalence of conditions or situations
New Auto-Interp
Negative Logits
tea
-0.17
phan
-0.16
odem
-0.15
à¸Ńà¸Ļ
-0.15
Sham
-0.14
egan
-0.14
ray
-0.14
Äģn
-0.14
avl
-0.14
olest
-0.13
POSITIVE LOGITS
xuyên
0.22
-used
0.20
occurrence
0.18
ly
0.16
/common
0.16
yny
0.15
682
0.15
among
0.15
/pop
0.15
fare
0.15
Activations Density 0.098%