INDEX
Explanations
quantitative phrases indicating prevalence or counts among various subjects
New Auto-Interp
Negative Logits
óng
-0.16
weakest
-0.15
gnu
-0.15
mares
-0.15
anela
-0.14
isen
-0.14
zo
-0.14
داÙħ
-0.14
aurant
-0.14
darkest
-0.14
POSITIVE LOGITS
Dag
0.14
erture
0.14
olv
0.14
arger
0.14
andler
0.14
atal
0.13
DeV
0.13
جÙĪ
0.13
Banc
0.13
hap
0.13
Activations Density 0.040%