INDEX
Explanations
dollar signs indicating monetary values or references to currency
New Auto-Interp
Negative Logits
مشين
-0.94
للمعارف
-0.91
олові
-0.90
Билгалдахарш
-0.90
rungsseite
-0.89
省市镇
-0.87
ंदीखरीदारी
-0.86
UnsafeEnabled
-0.86
Мексичка
-0.84
GEBURTSDATUM
-0.84
POSITIVE LOGITS
way
0.49
وا
0.44
δι
0.43
ecap
0.43
modo
0.42
stream
0.41
lini
0.40
δι
0.40
พ
0.40
so
0.39
Activations Density 0.012%