INDEX
Explanations
proper nouns, particularly names and geographical locations
place names and foreign words
New Auto-Interp
Negative Logits
GTCX
-0.38
verwijspagina
-0.37
impact
-0.32
imp
-0.31
herzog
-0.30
inten
-0.30
impon
-0.30
Jr
-0.29
springframework
-0.29
alve
-0.29
POSITIVE LOGITS
ंदीखरीदारी
0.79
batore
0.73
astéroïdes
0.59
Hozzáférés
0.59
ibouti
0.57
Билгалдахарш
0.56
Савезне
0.56
COUVER
0.56
Мексичка
0.55
للمعارف
0.54
Activations Density 0.576%