INDEX
Explanations
references to individuals or entities associated with a certain group or organization
New Auto-Interp
Negative Logits
Hentet
-0.45
Искәрмәләр
-0.42
Дереккөздер
-0.39
Iné
-0.39
قایناقلار
-0.37
חיצוניים
-0.37
theless
-0.36
Zust
-0.36
टॉप
-0.36
prevail
-0.35
POSITIVE LOGITS
member
1.16
member
0.94
Member
0.94
MEMBER
0.86
Member
0.83
members
0.79
membre
0.78
MEMBER
0.73
gap
0.70
membro
0.69
Activations Density 0.167%