INDEX
Explanations
phrases indicating origin or source of participants or subjects
New Auto-Interp
Negative Logits
is
-0.55
مرئيه
-0.52
ľa
-0.47
To
-0.44
SuppressLint
-0.44
She
-0.44
isEqualToString
-0.44
nakalista
-0.43
me
-0.43
là
-0.43
POSITIVE LOGITS
Portail
0.95
sizeCache
0.94
kteří
0.81
новништво
0.72
zufolge
0.72
بوابة
0.71
whom
0.71
ۜ
0.70
ويكيپيديا
0.68
ktorí
0.68
Activations Density 0.386%