INDEX
Explanations
references to social interactions or engagements
New Auto-Interp
Negative Logits
tena
-0.43
.
-0.42
oras
-0.41
directe
-0.41
rsiniz
-0.41
вперед
-0.40
aprobó
-0.39
ھر
-0.38
forward
-0.38
tropical
-0.37
POSITIVE LOGITS
Roskov
0.94
ویکیپدی
0.92
########.
0.83
kasarigan
0.80
ंदीखरीदारी
0.78
]");
0.75
bezeichneter
0.75
Geplaatst
0.74
:+:
0.73
بيها
0.73
Activations Density 0.327%