INDEX
Negative Logits
উপে
0.46
监理
0.43
ustainability
0.39
医
0.39
সার্টিফি
0.38
અનુભ
0.38
忘
0.38
symptômes
0.37
ྨ
0.37
Explorer
0.37
POSITIVE LOGITS
arrests
1.74
arrest
1.63
arrest
1.51
arrested
1.48
逮捕
1.48
Arrest
1.38
arresting
1.29
apprehended
1.24
गिरफ्तार
1.23
apprehend
1.17
Activations Density 0.011%