INDEX
Explanations
actions related to legal or disciplinary measures
New Auto-Interp
Negative Logits
itchen
-0.18
agas
-0.15
alach
-0.15
Ậ
-0.14
Äįet
-0.14
indem
-0.14
aggi
-0.13
berman
-0.13
ntl
-0.13
anches
-0.13
POSITIVE LOGITS
due
0.43
due
0.40
Due
0.35
_due
0.34
Due
0.34
debido
0.33
vido
0.30
بسبب
0.29
following
0.29
wegen
0.28
Activations Density 0.190%