INDEX
Explanations
references to bans, suspensions, and rejections in various contexts
New Auto-Interp
Negative Logits
ĮĢ
-0.14
åĪ»
-0.14
.cms
-0.14
lash
-0.13
Shed
-0.13
berger
-0.13
-counter
-0.13
oth
-0.13
DED
-0.13
decks
-0.13
POSITIVE LOGITS
due
0.37
due
0.35
because
0.34
because
0.32
åĽłä¸º
0.31
wegen
0.29
بسبب
0.28
_due
0.26
debido
0.26
Due
0.26
Activations Density 0.217%