INDEX
Explanations
terms related to bans and prohibitions
New Auto-Interp
Negative Logits
بيها
-0.78
p
-0.70
mtext
-0.68
ณะ
-0.59
)}</
-0.59
}}"></
-0.59
mes
-0.59
tagext
-0.58
":[{-0.57
</em>
-0.56
POSITIVE LOGITS
bans
1.45
banning
1.41
banned
1.38
Bans
1.27
prohibitions
1.23
Bans
1.23
prohibition
1.22
prohibiting
1.21
prohibit
1.20
banned
1.14
Activations Density 0.188%