INDEX
Explanations
terms indicating permission or enabling actions
the word "allow" in various contexts.
New Auto-Interp
Negative Logits
projectName
-0.33
архивлан
-0.33
CIS
-0.31
Rie
-0.31
StringCopy
-0.30
<strong>
-0.29
Démographie
-0.29
KommentareTeilen
-0.28
mad
-0.28
Rie
-0.28
POSITIVE LOGITS
autorytatywna
0.81
ALLOW
0.69
allow
0.67
Allows
0.67
Allows
0.66
ALLOW
0.65
Allow
0.65
Italijanski
0.64
Allow
0.62
GTCX
0.62
Activations Density 0.154%