INDEX
Explanations
terms related to prohibition or restriction
words related to prohibitions or restrictions
New Auto-Interp
Negative Logits
sie
-0.81
framework
-0.80
wm
-0.71
wake
-0.71
lycer
-0.70
reset
-0.69
holm
-0.69
dds
-0.69
factor
-0.68
ead
-0.68
POSITIVE LOGITS
imports
0.99
smoking
0.88
discrimination
0.87
outright
0.86
abortions
0.85
anyone
0.84
unauthorized
0.83
importing
0.82
exporting
0.80
outsiders
0.80
Activations Density 0.084%