INDEX
Explanations
prohibitions or restrictions set by rules or regulations
terms related to prohibition or restrictions
New Auto-Interp
Negative Logits
eah
-0.84
prise
-0.78
sonian
-0.75
ensional
-0.74
eon
-0.72
gener
-0.72
ctl
-0.72
ipel
-0.70
lings
-0.70
bench
-0.70
POSITIVE LOGITS
prohibited
0.96
prohibits
0.83
forbids
0.83
violations
0.81
etheless
0.79
prohibition
0.79
bidden
0.79
avorite
0.78
prohibit
0.78
forbidden
0.76
Activations Density 0.029%