INDEX
Explanations
terms related to laws, specifically focusing on prohibition
references to prohibition laws
New Auto-Interp
Negative Logits
Fault
-0.65
events
-0.65
Tur
-0.65
Tur
-0.65
PE
-0.64
ource
-0.63
eon
-0.63
Oval
-0.62
rics
-0.61
)</
-0.59
POSITIVE LOGITS
prohibition
1.20
regimes
0.87
ategory
0.85
prohibited
0.85
prohibit
0.84
outlaw
0.84
preclude
0.84
prohibiting
0.83
restriction
0.82
alist
0.82
Activations Density 0.015%