INDEX
Explanations
keywords related to rules, regulations, and decision-making processes
references to formal requests or procedures in a regulatory context
New Auto-Interp
Negative Logits
essel
-0.51
roma
-0.50
namese
-0.49
BU
-0.48
ongyang
-0.48
oppable
-0.48
iverpool
-0.45
HOME
-0.45
ggles
-0.44
slaught
-0.44
POSITIVE LOGITS
statistical
0.59
ibly
0.58
uly
0.53
jurisd
0.51
Legal
0.51
discretion
0.51
ELY
0.50
Plaint
0.49
prosecut
0.49
lined
0.49
Activations Density 1.812%