INDEX
Explanations
mentions of non-compliance and violations of regulations or laws
phrases and words related to legal compliance
New Auto-Interp
Negative Logits
orah
-0.85
ndra
-0.74
hler
-0.70
ulz
-0.69
ouf
-0.68
bably
-0.65
chief
-0.65
mAh
-0.64
ju
-0.64
edin
-0.64
POSITIVE LOGITS
laws
0.70
stringent
0.69
orders
0.69
obligations
0.69
directives
0.69
norms
0.69
instructions
0.68
provisions
0.67
regulations
0.67
their
0.66
Activations Density 0.073%