INDEX
Explanations
references to compliance or non-compliance with rules, laws, or regulations
references to compliance with regulations and laws
New Auto-Interp
Negative Logits
iewicz
-0.76
rage
-0.75
nl
-0.72
quarrel
-0.67
nova
-0.66
convol
-0.63
bind
-0.63
ument
-0.63
Gorge
-0.61
aster
-0.61
POSITIVE LOGITS
comply
0.94
Compliance
0.86
compliance
0.82
ively
0.81
ibilities
0.81
encies
0.80
with
0.79
complying
0.78
enza
0.78
compliance
0.76
Activations Density 0.049%