INDEX
Explanations
phrases related to violations of rules or laws
terms related to legal or regulatory violations
New Auto-Interp
Negative Logits
arger
-0.84
Tycoon
-0.84
NetMessage
-0.76
roxy
-0.70
ilts
-0.63
zie
-0.60
igmat
-0.59
rica
-0.58
river
-0.58
glamorous
-0.58
POSITIVE LOGITS
violations
1.00
viol
0.98
violation
0.90
punishable
0.88
committed
0.83
Viol
0.79
thereof
0.76
viol
0.76
lig
0.72
infring
0.70
Activations Density 0.076%