INDEX
Explanations
mentions of rules, regulations, violations, and associated penalties or punishments
terms and phrases related to legal violations and penalties
New Auto-Interp
Negative Logits
optim
-0.78
Miracle
-0.69
wow
-0.67
ween
-0.67
Beaut
-0.66
pheus
-0.66
NetMessage
-0.65
bits
-0.63
akes
-0.63
Perfect
-0.63
POSITIVE LOGITS
penalties
1.63
punishment
1.53
fines
1.51
punishments
1.50
penalty
1.49
reprim
1.48
fined
1.44
expulsion
1.38
imprisonment
1.37
jail
1.37
Activations Density 0.383%