INDEX
Explanations
words related to regulatory or de-regulatory processes
terms related to regulation
New Auto-Interp
Negative Logits
¿½
-1.02
lihood
-0.83
ĺħ
-0.77
\\\\\\\\
-0.71
Haram
-0.70
Fine
-0.69
Misty
-0.68
Dangerous
-0.68
=-=-
-0.67
ĪĴ
-0.63
POSITIVE LOGITS
nant
1.14
ardless
1.10
arded
1.04
ulatory
1.02
istration
0.99
rett
0.99
arious
0.98
roup
0.96
inal
0.95
ulations
0.95
Activations Density 0.008%