INDEX
Explanations
words related to regulations or regulatory bodies
terms related to regulations or governing principles
New Auto-Interp
Negative Logits
¿½
-0.95
lihood
-0.76
ĺħ
-0.76
Haram
-0.74
=-=-
-0.70
ACP
-0.68
\\\\\\\\
-0.67
Fine
-0.66
Dangerous
-0.64
chal
-0.64
POSITIVE LOGITS
ardless
1.18
arious
1.04
nant
1.04
istration
1.03
arded
1.00
rett
1.00
ulatory
0.98
ulators
0.97
ulations
0.97
roup
0.94
Activations Density 0.014%