INDEX
Explanations
references to authority or rules, particularly in the context of governance or political structures
New Auto-Interp
Negative Logits
/*#__
-0.15
rades
-0.14
atitis
-0.14
lej
-0.14
uco
-0.14
ÏĢει
-0.14
rez
-0.13
enu
-0.13
ustum
-0.13
óst
-0.13
POSITIVE LOGITS
rule
1.59
Rule
1.44
rule
1.38
rules
1.35
Rule
1.34
-rule
1.33
RULE
1.28
Rules
1.27
_rule
1.23
ruled
1.16
Activations Density 0.333%