INDEX
Explanations
references to government-issued instructions or commands, particularly executive orders
instances of executive orders and related terminology
New Auto-Interp
Negative Logits
Argon
-0.78
dden
-0.71
apest
-0.69
espie
-0.68
olls
-0.68
Cats
-0.67
rown
-0.67
Unity
-0.67
stocks
-0.66
rum
-0.66
POSITIVE LOGITS
prohibiting
1.04
authorizing
1.01
issued
1.00
granting
0.96
restricting
0.95
barring
0.91
banning
0.91
decree
0.90
prohibits
0.88
restraining
0.87
Activations Density 0.051%