INDEX
Explanations
mentions of specific policies or efforts being pursued or implemented
references to governmental policies
New Auto-Interp
Negative Logits
ITNESS
-0.83
parts
-0.75
lihood
-0.72
Flavoring
-0.71
issan
-0.71
Stain
-0.69
athan
-0.69
Sud
-0.69
Barker
-0.67
CLASSIFIED
-0.65
POSITIVE LOGITS
enacted
0.96
governing
0.95
policies
0.94
implemented
0.83
prescriptions
0.82
restricting
0.82
affecting
0.81
olicy
0.80
imposed
0.80
policy
0.79
Activations Density 0.022%