INDEX
Explanations
references to governmental or organizational policies
references to government policies
New Auto-Interp
Negative Logits
ITNESS
-0.85
issan
-0.85
athan
-0.75
ãĤ¨ãĥ«
-0.73
Vel
-0.70
Rocket
-0.69
Brotherhood
-0.69
Sabha
-0.69
avez
-0.69
Flavoring
-0.67
POSITIVE LOGITS
policies
1.11
prescriptions
0.91
Policies
0.90
policy
0.90
preferences
0.84
stances
0.83
olicy
0.82
policy
0.79
governing
0.79
aroo
0.77
Activations Density 0.012%