INDEX
Explanations
concepts related to the influence of corporations and their impact on public health and safety
New Auto-Interp
Negative Logits
Status
-0.16
Status
-0.16
030
-0.16
addon
-0.16
Logic
-0.15
ök
-0.14
erox
-0.14
ahas
-0.14
deaux
-0.14
866
-0.14
POSITIVE LOGITS
decision
0.34
decisions
0.33
decision
0.29
policies
0.27
policy
0.25
policym
0.24
Decision
0.24
Decision
0.23
Policies
0.23
karar
0.23
Activations Density 0.102%