INDEX
Explanations
words related to legislation and policy-making
phrases related to environmental and economic impacts
New Auto-Interp
Negative Logits
'.
-0.80
.).
-0.79
''.
-0.76
.'
-0.73
?).
-0.71
.]
-0.66
attRot
-0.65
".
-0.65
}.
-0.64
'."
-0.64
POSITIVE LOGITS
outpatient
0.62
-,
0.55
geographically
0.55
âμ
0.54
physically
0.54
physical
0.54
nighttime
0.51
physical
0.51
foundational
0.51
nonviolent
0.51
Activations Density 1.793%