INDEX
Explanations
instances where an action is taken or an event occurs
phrases related to social and economic issues
New Auto-Interp
Negative Logits
.).
-0.57
!.
-0.53
!).
-0.51
+.
-0.51
.''
-0.50
)!
-0.50
%.
-0.49
'.
-0.49
$.
-0.49
.ãĢį
-0.49
POSITIVE LOGITS
ividual
0.60
osponsors
0.59
bernatorial
0.59
subur
0.55
regarding
0.54
associated
0.52
imentary
0.51
pez
0.51
oided
0.50
ivalent
0.50
Activations Density 1.420%