INDEX
Explanations
action-related terms or calls for intervention
references to taking action or the concept of action
New Auto-Interp
Negative Logits
conservancy
-0.81
abund
-0.69
Province
-0.63
iciency
-0.63
Kou
-0.62
Fey
-0.62
inately
-0.61
illusions
-0.61
abundant
-0.61
ringe
-0.60
POSITIVE LOGITS
ional
0.93
action
0.87
Replay
0.87
iveness
0.84
ality
0.82
ivism
0.81
igraph
0.81
ACTIONS
0.79
uations
0.79
uary
0.77
Activations Density 0.034%