INDEX
Explanations
phrases indicating evaluation or contemplation of information
instances of the word "considering."
New Auto-Interp
Negative Logits
ode
-0.60
atre
-0.57
orc
-0.56
eva
-0.55
shuttle
-0.55
crashed
-0.54
cade
-0.54
rows
-0.53
ater
-0.53
binds
-0.53
POSITIVE LOGITS
considering
3.46
contemplating
1.91
Considering
1.68
Considering
1.65
consider
1.65
evaluating
1.41
reconsider
1.37
comparing
1.37
judging
1.37
consideration
1.35
Activations Density 0.015%