INDEX
Explanations
phrases prompting to consider actions or decisions
instances of the word "consider" and its variations, indicating suggestions or recommendations
New Auto-Interp
Negative Logits
oiler
-0.66
mith
-0.64
place
-0.62
IER
-0.61
ijn
-0.59
oil
-0.58
ifact
-0.58
orld
-0.57
cart
-0.57
attendant
-0.57
POSITIVE LOGITS
phas
0.92
MFT
0.90
ilitarian
0.86
ibility
0.85
ationally
0.80
ate
0.80
ably
0.79
mental
0.75
prising
0.73
akeru
0.73
Activations Density 0.024%