INDEX
Explanations
first-person singular pronouns followed by verbs indicating leading or guiding actions
New Auto-Interp
Negative Logits
ese
-0.75
ary
-0.71
RP
-0.62
Enlarge
-0.62
supplied
-0.61
YY
-0.60
BR
-0.59
sorted
-0.59
ially
-0.59
yz
-0.58
POSITIVE LOGITS
believe
1.03
realize
0.86
conclude
0.86
conclusions
0.84
realise
0.80
pursue
0.79
discover
0.77
embrace
0.76
contemplate
0.75
extremes
0.73
Activations Density 0.161%