INDEX
Explanations
personal pronouns followed by positive statements about one's actions or beliefs
pronouns and references to individuals, particularly in contexts of achievement and recognition
New Auto-Interp
Negative Logits
beware
-0.71
longing
-0.65
Dragonbound
-0.61
RET
-0.60
quartered
-0.60
approximation
-0.59
URI
-0.59
ranged
-0.59
ulsion
-0.58
onde
-0.56
POSITIVE LOGITS
accomplished
1.03
did
0.93
did
0.89
accomplish
0.86
done
0.85
've
0.83
wrought
0.81
do
0.79
achieved
0.77
done
0.76
Activations Density 0.110%