INDEX
Explanations
phrases related to advocating for or calling for action
instances of the word "have" in various contexts
New Auto-Interp
Negative Logits
rift
-0.59
wounding
-0.58
backdrop
-0.58
laus
-0.57
catentry
-0.56
Creep
-0.56
hill
-0.56
etter
-0.55
Dough
-0.52
etting
-0.52
POSITIVE LOGITS
been
0.98
gotten
0.98
recourse
0.91
gotten
0.86
undergone
0.84
been
0.83
eaten
0.81
taken
0.79
access
0.78
seen
0.77
Activations Density 0.114%