INDEX
Explanations
instances of the word "still" in various contexts
New Auto-Interp
Negative Logits
origin
-0.73
ighth
-0.70
ocol
-0.66
vals
-0.64
yond
-0.63
ISE
-0.63
imer
-0.63
rosso
-0.62
illet
-0.62
benef
-0.62
POSITIVE LOGITS
pacing
0.91
staring
0.85
amid
0.77
amidst
0.77
screaming
0.76
asleep
0.75
enance
0.75
chanting
0.73
yelling
0.73
helpless
0.73
Activations Density 0.216%