INDEX
Explanations
instances of the word "shift" and its variations, indicating a focus on changes or transitions
New Auto-Interp
Negative Logits
erate
-0.16
uento
-0.15
ilet
-0.15
lek
-0.14
hide
-0.14
åĤĻ
-0.14
eled
-0.14
ervation
-0.14
hek
-0.14
ITY
-0.14
POSITIVE LOGITS
shift
0.21
Shift
0.21
shifts
0.20
shift
0.20
sands
0.19
(shift
0.19
SHIFT
0.19
Shift
0.19
-shift
0.19
Hlav
0.17
Activations Density 0.017%