INDEX
Explanations
instances of the word "look" in various forms, indicating a focus on expressions of anticipation or interest
New Auto-Interp
Negative Logits
uali
-0.18
æĤł
-0.16
ooth
-0.16
haven
-0.15
otta
-0.15
_LEG
-0.15
Gol
-0.14
occo
-0.14
åħ«
-0.14
EMPLARY
-0.14
POSITIVE LOGITS
forward
0.46
forward
0.31
-forward
0.27
fwd
0.27
FORWARD
0.27
forwarding
0.27
forwarded
0.26
.forward
0.26
_forward
0.26
f
0.24
Activations Density 0.013%