INDEX
Explanations
instances of the word "look" in various forms, indicating a focus on sight or observation-related actions
New Auto-Interp
Negative Logits
iom
-0.16
olik
-0.15
ollah
-0.15
.shiro
-0.15
idar
-0.14
150
-0.14
ula
-0.14
eker
-0.14
elix
-0.14
uada
-0.14
POSITIVE LOGITS
closely
0.21
up
0.20
istrovstvÃŃ
0.17
at
0.17
carefully
0.17
online
0.16
through
0.16
around
0.16
elsewhere
0.16
Twice
0.16
Activations Density 0.049%