INDEX
Explanations
instances of the word "look" in various forms and contexts
New Auto-Interp
Negative Logits
undi
-0.15
doi
-0.15
uns
-0.14
Gors
-0.14
urs
-0.14
лÑĸк
-0.14
oad
-0.14
XX
-0.13
pling
-0.13
Fa
-0.13
POSITIVE LOGITS
Kon
0.15
par
0.15
par
0.15
ÙĦÛĮت
0.15
utherland
0.14
inf
0.14
askell
0.14
buz
0.14
ekk
0.14
esson
0.14
Activations Density 0.006%