INDEX
Explanations
instances of the word "wrinkle" or variations of it
New Auto-Interp
Negative Logits
iji
-0.16
panse
-0.15
еп
-0.15
cede
-0.14
lor
-0.14
perpet
-0.14
uzzle
-0.14
223
-0.14
jour
-0.14
bombs
-0.14
POSITIVE LOGITS
Wr
0.38
wr
0.33
wr
0.30
Wr
0.29
.wr
0.26
angler
0.26
Wrapped
0.24
wrapping
0.24
Wrap
0.23
wrap
0.23
Activations Density 0.009%