INDEX
Explanations
occurrences of the word "turn" in various contexts
New Auto-Interp
Negative Logits
../../../
-0.17
imized
-0.16
ually
-0.15
ged
-0.15
zing
-0.15
ãĥ¥
-0.14
jour
-0.14
ersonic
-0.14
nard
-0.14
lou
-0.14
POSITIVE LOGITS
pike
0.39
stile
0.37
tables
0.30
itin
0.27
tabl
0.27
about
0.26
table
0.25
ips
0.25
around
0.24
-around
0.23
Activations Density 0.023%