INDEX
Explanations
the word "only" in various contexts, emphasizing exclusivity or singularity
New Auto-Interp
Negative Logits
elt
-0.15
_iff
-0.13
cken
-0.13
ennen
-0.13
кова
-0.13
enough
-0.13
.import
-0.13
atica
-0.13
_$
-0.13
pga
-0.12
POSITIVE LOGITS
thing
0.35
remaining
0.28
Thing
0.24
remaining
0.24
thing
0.24
way
0.23
Thing
0.23
Remaining
0.22
ones
0.21
(thing
0.21
Activations Density 0.043%