INDEX
Explanations
instances of the word "come" and its variations
New Auto-Interp
Negative Logits
dzi
-0.16
rit
-0.16
ume
-0.15
forme
-0.15
iei
-0.15
infer
-0.15
amac
-0.14
ritz
-0.14
licer
-0.14
urai
-0.14
POSITIVE LOGITS
back
0.21
here
0.19
home
0.18
upp
0.17
into
0.17
backs
0.16
_here
0.16
onto
0.16
-back
0.16
here
0.15
Activations Density 0.052%