INDEX
Explanations
instances of the word "initial" in various contexts
New Auto-Interp
Negative Logits
ess
-0.18
ensch
-0.15
831
-0.14
thane
-0.14
oes
-0.14
Ces
-0.14
inev
-0.14
oi
-0.13
ìĭ±
-0.13
running
-0.13
POSITIVE LOGITS
ity
0.19
ÃŃch
0.14
ilst
0.14
lest
0.14
mente
0.14
DCALL
0.14
åı·
0.14
leo
0.14
aptors
0.14
ENCH
0.14
Activations Density 0.009%