INDEX
Explanations
instances of the word "make" and its variations, indicating a focus on creation or action
New Auto-Interp
Negative Logits
actics
-0.15
\<^
-0.15
nda
-0.15
Ì£
-0.14
itis
-0.14
warn
-0.14
±Ð¾ÑĤ
-0.14
çŃĶæ¡Ī
-0.14
awan
-0.14
StdString
-0.13
POSITIVE LOGITS
mistake
0.29
mistakes
0.28
noises
0.26
contribution
0.26
noise
0.26
choices
0.23
mist
0.23
distinction
0.23
connection
0.23
strides
0.22
Activations Density 0.145%