INDEX
Explanations
various forms of the word "idea" and related concepts
New Auto-Interp
Negative Logits
ucha
-0.16
endon
-0.16
BSD
-0.15
dess
-0.15
our
-0.15
ir
-0.15
imes
-0.14
conj
-0.14
ello
-0.14
don
-0.14
POSITIVE LOGITS
ohn
0.15
istic
0.15
aida
0.15
ative
0.15
ually
0.14
beh
0.14
istically
0.14
.idea
0.14
behind
0.14
epoch
0.14
Activations Density 0.049%