INDEX
Explanations
the word "Dom" or variations of it
terms related to dominance in various contexts
New Auto-Interp
Negative Logits
packing
-0.75
SEE
-0.66
seeker
-0.61
arrow
-0.59
UGH
-0.59
arson
-0.58
arrows
-0.57
forms
-0.57
eye
-0.57
eyes
-0.57
POSITIVE LOGITS
estic
1.40
ino
1.31
inant
1.27
inating
1.20
ination
1.18
ains
1.15
aine
1.05
inate
1.04
inated
1.00
inion
0.98
Activations Density 0.029%