INDEX
Explanations
terms related to things that are hidden, undisclosed, or not widely known
prefixes related to negation or lack
New Auto-Interp
Negative Logits
=-=-=-=-=-=-=-=-
-0.75
bull
-0.74
bulls
-0.72
sarc
-0.67
sucking
-0.67
simulator
-0.63
initials
-0.62
Knights
-0.62
straw
-0.61
rooting
-0.60
POSITIVE LOGITS
leased
1.58
achable
1.48
ported
1.47
ason
1.30
ached
1.26
peat
1.25
vised
1.20
ired
1.13
acted
1.10
emed
1.03
Activations Density 0.028%