INDEX
Explanations
variations of the term "Twist."
New Auto-Interp
Negative Logits
oct
-0.17
eur
-0.17
ei
-0.16
ead
-0.16
undis
-0.15
UDO
-0.15
sing
-0.15
ozo
-0.15
RESS
-0.15
oa
-0.15
POSITIVE LOGITS
tw
0.34
Tw
0.31
elfth
0.26
Tw
0.26
tw
0.24
elve
0.24
inkle
0.24
isted
0.24
-tw
0.21
addle
0.20
Activations Density 0.013%