INDEX
Explanations
words related to transition and transformation concepts
New Auto-Interp
Negative Logits
ows
-0.16
spiel
-0.16
ices
-0.16
strap
-0.15
ouch
-0.15
benh
-0.15
ialized
-0.15
embro
-0.14
osal
-0.14
owski
-0.14
POSITIVE LOGITS
/trans
0.28
aksi
0.25
ylvania
0.21
-trans
0.20
verse
0.20
parency
0.18
ylv
0.18
sexual
0.18
.trans
0.17
mere
0.17
Activations Density 0.037%