INDEX
Explanations
words related to the act of shaping or creating
New Auto-Interp
Negative Logits
emas
-0.21
rome
-0.16
formally
-0.16
formal
-0.16
bane
-0.16
vert
-0.15
esis
-0.15
ken
-0.15
grams
-0.15
verter
-0.15
POSITIVE LOGITS
ulating
0.35
ulates
0.31
idable
0.29
ative
0.25
ulate
0.25
ulary
0.25
ulators
0.24
atted
0.23
ulas
0.23
íĥľ
0.23
Activations Density 0.068%