INDEX
Explanations
words related to growth and development
New Auto-Interp
Negative Logits
kova
-0.17
olini
-0.17
692
-0.16
ized
-0.16
oice
-0.15
urn
-0.15
.dx
-0.15
ize
-0.15
ged
-0.15
going
-0.14
POSITIVE LOGITS
pains
0.23
íij¸
0.16
аниÑĨ
0.16
spender
0.15
deaux
0.15
ling
0.15
vek
0.15
arde
0.14
asser
0.14
orest
0.14
Activations Density 0.039%