INDEX
Explanations
words related to growth or escalation
New Auto-Interp
Negative Logits
nore
-0.16
ед
-0.16
vod
-0.15
360
-0.15
Vance
-0.14
IEWS
-0.14
okt
-0.14
hog
-0.13
smouth
-0.13
eci
-0.13
POSITIVE LOGITS
³
0.16
udit
0.15
ulus
0.14
averse
0.14
Gaw
0.13
Gn
0.13
quine
0.13
gerne
0.13
fers
0.13
HN
0.13
Activations Density 0.010%