INDEX
Explanations
words related to motion or directional movement
New Auto-Interp
Negative Logits
ullah
-0.79
ament
-0.76
creen
-0.68
Horus
-0.64
eers
-0.64
ificent
-0.62
icio
-0.61
oret
-0.58
idable
-0.57
lake
-0.56
POSITIVE LOGITS
overboard
1.04
verning
1.04
lems
0.99
vt
0.99
ggle
0.96
viral
0.88
unnoticed
0.84
Ń·
0.82
bankrupt
0.81
extinct
0.79
Activations Density 2.193%