INDEX
Explanations
words related to actions or processes denoting movement or change
actions or processes related to management and control
New Auto-Interp
Negative Logits
oos
-0.72
SHIP
-0.70
sw
-0.68
aly
-0.64
squ
-0.63
isp
-0.62
fare
-0.62
coord
-0.62
ners
-0.61
away
-0.60
POSITIVE LOGITS
ometimes
1.04
ilver
0.95
ensibly
0.87
hirt
0.84
omething
0.84
hift
0.79
uggest
0.78
paces
0.78
afety
0.78
heet
0.74
Activations Density 0.377%