INDEX
Explanations
verbs that denote actions related to placement or movement
New Auto-Interp
Negative Logits
sto
-0.16
him
-0.16
wil
-0.15
ize
-0.15
kon
-0.15
od
-0.15
anga
-0.15
ulate
-0.15
itis
-0.14
uner
-0.14
POSITIVE LOGITS
chten
0.15
resh
0.15
496
0.15
entrev
0.15
ì§ģ
0.14
.createServer
0.14
ovnÃŃ
0.14
uren
0.14
ská
0.14
ablish
0.14
Activations Density 0.107%