INDEX
Explanations
mentions of physical locations or directions
keywords related to movement and activity
New Auto-Interp
Negative Logits
forcer
-0.70
istor
-0.69
timer
-0.63
iece
-0.62
sequence
-0.62
iom
-0.61
utan
-0.61
outine
-0.60
raper
-0.59
unit
-0.59
POSITIVE LOGITS
ones
1.46
nesses
1.10
mails
1.05
rows
1.03
fronts
1.02
aries
1.01
positions
0.99
outputs
0.99
votes
0.99
voices
0.99
Activations Density 0.602%