INDEX
Explanations
verbs related to transportation and movement
New Auto-Interp
Negative Logits
ior
-0.70
grading
-0.69
pher
-0.68
zai
-0.67
duction
-0.64
onymous
-0.64
itone
-0.63
chan
-0.63
bies
-0.63
icago
-0.63
POSITIVE LOGITS
weights
1.01
weight
0.94
loads
0.92
loads
0.82
luggage
0.81
baggage
0.80
weights
0.79
suitcase
0.79
belongings
0.78
crates
0.78
Activations Density 4.852%