INDEX
Explanations
action verbs related to combat and violence
New Auto-Interp
Negative Logits
Bullet
-0.16
Bullet
-0.15
é¢Ĩ
-0.15
GOR
-0.15
INK
-0.15
aura
-0.14
ĥĿ
-0.14
bins
-0.14
_bindings
-0.14
_DOT
-0.14
POSITIVE LOGITS
clubs
0.32
axes
0.31
staff
0.30
Clubs
0.30
sword
0.29
axe
0.28
pole
0.27
staff
0.27
blade
0.26
clubs
0.26
Activations Density 0.203%