INDEX
Explanations
items or actions related to knives
references to knives in various contexts
New Auto-Interp
Negative Logits
mberg
-0.84
phas
-0.78
rians
-0.74
rian
-0.73
rix
-0.73
auri
-0.68
alse
-0.68
Emin
-0.67
nces
-0.66
ysical
-0.65
POSITIVE LOGITS
scissors
1.09
blade
1.04
blades
0.99
knife
0.99
wielded
0.94
knife
0.92
knives
0.92
claws
0.91
edge
0.89
cutter
0.88
Activations Density 0.043%