INDEX
Explanations
words related to tools, specifically hammers
references to hammers and hammering actions
New Auto-Interp
Negative Logits
VICE
-0.75
apes
-0.71
Leary
-0.71
ected
-0.66
Wide
-0.66
Sav
-0.66
Relations
-0.64
resp
-0.64
ITIES
-0.64
Es
-0.64
POSITIVE LOGITS
hammer
1.36
hammered
1.03
sonian
1.00
bats
0.96
hammer
0.93
hamm
0.93
matical
0.91
nails
0.85
axe
0.82
wrench
0.78
Activations Density 0.008%