INDEX
Explanations
words related to tools or actions involving a hammer
New Auto-Interp
Negative Logits
uce
-0.75
RIP
-0.72
VICE
-0.71
ITIES
-0.70
cript
-0.69
ected
-0.69
otes
-0.68
ocard
-0.68
Charity
-0.67
ceed
-0.66
POSITIVE LOGITS
sonian
1.09
heads
0.93
bats
0.93
head
0.90
hammer
0.90
hammer
0.90
ing
0.83
lock
0.80
hamm
0.80
fists
0.77
Activations Density 0.028%