INDEX
Explanations
references to the term "Hammer" and its variations, often in contexts related to music or tools
New Auto-Interp
Negative Logits
andler
-0.17
@$_
-0.16
conde
-0.15
绩
-0.14
ramework
-0.14
bos
-0.14
sim
-0.14
omanip
-0.14
боÑĢа
-0.14
-Ñı
-0.14
POSITIVE LOGITS
head
0.19
heads
0.17
Hammer
0.17
dul
0.17
hammer
0.17
oen
0.17
ing
0.16
mith
0.16
sublic
0.16
hammer
0.15
Activations Density 0.009%