INDEX
Explanations
mentions of footwear, particularly boots
mentions of boots
New Auto-Interp
Negative Logits
terday
-0.71
udic
-0.68
ught
-0.68
Ĭ±
-0.68
enced
-0.68
NAD
-0.67
encers
-0.67
neurot
-0.67
LD
-0.66
kefeller
-0.65
POSITIVE LOGITS
strap
1.89
loader
1.14
stra
1.11
legged
1.03
camp
0.99
leg
0.88
tails
0.87
loader
0.83
boot
0.81
lake
0.81
Activations Density 0.012%