INDEX
Explanations
occurrences of the word "hump"
references to camels and their humps
New Auto-Interp
Negative Logits
Vale
-0.75
wra
-0.72
unal
-0.70
xen
-0.69
Xan
-0.68
Ren
-0.68
hern
-0.68
ren
-0.65
conn
-0.64
Stamford
-0.63
POSITIVE LOGITS
hump
3.09
pedal
1.44
sculpt
1.27
grind
1.25
plateau
1.20
ridge
1.18
herd
1.14
ules
1.06
calves
1.05
fuck
1.02
Activations Density 0.074%