INDEX
Explanations
the word "Denny" with different activation strengths, specifically with an emphasis on the higher activations
the name "Lenny" in various contexts
New Auto-Interp
Negative Logits
pmwiki
-0.77
behav
-0.74
itud
-0.72
76561
-0.71
istical
-0.70
ioch
-0.70
juven
-0.70
ifier
-0.68
spring
-0.68
Hurricanes
-0.65
POSITIVE LOGITS
oshenko
0.88
enny
0.75
hyde
0.75
utical
0.73
ollar
0.73
Arcade
0.72
roach
0.67
ption
0.67
bles
0.67
wallet
0.67
Activations Density 0.009%