INDEX
Explanations
mentions of the name "Jay" with a strong emphasis on "Jay" as the name activates the neuron significantly
the name "Jay" in various contexts related to the individual or character
New Auto-Interp
Negative Logits
ITIES
-0.80
iture
-0.78
milo
-0.72
idad
-0.71
ENTION
-0.68
htaking
-0.66
ities
-0.66
ITY
-0.66
iments
-0.65
ational
-0.65
POSITIVE LOGITS
hawks
1.28
hawk
1.17
walking
1.14
den
0.96
haw
0.96
lon
0.94
bird
0.91
pee
0.90
Jay
0.88
len
0.86
Activations Density 0.025%