INDEX
Explanations
proper nouns, specifically names like "Jim"
the name "Jim" across various contexts
New Auto-Interp
Negative Logits
NetMessage
-0.85
DragonMagazine
-0.80
cffff
-0.80
BOOK
-0.80
Spoiler
-0.79
Constructed
-0.75
ylum
-0.74
CONCLUS
-0.70
Charges
-0.70
xual
-0.68
POSITIVE LOGITS
enez
1.33
mie
1.17
bo
0.88
Yong
0.85
Crow
0.85
Beam
0.84
Ross
0.80
bles
0.80
Murphy
0.79
iny
0.78
Activations Density 0.009%