INDEX
Explanations
proper nouns, specifically names, especially "Jane."
mentions of the name "Jane" in various contexts
New Auto-Interp
Negative Logits
orescent
-0.91
ctic
-0.91
*/(
-0.84
idated
-0.81
rophe
-0.79
PDATE
-0.78
cffff
-0.76
akespe
-0.75
fecture
-0.75
ulative
-0.73
POSITIVE LOGITS
Doe
1.25
Jane
0.94
Aust
0.88
Jane
0.85
Roe
0.84
Approximately
0.84
Jacobs
0.82
Seymour
0.81
Mayer
0.80
ju
0.79
Activations Density 0.016%