INDEX
Explanations
proper nouns related to political figures from California
references to California and its representatives
New Auto-Interp
Negative Logits
nings
-0.74
Grail
-0.70
schild
-0.68
culosis
-0.67
Rumble
-0.65
Ukrainian
-0.64
RIS
-0.63
Sleeping
-0.63
theless
-0.62
bol
-0.62
POSITIVE LOGITS
Calif
1.07
sylvania
0.91
aii
0.85
qua
0.80
uti
0.79
ishable
0.78
osi
0.77
orn
0.76
eno
0.76
utation
0.75
Activations Density 0.004%