INDEX
Explanations
names of individuals, specifically women, with the title "Mrs."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
544
+0.13
0.4%
1527
+0.13
0.4%
1964
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1527
+0.13
0.01
544
+0.13
0.01
1964
+0.11
0.01
Negative Logits
unisex
-0.56
">“
-0.51
cardigan
-0.50
internas
-0.50
letti
-0.48
Dishwasher
-0.48
satin
-0.47
проєкту
-0.46
charcoal
-0.46
crochet
-0.46
POSITIVE LOGITS
Mrs
1.26
Mrs
1.21
MRS
0.99
mrs
0.92
MRS
0.83
mrs
0.81
höl
0.72
Meksi
0.72
Chapitre
0.71
Mexique
0.70
Activations Density 0.025%