INDEX
Explanations
references to women's health and wellbeing in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
271
+0.20
1.2%
261
+0.16
1.0%
186
+0.16
1.0%
Correlated Neurons
Index
P. Corr.
Cos Sim.
261
+0.20
0.08
441
+0.16
-0.02
271
+0.16
-0.06
Negative Logits
sheet
-1.54
edition
-1.49
statement
-1.41
offers
-1.41
recogn
-1.40
defenses
-1.38
itself
-1.38
won
-1.36
findViewById
-1.35
competence
-1.30
POSITIVE LOGITS
ĻĤ
2.91
Īĺ
2.54
¿½
2.41
IJ
2.36
»¿
2.24
ĭ
2.21
à¯į
2.17
º
2.17
Ļª
2.12
ħ
2.05
Activations Density 6.194%