INDEX
Explanations
mentions of legal ages and legal restrictions related to marriage and abortion
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.12
0.3%
1842
+0.11
0.3%
453
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
776
+0.12
0.05
453
+0.11
0.05
1978
+0.10
0.04
Negative Logits
unspeak
-1.45
apprehen
-1.32
indescri
-1.26
gaily
-1.22
vainly
-1.21
intersper
-1.20
intrigu
-1.16
shenan
-1.13
tolerably
-1.11
unwarran
-1.11
POSITIVE LOGITS
karton
1.69
silikon
1.65
lapto
1.55
alkoh
1.53
makro
1.52
kask
1.52
lampa
1.50
mikrofon
1.49
elek
1.47
torba
1.43
Activations Density 0.247%