INDEX
Explanations
mentions of equality, specifically related to rights, worth, and love
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
596
+0.18
0.6%
1387
+0.18
0.6%
1581
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1387
+0.18
0.04
596
+0.18
0.03
130
+0.13
0.03
Negative Logits
liberality
-0.59
thereupon
-0.58
Voilà
-0.56
Shakspeare
-0.56
tolerably
-0.55
Aras
-0.54
endeavouring
-0.54
Autre
-0.53
Inhabitants
-0.52
Sugges
-0.51
POSITIVE LOGITS
Equal
1.08
equal
1.07
equal
0.99
EQUAL
0.99
Equal
0.97
equality
0.85
equ
0.83
EQUAL
0.82
EQU
0.81
equals
0.80
Activations Density 0.103%