INDEX
Explanations
phrases related to political elections and appointments among other surrounding activities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1124
+0.19
1.0%
1677
+0.17
0.9%
1909
+0.15
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1097
+0.19
0.06
1677
+0.17
0.04
1741
+0.15
-0.00
Negative Logits
<bos>
-0.81
unspeak
-0.78
gaily
-0.76
Wtf
-0.70
Whence
-0.67
Monticello
-0.67
Lmao
-0.66
kani
-0.66
unwarran
-0.65
Placer
-0.64
POSITIVE LOGITS
Mur
1.06
Mur
0.94
MUR
0.86
MUR
0.84
Murphy
0.79
mur
0.78
mur
0.78
^
0.69
Murphy
0.69
majest
0.68
Activations Density 0.625%