INDEX
Explanations
mentions of ballot-related terms, likely related to elections and voting
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
889
+0.23
1.0%
204
+0.20
0.9%
1757
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
204
+0.23
0.04
889
+0.20
0.04
227
+0.14
0.06
Negative Logits
كومونز
-0.64
Himo
-0.61
fontName
-0.60
EndInit
-0.58
PerformLayout
-0.57
styleType
-0.57
Fö
-0.56
kosme
-0.54
Tél
-0.53
Ӧ
-0.53
POSITIVE LOGITS
🤣🤣
0.84
unspeak
0.84
rval
0.83
laft
0.82
:'(
0.81
pegasus
0.81
thut
0.81
pixar
0.81
intersper
0.80
indescri
0.80
Activations Density 0.432%