INDEX
Explanations
recurring conjunctions in a list context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
247
+0.12
0.7%
369
+0.11
0.6%
39
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
494
+0.12
0.45
209
+0.11
0.29
402
+0.11
0.26
Negative Logits
errals
-1.68
alus
-1.67
iterr
-1.57
eson
-1.55
engers
-1.54
ackets
-1.48
onica
-1.45
acters
-1.43
etts
-1.43
tingham
-1.41
POSITIVE LOGITS
Ĥ¬
2.92
µ
2.79
½
2.79
ķ
2.75
Ĺ
2.73
IJ
2.73
¿
2.70
§
2.68
ĸ´
2.67
¯
2.63
Activations Density 1.569%