INDEX
Explanations
Information related to technical specifications and measurements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.5%
1859
+0.05
0.2%
297
+0.05
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.13
0.07
1859
+0.05
0.06
1529
+0.05
0.05
Negative Logits
<bos>
-1.82
public
-0.82
//
-0.81
-0.75
-0.75
for
-0.75
0
-0.73
<h1>
-0.73
-0.72
-0.72
POSITIVE LOGITS
stockholm
1.80
maneu
1.66
lidl
1.65
milf
1.58
wikihow
1.55
impra
1.55
quoique
1.55
véhic
1.50
affor
1.49
peppa
1.49
Activations Density 0.284%