INDEX
Explanations
phrases related to supporting evidence or development plans in a text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.10
0.4%
25
+0.06
0.3%
1052
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1771
+0.10
0.04
1256
+0.06
0.04
1052
+0.06
0.04
Negative Logits
<bos>
-1.63
public
-0.77
േ
-0.71
///**
-0.71
enumerate
-0.71
displayquote
-0.71
CreateIndex
-0.71
HasColumnType
-0.69
,
-0.69
//
-0.69
POSITIVE LOGITS
stockholm
2.15
Carrying
2.13
maneu
2.13
affor
2.11
accla
2.05
Carried
2.02
Carry
1.99
impra
1.98
increa
1.96
shenan
1.96
Activations Density 0.115%