INDEX
Explanations
specifically mentioned or desired actions or targets within a context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.8%
1385
+0.14
0.6%
678
+0.10
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1385
+0.17
0.09
468
+0.14
0.06
1793
+0.10
0.06
Negative Logits
<bos>
-2.77
styleType
-0.85
AlterField
-0.85
|}
-0.84
HasIndex
-0.81
contentLoaded
-0.79
HasAnnotation
-0.78
AddField
-0.78
WriteLiteral
-0.77
//
-0.76
POSITIVE LOGITS
affor
2.00
napoli
1.99
maneu
1.93
milano
1.92
stockholm
1.91
milf
1.90
fta
1.90
ricardo
1.89
roberto
1.88
jorge
1.88
Activations Density 0.927%