INDEX
Explanations
terms related to the act of moving forward or taking action
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.0%
122
+0.14
0.7%
1404
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
122
+0.20
0.02
156
+0.14
0.02
1404
+0.12
0.02
Negative Logits
<bos>
-2.74
public
-0.74
echo
-0.72
hub
-0.69
api
-0.67
JspWriter
-0.67
root
-0.67
console
-0.67
str
-0.66
static
-0.65
POSITIVE LOGITS
bandung
1.88
affor
1.87
milf
1.71
jaya
1.71
maneu
1.70
impra
1.65
scrat
1.64
lamborghini
1.63
bangkok
1.63
fta
1.62
Activations Density 0.063%