INDEX
Explanations
mentions of following or contacting individuals and organizations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
0.9%
1177
+0.11
0.4%
856
+0.09
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
254
+0.22
0.06
1445
+0.11
0.06
1799
+0.09
0.04
Negative Logits
<bos>
-2.16
ⓧ
-1.05
<?
-0.92
-0.78
/**
-0.73
.
-0.71
introduce
-0.69
/*
-0.66
in
-0.66
realize
-0.66
POSITIVE LOGITS
accla
2.35
affor
2.28
reluct
2.19
maneu
2.12
wherea
2.09
emphat
2.04
depic
2.03
increa
2.02
inev
2.01
véhic
2.00
Activations Density 0.210%