INDEX
Explanations
words related to database and technical support
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.10
0.5%
394
+0.06
0.3%
1288
+0.04
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1385
+0.10
0.25
1870
+0.06
0.01
2011
+0.04
0.09
Negative Logits
<bos>
-1.70
public
-0.89
’
-0.88
if
-0.86
/**
-0.83
/**
-0.83
.
-0.83
protected
-0.83
of
-0.83
//
-0.82
POSITIVE LOGITS
increa
2.29
affor
2.29
Juf
2.23
maneu
2.18
aen
2.15
impra
2.13
emphat
2.09
stockholm
2.06
Augu
2.06
dises
2.05
Activations Density 6.293%