INDEX
Explanations
references to technical terms and company names
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.12
0.5%
1978
+0.07
0.3%
344
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.12
0.06
1343
+0.07
0.07
344
+0.06
0.05
Negative Logits
<bos>
-1.59
ransition
-0.82
ตร์
-0.74
ensure
-0.73
public
-0.69
///**
-0.69
define
-0.69
ുള്ള
-0.68
-0.68
-0.68
POSITIVE LOGITS
affor
2.43
increa
2.23
volunte
2.13
guarante
2.10
véhic
2.07
philanth
2.06
maneu
2.04
inev
2.02
Intere
2.01
Confu
2.00
Activations Density 0.242%