INDEX
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.7%
405
+0.07
0.3%
101
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
405
+0.13
0.07
492
+0.07
0.04
2032
+0.06
0.04
Negative Logits
<bos>
-1.72
/*
-0.83
ⓧ
-0.81
<?
-0.80
-0.75
public
-0.74
/**
-0.73
//
-0.72
#
-0.70
protected
-0.68
POSITIVE LOGITS
maneu
2.18
accla
2.18
affor
2.17
impra
2.12
disagre
2.12
increa
2.04
reluct
2.01
emphat
1.98
excru
1.97
strick
1.89
Activations Density 0.071%