INDEX
Explanations
terms or phrases related to legal statements and documentation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
650
+0.08
0.3%
50
+0.07
0.2%
25
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
338
+0.08
0.03
1268
+0.07
0.02
1880
+0.07
0.02
Negative Logits
<bos>
-1.24
/**
-0.69
public
-0.68
void
-0.65
/**
-0.63
</tbody>
-0.61
den
-0.60
}{||-0.60
///**
-0.60
ɵɵ
-0.60
POSITIVE LOGITS
accla
2.12
increa
2.05
affor
2.01
maneu
1.98
impra
1.93
reluct
1.88
wherea
1.88
disagre
1.86
fortn
1.85
strick
1.81
Activations Density 0.131%