INDEX
Explanations
persistent URLs and links
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
0.9%
1150
+0.12
0.5%
1491
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
131
+0.24
0.04
1150
+0.12
0.03
1995
+0.08
0.04
Negative Logits
<bos>
-2.87
/*
-0.88
/**
-0.85
ⓧ
-0.81
ɵɵ
-0.75
Література
-0.74
<?
-0.70
/*++
-0.68
addGroup
-0.68
PerformLayout
-0.66
POSITIVE LOGITS
reluct
1.91
maneu
1.85
affor
1.83
impra
1.78
disagre
1.77
accla
1.76
unlaw
1.73
increa
1.71
volunte
1.70
philanth
1.68
Activations Density 0.331%