INDEX
Explanations
statistical data and numerical values
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
765
+0.15
0.5%
878
+0.14
0.4%
1372
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
878
+0.15
0.04
765
+0.14
0.03
382
+0.11
0.03
Negative Logits
unspeak
-0.98
vainly
-0.97
shenan
-0.96
Rine
-0.93
philanth
-0.89
unavoid
-0.89
horrend
-0.89
endeav
-0.89
encomp
-0.88
Vaugh
-0.87
POSITIVE LOGITS
cipolla
0.77
herbes
0.72
torba
0.72
">=
0.68
farbe
0.67
distanciation
0.65
augus
0.64
lievito
0.64
grises
0.64
autunno
0.63
Activations Density 0.070%