INDEX
Explanations
references to procedures and techniques
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
1.4%
316
+0.11
0.6%
1262
+0.09
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
316
+0.23
0.13
858
+0.11
0.10
1023
+0.09
0.09
Negative Logits
<bos>
-3.39
<?
-0.89
ⓧ
-0.86
-0.86
/**
-0.82
/***
-0.76
nahilalakip
-0.69
kasarigan
-0.68
<?
-0.68
Jeografia
-0.67
POSITIVE LOGITS
affor
1.44
maneu
1.35
accla
1.20
increa
1.20
disagre
1.19
reluct
1.18
volunte
1.17
véhic
1.15
shenan
1.13
impra
1.12
Activations Density 0.522%