INDEX
Explanations
dates in the format 'YYYY' occurring in the text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2019
+0.23
0.8%
381
+0.16
0.5%
1842
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
950
+0.23
0.07
1265
+0.16
0.05
736
+0.15
0.06
Negative Logits
himo
-0.85
Ceinture
-0.67
arrivant
-0.67
appuy
-0.67
Mémoires
-0.66
règlement
-0.65
nowu
-0.64
règne
-0.63
MediatR
-0.63
FBref
-0.63
POSITIVE LOGITS
maneu
0.84
reluct
0.78
disagre
0.78
apprehen
0.74
encomp
0.72
which
0.72
disad
0.70
ineffec
0.69
shenan
0.68
inev
0.68
Activations Density 0.167%