INDEX
Explanations
official titles or positions within organizations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.15
0.4%
1177
+0.14
0.4%
227
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.15
0.04
752
+0.14
0.03
227
+0.12
0.04
Negative Logits
purée
-0.56
BEHAV
-0.52
sprigs
-0.52
frites
-0.51
jenner
-0.49
viewWillAppear
-0.49
shallots
-0.49
viewDid
-0.48
INFLU
-0.46
EFFICIENCY
-0.46
POSITIVE LOGITS
destinées
0.99
soulign
0.98
soigneusement
0.98
Cfr
0.96
Souha
0.96
CiNii
0.86
Leurs
0.86
Keny
0.86
Mə
0.86
spécialement
0.86
Activations Density 0.162%