INDEX
Explanations
references to various sports
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
350
+0.14
0.8%
389
+0.14
0.8%
155
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
389
+0.14
0.03
155
+0.14
0.03
112
+0.13
0.02
Negative Logits
rowser
-1.73
manner
-1.61
miscar
-1.58
fois
-1.52
nlm
-1.50
wake
-1.50
bye
-1.48
fore
-1.47
meu
-1.47
libc
-1.44
POSITIVE LOGITS
smen
2.97
icity
2.31
ivity
2.21
sw
2.20
sm
2.16
iest
2.13
ivation
2.12
ive
2.08
iveness
2.01
ivated
1.96
Activations Density 0.207%