INDEX
Explanations
connections and associations within a network
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
260
+0.13
0.7%
127
+0.12
0.7%
494
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
372
+0.13
0.01
260
+0.12
0.02
148
+0.12
0.01
Negative Logits
akers
-1.64
orer
-1.58
endment
-1.56
»
-1.55
clamation
-1.52
slogan
-1.47
Ĩ
-1.46
Competing
-1.40
ģ
-1.39
excuse
-1.34
POSITIVE LOGITS
thereto
1.98
aterally
1.65
ely
1.64
rier
1.63
ioned
1.60
ues
1.57
thon
1.57
itudinal
1.56
hereto
1.55
vest
1.52
Activations Density 0.145%