INDEX
Explanations
information related to weight measurement
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
690
+0.14
0.4%
453
+0.09
0.3%
1398
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1398
+0.14
0.03
369
+0.09
0.04
1311
+0.09
0.04
Negative Logits
galo
-1.13
lele
-1.12
kram
-1.12
silikon
-1.11
makro
-1.10
frans
-1.09
wien
-1.09
meis
-1.09
milano
-1.08
pira
-1.08
POSITIVE LOGITS
approximately
0.77
roughly
0.70
Até
0.65
Embora
0.64
Apesar
0.64
approx
0.63
shayari
0.62
Enquanto
0.61
absl
0.61
Muito
0.61
Activations Density 0.271%