INDEX
Explanations
time-related events or experiences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.15
0.5%
1376
+0.13
0.4%
1194
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1376
+0.15
0.03
1194
+0.13
0.03
1135
+0.11
0.03
Negative Logits
Estilo
-0.61
Diret
-0.56
Continu
-0.56
detal
-0.55
Febrero
-0.55
Loja
-0.55
Abril
-0.54
Ofer
-0.54
databinding
-0.54
Avez
-0.54
POSITIVE LOGITS
maneu
0.86
reluct
0.86
ago
0.85
gaily
0.83
reconno
0.83
fep
0.82
milf
0.81
Pamphlet
0.81
ftu
0.81
disagre
0.80
Activations Density 0.073%