INDEX
Explanations
short phrases indicating progression or continuation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1137
+0.14
0.5%
897
+0.14
0.5%
1265
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1137
+0.14
0.05
131
+0.14
0.03
554
+0.12
0.04
Negative Logits
Argumento
-0.60
Vedi
-0.57
Inoltre
-0.56
katapos
-0.55
Contactez
-0.55
Avez
-0.55
Eksteraj
-0.54
Sinopse
-0.54
NamedQueries
-0.52
kemer
-0.52
POSITIVE LOGITS
ALONG
1.13
along
0.98
along
0.96
Along
0.93
reluct
0.92
milf
0.92
Along
0.90
shenan
0.90
madonna
0.87
scrat
0.85
Activations Density 0.088%