INDEX
Neuron Alignment
Index
Value
% of L₁
906
+0.19
0.6%
108
+0.13
0.4%
736
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.19
0.08
392
+0.13
0.04
2045
+0.13
0.05
Negative Logits
↵
-0.59
↵↵
-0.58
I
-0.55
.
-0.54
FORMANCE
-0.53
makeText
-0.53
}
-0.53
:
-0.53
↵↵↵
-0.52
;
-0.52
POSITIVE LOGITS
simplif
1.34
hentai
1.29
milf
1.24
emphat
1.23
Souha
1.22
michelin
1.21
Mlle
1.21
ritard
1.19
intermitt
1.16
incess
1.15
Activations Density 0.489%