INDEX
Explanations
mentions of mixing or combinations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1516
+0.13
0.5%
1096
+0.12
0.4%
889
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1516
+0.13
0.03
1548
+0.12
0.02
1096
+0.12
0.02
Negative Logits
akade
-0.56
biograf
-0.51
biografi
-0.50
solidar
-0.50
akut
-0.48
geograf
-0.48
gole
-0.48
kriminal
-0.47
vermel
-0.47
kosme
-0.47
POSITIVE LOGITS
mix
1.20
Mix
1.18
MIX
1.15
mixes
1.14
Mix
1.12
mix
1.11
mixing
1.07
mixed
1.03
MIX
1.03
mixed
0.98
Activations Density 0.078%