INDEX
Explanations
information related to the design and development process of a character or concept
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.23
0.7%
1967
+0.08
0.2%
630
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1625
+0.23
0.03
1801
+0.08
0.02
1343
+0.08
0.03
Negative Logits
fter
-0.76
gilet
-0.69
hoody
-0.68
ecru
-0.68
sceptre
-0.67
efty
-0.66
timately
-0.66
unwarran
-0.65
sophie
-0.63
hairc
-0.62
POSITIVE LOGITS
tré
0.51
4
0.46
iseta
0.45
8
0.44
6
0.44
7
0.43
2
0.43
1
0.43
obti
0.42
3
0.42
Activations Density 0.055%