INDEX
Explanations
adjectives reflecting emotions or personal qualities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
198
+0.13
0.4%
1385
+0.11
0.3%
946
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.13
0.06
802
+0.11
0.06
260
+0.11
0.04
Negative Logits
duomen
-0.53
PointSize
-0.53
maxWidth
-0.52
]**
-0.52
setLayout
-0.51
dhism
-0.50
)>>
-0.50
fillStyle
-0.50
PCell
-0.50
)>=
-0.50
POSITIVE LOGITS
mef
0.88
triton
0.86
uncin
0.85
pubg
0.84
Juf
0.82
passim
0.81
casio
0.80
Augu
0.80
Khart
0.79
logitech
0.79
Activations Density 0.343%