INDEX
Explanations
descriptions related to architectural structures and interior spaces
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
394
+0.23
0.8%
184
+0.16
0.6%
1224
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.23
0.04
599
+0.16
0.09
394
+0.15
0.06
Negative Logits
dises
-0.81
maroc
-0.74
hina
-0.73
milano
-0.72
buta
-0.72
mariana
-0.71
baum
-0.71
vogli
-0.71
bandung
-0.70
petto
-0.70
POSITIVE LOGITS
,”
0.84
,"
0.81
”,
0.69
,'
0.65
",
0.64
[
0.63
”
0.63
,’
0.63
,''
0.60
,”
0.58
Activations Density 0.732%