INDEX
Explanations
mentions of physical objects and actions related to chairs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1385
+0.10
0.3%
1013
+0.09
0.3%
1857
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1857
+0.10
0.02
505
+0.09
0.02
1119
+0.09
0.03
Negative Logits
Cfr
-0.86
Ibidem
-0.70
Queste
-0.68
§.
-0.64
myn
-0.63
Febru
-0.62
Bibl
-0.61
Khart
-0.61
Idem
-0.60
Altri
-0.60
POSITIVE LOGITS
chairs
1.03
chair
0.99
seating
0.98
seat
0.96
seats
0.91
Chairs
0.86
chairs
0.86
chair
0.84
seated
0.83
seat
0.81
Activations Density 0.229%