INDEX
Explanations
references to specific individuals or names
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
356
+0.15
0.8%
410
+0.14
0.8%
350
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
410
+0.15
0.03
338
+0.14
0.02
350
+0.12
0.02
Negative Logits
boards
-1.74
board
-1.65
banks
-1.56
)](#
-1.56
strength
-1.53
blown
-1.51
entire
-1.41
),$$
-1.38
boarding
-1.36
lando
-1.36
POSITIVE LOGITS
ournal
2.33
º
2.33
ĻĤ
2.19
¥
2.13
§
2.12
¤
2.11
°
2.02
Ń
1.95
³
1.94
ŀ
1.93
Activations Density 0.156%