INDEX
Explanations
references to specific geographical locations or directions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.18
1.0%
313
+0.16
0.9%
544
+0.16
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1741
+0.18
-0.00
227
+0.16
0.07
1097
+0.16
0.06
Negative Logits
<bos>
-0.90
Sanderson
-0.57
mettent
-0.53
Pyrr
-0.52
Duncan
-0.52
Tierney
-0.51
Kerr
-0.50
hdr
-0.50
Ales
-0.50
Kir
-0.49
POSITIVE LOGITS
ritard
0.94
paff
0.94
vna
0.92
makro
0.91
marseille
0.90
ohr
0.90
juft
0.89
broder
0.89
mef
0.89
ftre
0.88
Activations Density 0.806%