INDEX
Explanations
dates, locations, and specific events or details
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2034
+0.13
0.4%
1967
+0.13
0.4%
453
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1750
+0.13
0.05
1398
+0.13
0.04
1803
+0.12
0.04
Negative Logits
fta
-1.39
ftu
-1.37
thut
-1.29
vnt
-1.24
mef
-1.24
fte
-1.23
»>
-1.23
fup
-1.22
aen
-1.20
fign
-1.18
POSITIVE LOGITS
0.81
2
0.63
$
0.59
1
0.56
however
0.55
at
0.55
while
0.55
,
0.55
but
0.54
in
0.54
Activations Density 0.113%