INDEX
Explanations
mentions of time periods or sequenced events related to sports
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.10
0.3%
1479
+0.09
0.2%
1081
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1081
+0.10
0.05
1225
+0.09
0.03
1479
+0.09
0.02
Negative Logits
increa
-1.87
fuf
-1.85
inev
-1.78
guarante
-1.78
disagre
-1.77
wherea
-1.76
maneu
-1.75
depic
-1.74
emphat
-1.74
purcha
-1.72
POSITIVE LOGITS
without
0.78
knowing
0.75
Autoritní
0.73
asteroide
0.73
with
0.72
формление
0.71
Με
0.70
consultato
0.68
without
0.67
<bos>
0.67
Activations Density 0.266%