INDEX
Explanations
mentions of sports teams and their performance, specifically emphasizing wins and losses
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.14
0.4%
108
+0.09
0.3%
1967
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1551
+0.14
0.04
818
+0.09
0.05
392
+0.09
0.03
Negative Logits
ACKNOWLEDGMENTS
-0.62
andreas
-0.60
$:$
-0.55
sfera
-0.54
NOO
-0.52
effluents
-0.51
leonardo
-0.51
gmbh
-0.51
Ampli
-0.50
anical
-0.50
POSITIVE LOGITS
opponent
0.59
tougher
0.55
opponents
0.53
saurait
0.53
enrique
0.50
caliber
0.50
calibre
0.49
formidable
0.49
toughest
0.49
weaker
0.49
Activations Density 0.314%