INDEX
Explanations
phrases related to sports events and player performance, with a focus on goals, wins, and key plays
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.16
0.5%
1842
+0.13
0.4%
394
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
736
+0.16
0.07
861
+0.13
0.06
392
+0.13
0.03
Negative Logits
NOO
-1.01
meis
-0.95
particolar
-0.95
Expt
-0.95
pollut
-0.93
Manufact
-0.91
dirit
-0.90
!...
-0.89
aquarelle
-0.89
gmbh
-0.89
POSITIVE LOGITS
touchdown
0.59
Juventud
0.54
Bibliote
0.53
Hermoso
0.53
scored
0.53
Voto
0.52
Glej
0.52
Significado
0.52
Beij
0.52
highlight
0.51
Activations Density 0.369%