INDEX
Explanations
terms related to American football strategies and tactics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
0.8%
1842
+0.21
0.8%
764
+0.19
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
468
+0.22
0.05
1842
+0.21
0.04
1166
+0.19
0.06
Negative Logits
<bos>
-1.17
simplifié
-0.69
consulté
-0.68
-0.67
famí
-0.66
"..\..\..\
-0.65
bibnamefont
-0.64
"..\..\
-0.63
htbp
-0.63
desertcart
-0.62
POSITIVE LOGITS
disagre
2.39
unwarran
2.20
indestru
2.16
reluct
2.16
emphat
2.15
increa
2.15
impra
2.12
unspeak
2.11
depic
2.10
pamph
2.10
Activations Density 0.464%