INDEX
Explanations
sport-related terms and phrases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.6%
1491
+0.07
0.3%
629
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
629
+0.13
0.08
851
+0.07
0.07
1491
+0.06
0.07
Negative Logits
<bos>
-1.87
/***
-0.96
ⓧ
-0.94
-0.93
<?
-0.90
/**
-0.85
/*
-0.84
<?
-0.73
subdue
-0.67
inaugurate
-0.66
POSITIVE LOGITS
Right
1.80
Right
1.67
right
1.65
RIGHT
1.63
Righ
1.48
RIGHT
1.48
right
1.47
righ
1.25
maroc
1.14
lele
1.11
Activations Density 0.103%