INDEX
Explanations
sports-related terms and leagues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.12
0.4%
1499
+0.09
0.3%
280
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
77
+0.12
0.04
1915
+0.09
0.04
227
+0.09
0.05
Negative Logits
Norvège
-0.60
phosphoric
-0.57
vedette
-0.56
tetrach
-0.55
pylab
-0.55
regardant
-0.55
BeforeAll
-0.55
rictions
-0.54
fromnode
-0.53
oubted
-0.53
POSITIVE LOGITS
level
0.52
verhe
0.51
abstrak
0.50
leagues
0.50
league
0.50
League
0.50
ikyuu
0.49
McLaugh
0.49
achella
0.47
tuong
0.47
Activations Density 0.239%