INDEX
Explanations
sports-related content
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.13
0.4%
1445
+0.10
0.3%
924
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1813
+0.13
0.04
924
+0.10
0.06
1190
+0.10
0.05
Negative Logits
chert
-0.68
limestones
-0.63
despotism
-0.62
alberto
-0.61
sergio
-0.61
ecru
-0.61
sherds
-0.60
feldspar
-0.58
blackish
-0.58
tubercle
-0.58
POSITIVE LOGITS
véri
0.66
expandindo
0.56
Sep
0.54
écri
0.51
exé
0.51
rédig
0.51
aimer
0.51
Apr
0.49
Published
0.49
Multivariate
0.49
Activations Density 0.308%