INDEX
Explanations
references to NBA teams or basketball players
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1777
+0.14
0.5%
1141
+0.14
0.5%
1472
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1141
+0.14
0.04
1351
+0.14
0.03
1178
+0.13
0.03
Negative Logits
vorrei
-0.85
[''
-0.75
trovo
-0.73
sappi
-0.72
claudia
-0.70
espri
-0.69
surpl
-0.69
réfugi
-0.69
scopri
-0.68
brille
-0.65
POSITIVE LOGITS
basketball
1.27
Basketball
1.17
NBA
1.16
Basketball
1.12
NBA
1.09
basketball
0.99
basket
0.87
Basket
0.81
basket
0.76
Basket
0.75
Activations Density 0.152%