INDEX
Explanations
names of football players or individuals
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
204
+0.15
0.7%
200
+0.15
0.6%
1053
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
200
+0.15
0.02
204
+0.15
0.02
1053
+0.14
0.02
Negative Logits
Atsauces
-0.64
zurück
-0.59
település
-0.57
wieś
-0.57
Parmi
-0.56
Törté
-0.53
Formazione
-0.53
municipi
-0.52
Další
-0.51
více
-0.51
POSITIVE LOGITS
Jones
1.62
Jones
1.48
jones
1.36
JONES
1.34
jones
1.28
Machine
0.66
Machine
0.64
machine
0.64
machine
0.64
machines
0.61
Activations Density 0.089%