INDEX
Explanations
names of individuals, particularly in the context of various industries like games, computing, and entertainment
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.31
1.1%
1741
+0.22
0.8%
382
+0.18
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.31
0.06
382
+0.22
0.05
1343
+0.18
0.05
Negative Logits
sprend
-0.64
dager
-0.62
asado
-0.61
FORMANCE
-0.60
PONENTS
-0.59
špat
-0.59
különböz
-0.58
poważ
-0.58
måneder
-0.58
forsø
-0.58
POSITIVE LOGITS
confé
0.89
supplé
0.84
vôtre
0.81
vété
0.80
nôtre
0.80
Cfr
0.80
Gemeinsame
0.77
éclairage
0.76
réfé
0.76
Anm
0.76
Activations Density 0.201%