INDEX
Explanations
information related to celebrities and their activities or achievements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1108
+0.14
0.5%
1177
+0.13
0.4%
50
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1108
+0.14
0.04
1786
+0.13
0.01
1018
+0.13
0.02
Negative Logits
ⓧ
-0.61
URBANA
-0.56
-0.55
<?
-0.52
amaged
-0.51
vantaggi
-0.50
rlrl
-0.49
/**
-0.47
原始内容
-0.46
shewn
-0.46
POSITIVE LOGITS
виправивши
0.68
Apro
0.53
sule
0.52
stdafx
0.52
mavi
0.52
rémun
0.51
Conclu
0.51
lindo
0.51
Autres
0.51
antiago
0.50
Activations Density 0.151%