INDEX
Explanations
phrases related to movement, particularly going upwards and downwards
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1565
+0.11
0.3%
1757
+0.11
0.3%
397
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1757
+0.11
0.05
1398
+0.11
0.03
1565
+0.10
0.04
Negative Logits
vogli
-0.51
AfterEach
-0.51
pylab
-0.51
dimenti
-0.51
inguém
-0.51
XmlEnum
-0.49
shutil
-0.49
seaborn
-0.47
triangleq
-0.47
trovar
-0.47
POSITIVE LOGITS
up
1.04
up
0.95
Up
0.95
UP
0.93
Up
0.90
UP
0.84
ups
0.75
ups
0.74
upy
0.67
アップ
0.67
Activations Density 0.127%