INDEX
Explanations
words related to movement, specifically going up and down in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.10
0.3%
2022
+0.07
0.2%
1385
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
491
+0.10
0.04
1884
+0.07
0.04
1317
+0.07
0.04
Negative Logits
<bos>
-1.53
GeneratedMessage
-0.63
ⓧ
-0.62
/*++
-0.60
}{||-0.58
migrationBuilder
-0.57
setClass
-0.56
glBind
-0.56
rowspan
-0.55
glBindBuffer
-0.55
POSITIVE LOGITS
jaya
1.78
lele
1.73
thut
1.59
aen
1.57
hcm
1.56
stockholm
1.55
meis
1.55
!...
1.54
?...
1.51
„,
1.50
Activations Density 0.383%