INDEX
Explanations
patterns related to code syntax
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.20
0.6%
876
+0.11
0.3%
1967
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.20
0.05
453
+0.11
0.05
1108
+0.10
0.05
Negative Logits
rval
-0.75
unspeak
-0.74
Lmao
-0.70
gaily
-0.70
Whence
-0.70
Whoa
-0.70
toils
-0.69
despotism
-0.68
McLaugh
-0.68
Considerable
-0.68
POSITIVE LOGITS
ù
1.20
lomb
1.02
Gemeinsame
1.01
luy
0.98
bont
0.97
dì
0.96
BnF
0.95
adal
0.95
magis
0.95
dè
0.94
Activations Density 0.241%