INDEX
Explanations
verbs and actions related to scientific findings and processes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
23
+0.24
1.4%
478
+0.16
0.9%
376
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
376
+0.24
0.03
148
+0.16
0.03
166
+0.13
0.04
Negative Logits
nem
-1.53
mage
-1.48
emergency
-1.43
plus
-1.42
ERC
-1.41
arter
-1.39
Plus
-1.39
starter
-1.37
rette
-1.37
illo
-1.37
POSITIVE LOGITS
Ĥ
2.27
«
2.25
ĥ
2.21
²
2.11
¿½
2.10
¿
2.06
Ŀ
2.01
akov
2.00
¦
1.94
¾
1.94
Activations Density 0.399%