INDEX
Explanations
phrases related to data analysis and information sharing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
752
+0.17
0.5%
394
+0.11
0.3%
690
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
752
+0.17
0.07
16
+0.11
0.07
1156
+0.10
0.04
Negative Logits
<bos>
-0.76
instancetype
-0.68
Còn
-0.66
))){-0.64
ViewImports
-0.64
XMLSchema
-0.64
==""){-0.63
Tôi
-0.63
)});
-0.63
Beneficios
-0.61
POSITIVE LOGITS
reluct
2.18
increa
2.00
impra
1.99
indestru
1.99
depic
1.99
shenan
1.98
unspeak
1.96
snoopy
1.96
disagre
1.94
gaily
1.92
Activations Density 0.441%