INDEX
Explanations
terminology related to contracts and agreements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.28
1.2%
2034
+0.15
0.6%
1870
+0.14
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1870
+0.28
0.08
2034
+0.15
0.11
1705
+0.14
0.07
Negative Logits
<bos>
-2.86
intersper
-1.34
encomp
-1.05
/**
-1.03
timately
-1.02
<?
-1.00
ⓧ
-0.98
-0.95
Transkript
-0.93
racon
-0.92
POSITIVE LOGITS
==""){0.60
)<=
0.60
jawa
0.57
))*
0.56
=="")
0.54
)>=
0.54
droj
0.52
=""></
0.51
:]:
0.51
kutu
0.51
Activations Density 1.163%