INDEX
Explanations
for and against arguments in a document
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.21
0.7%
184
+0.19
0.7%
50
+0.15
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.21
0.01
16
+0.19
0.04
764
+0.15
0.03
Negative Logits
ьаж
-0.94
Географиясе
-0.82
jgl
-0.79
Aholisi
-0.76
<bos>
-0.76
Duración
-0.76
>//
-0.76
HasColumnName
-0.75
endwhile
-0.74
WindowConstants
-0.73
POSITIVE LOGITS
reluct
2.30
affor
2.25
shenan
2.23
intersper
2.22
disagre
2.19
impra
2.17
encomp
2.15
increa
2.12
indestru
2.10
depic
2.07
Activations Density 0.198%