INDEX
Explanations
references to various indexes or tables
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
1.1%
501
+0.13
0.8%
892
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
501
+0.19
0.03
892
+0.13
0.03
316
+0.11
0.03
Negative Logits
<bos>
-3.27
ⓧ
-0.86
EndProject
-0.81
/**
-0.80
<?
-0.77
/***
-0.67
елның
-0.67
-0.66
AssemblyCompany
-0.65
EndGlobalSection
-0.62
POSITIVE LOGITS
maneu
1.51
bandung
1.45
affor
1.42
Minang
1.41
increa
1.39
emphat
1.31
stockholm
1.31
jacques
1.29
Juf
1.29
volunte
1.28
Activations Density 0.051%