INDEX
Explanations
professional terms or titles related to specific fields
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.15
0.8%
168
+0.12
0.6%
555
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
168
+0.15
0.03
1041
+0.12
0.03
370
+0.12
0.02
Negative Logits
<bos>
-3.12
//---
-0.76
/***
-0.73
public
-0.71
SourceChecksum
-0.70
<?
-0.66
/*---
-0.65
ⓧ
-0.64
/***
-0.64
/*!
-0.64
POSITIVE LOGITS
increa
1.59
Minang
1.54
disagre
1.53
unlaw
1.50
affor
1.50
thut
1.50
maneu
1.49
gaily
1.48
jaya
1.47
unwarran
1.44
Activations Density 0.091%