INDEX
Explanations
punctuation and sentence endings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.40
1.4%
453
+0.20
0.7%
2019
+0.20
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
360
+0.40
0.05
2019
+0.20
0.05
543
+0.20
0.04
Negative Logits
<bos>
-1.57
Abbé
-1.27
Mlle
-1.27
tremb
-1.22
pamph
-1.20
suscep
-1.20
suspic
-1.14
fta
-1.12
hentai
-1.10
reluct
-1.09
POSITIVE LOGITS
Discografia
0.58
MEMORANDUM
0.56
Tecnologia
0.55
CharField
0.54
وصلة
0.53
HKEY
0.53
存于互联网档案馆
0.53
Typeface
0.52
Historie
0.52
Timetable
0.52
Activations Density 0.207%