INDEX
Explanations
common words used in technical instructions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1042
+0.11
0.3%
50
+0.11
0.3%
1892
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1806
+0.11
0.06
1892
+0.11
0.05
50
+0.10
0.06
Negative Logits
XmlEnum
-0.78
Kategor
-0.69
FBref
-0.67
exé
-0.66
komik
-0.65
ihnachten
-0.64
BIBSYS
-0.64
kaos
-0.63
kategor
-0.63
soutient
-0.63
POSITIVE LOGITS
intersper
0.91
encomp
0.85
whatnot
0.77
shenan
0.77
unspeak
0.75
disagre
0.75
vainly
0.72
indescri
0.71
increa
0.71
reccom
0.70
Activations Density 0.389%