INDEX
Explanations
single words ending in 'ed'
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1350
+0.11
0.3%
690
+0.11
0.3%
1379
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1350
+0.11
0.06
569
+0.11
0.06
690
+0.10
0.04
Negative Logits
makro
-0.85
kriminal
-0.77
kosme
-0.77
traktor
-0.76
antik
-0.74
kaos
-0.73
Nö
-0.73
Strukt
-0.72
Kombin
-0.72
kase
-0.71
POSITIVE LOGITS
relenting
0.87
solicited
0.83
wavering
0.82
nahilalakip
0.72
sightly
0.69
thinkable
0.69
hornblende
0.68
compromising
0.66
tramonto
0.65
mistak
0.63
Activations Density 0.294%