INDEX
Explanations
terms related to guiding or instructing others and innovation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
0.8%
1387
+0.13
0.5%
897
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
897
+0.21
0.06
1387
+0.13
0.05
2000
+0.08
0.05
Negative Logits
<bos>
-1.89
/**
-0.71
propose
-0.65
ⓧ
-0.64
-0.63
began
-0.63
started
-0.62
also
-0.62
spoke
-0.61
begin
-0.59
POSITIVE LOGITS
saar
1.33
lele
1.31
jaya
1.30
maksi
1.29
gomma
1.28
maroc
1.28
ceramica
1.27
autunno
1.27
levis
1.27
Juf
1.27
Activations Density 0.520%