INDEX
Explanations
the word "then" occurring in sequences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
61
+0.17
0.6%
645
+0.14
0.5%
1872
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
61
+0.17
0.06
645
+0.14
0.06
1872
+0.12
0.05
Negative Logits
rafra
-0.77
réve
-0.76
mû
-0.72
angelo
-0.69
Attr
-0.68
perpé
-0.68
monaster
-0.67
éta
-0.66
fons
-0.66
éto
-0.65
POSITIVE LOGITS
then
0.99
THEN
0.85
then
0.78
subsequently
0.76
proceed
0.74
thereafter
0.71
afterwards
0.71
Then
0.71
Then
0.70
THEN
0.69
Activations Density 0.108%