INDEX
Explanations
conversation patterns involving the phrases 'And I' and variations of it
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1445
+0.14
0.5%
381
+0.13
0.5%
1892
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1892
+0.14
0.06
1265
+0.13
0.06
390
+0.13
0.06
Negative Logits
magis
-1.74
hina
-1.71
dises
-1.70
mef
-1.70
haup
-1.69
lele
-1.68
meis
-1.65
umo
-1.64
hcm
-1.64
aen
-1.63
POSITIVE LOGITS
And
0.98
they
0.98
then
0.98
I
0.98
it
0.97
we
0.97
you
0.95
everybody
0.95
there
0.91
he
0.91
Activations Density 0.226%