INDEX
Explanations
the word "still" indicating a sense of continuity or permanence
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.15
0.8%
239
+0.12
0.7%
335
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
335
+0.15
0.02
1
+0.12
0.02
461
+0.12
0.02
Negative Logits
»¿
-2.75
¯
-2.69
ģ
-2.66
ĨĴ
-2.65
ĥ½
-2.57
ĵ
-2.50
ij
-2.45
Ģ
-2.41
Ķ
-2.38
Ĵ
-2.33
POSITIVE LOGITS
ilage
1.71
stock
1.71
sti
1.67
birth
1.57
ération
1.56
ylvania
1.54
jam
1.53
reaction
1.49
capacity
1.49
iosis
1.47
Activations Density 0.060%