INDEX
Explanations
information related to duration or passage of time
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
889
+0.14
0.5%
297
+0.12
0.4%
1013
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1830
+0.14
0.06
297
+0.12
0.07
1856
+0.12
0.07
Negative Logits
Δια
-0.69
asteroide
-0.69
Walkover
-0.68
Selección
-0.67
impon
-0.67
Diciembre
-0.67
endpush
-0.66
dezembro
-0.65
novembro
-0.64
outubro
-0.64
POSITIVE LOGITS
unspeak
1.23
sophistic
1.21
disagre
1.20
unwarran
1.18
maneu
1.17
impra
1.16
gaily
1.14
affor
1.12
pamph
1.11
excru
1.10
Activations Density 0.170%