INDEX
Explanations
dates and months of events or occurrences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.18
1.0%
15
+0.11
0.6%
467
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
467
+0.18
0.03
15
+0.11
0.03
70
+0.09
0.03
Negative Logits
amais
-1.74
STEM
-1.71
quo
-1.62
untu
-1.60
xico
-1.60
obbsee
-1.58
fessor
-1.58
oved
-1.55
ellow
-1.53
unge
-1.53
POSITIVE LOGITS
et
1.59
tragedy
1.55
motive
1.52
wood
1.51
works
1.48
blows
1.42
ikh
1.42
interference
1.39
implications
1.39
ledge
1.38
Activations Density 0.098%