INDEX
Explanations
mentions of time and temporal references
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.29
1.5%
479
+0.10
0.5%
2004
+0.10
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2004
+0.29
0.03
479
+0.10
0.03
474
+0.10
0.02
Negative Logits
<bos>
-3.09
deepen
-0.70
mobilize
-0.68
seek
-0.67
abolish
-0.66
educate
-0.66
ⓧ
-0.66
organize
-0.64
expel
-0.63
nourish
-0.63
POSITIVE LOGITS
lele
1.38
Minang
1.36
jaya
1.35
kaos
1.31
jawa
1.30
seksi
1.27
panik
1.25
dises
1.22
silikon
1.22
bandung
1.21
Activations Density 0.060%