INDEX
Explanations
phrases related to time and temporal concepts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.05
3:0.14
4:0.02
5:0.05
6:0.10
7:0.26
8:0.06
9:0.07
10:0.05
11:0.09
Negative Logits
ENE
-1.51
��
-1.31
��
-1.21
uay
-1.20
Territories
-1.16
ylum
-1.16
覚醒
-1.15
GAN
-1.15
AE
-1.11
Saharan
-1.09
POSITIVE LOGITS
clock
1.28
desks
1.27
brim
1.22
countdown
1.20
fram
1.19
screens
1.16
clock
1.15
rigs
1.14
rot
1.13
ceiling
1.13
Activations Density 0.012%