INDEX
Explanations
phrases related to time
occurrences of the word "the"
New Auto-Interp
Negative Logits
tle
-0.74
galitarian
-0.70
conom
-0.68
ATURES
-0.68
INO
-0.67
hyde
-0.67
oma
-0.64
krit
-0.63
plain
-0.62
terness
-0.62
POSITIVE LOGITS
bunch
1.26
year
1.25
week
1.23
season
1.21
night
1.18
month
1.16
evening
1.16
millennium
1.15
day
1.13
weekend
1.13
Activations Density 0.081%