INDEX
Explanations
mentions of specific years and the context surrounding historical events
New Auto-Interp
Head Attr Weights
0:0.06
1:0.06
2:0.04
3:0.04
4:0.03
5:0.31
6:0.03
7:0.02
8:0.05
9:0.10
10:0.11
11:0.09
Negative Logits
emis
-1.48
ahi
-1.46
fish
-1.44
apons
-1.42
ava
-1.37
atin
-1.36
onis
-1.32
Tycoon
-1.32
killer
-1.30
ola
-1.28
POSITIVE LOGITS
onwards
1.75
timeframe
1.62
onward
1.46
Attempt
1.29
Ago
1.22
ruary
1.16
Composite
1.15
Era
1.14
Coh
1.14
terms
1.14
Activations Density 0.193%