INDEX
Explanations
references to historical events and their implications
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.07
3:0.06
4:0.02
5:0.07
6:0.14
7:0.30
8:0.06
9:0.03
10:0.07
11:0.08
Negative Logits
atum
-1.54
Chan
-1.44
assi
-1.43
tein
-1.42
icidal
-1.40
onding
-1.39
chan
-1.36
Cosponsors
-1.32
り
-1.32
ello
-1.29
POSITIVE LOGITS
prol
1.25
Pharaoh
1.23
legends
1.22
fiction
1.20
histor
1.20
Archae
1.18
Tropical
1.15
historians
1.15
Glac
1.15
folklore
1.15
Activations Density 0.059%