INDEX
Explanations
references to specific locations and dates related to significant events
New Auto-Interp
Head Attr Weights
0:0.04
1:0.06
2:0.08
3:0.04
4:0.03
5:0.05
6:0.40
7:0.03
8:0.03
9:0.06
10:0.07
11:0.06
Negative Logits
acebook
-1.61
IMAGES
-1.37
soever
-1.33
secondly
-1.32
itiveness
-1.32
allowances
-1.31
stamps
-1.24
SHIP
-1.24
offerings
-1.23
staples
-1.23
POSITIVE LOGITS
ahu
1.68
ober
1.63
ivo
1.54
aeda
1.54
arch
1.46
aghd
1.46
ouf
1.45
nih
1.43
fi
1.43
az
1.42
Activations Density 0.003%