INDEX
Explanations
phrases related to historical events and milestones
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.10
3:0.06
4:0.18
5:0.03
6:0.03
7:0.31
8:0.03
9:0.04
10:0.05
11:0.06
Negative Logits
udder
-1.75
ascript
-1.70
anamo
-1.57
aido
-1.39
swer
-1.39
gency
-1.38
aimon
-1.35
qt
-1.34
aft
-1.32
▓
-1.31
POSITIVE LOGITS
Invasion
1.78
hegemony
1.53
CLASS
1.45
魔
1.43
citation
1.41
appropri
1.39
Forbes
1.38
ESCO
1.38
UNESCO
1.38
Recall
1.34
Activations Density 0.001%