INDEX
Explanations
references to significant moments in time, particularly focusing on personal or historical milestones
New Auto-Interp
Head Attr Weights
0:0.06
1:0.01
2:0.34
3:0.07
4:0.04
5:0.08
6:0.02
7:0.05
8:0.10
9:0.04
10:0.10
11:0.05
Negative Logits
rored
-1.31
resistant
-1.09
vulner
-1.06
ignty
-1.05
mun
-1.01
rongh
-1.00
ん
-1.00
zai
-0.99
accordingly
-0.97
ancial
-0.94
POSITIVE LOGITS
standpoint
2.39
perspective
1.82
ratch
1.76
vantage
1.64
outset
1.52
viewpoint
1.47
confines
1.39
inception
1.38
beginnings
1.36
outer
1.29
Activations Density 0.335%