INDEX
Explanations
significant temporal indicators and dates within the text
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.12
3:0.06
4:0.07
5:0.03
6:0.10
7:0.32
8:0.03
9:0.02
10:0.07
11:0.10
Negative Logits
distraction
-1.37
soType
-1.36
姫
-1.33
behavi
-1.32
emis
-1.30
REF
-1.29
IELD
-1.28
vigilant
-1.25
neglected
-1.25
WF
-1.24
POSITIVE LOGITS
Lancaster
1.45
phabet
1.42
runs
1.42
qus
1.41
sburgh
1.38
sequ
1.36
playthrough
1.35
assembly
1.34
cible
1.32
sembly
1.32
Activations Density 0.007%