INDEX
Explanations
references to crises and significant challenges
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.11
3:0.05
4:0.07
5:0.03
6:0.18
7:0.27
8:0.05
9:0.04
10:0.05
11:0.06
Negative Logits
srfAttach
-1.76
ansom
-1.75
rats
-1.61
ledge
-1.59
sheets
-1.52
Lies
-1.51
title
-1.46
ername
-1.43
intel
-1.41
nings
-1.40
POSITIVE LOGITS
downtime
1.70
vity
1.64
flashbacks
1.56
encountering
1.54
improv
1.52
heightened
1.46
baugh
1.44
filming
1.44
discomfort
1.43
hearing
1.41
Activations Density 0.002%