INDEX
Explanations
instances of contrasting perspectives or characterizations within narratives
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.02
3:0.03
4:0.09
5:0.04
6:0.03
7:0.05
8:0.02
9:0.19
10:0.02
11:0.43
Negative Logits
leneck
-2.05
anned
-2.00
landowners
-2.00
stash
-1.96
ochet
-1.88
Founders
-1.87
cler
-1.86
backlog
-1.86
angler
-1.83
throats
-1.83
POSITIVE LOGITS
manner
4.16
ways
4.07
way
3.56
terms
3.52
sense
3.43
fashion
3.29
guise
3.23
WAY
3.13
respects
2.95
fashion
2.83
Activations Density 0.154%