INDEX
Explanations
narrative elements that reflect complexity and challenges faced in various situations
New Auto-Interp
Head Attr Weights
0:0.05
1:0.01
2:0.04
3:0.12
4:0.03
5:0.06
6:0.01
7:0.08
8:0.02
9:0.01
10:0.48
11:0.02
Negative Logits
Citation
-2.42
inspect
-2.07
probing
-2.07
ribute
-2.06
borrow
-2.03
inqu
-2.00
seek
-1.95
browse
-1.91
cove
-1.88
descendant
-1.88
POSITIVE LOGITS
icably
2.72
icable
2.44
noticed
2.40
smoothly
2.40
Wrong
2.37
unnoticed
2.33
Success
2.26
nown
2.15
debacle
2.13
wrong
2.07
Activations Density 0.530%