INDEX
Explanations
sentences conveying a sense of tension or challenge
instances of the word "but" indicating contrast or complications in narratives
New Auto-Interp
Negative Logits
UD
-0.73
pron
-0.72
pec
-0.72
tnc
-0.71
NULL
-0.67
AMP
-0.67
usal
-0.65
actionDate
-0.64
igmat
-0.63
ungle
-0.62
POSITIVE LOGITS
luckily
1.52
fortunately
1.51
nevertheless
1.47
nonetheless
1.42
thankfully
1.27
hey
1.25
mirac
0.90
otherwise
0.88
somehow
0.87
tolerated
0.87
Activations Density 0.216%