INDEX
Explanations
instances of words indicating a contrast or contradiction
the conjunction "but" indicating contrast or opposition in statements
New Auto-Interp
Negative Logits
sky
-0.70
ige
-0.67
tnc
-0.65
itto
-0.65
oir
-0.65
Times
-0.64
roy
-0.63
ump
-0.63
obyl
-0.63
cloth
-0.63
POSITIVE LOGITS
fortunately
1.09
nevertheless
1.08
luckily
1.07
alas
1.06
nonetheless
0.98
thankfully
0.93
tons
0.89
ersen
0.87
chery
0.85
unfortunately
0.84
Activations Density 0.186%