INDEX
Explanations
instances where the text presents a counterpoint or contrast to an idea
the conjunction "but," often used to introduce contrasting ideas or exceptions
New Auto-Interp
Negative Logits
uto
-0.68
irl
-0.65
amus
-0.64
Times
-0.63
edu
-0.62
ige
-0.62
awan
-0.62
ories
-0.61
hal
-0.60
leck
-0.59
POSITIVE LOGITS
alas
1.26
nevertheless
1.25
nonetheless
1.16
fortunately
1.06
luckily
0.99
tons
0.93
chery
0.92
ultimately
0.92
hey
0.91
unfortunately
0.86
Activations Density 0.190%