INDEX
Explanations
phrases with opposing viewpoints or actions
the conjunction "but" indicating contrast or opposition in statements
New Auto-Interp
Negative Logits
nce
-0.66
resa
-0.63
tnc
-0.60
mary
-0.60
vance
-0.59
ammu
-0.59
lish
-0.58
uto
-0.58
venue
-0.58
ogical
-0.58
POSITIVE LOGITS
tons
1.15
chery
1.04
alas
0.87
chers
0.86
nevertheless
0.85
tered
0.80
luckily
0.80
fortunately
0.79
nonetheless
0.79
ler
0.76
Activations Density 0.105%