INDEX
Explanations
conjunctions, specifically the word "but"
the word "but" indicating contrast or contradiction
New Auto-Interp
Negative Logits
agra
-0.79
tnc
-0.72
itto
-0.67
idon
-0.64
entry
-0.62
analysis
-0.62
代
-0.62
itaire
-0.61
olution
-0.61
venue
-0.61
POSITIVE LOGITS
tons
1.10
chery
0.83
chers
0.75
ts
0.74
nevertheless
0.72
still
0.69
alas
0.69
nonetheless
0.68
hey
0.64
tern
0.63
Activations Density 0.107%