INDEX
Explanations
phrases containing the word "but"
instances of the word "but" indicating contrasting statements or objections
New Auto-Interp
Negative Logits
pmwiki
-0.77
famous
-0.71
Written
-0.65
fuck
-0.65
catch
-0.62
Merch
-0.61
MY
-0.61
ghost
-0.60
Purchase
-0.59
代
-0.59
POSITIVE LOGITS
noting
1.15
cautioned
1.08
concedes
1.03
stressing
1.02
admits
1.01
acknowledging
0.95
acknowledges
0.94
disagreed
0.92
stressed
0.91
nonetheless
0.91
Activations Density 0.370%