INDEX
Explanations
sentences containing the word "would" followed by a verb in the infinitive form
conditional statements or hypothetical scenarios
New Auto-Interp
Negative Logits
noticed
-0.70
Named
-0.68
Cosponsors
-0.65
Brill
-0.63
Vis
-0.59
Returning
-0.59
Xuan
-0.57
lobb
-0.57
Territories
-0.56
SUN
-0.56
POSITIVE LOGITS
imply
1.16
seem
1.12
be
1.06
require
1.06
surely
1.04
certainly
1.02
entail
1.01
ordinarily
0.99
constitute
0.98
violate
0.98
Activations Density 0.144%