INDEX
Explanations
phrases indicating a comparison, exception, or differentiation
comparative phrases that indicate exceptions or alternatives
New Auto-Interp
Negative Logits
orian
-0.78
ahime
-0.73
fell
-0.71
foreseen
-0.71
uese
-0.70
natureconservancy
-0.70
oro
-0.70
ãĥł
-0.69
Cosponsors
-0.67
izer
-0.67
POSITIVE LOGITS
oneself
0.80
maybe
0.79
those
0.74
perhaps
0.74
possibly
0.71
ours
0.70
namely
0.70
theirs
0.70
ourselves
0.69
hers
0.66
Activations Density 0.054%