INDEX
Explanations
comparisons and contrasts in sentences
expressions that indicate contrast or conditionality
New Auto-Interp
Negative Logits
DAQ
-0.75
hig
-0.73
exting
-0.73
bas
-0.71
SPONSORED
-0.71
orthy
-0.71
predec
-0.70
pez
-0.70
atl
-0.68
Rated
-0.68
POSITIVE LOGITS
we
0.80
they
0.72
acknowledging
0.71
spirits
0.68
fy
0.68
it
0.67
SOME
0.67
you
0.66
there
0.66
he
0.63
Activations Density 0.134%