INDEX
Explanations
instances of exceptions or exclusions
phrases expressing conditional or qualifying contexts
New Auto-Interp
Negative Logits
rift
-0.79
orthy
-0.78
ongyang
-0.76
yssey
-0.72
kefeller
-0.72
orian
-0.72
ahead
-0.68
Cosponsors
-0.67
oured
-0.67
cean
-0.66
POSITIVE LOGITS
occasional
0.83
exceptions
0.80
caveats
0.74
caveat
0.68
occasionally
0.67
ones
0.66
Exception
0.66
emort
0.65
glaring
0.64
faint
0.63
Activations Density 0.093%