INDEX
Explanations
contractions of the word "are" in sentences
the contraction "we're"
New Auto-Interp
Negative Logits
concealed
-0.58
att
-0.56
prior
-0.55
sund
-0.55
observation
-0.53
entry
-0.53
online
-0.52
herald
-0.52
following
-0.52
abroad
-0.52
POSITIVE LOGITS
're
2.97
've
1.92
'll
1.68
'd
1.46
aren
1.43
'm
1.41
weren
1.38
are
1.25
Are
1.22
ARE
1.15
Activations Density 0.034%