INDEX
Explanations
negations or contractions with "aren't"
negative contractions of the verb "are."
New Auto-Interp
Negative Logits
assi
-0.58
UNHCR
-0.55
com
-0.51
amaz
-0.49
continuous
-0.48
Ext
-0.48
undert
-0.48
APP
-0.48
ces
-0.48
conv
-0.48
POSITIVE LOGITS
aren
2.84
weren
2.57
haven
1.78
shouldn
1.49
isn
1.48
don
1.45
Aren
1.42
wouldn
1.35
ain
1.34
hadn
1.28
Activations Density 0.018%