INDEX
Explanations
phrases indicating a call for social justice and equality
instances of conditional phrases and qualifiers in discourse
New Auto-Interp
Negative Logits
ery
-0.70
edia
-0.70
eny
-0.66
etting
-0.62
agram
-0.61
eln
-0.60
icky
-0.58
qus
-0.58
gged
-0.58
ech
-0.58
POSITIVE LOGITS
except
1.53
except
1.48
Including
1.47
irrespective
1.35
including
1.35
including
1.33
regardless
1.29
imaginable
1.18
INCLUD
1.15
excluding
1.09
Activations Density 0.474%