INDEX
Explanations
phrases related to negative consequences or problems
mentions of public health and environmental concerns
New Auto-Interp
Negative Logits
answered
-0.68
********************************
-0.61
Thu
-0.58
arial
-0.58
Lit
-0.57
interstitial
-0.56
\/
-0.55
Identification
-0.55
=\"
-0.54
Quote
-0.54
POSITIVE LOGITS
but
1.31
but
1.30
But
0.96
BUT
0.94
But
0.91
nor
0.79
}{0.76
BUT
0.74
})
0.73
though
0.73
Activations Density 0.307%