INDEX
Explanations
quotes or reported statements in a news context
New Auto-Interp
Negative Logits
sidx
-0.75
imposed
-0.74
ILCS
-0.73
adesh
-0.71
elsh
-0.69
mun
-0.65
Interstitial
-0.65
mite
-0.65
aband
-0.65
racted
-0.65
POSITIVE LOGITS
us
1.23
tale
1.18
me
1.14
him
1.02
ingly
0.98
reporters
0.90
them
0.83
listeners
0.81
viewers
0.78
passers
0.76
Activations Density 1.571%