INDEX
Explanations
responses to requests or comments in news articles
New Auto-Interp
Negative Logits
Surv
-0.72
termin
-0.69
conserv
-0.68
riet
-0.68
ILCS
-0.65
kil
-0.65
istan
-0.64
hered
-0.64
artifacts
-0.64
flix
-0.62
POSITIVE LOGITS
requests
1.43
criticism
1.39
criticisms
1.31
inquiries
1.29
queries
1.20
complaints
1.20
suggestions
1.14
request
1.10
objections
1.08
accusations
1.08
Activations Density 1.648%