INDEX
Explanations
references to media criticism and public reactions to current events
New Auto-Interp
Negative Logits
pgsql
-0.15
uers
-0.15
attern
-0.14
ivic
-0.14
วล
-0.14
athy
-0.14
Damn
-0.14
emoc
-0.14
polit
-0.14
atter
-0.13
POSITIVE LOGITS
softball
0.16
ped
0.16
trafficking
0.15
puff
0.15
sik
0.14
chy
0.14
pur
0.14
misleading
0.14
spinner
0.14
onenumber
0.14
Activations Density 0.059%