INDEX
    Explanations

    phrases related to media discourse and alleged misinformation

    New Auto-Interp
    Negative Logits
     UsersController
    -0.15
    iland
    -0.15
     Usa
    -0.14
    Į¨
    -0.14
     ç¦
    -0.13
    inya
    -0.13
    zego
    -0.13
    empo
    -0.13
    ÄĻ
    -0.13
    lease
    -0.13
    POSITIVE LOGITS
     misunder
    0.18
     Greatest
    0.17
    progress
    0.15
     Progress
    0.15
     misunderstood
    0.15
    woke
    0.15
    Progress
    0.15
     progress
    0.14
    verage
    0.14
     Sax
    0.14
    Act Density 0.182%

    No Known Activations