INDEX
    Explanations

    the presence of news reporting and editorial commentary language

    New Auto-Interp
    Negative Logits
    amera
    -0.07
    Ñĩив
    -0.07
    ênh
    -0.07
    олÑĮно
    -0.06
    orage
    -0.06
    implify
    -0.06
    azz
    -0.06
     ret
    -0.06
    udge
    -0.06
    ippet
    -0.06
    POSITIVE LOGITS
     NPR
    0.08
     listeners
    0.08
    Listeners
    0.08
    Listener
    0.07
     listener
    0.07
     listen
    0.07
     baise
    0.07
    listen
    0.07
    listeners
    0.07
     listens
    0.06
    Act Density 0.003%

    No Known Activations