INDEX
    Explanations

    instances of criticism or negative commentary about films

    New Auto-Interp
    Negative Logits
    zyst
    -0.20
    ugin
    -0.17
    uhan
    -0.15
    OLID
    -0.15
    umo
    -0.14
    odash
    -0.14
    ucha
    -0.14
    èıĮ
    -0.14
     hopefully
    -0.14
     Buch
    -0.14
    POSITIVE LOGITS
    iddi
    0.16
     Messenger
    0.16
     hack
    0.15
     artificial
    0.14
    generic
    0.14
    Messenger
    0.14
     conveniently
    0.14
    権
    0.14
    misc
    0.14
    733
    0.14
    Act Density 0.132%

    No Known Activations