INDEX
    Explanations

    phrases indicating a critical or evaluative perspective on films

    New Auto-Interp
    Negative Logits
     sát
    -0.15
    erset
    -0.15
    asco
    -0.14
    _rq
    -0.14
    Seriously
    -0.14
    omor
    -0.14
     LAP
    -0.14
    ovu
    -0.14
    æĬľ
    -0.14
    retim
    -0.13
    POSITIVE LOGITS
     decent
    0.25
     OK
    0.17
     nicely
    0.17
     ok
    0.16
     okay
    0.16
    iler
    0.16
    nic
    0.15
     redeem
    0.15
     nice
    0.15
    basic
    0.15
    Act Density 0.290%

    No Known Activations