INDEX
    Explanations

    positive expressions and emotional reactions to artistic works

    New Auto-Interp
    Negative Logits
     endast
    -0.55
    -0.52
    curator
    -0.52
     terenie
    -0.52
    例句
    -0.50
    PhysRevLett
    -0.50
     tegas
    -0.50
    chaffenheit
    -0.49
     Boletín
    -0.48
    Convey
    -0.48
    POSITIVE LOGITS
    ThroughAttribute
    0.72
     reading
    0.68
     rewatch
    0.67
     disambiguazione
    0.66
     watching
    0.65
     nahilalakip
    0.60
    ValueGenerated
    0.59
    watching
    0.59
    RenderAtEndOf
    0.57
     enjoyment
    0.57
    Act Density 0.337%

    No Known Activations