INDEX
    Explanations

    positive descriptions of acting performances

    New Auto-Interp
    Negative Logits
    ilaire
    -0.46
    Ptr
    -0.44
    ην
    -0.44
     simplifié
    -0.42
    setBorder
    -0.42
    ieteur
    -0.41
    clearInterval
    -0.41
    歌词
    -0.41
    節目
    -0.41
     Sécurité
    -0.41
    POSITIVE LOGITS
     actors
    1.08
     actor
    0.99
     actress
    0.97
     Actor
    0.92
     Actors
    0.92
     actresses
    0.92
    Actors
    0.89
    actors
    0.86
     acting
    0.83
    Actor
    0.82
    Act Density 0.198%

    No Known Activations