INDEX
    Explanations

    expressions of preference and enjoyment

    New Auto-Interp
    Negative Logits
     Asher
    -0.70
    <h6>
    -0.69
    ExecuteReader
    -0.68
    MockBean
    -0.68
     brazos
    -0.67
    èvre
    -0.65
     Crowe
    -0.64
    রণ
    -0.64
    ZEL
    -0.63
     Brant
    -0.63
    POSITIVE LOGITS
     liked
    1.40
     liking
    1.27
     Liked
    1.25
     likes
    1.24
     Likes
    1.22
    likes
    1.14
    dislike
    1.14
    liked
    1.11
    Likes
    1.10
     gusta
    1.08
    Act Density 0.062%

    No Known Activations