INDEX
    Explanations

    references to interpersonal relationships and emotional experiences

    New Auto-Interp
    Negative Logits
     hopefully
    -0.17
    Hopefully
    -0.16
    hopefully
    -0.15
     tonight
    -0.15
    Tonight
    -0.14
     Hopefully
    -0.14
    okable
    -0.14
    orge
    -0.14
    /wiki
    -0.14
    ä½ķ
    -0.14
    POSITIVE LOGITS
    they
    0.18
     PureComponent
    0.17
     they
    0.16
     basically
    0.15
     during
    0.15
    They
    0.15
    ]={↵
    0.15
    when
    0.15
     spoiler
    0.15
    during
    0.15
    Act Density 0.355%

    No Known Activations