INDEX
    Explanations

    words expressing positivity and admiration

    New Auto-Interp
    Negative Logits
    IsContent
    -0.69
    gypti
    -0.64
     almeno
    -0.62
    Slf
    -0.62
     allegedly
    -0.60
     autora
    -0.60
    AdapterView
    -0.57
     silencio
    -0.55
    CacheManager
    -0.55
     Cassius
    -0.55
    POSITIVE LOGITS
     wonderful
    3.04
    wonderful
    2.76
    Wonderful
    2.57
     Wonderful
    2.56
     marvelous
    2.35
     marvellous
    2.18
     fantastic
    2.16
     fabulous
    2.05
     terrific
    2.04
     wonderfully
    2.02
    Act Density 0.052%

    No Known Activations