INDEX
    Explanations

    words related to positive impact or approval

    New Auto-Interp
    Negative Logits
    anos
    -0.65
    bley
    -0.62
    atten
    -0.59
    mare
    -0.58
    kas
    -0.58
    zan
    -0.55
    mur
    -0.55
    mber
    -0.55
    rys
    -0.55
    inity
    -0.55
    POSITIVE LOGITS
     as
    0.64
    },"
    0.63
    onse
    0.61
    ]);
    0.59
    Parameters
    0.59
     };
    0.58
    FTWARE
    0.58
    isSpecialOrderable
    0.57
    atically
    0.56
    .:
    0.56
    Act Density 0.933%

    No Known Activations