INDEX
    Explanations

    phrases and terms indicating costs or value judgments

    New Auto-Interp
    Negative Logits
    .googlecode
    -0.16
    yps
    -0.15
     Downs
    -0.15
    atel
    -0.14
    ctors
    -0.14
     Wonder
    -0.14
    Ñıн
    -0.14
    à¸ģà¸ķ
    -0.14
    ekten
    -0.14
    itur
    -0.13
    POSITIVE LOGITS
    auth
    0.27
     authentic
    0.26
     Stitch
    0.26
     Authentic
    0.26
     outlet
    0.25
    /Auth
    0.25
    .auth
    0.25
    jer
    0.25
     auth
    0.24
    Auth
    0.24
    Act Density 0.031%

    No Known Activations