INDEX
    Explanations

    phrases related to free offers and promotions

    New Auto-Interp
    Negative Logits
    нÑĥ
    -0.15
    pike
    -0.15
     pricey
    -0.15
    la
    -0.14
    ubu
    -0.14
    460
    -0.14
    arians
    -0.14
     freopen
    -0.14
    que
    -0.14
    phalt
    -0.14
    POSITIVE LOGITS
    bies
    0.40
    bie
    0.39
    zers
    0.26
    zing
    0.25
    -standing
    0.24
    /free
    0.24
    zes
    0.23
    zer
    0.23
    bsd
    0.22
    -floating
    0.21
    Act Density 0.030%

    No Known Activations