INDEX
    Explanations

    words related to promotions or bonuses

    New Auto-Interp
    Negative Logits
    <bos>
    -0.97
    -0.59
    public
    -0.57
    -0.54
    //
    -0.54
     become
    -0.52
    /*
    -0.52
    //
    -0.52
    ੱਚ
    -0.52
    protected
    -0.51
    POSITIVE LOGITS
     starter
    2.49
     Starter
    2.39
    Starter
    2.12
     starters
    2.06
    starter
    2.04
     affor
    1.25
     lidl
    1.23
     stockholm
    1.22
     maneu
    1.14
     scrat
    1.12
    Act Density 0.121%

    No Known Activations