INDEX
    Explanations

    expressions of repetition or recurrence

    New Auto-Interp
    Negative Logits
    lrt
    -0.15
     gratuiti
    -0.15
    wdx
    -0.15
    -Semit
    -0.15
    achable
    -0.15
     giả
    -0.15
    posables
    -0.14
    ÛĮÙĨÙĩ
    -0.14
    antes
    -0.14
    advertisement
    -0.14
    POSITIVE LOGITS
     gain
    0.33
     ag
    0.32
    gain
    0.28
     Gain
    0.27
    ag
    0.25
    Gain
    0.25
    _gain
    0.22
     AN
    0.22
    -ag
    0.21
     against
    0.20
    Act Density 0.011%

    No Known Activations