INDEX
    Explanations

    variations of the word "slightly" indicating small degrees of change or difference

    New Auto-Interp
    Negative Logits
    ars
    -0.16
    eters
    -0.15
     plein
    -0.15
    ource
    -0.14
    like
    -0.14
    lot
    -0.14
    stairs
    -0.14
    èά
    -0.14
    dependent
    -0.14
    eter
    -0.14
    POSITIVE LOGITS
    /stdc
    0.17
    y
    0.17
    .ly
    0.17
    umen
    0.17
    ingly
    0.17
    omore
    0.16
    /errors
    0.15
    /mod
    0.15
     (<
    0.15
    ewan
    0.15
    Act Density 0.015%

    No Known Activations