INDEX
    Explanations

    punctuations and transitional phrases indicating argument flow

    New Auto-Interp
    Negative Logits
       
    -0.17
    eling
    -0.15
    oran
    -0.14
    _Impl
    -0.14
    arella
    -0.14
    erva
    -0.14
    à¸Īะà¹Ħà¸Ķ
    -0.14
    åĴ²
    -0.14
    phis
    -0.13
    pressions
    -0.13
    POSITIVE LOGITS
     Fol
    0.20
    esel
    0.19
    Attempt
    0.16
    åĬ¡
    0.16
     exactly
    0.16
    à¥ģण
    0.15
     Bene
    0.14
    remen
    0.14
    åIJ¯
    0.14
     inch
    0.14
    Act Density 0.065%

    No Known Activations