INDEX
    Explanations

    references to changes or modifications within a context

    New Auto-Interp
    Negative Logits
    mius
    -0.56
     j
    -0.54
     get
    -0.54
     st
    -0.54
     mo
    -0.53
     kadang
    -0.53
    zeti
    -0.53
     so
    -0.53
    そもそも
    -0.53
    ,
    -0.51
    POSITIVE LOGITS
     changes
    1.55
    Changes
    1.46
     Changes
    1.46
    changes
    1.39
    CHANGES
    1.32
     CHANGES
    1.29
     itſelf
    1.13
     improvements
    1.13
     amendments
    1.11
     changements
    1.09
    Act Density 0.280%

    No Known Activations