INDEX
    Explanations

    phrases indicating caution or premature conclusions about future events

    New Auto-Interp
    Negative Logits
    amma
    -0.15
    gua
    -0.14
    177
    -0.14
    nier
    -0.14
     afterwards
    -0.14
    ucken
    -0.14
    unden
    -0.13
    زا
    -0.13
    é¡ĺ
    -0.13
    queda
    -0.13
    POSITIVE LOGITS
     premature
    0.60
     early
    0.55
    early
    0.52
     prematurely
    0.51
     Early
    0.49
    Early
    0.47
    æĹ©
    0.42
     too
    0.41
     Prem
    0.40
    Prem
    0.39
    Act Density 0.138%

    No Known Activations