INDEX
    Explanations

    instances of admitting mistakes or being proven wrong

    New Auto-Interp
    Negative Logits
    iÃŃ
    -0.17
    linger
    -0.15
    alc
    -0.14
    .bio
    -0.14
    azar
    -0.14
    anel
    -0.14
    имÑĥ
    -0.14
    ÙĴت
    -0.13
    ứ
    -0.13
     sap
    -0.13
    POSITIVE LOGITS
     proved
    0.16
    ears
    0.16
     silenced
    0.15
     proving
    0.15
    ampa
    0.15
     disp
    0.15
    egov
    0.15
    obsolete
    0.15
    sil
    0.15
    ковод
    0.14
    Act Density 0.105%

    No Known Activations