INDEX
    Explanations

    references to historical or legal events and developments

    New Auto-Interp
    Negative Logits
     myſelf
    -0.61
     leſs
    -0.59
    ſelf
    -0.59
    يكب
    -0.59
    Tembelea
    -0.57
     itſelf
    -0.56
     houſe
    -0.56
     himſelf
    -0.54
    tagHelperRunner
    -0.54
     препратки
    -0.54
    POSITIVE LOGITS
     dopiero
    0.81
     until
    0.70
     Until
    0.59
    Until
    0.59
    until
    0.57
    直到
    0.56
    ようやく
    0.52
    やっと
    0.52
     untill
    0.51
     aż
    0.48
    Act Density 0.454%

    No Known Activations