INDEX
    Explanations

    questions and invitations for reader engagement

    New Auto-Interp
    Negative Logits
    umpt
    -0.17
     inquire
    -0.15
    ocz
    -0.15
    Äįka
    -0.15
    hora
    -0.14
    inst
    -0.14
    座
    -0.14
    Invoke
    -0.14
     lah
    -0.13
    licht
    -0.13
    POSITIVE LOGITS
     leave
    0.35
     share
    0.33
     let
    0.33
     Leave
    0.32
     Share
    0.32
    Leave
    0.31
    leave
    0.31
    Share
    0.30
    share
    0.30
     tell
    0.29
    Act Density 0.040%

    No Known Activations