INDEX
    Explanations

    possessive forms of "is."

    New Auto-Interp
    Negative Logits
    ’s
    -0.21
    ’na
    -0.20
    ’n
    -0.17
    ’t
    -0.17
    ën
    -0.17
    ’re
    -0.16
    tsky
    -0.16
    forms
    -0.16
    sher
    -0.15
    ’in
    -0.15
    POSITIVE LOGITS
     Worth
    0.18
    -'
    0.17
    /'
    0.17
     worth
    0.17
    itter
    0.15
    assy
    0.14
    tatus
    0.14
     sake
    0.14
     Guide
    0.13
    ibs
    0.13
    Act Density 0.118%

    No Known Activations