INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    sp
    -0.11
    adamente
    -0.10
    soever
    -0.10
    teenth
    -0.10
    (strict
    -0.09
    ména
    -0.09
    spi
    -0.09
    ngo
    -0.09
    ingly
    -0.09
    ypi
    -0.08
    POSITIVE LOGITS
     Presence
    0.18
     presence
    0.17
    presence
    0.15
    Presence
    0.14
    ences
    0.14
    ence
    0.14
    idon
    0.12
    -abs
    0.11
    -day
    0.11
    ential
    0.11
    Act Density 0.031%

    No Known Activations