INDEX
    Explanations

    short, brief statements or comments within a text

    New Auto-Interp
    Negative Logits
    eworld
    -0.75
    ammed
    -0.73
    existent
    -0.71
    ardless
    -0.71
    aden
    -0.71
    Reilly
    -0.69
    uvian
    -0.69
    ankind
    -0.68
    "},"
    -0.68
    ð
    -0.68
    POSITIVE LOGITS
     note
    1.37
     caveat
    1.36
     disclaimer
    1.34
     caveats
    1.29
     reminder
    1.19
     recap
    1.13
     refres
    1.12
     clarification
    1.11
     Note
    1.10
     notes
    1.01
    Act Density 0.136%

    No Known Activations