INDEX
    Explanations

    phrases related to making claims or accusations

    empty or non-conventional text markers

    New Auto-Interp
    Negative Logits
    emale
    -0.52
     afterwards
    -0.52
    ornings
    -0.51
    chuk
    -0.51
    *.
    -0.50
    ée
    -0.49
    worth
    -0.48
    rade
    -0.48
     alas
    -0.48
    arettes
    -0.48
    POSITIVE LOGITS
     same
    0.96
     aforementioned
    0.83
     latest
    0.82
     latter
    0.81
    ses
    0.76
     following
    0.75
     simplest
    0.75
     hottest
    0.73
     fastest
    0.73
     largest
    0.73
    Act Density 1.169%

    No Known Activations