INDEX
    Explanations

    numbers within text

    numerical or chronological references in a text

    New Auto-Interp
    Negative Logits
    heit
    -0.70
    boro
    -0.64
     diseng
    -0.63
    ardless
    -0.62
    ocial
    -0.60
    umni
    -0.59
    abouts
    -0.59
    isexual
    -0.56
    activity
    -0.54
    assador
    -0.53
    POSITIVE LOGITS
    Scroll
    0.75
    However
    0.69
     Debor
    0.68
    reditary
    0.68
    ECK
    0.66
    Until
    0.65
    ³³³
    0.63
    Lear
    0.63
    Specifically
    0.63
    SEE
    0.63
    Act Density 0.627%

    No Known Activations