INDEX
    Explanations

    the word 'te' or 'te' followed by a number

    the word "te" in various contexts throughout the document

    New Auto-Interp
    Negative Logits
    interrupted
    -0.73
    ipolar
    -0.68
    antha
    -0.66
    olicy
    -0.62
     sweep
    -0.62
    lessly
    -0.62
    usher
    -0.61
    rha
    -0.60
    OST
    -0.60
     allowances
    -0.60
    POSITIVE LOGITS
    brate
    1.54
    brates
    1.48
    ller
    1.20
    llers
    1.18
    levision
    1.18
    achers
    1.12
    llo
    1.09
    lla
    1.04
    achable
    1.04
    eve
    1.03
    Act Density 0.033%

    No Known Activations