INDEX
    Explanations

    terms related to things that are hidden, undisclosed, or not widely known

    prefixes related to negation or lack

    New Auto-Interp
    Negative Logits
    =-=-=-=-=-=-=-=-
    -0.75
     bull
    -0.74
     bulls
    -0.72
     sarc
    -0.67
     sucking
    -0.67
     simulator
    -0.63
     initials
    -0.62
     Knights
    -0.62
     straw
    -0.61
     rooting
    -0.60
    POSITIVE LOGITS
    leased
    1.58
    achable
    1.48
    ported
    1.47
    ason
    1.30
    ached
    1.26
    peat
    1.25
    vised
    1.20
    ired
    1.13
    acted
    1.10
    emed
    1.03
    Act Density 0.028%

    No Known Activations