INDEX
    Explanations

    words related to hindering or preventing something

    the word "deter" in various contexts

    New Auto-Interp
    Negative Logits
    ammy
    -0.78
    olini
    -0.78
    ocene
    -0.75
     halls
    -0.73
     rooft
    -0.67
    hetti
    -0.65
    ioch
    -0.64
    ocalypse
    -0.64
     appointments
    -0.62
     openings
    -0.60
    POSITIVE LOGITS
    ministic
    1.84
    minist
    1.50
    rence
    1.05
    gent
    1.00
    ior
    0.96
    ring
    0.94
    ency
    0.87
    rer
    0.87
    red
    0.86
     deter
    0.85
    Act Density 0.018%

    No Known Activations