INDEX
    Explanations

    words related to genetic mutations or alterations

    terms related to mutilation and its implications

    New Auto-Interp
    Negative Logits
    ¯¯
    -0.66
     Defenders
    -0.64
     park
    -0.64
    OUGH
    -0.62
    boarding
    -0.62
    board
    -0.61
     Fellow
    -0.61
     ours
    -0.60
     Standing
    -0.60
     Fulton
    -0.59
    POSITIVE LOGITS
    iple
    1.19
    agen
    1.14
    ually
    1.11
    ilated
    1.07
    ilation
    1.07
    atis
    1.06
    iny
    1.03
    ations
    0.96
    agi
    0.93
    reating
    0.92
    Act Density 0.025%

    No Known Activations