INDEX
    Explanations

    references to in-depth investigation or thorough analysis

    New Auto-Interp
    Negative Logits
     Alive
    -0.72
    pal
    -0.69
    Frames
    -0.69
    gged
    -0.67
    alias
    -0.67
    nery
    -0.67
    Safe
    -0.66
    fighter
    -0.66
    pos
    -0.66
    Earth
    -0.65
    POSITIVE LOGITS
     amounts
    0.96
     extensive
    0.82
     portions
    0.80
     thorough
    0.78
     quantities
    0.77
    mble
    0.74
     tracts
    0.73
     disadvantages
    0.72
     stretches
    0.72
     expansions
    0.70
    Act Density 0.008%

    No Known Activations