INDEX
    Explanations

    words related to involvement or participation

    New Auto-Interp
    Negative Logits
     unchecked
    -0.66
    launch
    -0.66
    enberg
    -0.64
    ear
    -0.63
    \\\\\\\\
    -0.61
     testament
    -0.59
     asylum
    -0.59
    thur
    -0.59
     supremacy
    -0.59
    ework
    -0.58
    POSITIVE LOGITS
    enza
    0.86
     therein
    0.82
    imental
    0.75
    hips
    0.73
    iers
    0.72
    lees
    0.68
    inity
    0.67
    iments
    0.67
    atively
    0.67
    iltr
    0.66
    Act Density 2.582%

    No Known Activations