INDEX
    Explanations

    phrases related to physical harm or injury

    gerunds and participles in relation to actions or activities

    New Auto-Interp
    Negative Logits
    Present
    -0.79
    lance
    -0.73
    present
    -0.72
    κ
    -0.68
    draw
    -0.68
    ingen
    -0.68
    Jump
    -0.66
    writer
    -0.66
    staff
    -0.66
    lander
    -0.65
    POSITIVE LOGITS
    imentary
    0.85
    axy
    0.81
    gorith
    0.79
    azar
    0.78
    gorithm
    0.76
    ergic
    0.73
    enaries
    0.70
    ISTER
    0.70
    arine
    0.68
    abama
    0.68
    Act Density 0.016%

    No Known Activations