INDEX
    Explanations

    terms related to physical arms or weapons

    New Auto-Interp
    Negative Logits
    light
    -0.68
    ples
    -0.61
    ters
    -0.61
    DER
    -0.58
    PER
    -0.57
    charged
    -0.57
    rers
    -0.56
    faced
    -0.56
    TER
    -0.55
    flush
    -0.55
    POSITIVE LOGITS
    ament
    1.17
    aments
    1.13
    aceutical
    1.06
    ando
    1.05
    ageddon
    1.02
    chair
    0.96
    agnetic
    0.93
    ally
    0.93
    heid
    0.91
    ophon
    0.91
    Act Density 1.431%

    No Known Activations