INDEX
    Explanations

    words related to conflict and negative impacts like war, injury, disease, and danger

    themes related to conflict, suffering, and moral implications of actions

    New Auto-Interp
    Negative Logits
     referen
    -0.66
     Liv
    -0.62
    ofi
    -0.62
     Marketplace
    -0.61
    abase
    -0.61
     trademark
    -0.60
     Electoral
    -0.59
     Ri
    -0.58
    ynchron
    -0.58
     Socket
    -0.57
    POSITIVE LOGITS
    flies
    0.97
    killers
    0.88
    istically
    0.86
    bows
    0.86
    lessly
    0.84
    worms
    0.83
    lessness
    0.82
    bugs
    0.82
    seekers
    0.82
    ously
    0.81
    Act Density 0.363%

    No Known Activations