INDEX
    Explanations

    phrases related to loss or danger to human life

    references to human lives and their significance in various contexts, particularly those involving risk or loss

    New Auto-Interp
    Negative Logits
    CAST
    -0.70
    ane
    -0.70
    NetMessage
    -0.64
    atorial
    -0.62
    Marginal
    -0.62
    ripp
    -0.60
    ractive
    -0.60
    ggles
    -0.59
     IDE
    -0.58
    iasis
    -0.58
    POSITIVE LOGITS
     lives
    0.86
    chool
    0.84
    journal
    0.83
    lihood
    0.82
    sole
    0.81
    ©¶æ
    0.77
     Forever
    0.76
    ynthesis
    0.74
    pun
    0.73
    bage
    0.72
    Act Density 0.014%

    No Known Activations