INDEX
    Explanations

    phrases related to the value of human lives

    New Auto-Interp
    Negative Logits
     elek
    -0.91
     uhr
    -0.91
     kask
    -0.88
     karton
    -0.86
     silikon
    -0.86
     kram
    -0.84
     naer
    -0.84
     makro
    -0.83
     quoc
    -0.81
     moza
    -0.81
    POSITIVE LOGITS
     lives
    1.15
    lives
    1.04
     Lives
    0.99
     LIVES
    0.96
    Lives
    0.95
     life
    0.87
    life
    0.85
     lived
    0.80
    Life
    0.78
     LIFE
    0.78
    Act Density 0.062%

    No Known Activations