INDEX
    Explanations

    phrases related to human existence or characteristics

    references to the concept of being human

    New Auto-Interp
    Negative Logits
    Els
    -0.73
     Mines
    -0.71
    NRS
    -0.70
    éĹĺ
    -0.70
     Passage
    -0.64
    ories
    -0.63
     redundancy
    -0.62
    ICE
    -0.62
    yss
    -0.62
     Deadly
    -0.61
    POSITIVE LOGITS
     alive
    0.91
    hood
    0.81
     who
    0.81
     lived
    0.81
     judged
    0.79
     inhab
    0.78
     born
    0.76
     endowed
    0.75
    atos
    0.74
     beings
    0.74
    Act Density 0.030%

    No Known Activations