INDEX
    Explanations

    words related to physical or emotional suffering or hardship

    references to the concept of "grueling" experiences or conditions

    New Auto-Interp
    Negative Logits
     Doctrine
    -0.67
     FISA
    -0.66
    itect
    -0.63
     Izan
    -0.62
     Mirror
    -0.62
     harbor
    -0.61
     Surveillance
    -0.60
     IMAGES
    -0.59
     Partners
    -0.59
     Perception
    -0.58
    POSITIVE LOGITS
    ppo
    1.00
     gru
    0.97
    eling
    0.96
    ppy
    0.93
    ache
    0.89
    ¸
    0.89
    esome
    0.88
    grim
    0.85
    ppe
    0.83
    arse
    0.83
    Act Density 0.016%

    No Known Activations