INDEX
    Explanations

    phrases related to negative descriptions or criticism

    descriptors related to negative experiences or qualities

    New Auto-Interp
    Negative Logits
    RIC
    -0.75
    riz
    -0.74
    INTON
    -0.69
     scholarship
    -0.67
    rigan
    -0.66
    lite
    -0.65
    rique
    -0.64
    work
    -0.64
    leaders
    -0.64
    rist
    -0.63
    POSITIVE LOGITS
    éĹĺ
    0.83
    士
    0.82
     Logic
    0.71
    dayName
    0.70
     Archangel
    0.70
     Brach
    0.70
     Stress
    0.69
     Tyrann
    0.68
     Beasts
    0.68
     Beast
    0.68
    Act Density 0.040%

    No Known Activations