INDEX
    Explanations

    words related to mental health and unstable behavior

    words related to chaos or disorder

    New Auto-Interp
    Negative Logits
     sponsors
    -0.69
     sponsor
    -0.64
     treatment
    -0.60
    wait
    -0.58
    resources
    -0.57
     color
    -0.57
     sponsorship
    -0.56
     provider
    -0.55
     commercials
    -0.55
     author
    -0.55
    POSITIVE LOGITS
    anged
    4.21
    anging
    2.73
    anges
    1.76
    ange
    1.58
    angs
    1.52
    ANGE
    1.48
    angers
    1.39
    ANG
    1.38
    ang
    1.37
    anger
    1.34
    Act Density 0.007%

    No Known Activations