INDEX
    Explanations

    references to mental health

    New Auto-Interp
    Negative Logits
     ***!
    -1.00
     للمعارف
    -0.99
     كومونز
    -0.97
    })();
    
    -0.94
     */
    
    
    -0.94
     myſelf
    -0.94
    ValueStyle
    -0.93
     MonoBehaviour
    -0.93
    batore
    -0.93
     الحره
    -0.92
    POSITIVE LOGITS
     mental
    0.75
    ta
    0.73
     Mental
    0.72
    Mental
    0.70
    mental
    0.68
    y
    0.64
    5
    0.64
    1
    0.62
    ure
    0.61
    ва
    0.59
    Act Density 0.005%

    No Known Activations