INDEX
    Explanations

    instances of emotional or traumatic events, specifically focusing on loss, suffering, or danger

    New Auto-Interp
    Negative Logits
    Geplaatst
    -0.70
    alnız
    -0.63
     Porn
    -0.62
    üyada
    -0.60
    ControllerAdvice
    -0.58
     EconPapers
    -0.57
    gebras
    -0.55
    Hochspringen
    -0.55
    EndInit
    -0.55
     pardon
    -0.54
    POSITIVE LOGITS
     فريبيس
    0.87
    !("{
    0.55
    ]})
    0.51
     للمعارف
    0.49
    <sup>
    0.48
    webElement
    0.48
    رشف
    0.48
    [][]
    0.48
    masını
    0.47
     Baus
    0.47
    Act Density 0.235%

    No Known Activations