INDEX
    Explanations

    phrases related to personal struggles or challenges

    expressions of frustration or dissatisfaction

    New Auto-Interp
    Negative Logits
    ppings
    -0.64
    geries
    -0.64
     Bowen
    -0.58
     Chao
    -0.56
     subsequent
    -0.54
     Combine
    -0.53
     repeated
    -0.53
    rompt
    -0.53
    ries
    -0.52
     Milton
    -0.52
    POSITIVE LOGITS
    âĢ
    1.29
    âĺ
    1.26
    ðŁij
    1.15
    âľ
    1.10
    âĿ
    1.08
     ðŁij
    1.05
    .ãĢį
    1.05
    ðŁ
    1.04
    â
    1.01
     âĢ
    1.00
    Act Density 0.373%

    No Known Activations