INDEX
    Explanations

    phrases that express an individual’s sense of responsibility or reflection on their actions and experiences

    New Auto-Interp
    Negative Logits
    تقاوى
    -0.65
     hende
    -0.58
     مرئيه
    -0.57
    RenderAtEndOf
    -0.54
    -0.53
    ']
    
    -0.53
     zda
    -0.51
    '},
    
    -0.50
    หน้านี้
    -0.50
     dessus
    -0.50
    POSITIVE LOGITS
     got
    2.25
    got
    1.81
    Got
    1.60
     Got
    1.57
     GOT
    1.50
    GOT
    1.25
     gota
    1.02
    gota
    1.00
     gotcha
    0.97
     gott
    0.96
    Act Density 0.176%

    No Known Activations