INDEX
    Explanations

    phrases related to evaluation or judgment, especially in relation to individuals or their actions

    references to opinions and assessments about individuals, particularly in the context of leadership and public perception

    New Auto-Interp
    Negative Logits
    zbollah
    -0.81
    '/
    -0.68
    iae
    -0.61
    pora
    -0.60
    anguage
    -0.58
    izo
    -0.56
    oola
    -0.54
    imum
    -0.54
    ather
    -0.51
     (/
    -0.51
    POSITIVE LOGITS
     him
    2.49
    him
    1.80
     his
    1.65
     HIM
    1.63
    his
    1.54
     Him
    1.52
    His
    1.49
     he
    1.40
    He
    1.38
     His
    1.24
    Act Density 1.191%

    No Known Activations