INDEX
    Explanations

    references to political figures, specifically the president

    New Auto-Interp
    Negative Logits
    ĥ½
    -2.42
    ŀ
    -1.80
    Ĥ
    -1.74
    ¢
    -1.72
    ĵ
    -1.69
    ij
    -1.69
    ¬
    -1.68
    backs
    -1.66
     Respondents
    -1.60
    Ĵ
    -1.60
    POSITIVE LOGITS
    doms
    1.92
    coat
    1.91
    zilla
    1.74
    brush
    1.71
    sheet
    1.70
    liness
    1.65
    esses
    1.63
    ee
    1.59
    urally
    1.59
    们
    1.58
    Act Density 0.021%

    No Known Activations