INDEX
    Explanations

    expressions related to social interactions and conflicts

    comma usage in sentences with contrasting ideas or clauses

    New Auto-Interp
    Negative Logits
    tains
    -0.88
    itational
    -0.74
    ãĥīãĥ©
    -0.68
    nergy
    -0.67
    qqa
    -0.66
    ãĥ¯
    -0.65
    tesy
    -0.63
    cellence
    -0.63
    luence
    -0.62
    eatures
    -0.62
    POSITIVE LOGITS
     fearing
    1.31
     afraid
    1.14
     worried
    1.03
     preferring
    1.01
     ashamed
    1.00
     believing
    0.99
     realizing
    0.98
     wondering
    0.98
     feared
    0.97
     thinking
    0.97
    Act Density 0.396%

    No Known Activations