INDEX
    Explanations

    phrases related to expressing preferences or opinions

    expressions of concern and emotional support within interpersonal relationships

    New Auto-Interp
    Negative Logits
    etheless
    -0.93
    ãĤ´ãĥ³
    -0.87
    ¥ŀ
    -0.84
    ortium
    -0.81
    *:
    -0.78
    Indeed
    -0.75
    âĦ¢:
    -0.74
    surprisingly
    -0.73
    UGC
    -0.72
    éŃĶ
    -0.72
    POSITIVE LOGITS
    ,'"
    1.40
     â̦"
    1.35
    .")
    1.28
    .'"
    1.28
     ..."
    1.23
    ?'"
    1.22
    ',"
    1.20
    !'"
    1.14
    ),"
    1.14
    ,"
    1.13
    Act Density 0.917%

    No Known Activations