INDEX
    Explanations

    phrases related to physical exertion and effort

    phrases that convey intense emotional experiences or struggles

    New Auto-Interp
    Negative Logits
    etheless
    -0.81
     respectively
    -0.62
     «
    -0.61
     briefly
    -0.59
    ãĤ´ãĥ³
    -0.58
    ģĸ
    -0.57
     Alternate
    -0.57
    described
    -0.56
     Released
    -0.56
    ¥ŀ
    -0.56
    POSITIVE LOGITS
    .")
    1.44
     â̦"
    1.34
    )."
    1.29
    !"
    1.28
     ..."
    1.27
    ),"
    1.20
    ?"
    1.19
    )"
    1.19
     ['
    1.15
    ,'"
    1.14
    Act Density 1.552%

    No Known Activations