INDEX
    Explanations

    actions involving physical force or effort, such as pushing, pulling, grabbing, and tugging

    actions involving physical force or manipulation

    New Auto-Interp
    Negative Logits
     Surviv
    -0.77
    mun
    -0.74
    ãĥīãĥ©ãĤ´ãĥ³
    -0.73
     Broadcast
    -0.70
    Deal
    -0.70
    ãĥİ
    -0.69
    league
    -0.69
    天
    -0.68
    mberg
    -0.67
     Fallen
    -0.67
    POSITIVE LOGITS
     unconscious
    0.84
     joints
    0.79
     torches
    0.78
     glide
    0.78
    lasses
    0.77
     jerk
    0.76
     limp
    0.74
     downwards
    0.74
     gently
    0.73
     fists
    0.73
    Act Density 0.141%

    No Known Activations