INDEX
    Explanations

    actions involving raising or lifting objects or limbs

    New Auto-Interp
    Negative Logits
    ahoo
    -0.18
    onis
    -0.16
     prime
    -0.15
    igure
    -0.14
     Herr
    -0.14
    bard
    -0.14
     nerves
    -0.13
     brains
    -0.13
    brain
    -0.13
    acho
    -0.13
    POSITIVE LOGITS
    .raise
    0.22
    Raises
    0.20
    .scalablytyped
    0.20
     arms
    0.20
    raised
    0.19
     Raise
    0.19
     raise
    0.19
    arms
    0.19
    raises
    0.19
     raised
    0.18
    Act Density 0.020%

    No Known Activations