INDEX
    Explanations

    references to scratching or related actions

    New Auto-Interp
    Negative Logits
     perc
    -0.17
    ška
    -0.16
     Hawk
    -0.16
    ropa
    -0.16
    ight
    -0.16
    elda
    -0.15
    Wire
    -0.15
    urm
    -0.15
    ٳ
    -0.15
    uitka
    -0.14
    POSITIVE LOGITS
    scratch
    0.15
    217
    0.15
    817
    0.15
    illation
    0.15
    267
    0.15
    .dtp
    0.14
    apter
    0.14
     Scratch
    0.14
     scratch
    0.14
       
    0.14
    Act Density 0.010%

    No Known Activations