INDEX
    Explanations

    various forms of the verb "back" in different contexts

    New Auto-Interp
    Negative Logits
     Rouge
    -0.16
    ad
    -0.15
    esi
    -0.15
    ATS
    -0.15
    isti
    -0.15
    DM
    -0.15
    on
    -0.15
    uya
    -0.15
    اش
    -0.14
    aten
    -0.14
    POSITIVE LOGITS
     backing
    0.32
    Backing
    0.26
     backed
    0.23
    /back
    0.22
     backs
    0.21
    -backed
    0.21
    haul
    0.20
    =back
    0.20
    (back
    0.20
    plash
    0.19
    Act Density 0.014%

    No Known Activations