INDEX
    Explanations

    terms related to bending or curving actions

    New Auto-Interp
    Negative Logits
     del
    -0.51
     Sal
    -0.48
     bodem
    -0.48
    Drawing
    -0.47
    Джерела
    -0.44
    カウン
    -0.44
     figyel
    -0.43
    leden
    -0.43
    loten
    -0.43
     drew
    -0.43
    POSITIVE LOGITS
     bend
    0.82
     repos
    0.74
     Theſe
    0.73
    Bend
    0.72
     تانيه
    0.71
     Cæsar
    0.71
     bends
    0.70
     myſelf
    0.69
     bent
    0.68
     Huguen
    0.68
    Act Density 2.319%

    No Known Activations