INDEX
    Explanations

    references to the concept of "free" in various contexts

    New Auto-Interp
    Negative Logits
    c
    -0.18
    la
    -0.17
    tra
    -0.17
    elastic
    -0.17
    re
    -0.16
    so
    -0.16
    ร
    -0.16
    ry
    -0.15
    st
    -0.15
    revision
    -0.15
    POSITIVE LOGITS
    bie
    0.27
    bies
    0.27
    bsd
    0.19
    esktop
    0.17
     à¹Ĩ
    0.16
    -floating
    0.16
    hold
    0.16
    zeitig
    0.16
     dÃłng
    0.15
    -ÑĤаки
    0.15
    Act Density 0.050%

    No Known Activations