INDEX
    Explanations

    terms related to scanning functions or activities

    New Auto-Interp
    Negative Logits
    kan
    -0.16
    ksen
    -0.16
    geber
    -0.15
    keit
    -0.15
    chnitt
    -0.15
    acomment
    -0.15
    strand
    -0.15
    unker
    -0.14
    ëĵł
    -0.14
    inction
    -0.14
    POSITIVE LOGITS
    olini
    0.18
    æıı
    0.18
    /sc
    0.17
    pst
    0.17
    iero
    0.17
    =sc
    0.16
    orama
    0.16
    warz
    0.16
    lations
    0.15
    crow
    0.15
    Act Density 0.033%

    No Known Activations