INDEX
    Explanations

    questions and confirmations within discussions

    New Auto-Interp
    Negative Logits
    leo
    -0.14
    εÏģι
    -0.13
    ainter
    -0.13
    ibre
    -0.12
     Logic
    -0.12
    wij
    -0.12
    ylko
    -0.12
    ynos
    -0.12
     --------↵
    -0.12
     [/
    -0.12
    POSITIVE LOGITS
    gross
    0.15
    umm
    0.15
    QUI
    0.15
    chor
    0.14
    -www
    0.13
    istrovstvÃŃ
    0.13
    tü
    0.13
     sk
    0.13
    ãģªãģŁ
    0.13
     Williamson
    0.13
    Act Density 0.003%

    No Known Activations