INDEX
    Explanations

    BSD-2-Clause-Patent name

    New Auto-Interp
    Negative Logits
     individuel
    -1.13
     tikai
    -1.12
     pittores
    -1.07
     bambou
    -1.02
    áis
    -1.01
     sidor
    -1.00
     Wład
    -0.99
     hunde
    -0.99
    های
    -0.98
     cyclisme
    -0.98
    POSITIVE LOGITS
    ↵↵↵↵↵↵↵↵↵↵↵↵↵
    1.13
    ↵↵↵↵↵↵↵↵↵↵↵↵
    1.05
    ↵↵↵↵↵↵↵↵↵↵↵
    1.05
    ↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
    1.03
     nörd
    0.98
    ↵↵↵↵↵↵↵↵↵↵
    0.97
    ↵↵↵↵↵↵↵
    0.93
    icona
    0.93
    𝗩
    0.92
     what
    0.88
    Act Density 0.043%

    No Known Activations