INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     peri
    -0.08
    ไม
    -0.08
    .apps
    -0.07
     Möglichkeit
    -0.07
     snippets
    -0.07
     Protein
    -0.07
     sembl
    -0.07
    _strip
    -0.07
     випад
    -0.06
    ني
    -0.06
    POSITIVE LOGITS
     loud
    0.22
     Loud
    0.15
     loudly
    0.12
     louder
    0.12
     aloud
    0.09
     Lou
    0.09
     lou
    0.08
    Lou
    0.08
     Лу
    0.08
    large
    0.07
    Act Density 0.003%

    No Known Activations