INDEX
    Explanations

    punctuation

    New Auto-Interp
    Negative Logits
     somme
    -0.08
    isite
    -0.08
    -0.08
     característica
    -0.08
     palma
    -0.08
     Polski
    -0.08
     gedaan
    -0.08
     aut
    -0.08
     sprang
    -0.08
     pisa
    -0.08
    POSITIVE LOGITS
     Bab
    0.08
    Ар
    0.08
    (?
    0.07
     menet
    0.07
    cp
    0.07
    rap
    0.07
    0.07
    arp
    0.07
    seq
    0.07
    Bab
    0.07
    Act Density 0.216%

    No Known Activations