INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     выступ
    -0.08
     Lel
    -0.08
    dracht
    -0.08
    ('.')
    -0.08
    ilever
    -0.08
    acula
    -0.08
    יאל
    -0.07
     fundamentally
    -0.07
     who's
    -0.07
    astră
    -0.07
    POSITIVE LOGITS
     tailored
    0.10
     biệt
    0.10
     speciale
    0.10
     niche
    0.09
     Augenmerk
    0.09
     khusus
    0.09
     specializes
    0.09
     specialized
    0.09
    -special
    0.09
     boutiques
    0.09
    Act Density 0.218%

    No Known Activations