INDEX
    Explanations

    long, specific names and terms, potentially including surnames and locations

    New Auto-Interp
    Negative Logits
    epsfig
    -0.55
    lüğ
    -0.55
     vsak
    -0.52
    leyeb
    -0.52
    ceğim
    -0.52
     oseb
    -0.52
    lenmiş
    -0.52
    ceğini
    -0.51
     Septembre
    -0.51
    ceğ
    -0.49
    POSITIVE LOGITS
     kele
    0.89
     seksi
    0.88
     sula
    0.88
     lele
    0.85
     antik
    0.85
     ille
    0.84
     maksi
    0.84
     kosme
    0.84
     silikon
    0.84
     kollek
    0.84
    Act Density 0.298%

    No Known Activations