INDEX
    Explanations

    prepositions and their related forms

    New Auto-Interp
    Negative Logits
    yle
    -0.16
    uros
    -0.15
    ÃĹ↵↵
    -0.15
    tober
    -0.15
    xies
    -0.14
    åľŁ
    -0.14
     dagen
    -0.14
    peria
    -0.14
    velt
    -0.13
    hci
    -0.13
    POSITIVE LOGITS
    olla
    0.17
    izz
    0.16
    æĢĿãģĦ
    0.16
    inati
    0.15
    azzo
    0.14
     pÅĻÃŃležit
    0.14
    voor
    0.14
    omination
    0.14
    .jd
    0.14
     lit
    0.14
    Act Density 0.034%

    No Known Activations