INDEX
    Explanations

    articles following prepositions/verbs

    New Auto-Interp
    Negative Logits
     diğer
    0.63
     যেসব
    0.62
    ренные
    0.58
     самые
    0.56
    ските
    0.55
     diejenigen
    0.55
    这些
    0.53
     seus
    0.53
     самых
    0.52
     যেগুলো
    0.52
    POSITIVE LOGITS
     isang
    1.94
     a
    1.87
     an
    1.83
     একটি
    1.80
     sebuah
    1.76
     einem
    1.72
     unui
    1.68
     एक
    1.63
     einer
    1.59
     einen
    1.59
    Act Density 0.899%

    No Known Activations