INDEX
    Explanations

    noun-adjective or noun-adverb pairings

    New Auto-Interp
    Negative Logits
    pita
    0.43
     দেখাতে
    0.42
    öyle
    0.40
     răm
    0.40
     bikin
    0.40
     compuesta
    0.40
     quần
    0.39
     Dieu
    0.38
     amacıyla
    0.38
     maken
    0.38
    POSITIVE LOGITS
     ца
    0.40
    edent
    0.39
     атмос
    0.39
    ceff
    0.39
     Tension
    0.38
    ucher
    0.38
     tension
    0.37
     suff
    0.37
     अने
    0.37
     accum
    0.36
    Act Density 0.005%

    No Known Activations