INDEX
    Explanations

    verb or adjective followed by preposition

    New Auto-Interp
    Negative Logits
    और
    0.80
     وغ
    0.75
    ֩
    0.75
    そして
    0.75
    এবং
    0.74
     وع
    0.74
    0.71
     અને
    0.71
     এবং
    0.70
    0.68
    POSITIVE LOGITS
    .].
    1.29
    .).
    1.29
    ."
    1.24
    ].
    1.23
    .</
    1.21
    .}
    1.21
     unless
    1.20
    .]
    1.19
    ."""
    1.18
    ?).
    1.18
    Act Density 0.121%

    No Known Activations